20231018-access-logs: design doc #11

kaloyan-raev · 2023-10-23T12:02:15Z

No description provided.

ferristocrat · 2023-10-27T03:10:03Z

20231018-access-logs.md

+
+#### Scraping and annotating access logs
+
+Three services generate access logs: Linksharing, Gateway-MT, and the Satellite API Pods. Each of these services has multiple instances. A logging agent will run next to each instance and scrape the generated access logs. The logging agent may filter the log entries or re-arrange their log format, but its main task is to annotate the log entries with the Tenant ID and push them to the Grafana Loki destination.


Will we end up with "duplicate" logs for linksharing and gateway requests? In other words, wont there be a satellite log for every linksharing/gateway request -- or is it somehow combined based on request id?

There will be a different stream (or "label" in Grafana terms) of logs for Linksharing, Gateway-MT, and Satellite API. There should be a correlating log entry in the satellite stream with a matching request ID for each request in the edge services. However, this may not be the case in the future if we introduce edge caching.

ferristocrat · 2023-10-27T03:25:10Z

20231018-access-logs.md

+
+Storj will operate the Loki server in write-only mode (with `-target=write,compactor` flag), i.e. it won't allow querying the logs. We chose the write-only mode for easier operation.
+
+Customers will query their access logs not through the Storj Loki server but with Loki client tooling ([LogCLI](https://grafana.com/docs/grafana-cloud/monitor-infrastructure/logs/export/query-exported-logs/#querying-the-archive-using-logcli) or [read-only Loki server](https://grafana.com/docs/grafana-cloud/monitor-infrastructure/logs/export/query-exported-logs/#query-the-archive-using-loki-in-read-only-mode)) configured directly to their target Storj bucket.


Seems like an unnecessary step to force Loki client tooling, rather than dumping more readily readable file formats... what would change if we needed to give non-proprietary log format?

The Loki format is open source and described in their docs: https://grafana.com/docs/loki/latest/operations/storage/#chunk-format.

The raw log chunks are compressed and have additional binary metadata. They can be converted to readable text format with the Loki chunk-inspect tool: https://github.com/grafana/loki/tree/main/cmd/chunks-inspect.

Then, users can do whatever they want with them.

it's always nice to have human readable logs without having to learn a new tool. I think people are going to want text logs readily available but maybe that's worth testing in a beta to see if anybody cares.

20231018-access-logs.md

pwilloughby · 2023-12-06T20:35:30Z

20231018-access-logs.md

+
+### Open question
+
+- Should we log linksharing requests beyond those to raw content like listing buckets and prefixes, displaying the object map, etc.?


I think that would be useful, maybe we could make it clear where these are coming from with the user agent.

pwilloughby · 2023-12-06T20:39:21Z

20231018-access-logs.md

+
+#### Configuring a bucket for access logs
+
+The customer will be able to turn on access logs per bucket. By default, a bucket does not generate access logs. The customer may decide later to turn off the access logs for the bucket.


Which part of the stack knows which buckets have logging enabled? How does that information make it down to the Loki server?

This is yet to be decided. It could be another column in the satellite's bucket_metainfo table, or a separate registry.

For the MVP, we can manually configure the respective components:

Linksharing: the list of project-bucket pairs to generate access logs for.

The Loki distribution job: the S3 credentials to the target customer bucket for each Tenant ID.

When we have some experience with the MVP, we'll know best how to improve the config and communicate it across the stack.

kaloyan-raev · 2023-12-07T19:19:53Z

I updated the doc and added an MVP section at the end.

20231018-access-logs: design doc

b561d23

kaloyan-raev marked this pull request as draft October 23, 2023 12:02

ferristocrat reviewed Oct 27, 2023

View reviewed changes

ferristocrat mentioned this pull request Dec 4, 2023

Review Linksharing Log Technical Design storj/edge#379

Closed

pwilloughby reviewed Dec 6, 2023

View reviewed changes

20231018-access-logs.md Show resolved Hide resolved

pwilloughby reviewed Dec 6, 2023

View reviewed changes

Added MVP section and a few clarifications

b79028d

kaloyan-raev marked this pull request as ready for review December 7, 2023 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

20231018-access-logs: design doc #11

20231018-access-logs: design doc #11

kaloyan-raev commented Oct 23, 2023

ferristocrat Oct 27, 2023

kaloyan-raev Oct 31, 2023

ferristocrat Oct 27, 2023

kaloyan-raev Oct 31, 2023

pwilloughby Dec 6, 2023 •

edited

Loading

pwilloughby Dec 6, 2023

pwilloughby Dec 6, 2023

kaloyan-raev Dec 7, 2023

kaloyan-raev commented Dec 7, 2023


		#### Scraping and annotating access logs

		Three services generate access logs: Linksharing, Gateway-MT, and the Satellite API Pods. Each of these services has multiple instances. A logging agent will run next to each instance and scrape the generated access logs. The logging agent may filter the log entries or re-arrange their log format, but its main task is to annotate the log entries with the Tenant ID and push them to the Grafana Loki destination.


		Storj will operate the Loki server in write-only mode (with `-target=write,compactor` flag), i.e. it won't allow querying the logs. We chose the write-only mode for easier operation.

		Customers will query their access logs not through the Storj Loki server but with Loki client tooling ([LogCLI](https://grafana.com/docs/grafana-cloud/monitor-infrastructure/logs/export/query-exported-logs/#querying-the-archive-using-logcli) or [read-only Loki server](https://grafana.com/docs/grafana-cloud/monitor-infrastructure/logs/export/query-exported-logs/#query-the-archive-using-loki-in-read-only-mode)) configured directly to their target Storj bucket.


		### Open question

		- Should we log linksharing requests beyond those to raw content like listing buckets and prefixes, displaying the object map, etc.?


		#### Configuring a bucket for access logs

		The customer will be able to turn on access logs per bucket. By default, a bucket does not generate access logs. The customer may decide later to turn off the access logs for the bucket.

20231018-access-logs: design doc #11

Are you sure you want to change the base?

20231018-access-logs: design doc #11

Conversation

kaloyan-raev commented Oct 23, 2023

ferristocrat Oct 27, 2023

Choose a reason for hiding this comment

kaloyan-raev Oct 31, 2023

Choose a reason for hiding this comment

ferristocrat Oct 27, 2023

Choose a reason for hiding this comment

kaloyan-raev Oct 31, 2023

Choose a reason for hiding this comment

pwilloughby Dec 6, 2023 • edited Loading

Choose a reason for hiding this comment

pwilloughby Dec 6, 2023

Choose a reason for hiding this comment

pwilloughby Dec 6, 2023

Choose a reason for hiding this comment

kaloyan-raev Dec 7, 2023

Choose a reason for hiding this comment

kaloyan-raev commented Dec 7, 2023

pwilloughby Dec 6, 2023 •

edited

Loading