feat(blooms): Prefetch bloom blocks as soon as they are built #15050

salvacorts · 2024-11-21T08:51:41Z

What this PR does / why we need it:

This PR adds a new RPC endpoint to the bloom gateway service. Builders pass the blocks they built to the gateway and it download them async. That way blocks will likely be already present in the gateways at query-time.

This can be enabled setting the per-tenant bloom_prefetch_blocks limit to true.

We are also increasing the gateway download queue from 10K to 100K.

Checklist

Reviewed the CONTRIBUTING.md guide (required)
Documentation added
Tests updated
Title matches the required conventional commits format, see here
- Note that Promtail is considered to be feature complete, and future development for logs collection will be in Grafana Alloy. As such, feat PRs are unlikely to be accepted unless a case can be made for the feature actually being a bug fix to existing behavior.
Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

Signed-off-by: Christian Haudum <[email protected]>

pkg/bloomgateway/bloomgateway.go

Signed-off-by: Christian Haudum <[email protected]>

salvacorts

LGTM

pkg/bloomgateway/client.go

salvacorts · 2024-11-21T13:15:13Z

pkg/bloomgateway/bloomgateway.go

+
+	bqs, err := g.bloomStore.FetchBlocks(
+		// We don't use the ctx passed to the handler since its canceled when the handler returns
+		context.Background(),


Not passing a timeout here since you always need to call the cancel function before returning, which would trigger the download to stop. This is not an issue since when the gateway stops, the queue is closed and any pending block download is stopped.

rfratto

LGTM. I'm interested in seeing how this works in practice.

Since prefetch is only ever triggered by the builder, I can think of a few cases where this wouldn't work:

Gateways get restarted before they can finish prefetching (either from a rollout, an OOM, or something else)
We scale up gateways; new replicas won't be told to prefetch

I'm not sure how much these matter in practice, but this PR is pretty small so I think it's fine for us to give it a shot.

rfratto · 2024-11-21T13:15:00Z

pkg/bloomgateway/bloomgateway.go

@@ -161,6 +161,40 @@ func (g *Gateway) stopping(_ error) error {
 	return services.StopManagerAndAwaitStopped(context.Background(), g.serviceMngr)
 }

+func (g *Gateway) PrefetchBloomBlocks(_ context.Context, req *logproto.PrefetchBloomBlocksRequest) (*logproto.PrefetchBloomBlocksResponse, error) {


Do we need prefetch-specific metrics so we can measure the impact of this? (for example, how many blocks were downloaded via a prefetch)

feat(blooms): Add PrefetchBloomBlocks to bloomgateway service

fd25955

pull-request-size bot added the size/S label Nov 21, 2024

Implement PrefetchBloomBlocks gRPC method on gateway

204acaa

Signed-off-by: Christian Haudum <[email protected]>

salvacorts changed the title ~~feat(blooms): Add (unimplemented) PrefetchBloomBlocks to the bloom-gateway service~~ feat(blooms): Add PrefetchBloomBlocks to the bloom-gateway service Nov 21, 2024

pull-request-size bot added size/M and removed size/S labels Nov 21, 2024

salvacorts commented Nov 21, 2024

View reviewed changes

pkg/bloomgateway/bloomgateway.go Outdated Show resolved Hide resolved

chaudum added 2 commits November 21, 2024 10:13

fixup! Implement PrefetchBloomBlocks gRPC method on gateway

a90607d

Signed-off-by: Christian Haudum <[email protected]>

fixup! fixup! Implement PrefetchBloomBlocks gRPC method on gateway

b141863

Signed-off-by: Christian Haudum <[email protected]>

salvacorts marked this pull request as ready for review November 21, 2024 09:39

salvacorts requested a review from a team as a code owner November 21, 2024 09:39

salvacorts commented Nov 21, 2024

View reviewed changes

chaudum approved these changes Nov 21, 2024

View reviewed changes

salvacorts changed the title ~~feat(blooms): Add PrefetchBloomBlocks to the bloom-gateway service~~ feat(blooms): Prefetch bloom blocks as soon as they are built Nov 21, 2024

Builders call PrefetchBloomBlocks

94face5

pull-request-size bot added size/L and removed size/M labels Nov 21, 2024

github-actions bot added the type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories label Nov 21, 2024

fixup

6a79cee

chaudum reviewed Nov 21, 2024

View reviewed changes

pkg/bloomgateway/client.go Outdated Show resolved Hide resolved

pkg/bloomgateway/client.go Outdated Show resolved Hide resolved

salvacorts added 6 commits November 21, 2024 11:53

CR feedback

685e871

remove unused cfg

bb2c7c2

use running ctx to fetch blocks async

d3fc8be

pass background ctx

db765bb

increase queue size

7fc112c

Skip reporting hit/miss in cache

1679110

salvacorts commented Nov 21, 2024

View reviewed changes

rfratto approved these changes Nov 21, 2024

View reviewed changes

Metrics

a1ae1b0

salvacorts merged commit b406015 into main Nov 22, 2024
58 checks passed

salvacorts deleted the salvacorts/PrefetchBloomBlocks-definition branch November 22, 2024 12:57

This was referenced Dec 23, 2024

chore(k234): release 3.4.0 #15536

Open

chore(k235): release 3.4.0 #15555

Open

loki-gh-app bot mentioned this pull request Jan 6, 2025

chore(k236): release 3.4.0 #15595

Open

loki-gh-app bot mentioned this pull request Jan 13, 2025

chore(k237): release 3.4.0 #15705

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(blooms): Prefetch bloom blocks as soon as they are built #15050

feat(blooms): Prefetch bloom blocks as soon as they are built #15050

salvacorts commented Nov 21, 2024 •

edited

Loading

salvacorts left a comment

salvacorts Nov 21, 2024 •

edited

Loading

rfratto left a comment

rfratto Nov 21, 2024

salvacorts Nov 21, 2024

feat(blooms): Prefetch bloom blocks as soon as they are built #15050

feat(blooms): Prefetch bloom blocks as soon as they are built #15050

Conversation

salvacorts commented Nov 21, 2024 • edited Loading

salvacorts left a comment

Choose a reason for hiding this comment

salvacorts Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

rfratto left a comment

Choose a reason for hiding this comment

rfratto Nov 21, 2024

Choose a reason for hiding this comment

salvacorts Nov 21, 2024

Choose a reason for hiding this comment

salvacorts commented Nov 21, 2024 •

edited

Loading

salvacorts Nov 21, 2024 •

edited

Loading