Bloom compactor: Load blocks lazily in batches #11919

chaudum · 2024-02-12T11:18:32Z

What this PR does / why we need it:

To avoid loading possibly lots of blocks upfront, this PR introduces lazy loading of blocks in batches using an iterator that loads blocks on demand.

This is the first part of making the block loading truly lazy.
As a second part, the interface of the merge build is going to change so it accepts a single iterator.

salvacorts · 2024-02-12T14:11:49Z

pkg/bloomcompactor/batch.go

+func newBatchedBlockLoader(ctx context.Context, fetcher blocksFetcher, blocks []bloomshipper.BlockRef) (*batchedBlockLoader, error) {
+	return &batchedBlockLoader{
+		ctx:       ctx,
+		batchSize: 10, // make configurable?


I agree this should be configurable

salvacorts · 2024-02-12T14:17:31Z

pkg/bloomcompactor/controller.go

+		err.Add(itr.CloseBatch())
+	default:
+		// close remaining loaded blocks
+		for itr.Next() && itr.Err() == nil {


Would we ever reach here? As far as I can see closeLoadedBlocks is only called from buildBlocks which will pass whatever loadWorkForGap returns.

Since the function accepts an interface v1.CloseableInterator[*bloomshipper.CloseableBlockQuerier] and not a concrete type, it could be possible that a non-batched version of the iterator is passed.

…rs (#11924) While reviewing #11919, I figured it'd be nice to make `batchedLoader` generic so we can reuse it's logic. This let me test it easier and remove a lot of now-unnecessary adapter code (interfaces, types)

Signed-off-by: Christian Haudum <[email protected]>

…rs (grafana#11924) While reviewing grafana#11919, I figured it'd be nice to make `batchedLoader` generic so we can reuse it's logic. This let me test it easier and remove a lot of now-unnecessary adapter code (interfaces, types)

To avoid loading possibly lots of blocks upfront, this PR introduces lazy loading of blocks in batches using an iterator that loads blocks on demand. Signed-off-by: Christian Haudum <[email protected]>

pull-request-size bot added the size/L label Feb 12, 2024

chaudum requested review from owen-d and salvacorts February 12, 2024 13:23

chaudum marked this pull request as ready for review February 12, 2024 13:23

chaudum requested a review from a team as a code owner February 12, 2024 13:23

salvacorts reviewed Feb 12, 2024

View reviewed changes

owen-d mentioned this pull request Feb 13, 2024

makes batchedLoader generic + removes unnecessary interfaces & adapters #11924

Merged

salvacorts approved these changes Feb 13, 2024

View reviewed changes

Bloom compactor: Load blocks lazily in batches

35c21ac

Signed-off-by: Christian Haudum <[email protected]>

chaudum force-pushed the chaudum/lazyloading-bloomcompactor-work branch from dafa7f0 to 35c21ac Compare February 13, 2024 07:56

chaudum enabled auto-merge (squash) February 13, 2024 07:58

chaudum merged commit eb8464a into main Feb 13, 2024
8 checks passed

chaudum deleted the chaudum/lazyloading-bloomcompactor-work branch February 13, 2024 08:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bloom compactor: Load blocks lazily in batches #11919

Bloom compactor: Load blocks lazily in batches #11919

chaudum commented Feb 12, 2024 •

edited

Loading

salvacorts Feb 12, 2024

salvacorts Feb 12, 2024

chaudum Feb 13, 2024

Bloom compactor: Load blocks lazily in batches #11919

Bloom compactor: Load blocks lazily in batches #11919

Conversation

chaudum commented Feb 12, 2024 • edited Loading

salvacorts Feb 12, 2024

Choose a reason for hiding this comment

salvacorts Feb 12, 2024

Choose a reason for hiding this comment

chaudum Feb 13, 2024

Choose a reason for hiding this comment

chaudum commented Feb 12, 2024 •

edited

Loading