Ingest batches of ledgers in-memory before flushing to DB #5099

tamirms · 2023-11-01T10:55:43Z

In #4909 we have updated all the transaction processors to use the FastBatchInsertBuilder to insert rows into the history tables with the postgres COPY command. We also refactored the transaction processor interface to allow a single processor to accumulate records across multiple ledgers:

type horizonTransactionProcessor interface {
	ProcessTransaction(xdr.LedgerCloseMeta, ingest.LedgerTransaction) error
	Flush(ctx context.Context, session db.SessionInterface) error
}

Now we can build on this work to improve the performance of reingestion. #4909 improved the performance of ingesting a single ledger. But we can extract more performance gains by ingesting multiple ledgers within a single processor lifetime.

In the spike branch the dataflow for ingesting batches of ledgers within a single processor lifetime is:

	accountLoader := history.NewAccountLoader()
	cbLoader := history.NewClaimableBalanceLoader()
	lpLoader := history.NewLiquidityPoolLoader()
	assetLoader := history.NewAssetLoader()
	processors := buildTransactionProcessors(
		s.historyQ,
		accountLoader,
		cbLoader,
		lpLoader,
		assetLoader,
	)

       // apply all the ledgers in the batch on the processors
       for _, ledger := range ledgers {
		if err = s.runner.ApplyProcessorsOnLedger(processors, ledgerCloseMeta); err != nil {
			return err
		}
       }

       // use the loaders to lookup all the accounts, assets, claimable balances, and liquidity pools registered
       // by the processors
       err = func() error {
		if err := s.historyQ.Begin(); err != nil {
			return errors.Wrap(err, "Error starting a transaction")
		}
		defer s.historyQ.Rollback()

		if err := accountLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := cbLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := lpLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := assetLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := s.historyQ.Commit(); err != nil {
			return errors.Wrap(err, commitErrMsg)
		}
		return nil
	}()

        // flush the rows to the db, the processors will be able to obtain the integer ids from the loaders
	if err := s.historyQ.Begin(); err != nil {
		return errors.Wrap(err, "Error starting a transaction")
	}
	defer s.historyQ.Rollback()
	if err := processors.Commit(s.ctx, s.historyQ); err != nil {
		return err
	}
	if err := s.historyQ.Commit(); err != nil {
		return errors.Wrap(err, commitErrMsg)
	}

We will need to implement this new dataflow on the following states in the ingestion state machine:

reingestHistoryRangeState (we should make sure to test both single threaded reingestion and parallel reingestion)
historyRangeState

Note the resume state only ingests a single ledger so it is already covered by #4909 .

Also, the following bugs will need to be addressed when implementing this issue:

RebuildTradeAggregationBuckets() cannot be invoked concurrently during parallel ingestion because there will be duplicate key constraint errors when two workers invoke the function on adjacent buckets. That is because the buckets occur on minute boundaries and two adjacent ledger ranges will share the same trade aggregations bucket. We can fix this by modifying parallel reingestion so that the trade aggregation buckets are built once all the workers have completed their ingestion jobs. - services/horizon/ingest: RebuildTradeAggregationBuckets cannot be invoked concurrently during parallel ingestion #5127
We should remove the force flag because it is incompatible with the new data-flow of ingestion where we batch multiple ledgers in a single transaction. - services/horizon/ingest: remove the force flag on reingestion cmds #5128

The text was updated successfully, but these errors were encountered:

sreuland · 2023-11-09T19:42:38Z

reduced scope slightly, removed - verifyRangeState from list of states that need ranged ledger enablement, due to that state invoked change processors also, which we want to retain those processors to having a single ledger scope, we want to limit ledger ranged scope to only the tx processors.

… send batches of ledgers to processors

… instead

…o err

…st max flush size if lower than default of 100

… send batches of ledgers to tx processors (#5117) closes #5099: Ingest batches of ledgers in-memory before flushing to DB

tamirms added horizon performance issues aimed at improving performance labels Nov 1, 2023

tamirms added this to Platform Scrum Nov 1, 2023

github-project-automation bot moved this to Backlog in Platform Scrum Nov 1, 2023

mollykarcher moved this from Backlog to Next Sprint Proposal in Platform Scrum Nov 1, 2023

sreuland mentioned this issue Nov 1, 2023

Refactor ingestion data-flow #4909

Closed

8 tasks

sreuland self-assigned this Nov 2, 2023

sreuland moved this from Next Sprint Proposal to In Progress in Platform Scrum Nov 2, 2023

sreuland mentioned this issue Nov 6, 2023

services/horizon: Use COPY to speed up ClaimableBalanceChangeProcessor #5104

Merged

7 tasks

sreuland added a commit to sreuland/go that referenced this issue Nov 15, 2023

stellar#5099: changed historyRange and reingestHistoryRange states to…

fd46a14

… send batches of ledgers to processors

sreuland mentioned this issue Nov 15, 2023

services/horizon/ingest: historyRange and reingestHistoryRange states send batches of ledgers to tx processors #5117

Merged

7 tasks

sreuland added a commit to sreuland/go that referenced this issue Nov 16, 2023

stellar#5099: added 'ledgers-per-flush' reingest flag, default to 0.

e43e291

sreuland added a commit to sreuland/go that referenced this issue Nov 17, 2023

stellar#5099: fixed process runner unit tests

882443e

sreuland added a commit to sreuland/go that referenced this issue Nov 17, 2023

stellar#5099: fixed fmt warnings

fd713f8

sreuland moved this from In Progress to Needs Review in Platform Scrum Nov 17, 2023

sreuland added a commit to sreuland/go that referenced this issue Nov 27, 2023

stellar#5099: review feedback, remove maxflush command flag, calc max…

c2aea6a

… instead

sreuland added a commit to sreuland/go that referenced this issue Nov 27, 2023

stellar#5099: review feedback, commit already flushed batches prior t…

2386a20

…o err

sreuland added a commit to sreuland/go that referenced this issue Nov 28, 2023

stellar#5099: set max flush size default in NewSystem

d3a3926

sreuland added a commit to sreuland/go that referenced this issue Nov 28, 2023

stellar#5099: use parallelJobSize as the alternate default for reinge…

386891a

…st max flush size if lower than default of 100

This was referenced Nov 28, 2023

services/horizon/ingest: RebuildTradeAggregationBuckets cannot be invoked concurrently during parallel ingestion #5127

Closed

services/horizon/ingest: remove the force flag on reingestion cmds #5128

Closed

sreuland closed this as completed in #5117 Nov 28, 2023

sreuland added a commit that referenced this issue Nov 28, 2023

services/horizon/ingest: historyRange and reingestHistoryRange states…

eb5054f

… send batches of ledgers to tx processors (#5117) closes #5099: Ingest batches of ledgers in-memory before flushing to DB

github-project-automation bot moved this from Needs Review to Done in Platform Scrum Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingest batches of ledgers in-memory before flushing to DB #5099

Ingest batches of ledgers in-memory before flushing to DB #5099

tamirms commented Nov 1, 2023 •

edited by sreuland

Loading

sreuland commented Nov 9, 2023

Ingest batches of ledgers in-memory before flushing to DB #5099

Ingest batches of ledgers in-memory before flushing to DB #5099

Comments

tamirms commented Nov 1, 2023 • edited by sreuland Loading

sreuland commented Nov 9, 2023

tamirms commented Nov 1, 2023 •

edited by sreuland

Loading