Implement log compaction #36

allengeorge · 2014-01-26T18:55:08Z

The libraft log can grow without bound. Log compaction (i.e. snapshots) are necessary to address this. With compaction the current state is dumped to disk and log entries subsumed by this state are removed.

There are two high-level options here:

Snapshots are triggered without coordination on each server. Logs and snapshots may be different on different servers. Leaders have to send snapshots and followers have to process them. Snapshots have to be loaded on startup to prime the caller's state.
Leader triggers snapshot for the cluster. This maintains a high-degree of log coherency, but I don't think it adds much value. It requires coordination, and you still need all the messages and logic required for the first option.

Tasks

Take snapshots.
Load a mix of snapshots and log entries on startup.
Automatically trigger taking a snapshot.
Send snapshot to followers (RPCSender API, ordering, tests, etc.).
Receive snapshot from leader (RPCReciever API, ordering, tests, etc.).
Store new snapshot
Truncate log entries on receiving snapshot.
Clean up stale snapshots.
Truncate log entries on taking snapshot.

The text was updated successfully, but these errors were encountered:

ghost assigned allengeorge Jan 26, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement log compaction #36

Implement log compaction #36

allengeorge commented Jan 26, 2014

Implement log compaction #36

Implement log compaction #36

Comments

allengeorge commented Jan 26, 2014

Tasks