Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

keep N versions on cleanup #97

Open
hsitter opened this issue Sep 7, 2021 · 1 comment
Open

keep N versions on cleanup #97

hsitter opened this issue Sep 7, 2021 · 1 comment

Comments

@hsitter
Copy link
Contributor

hsitter commented Sep 7, 2021

It'd be cool if cleanup could be told to keep the N last processed data sets of a component instead of only the latest.

Maybe add an ingestion timestamp to the database entries and use that a measure for how old a given data set is?

In kde neon we have some repos continuously integrate from git. This results in a large number of versions but out of those versions the N latest are conceivably "relevant", not only the latest. Currently we can't really use cleanup because it'd drop everything that isn't the current version and break screenshot resolution for systems that are still on the metadata from (e.g.) an hour ago.

hsitter added a commit to pangea-project/pangea-tooling that referenced this issue Sep 7, 2021
@ximion
Copy link
Owner

ximion commented Dec 21, 2021

Usually this isn't an issue as data doesn't move that fast - for example, Debian runs the cleanup step once a week which is perfectly fine.
If you update hourly, this may become an issue though (if you also cleaned weekly, it would just be a less-noticeable issue).
Unfortunately, this is one of those feature requests that isn't actually trivial to implement and will need quite a bit of work (likely the way of least resistance is adding another subdatabase storing timestamps, but keeping that in sync will be annoying too).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants