Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support mutable datasets #149

Open
jonathanthiry opened this issue Feb 17, 2022 · 2 comments
Open

Support mutable datasets #149

jonathanthiry opened this issue Feb 17, 2022 · 2 comments

Comments

@jonathanthiry
Copy link
Contributor

jonathanthiry commented Feb 17, 2022

Mutable datasets would be especially useful for near real-time data. They could use UUIDs instead of PIDs based on the file's checksum.
Both files could be streamed at the same time to check that data has only been appended and that the new file includes at least the same data as the old one.

@DrMaggie
Copy link

Keeping the same PID for the changed digital object would not really be a good idea if the identifier string is constructed from the checksum of the original version. The setup we have now with assigning new PIDs to later versions (regardless of the reason for the version change - i.e. treating files with new appended values the same as those where there's been error corrections or similar - is much clearer. There is always the possibility to create identifiers that are always pointing to the "latest version" of data sets that we know a priori will be extended or otherwise updated on a regular basis- but then the associated PID strings should not be based on any fixity metadata! Related to this issue, we could also think of ways to display the reason(s) behind a version being created, as well as make it easy to see the full version history, on landing pages?

@jonathanthiry
Copy link
Contributor Author

Thanks, Maggie! Oleg also thought it would be better to use another identifier, like UUIDs, so we will do that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants