Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Atomic publishing of dataset records #85

Open
LucaCinquini opened this issue Apr 27, 2016 · 0 comments
Open

Atomic publishing of dataset records #85

LucaCinquini opened this issue Apr 27, 2016 · 0 comments
Assignees
Milestone

Comments

@LucaCinquini
Copy link
Member

Who: Alyn

It seems that if a call to the publication service is interrupted then
it leaves a record in Solr but with a short list of files - this can
be seen in CoG as a discrepancy between the "number of files" in the
summary metadata and the number of files in the actual file list - see
attached image with relevant part of a screenshot. Indeed, when
publishing a large dataset to the index (large enough that the
publication time is long relative to the master-slave sync interval in
Solr), I can watch the number of files grow in CoG.

This has left some items in the CEDA index in an inconsistent state in
our index because I was assuming that if a record had appeared then
publication to the index had succeeded. (Thanks to Katharina for
noticing this.)

Is it possible to make the publication service atomic? It probably
would not be a big problem if Solr does not allow it to be totally
atomic on the scale of the small amount of time it actually takes to
write the Solr document (although I'd be surprised), but could it at
least gather all the necessary information and then write it in a
single call to Solr?

@LucaCinquini LucaCinquini self-assigned this Apr 27, 2016
@LucaCinquini LucaCinquini added this to the Release 4.9.0 milestone Apr 27, 2016
@LucaCinquini LucaCinquini modified the milestones: Release 4.10, Release 4.9.0 May 23, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant