You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
node operator - those putting data in the Registry
πͺ Motivation
For our high-volume missions, data comes to us (IMG) from the data provider in s3. We never have a complete copy on disk anywhere. Validations are done piecewise on a KDP cluster. We do an s3-to-s3 transfer to the public bucket, where it needs to be registered.
Currently, all the data must be downloaded somewhere, which is problematic for 10TB deliveries.
Downloading the labels is tractable, but the data isn't. The data in s3 has (or can have) an rclone-style md5 checksum which can be retrieved, which should obviate the need for the data itself.
Yes we can download piecewise, but that's just that many more steps that could go wrong and risks missing things (for example I would never trust the KDP piecewise processing as it has been proven to be unreliable).
π Additional Details
No response
Acceptance Criteria
Given When I perform Then I expect
βοΈ Engineering Details
No response
π I&T
No response
The text was updated successfully, but these errors were encountered:
Checked for duplicates
Yes - I've already checked
π§βπ¬ User Persona(s)
node operator - those putting data in the Registry
πͺ Motivation
For our high-volume missions, data comes to us (IMG) from the data provider in s3. We never have a complete copy on disk anywhere. Validations are done piecewise on a KDP cluster. We do an s3-to-s3 transfer to the public bucket, where it needs to be registered.
Currently, all the data must be downloaded somewhere, which is problematic for 10TB deliveries.
Downloading the labels is tractable, but the data isn't. The data in s3 has (or can have) an rclone-style md5 checksum which can be retrieved, which should obviate the need for the data itself.
Yes we can download piecewise, but that's just that many more steps that could go wrong and risks missing things (for example I would never trust the KDP piecewise processing as it has been proven to be unreliable).
π Additional Details
No response
Acceptance Criteria
Given
When I perform
Then I expect
βοΈ Engineering Details
No response
π I&T
No response
The text was updated successfully, but these errors were encountered: