-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add mechanism to cancel the operation of uploading directory #75
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall looks good, couple questions/comments.
This commit addresses #75 (comment)
This commit addresses #75 (comment)
This commit addresses #75 (comment)
/// In contrast, calling [`Self::abort`] attempts to cancel ongoing tasks more explicitly. | ||
/// It first calls `.abort_all` on the tasks it owns, and then invokes `AbortMultipartUpload` | ||
/// to abort any in-progress multipart uploads. Errors encountered during `AbortMultipartUpload` | ||
/// are logged, but do not affect the overall cancellation flow. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
trivial: Worth mentioning that the operation might already be complete, and we won't delete anything?
@@ -18,13 +42,61 @@ pub struct UploadObjectsHandle { | |||
|
|||
impl UploadObjectsHandle { | |||
/// Consume the handle and wait for the upload to complete | |||
/// | |||
/// When the `FailedTransferPolicy` is set to [`FailedTransferPolicy::Abort`], this method | |||
/// will return an error if any of the spawned tasks encounter one. The other tasks will |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// will return an error if any of the spawned tasks encounter one. The other tasks will | |
/// will return the first error if any of the spawned tasks encounter one. The other tasks will |
@@ -617,4 +703,35 @@ mod unit { | |||
); | |||
} | |||
} | |||
|
|||
#[tokio::test] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should add a test that tries to do MPU but fails and calls abortMultipart instead of completeMultipart.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. As discussed offline, a good place to add that test is probably an integration test file for uploading single object. Turns out upload_objects/worker.rs
is probably the most convenient place to write a test in, as done in PR#78.
This commit addresses #75 (comment) #75 (comment)
This commit addresses #75 (comment) #75 (comment)
This commit addresses #75 (comment)
* Incorporate post-merge review feedback from PR#75 This commit addresses #75 (comment) #75 (comment) * Add test for canceling upload object via MPU This commit addresses #75 (comment) * Fix memory leak in test detected by `LeakSanitizer` * Simulate the flow of CreateMPU -> upload cancellation -> AbortMPU This commit addresses #78 (comment) * Avoid a single letter variable name
Issue #, if available:
Resolves a subtask in #65
Description of changes:
This PR introduces a mechanism to cancel the upload of multiple objects within a directory. It distinguishes between aborting and dropping a handle for both
UploadHandle
andUploadObjectsHandle
. Note that this PR does not modify theUploadHandle
functionality, except for adding some comments to the struct.For both of the handles, the basic idea is
.drop()
is called explicitly. The tasks will be canceled at the points where their futures are waiting, and we don't have much control over the cleanup process, which is left to the underlying async runtime..abort()
on it) allows tasks to complete their current work while ignoring any future tasks. The word "complete" has different meanings depending on whether the handle is anUploadHandle
or anUploadObjectsHandle
.UploadHandle
, it means that the control flow attempts to abort ongoing read and upload tasks, and it will invokeAbortMultipartUpload
in the case of MPU.UploadObjectsHandle
, it means that each upload task either finishes uploading the current object or invokes.abort()
on its respective single upload operation down the line.For
UploadObjectsHandle
, this PR introduces a watch channel that signals when the operation enters cancellation mode and needs to be terminated gracefully. There are two ways to trigger cancellation:.abort()
on theUploadObjectsHandle
.Both cases are covered by newly added integration tests.
Design Discussions:
Drop
the same as calling.abort()
—for example, having fndrop(&mut self)
call.abort()
. However, since.drop
is a synchronous method and.abort()
is asynchronous, this led me to implement separate behaviors for the two..abort()
(for bothUploadHandle
andUploadObjectsHandle
) will only return the error that triggered the cancellation and will log any additional errors encountered along the way (e.g., errors from AbortMultipartUpload). We have TODO and I also left TODO to assess this behavior, but for now, the code after this PR "swallows" and logs any new errors encountered during the cancellation process.By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.