service/s3: HeadObject waiter always retries on error, regardless of type #2937

goro9 · 2024-12-22T05:59:25Z

Acknowledgements

I have searched (https://github.com/aws/aws-sdk/issues?q=is%3Aissue) for past instances of this issue
I have verified all of my SDK modules are up-to-date (you can perform a bulk update with go get -u github.com/aws/aws-sdk-go-v2/...)

Describe the bug

It seems that the ObjectExistsStateRetryable used by default in NewObjectExistsWaiter is being retried for errors other than NotFound.

aws-sdk-go-v2/service/s3/api_op_HeadObject.go

Lines 915 to 930 in 3cc2195

    
           func objectExistsStateRetryable(ctx context.Context, input *HeadObjectInput, output *HeadObjectOutput, err error) (bool, error) { 
        
           	if err == nil { 
        
           		return false, nil 
        
           	} 
        
           	if err != nil { 
        
           		var errorType *types.NotFound 
        
           		if errors.As(err, &errorType) { 
        
           			return true, nil 
        
           		} 
        
           	} 
        
           	return true, nil 
        
           }

Regression Issue

Select this option if this issue appears to be a regression.

Expected Behavior

For NotFound errors: retry HeadObject
For errors other than NotFound: return err as is

Current Behavior

Always retry when an error occurs, regardless of the error type.

Reproduction Steps

options.Retryable = func(_ context.Context, _ *s3.HeadObjectInput, _ *s3.HeadObjectOutput, err error) (bool, error) {
	if err == nil {
		return false, nil
	}

	var errorType *types.NotFound
	if errors.As(err, &errorType) {
		return true, nil
	}

	return false, err
}

Possible Solution

No response

Additional Information/Context

No response

AWS Go SDK V2 Module Versions Used

	github.com/aws/aws-sdk-go-v2/service/s3 v1.71.1

Compiler and Version used

go version go1.23.4 darwin/arm64

Operating System and version

macOS Sonoma 14.1

The text was updated successfully, but these errors were encountered:

adev-code · 2024-12-23T23:28:47Z

Hello @goro9, thanks for reaching out and reporting the issue. I have replicated from my side and yes Go SDK is not retrying as it should. Meanwhile trying with Python SDK, it does make a retry. In this regard, I have brought this issue to the team for further review. I will let you know as soon as I get any updates. If you have any question let me know. Thanks.

Madrigal · 2025-01-06T17:53:30Z

Based on waiter workflow it seems like we're not doing the right thing. We need to address this and figure out if it's safe to roll out.

Need to also do some investigation around waiters on other services

lucix-aws · 2025-01-08T21:20:51Z

Waiter workflow step 4:

If none of the acceptors are matched and an error was encountered while calling the operation, then transition to the failure state and stop waiting.

Our implementation, for reasons probably lost to time, is just not doing this.

We can change it, trivially, but that comes with risk. Basically, any scenario where a waiter is called, some transient non-matching errors come back, and then a match does eventually come back would be broken. They would now fail on the unmatched error, as the spec says they should, instead of ignoring it. Whether or not anyone in the wild depends on that behavior, it's impossible to say.

The other option would be to let users enable the "on-spec" behavior through a new flag. Not at all a fan of having what would essentially be doThisCorrectly bool but it's the only way to avoid the aforementioned risk.

goro9 added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Dec 22, 2024

adev-code self-assigned this Dec 23, 2024

adev-code added investigating This issue is being investigated and/or work is in progress to resolve the issue. p3 This is a minor priority issue and removed needs-triage This issue or PR still needs to be triaged. labels Dec 23, 2024

adev-code added needs-review This issue or pull request needs review from a core team member. and removed investigating This issue is being investigated and/or work is in progress to resolve the issue. labels Dec 23, 2024

Madrigal added p1 This is a high priority issue queued This issues is on the AWS team's backlog and removed needs-review This issue or pull request needs review from a core team member. p3 This is a minor priority issue labels Jan 6, 2025

lucix-aws changed the title ~~service/s3: Always retry when an error occurs, regardless of the error type~~ service/s3: HeadObject waiter always retries on error, regardless of type Jan 8, 2025

lucix-aws assigned lucix-aws and unassigned adev-code Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

service/s3: HeadObject waiter always retries on error, regardless of type #2937

service/s3: HeadObject waiter always retries on error, regardless of type #2937

goro9 commented Dec 22, 2024

adev-code commented Dec 23, 2024

Madrigal commented Jan 6, 2025

lucix-aws commented Jan 8, 2025

service/s3: HeadObject waiter always retries on error, regardless of type #2937

service/s3: HeadObject waiter always retries on error, regardless of type #2937

Comments

goro9 commented Dec 22, 2024

Acknowledgements

Describe the bug

Regression Issue

Expected Behavior

Current Behavior

Reproduction Steps

Possible Solution

Additional Information/Context

AWS Go SDK V2 Module Versions Used

Compiler and Version used

Operating System and version

adev-code commented Dec 23, 2024

Madrigal commented Jan 6, 2025

lucix-aws commented Jan 8, 2025