-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Store Runtime tcp_keep_alive not working as expected #4021
Comments
Hi @gpiotti thanks for reaching out and your patience here. I think the parameter you want to use here is
Can you try increasing that value when running the put_record command? If still seeing an issue, please share your debug logs (with sensitive info redacted) by adding |
hi @tim-finnigan thanks for your suggestion, I've just tried with no luck, the connection is still being dropped and recreated after exactly 60 seconds. Here are the redacted logs, let me know if you need any further data. This logs correspond after waiting 60 seconds:
if I do not wait 60 seconds then the
|
Thanks for following up — after searching for related issues internally, I found one where the service team mentioned that this limit is imposed on their side and that this is expected behavior. I want to reach out to the SageMaker Feature Store team regarding this issue, to see if they could increase that 60 second limit or at least document the limitation. I created a new issue for this (aws/aws-sdk#752) in or cross-SDK repository since APIs like this are used across SDKs. Please refer to that issue for updates going forward, and feel free to add any additional comments there. |
This issue is now closed. Comments on closed issues are hard for our team to see. |
Describe the bug
When utilizing the feature store client's put_record method, the Feature Store runtime client appears to terminate connections after 60 seconds, despite the tcp_keep_alive setting being enabled. Consequently, this behavior results in the need to reopen connections for each put_record invocation if the time gap between calls exceeds 60 seconds. This behavior contributes to high latencies, often exceeding 200 milliseconds.
Expected Behavior
The Feature Store runtime client should maintain connections for longer durations, as specified by the tcp_keep_alive setting, to prevent frequent re-opening of connections and reduce latency.
Current Behavior
Connections are terminated after 60 seconds, necessitating the reopening of connections for each put_record invocation beyond this timeframe, resulting in high latencies.
Reproduction Steps
Possible Solution
No response
Additional Information/Context
when waiting more than 60 seconds, the logging states:
DEBUG:urllib3.connectionpool:Resetting dropped connection: featurestore-runtime.sagemaker.us-east-1.amazonaws.com
SDK version used
boto3==1.33.13
Environment details (OS name and version, etc.)
maxOs 13.4.1
The text was updated successfully, but these errors were encountered: