You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would help if these errors were returned at 429s so that consuming libraries could implement their own retry logic. Should _invoke_bedrock check for ThrottlingException rather than returning a 500 for all errors?
Is your feature request related to a problem? Please describe.
When hitting throttling limits, the gateway throws errors causing end clients to wait for reply for a long time, and the reply never comes.
Above shows that the gateway are able to catch that case (ThrottlingException) and that underlying libraries have some ability to set retries.
So this feature request is about:
With #1 being a preferred option :)
The text was updated successfully, but these errors were encountered: