-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Segment Replication] Revisit network timeout settings to avoid timeout exceptions #6027
Comments
I think all settings in RecoverySettings should be looked at, but a few other node level settings in particular where I think Segrep should have its own config & defaults:
We currently reuse these from recovery given the recovery code to send files was refactored to be reusable for segrep. |
Segment Replication uses the recovery settings for its operations. It is difficult to segregate these settings from Recovery because existing recovery settings are defined at
The other settings can be refactored into a separate SegmentReplicationSettings class but due to Recovery & SegRep both using common classes & pattern, using single settings makes sense. Previous Issue opened for timeout: #4392 |
Addressed concern of using hard-coded constant for identifying the request timeout for fetching segment files in #6523. This issue was opened to ensure segrep timeouts doesn't result in persistent recovery failures. Setting the max time for fetching files should address the timeout concern. |
Coming from discussion, as part of peer recovery a round of segment replication is performed to ensure the target is upto date with primary. For long running recovery process, it may be possible that segment replication request may time out. This is applicable for both primary and replica recovery process.
Related
#5242
#5313
The text was updated successfully, but these errors were encountered: