Use exponential backoff for connection retries #65

jrisc · 2024-11-18T09:09:49Z

Calls to socket.connect() are non-blocking, hence all subsequent calls to socket.sendall() will fail if the target KDC service is temporarily or indefinitely unreachable. Since the kdcproxy task uses busy-looping, it results in the journal to be flooded with warning logs.

This pull request introduces a per-socket reactivation delay which increases exponentially as the number of reties is incremented, until timeout is reached (i.e. 100ms, 200ms, 400ms, 800ms, 1.6s, 3.2s, ...).

(Also replaces the default root logger with a dedicated kdcproxy one)

simo5

LGTM

jrisc · 2024-11-18T17:46:34Z

I was trying to add the connection type, IP address, and port number in the warning message, but apparently such information cannot be fetched from the socket in Python 2. So I am doing to make some additional changes to report the timeout back to the parent function (where the address is available) to report the timeout rather than each connection failure.

simo5 · 2024-11-18T18:12:02Z

We still care about python 2 ?

jrisc · 2024-11-18T22:08:03Z

Yes, for RHEL 7 ELS until 2028.

jrisc · 2024-11-20T11:14:17Z

I made some improvements to log the timeout only once per Application.__await_reply() call. It is now logged this way:

WARNING:kdcproxy:Exchange with udp:[10.0.46.123]:88 failed: Timeout while sending packets after 2.00s and 4 tries: [Errno 32] Broken pipe

@simo5 I tested this code for Python 2. Could you enabled the CI tests so we can confirm it works with Python 3 too?

Calls to socket.connect() are non-blocking, hence all subsequent calls to socket.sendall() will fail if the target KDC service is temporarily or indefinitely unreachable. Since the kdcproxy task uses busy-looping, it results in the journal to be flooded with warning logs. This commit introduces a per-socket reactivation delay which increases exponentially as the number of reties is incremented, until timeout is reached (i.e. 100ms, 200ms, 400ms, 800ms, 1.6s, 3.2s, ...). Signed-off-by: Julien Rische <[email protected]>

Signed-off-by: Julien Rische <[email protected]>

jrisc force-pushed the unreachable_kdc branch 2 times, most recently from a8d5c15 to eee16d0 Compare November 18, 2024 09:14

simo5 approved these changes Nov 18, 2024

View reviewed changes

jrisc force-pushed the unreachable_kdc branch 2 times, most recently from 6cd07a4 to 4284384 Compare November 18, 2024 16:02

jrisc changed the title ~~Use exponential backoff for connection retries~~ [WIP] Use exponential backoff for connection retries Nov 18, 2024

jrisc force-pushed the unreachable_kdc branch from 4284384 to b33ebfa Compare November 20, 2024 11:09

jrisc changed the title ~~[WIP] Use exponential backoff for connection retries~~ Use exponential backoff for connection retries Nov 20, 2024

jrisc force-pushed the unreachable_kdc branch from b33ebfa to c28dc0b Compare November 21, 2024 15:26

simo5 approved these changes Nov 21, 2024

View reviewed changes

jrisc force-pushed the unreachable_kdc branch 2 times, most recently from c054d97 to 75eb76f Compare November 21, 2024 17:14

jrisc added 2 commits November 21, 2024 18:27

Use dedicated "kdcproxy" logger

c8a69db

Signed-off-by: Julien Rische <[email protected]>

jrisc force-pushed the unreachable_kdc branch from 75eb76f to c8a69db Compare November 21, 2024 17:27

Fix "doc" tox task

cde2416

Signed-off-by: Julien Rische <[email protected]>

jrisc merged commit 0606ca5 into latchset:main Nov 22, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use exponential backoff for connection retries #65

Use exponential backoff for connection retries #65

jrisc commented Nov 18, 2024 •

edited

Loading

simo5 left a comment

jrisc commented Nov 18, 2024

simo5 commented Nov 18, 2024

jrisc commented Nov 18, 2024

jrisc commented Nov 20, 2024

Use exponential backoff for connection retries #65

Use exponential backoff for connection retries #65

Conversation

jrisc commented Nov 18, 2024 • edited Loading

simo5 left a comment

Choose a reason for hiding this comment

jrisc commented Nov 18, 2024

simo5 commented Nov 18, 2024

jrisc commented Nov 18, 2024

jrisc commented Nov 20, 2024

jrisc commented Nov 18, 2024 •

edited

Loading