Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: experimental fix for runner connection failures #2116

Merged
merged 1 commit into from
Jul 22, 2024

Conversation

matt2e
Copy link
Collaborator

@matt2e matt2e commented Jul 19, 2024

We are storing clients per endpoint. During FTL upgrade we may see controllers try to connect to runners at an endpoint and fail but keep the client around. This may cause the RBAC issues we've seen in production.

This changes the cache to work based on runner key instead of endpoint so that a new runner at an old endpoint will not re-use an old client.

@matt2e matt2e requested a review from alecthomas as a code owner July 19, 2024 03:14
@matt2e matt2e requested review from a team and worstell and removed request for a team July 19, 2024 03:14
@ftl-robot ftl-robot mentioned this pull request Jul 19, 2024
Copy link
Contributor

@gak gak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wesbillman wesbillman merged commit 36a4b11 into main Jul 22, 2024
60 checks passed
@wesbillman wesbillman deleted the matt2e/experiment-runner-cache branch July 22, 2024 18:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants