Skip to content
This repository has been archived by the owner on Jan 31, 2022. It is now read-only.

label bot workers stop receiving pubsub messages; issue with workload identity? #105

Open
jlewi opened this issue Jan 17, 2020 · 3 comments
Labels

Comments

@jlewi
Copy link
Contributor

jlewi commented Jan 17, 2020

As part of #70 we deployed the workers on an update cluster which uses workload identity.

I'm observing that after the workers have been up for a long time they appear to stop receiving pubsub notifications.

This is visible in cloud console as a growing backlog of messages.

I suspect an issue related to credentials and workload identity. Bouncing the pods appears to fix it.

Related to #70

@issue-label-bot
Copy link

Issue-Label Bot is automatically applying the label kind/bug to this issue, with a confidence of 0.96. Please mark this comment with 👍 or 👎 to give our bot feedback!

Links: app homepage, dashboard and code for this bot.

@kf-label-bot-dev
Copy link

Issue-Label Bot is automatically applying the labels:

Label Probability
kind/bug 0.96

Please mark this comment with 👍 or 👎 to give our bot feedback!
Links: app homepage, dashboard and code for this bot.

@jlewi
Copy link
Contributor Author

jlewi commented Jan 17, 2020

To try to recover

  • Delete the gke metadata servers
kubectl -n kube-system  delete pods -l k8s-app=gke-metadata-server
  • Restart the label bot pods
kubectl delete pods -l app=label-bot 

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant