-
Notifications
You must be signed in to change notification settings - Fork 178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Container deployment issues seen on a Swarm mode cluster with rebooted workers running on VMs #1112
Comments
Looking at the logs in
Can you send the docker daemon's logs when you see this issue? |
On today's call, there was an ask to see if this is an issue on K8s as well or just in Docker Swarm mode. |
@blaksmit This issue is seen Docker Swarm mode. Pretty sure that this issue cannot be seen in k8s as k8s does not even have the |
@vhosakot the comment was to see whether a similar VM scale issue is seen with K8s. |
@blaksmit I see, got it. We could test if this issue is seen when |
Please note the changed title. |
@sisudhir , Is this issue is seen at every iteration of your failure test ? Is it possible for you to share your setup with me . I can take a look at setup during error times |
Description
In a mixed swarm mode cluster (baremetal and VMs) with Contiv 1.1.7, docker service scale issues are seen on rebooting the worker VMs.
Before the reboot the cluster had the containers running on all the nodes (baremetal and VMs) using Contiv network and policy framework.
Expected Behavior
The VM reboot should not affect the performance with Contiv network.
Observed Behavior
On rebooting the VMs that were running containers, the containers moved successfully on the surviving worker nodes. But the Docker service scale takes unusually long time. Also, connection errors are seen in netmaster log as:
Error dial tcp 10.65.121.129:9002: getsockopt: no route to host connecting to 10.65.121.129:%!s(uint16=9002). Retrying..
Steps to Reproduce (for bugs)
Your Environment
contiv-logs.tar.gz
The text was updated successfully, but these errors were encountered: