-
Notifications
You must be signed in to change notification settings - Fork 885
Description
Previous related threads:
- network sandbox join failed: error creating vxlan interface: file exists #562
- What do I do with "subnet sandbox join failed for "10.0.0.0/24": error creating vxlan interface: file exists" #751
- Cannot start container: subnet sandbox join failed for "10.0.0.0/24": error creating vxlan interface: file exists #945
- Containers on overlay network and "error creating vxlan interface: file exists" moby#21482
- [1.13.0-rc1][Intermittent] docker: Error response from daemon: subnet sandbox join failed for "10.0.0.0/16": error creating vxlan interface: file exists. moby#28559
Comment at the current tail-end of #945 recommends opening a new ticket. I couldn't find one opened by the original poster, so here we go.
I've been using swarm for the past couple of months, and frequently hit upon this problem. I have a modest swarm (~8-9 nodes) all running Ubuntu 16.04, now with Docker 17.05-ce on. There is not a great amount of container churn, but I do use a stack yaml file to deploy ~20 services across ~20 encrypted overlay networks.
I tend to find that after a couple of stack deploy / stack rm cycles, my containers get killed at startup with the "Error creating vxlan: file exists" error. This prevents the containers coming up on a host and forces them to attempt to relocate, which may / may not work.
I have noted in the above issues that the problems are, several times over, thought to have been rectified, but yet always creep back in for various users.
To rectify the issue, I have tried rebooting the node, restarting iptables, removing the stack and re-creating, all of which work to varying degrees but are most definitely workarounds and not solutions.
I cannot think how I can attempt to reproduce this error, but if anyone wants to suggest ways to debug, I am at your service.