-
Notifications
You must be signed in to change notification settings - Fork 30
1235.8.0 update has intermitent Docker bridge network #1785
Comments
I can confirm this is happening on one of my CoreOS hosts right after receiving the 1235.8.0 update. Here is the output from
Here is the output from a container with failed networking.
As you can see its missing Here is output from
Let me know if you would like any additional information. |
We have identified the issue (#1638 resurfaced) and have paused the automatic updates for now. |
Systems should begin updating to the fixed 1235.9.0 over the next day or so. |
I've just launched a new node with 1235.9.0 and can confirm the issue is gone. Thank you @dm0- for the quick response and fix. |
This was a relatively "big" bug, knocking several of my servers offline. I didn't see any mention of 1235.8.0 having network issues for containers on the blog/web site/releases web page. Is there any place we "broadcast" such priority/P0 problems when they arise? It could be helpful as a "first check". |
Those announcements were for verity in alpha. This bug is about Docker in stable. I believe this was announced in the release notes but not on the mailing lists. |
Ugh, sorry. Listen to @dm0-. |
The issue is still present on the beta channel:
|
@remh Start a container and leave it running with e.g. |
Thanks that was indeed the issue. Thanks again! |
Issue Report
After upgrading to 1235.8.0, ~10% of the spawned Docker containers don't have a working network (bridge mode).
Bug
CoreOS Version
Environment
What hardware/cloud provider/hypervisor is being used to run CoreOS?
Expected Behavior
All containers should have a working bridge network.
Actual Behavior
~10% of Docker containers boot with an non-working bridge network. (
--net host
works 100% of the time).Reproduction Steps
success
andfailed
outputs:while true; do docker run --rm alpine wget google.com 2>/dev/null && echo success || echo fail ; done
Other Information
I turned Docker debug mode on, but I did not found any info that would clarify the issue.
Environment
What hardware/cloud provider/hypervisor is being used to run CoreOS?
I was able to reproduce this in AWS instance, m4.large and t2.small with the HVM AMI.
I was also able to reproduce this in an DigitalOcean droplet.
Other Information
Please, point me out any other info that I could extract from the instance.
Sample outputs:
I've replaced those machines several times, all of them were affected.
The text was updated successfully, but these errors were encountered: