-
Notifications
You must be signed in to change notification settings - Fork 911
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ec2, hotplug: race between udev add event and IMDS data not being available #5373
Comments
Some kind of retry / waiting has to be added when required networking properties from the IMDS are missing. |
@aciba90 , so this has always been an issue with hotplug? We just happen to be seeing it now? |
@TheRealFalcon: I think so, the lack of #5271 was shadowing this race condition, which does not happen all times. I believe #5283 is due to this race. |
Tracking in: SF#00387392. |
Those checks are going to fail due to and until canonical#5373 is fixed. Disabled them to get a better feedback from integration tests.
Those checks are going to fail due to and until canonical#5373 is fixed. Disabled them to get a better feedback from integration tests.
It is pretty consistently failing due to canonical#5373 with no fix in sight.
It is pretty consistently failing due to #5373 with no fix in sight.
Test disabled in #5503. |
Per responses in SF#00387392, there is no better solution to synchronize with the IMDS, but implementing a wait / retry mechanism. Another solution, outlined by Noah in [1], would be to migrate our current implementation and configure PBR as a dhcp exit hook. This would have access to all required information, but we would need to implement the hooks for every dhcp client that cloud-init supports, or at least minimally for every default dhcp in the current Ubuntu supported releases:
In summary, I see two possible solutions:
[1] https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=963826 |
Removing from the 24.3 milestone |
It is pretty consistently failing due to canonical#5373 with no fix in sight.
It is pretty consistently failing due to #5373 with no fix in sight.
Tracking in SC-1850. |
Make tests more robust to temporary network failure. Document hotplug limitations. Fixes canonicalGH-5373
Make tests more robust to temporary network failure. Document hotplug limitations. Fixes canonicalGH-5373
Make tests more robust to temporary network failure. Document hotplug limitations. Fixes canonicalGH-5373
Make tests more robust to temporary network failure. Document hotplug limitations. Fixes canonicalGH-5373
Make tests more robust to temporary network failure. Document hotplug limitations. Fixes canonicalGH-5373
Bug report
Cloud-init hits what @nmeyerhans mentioned in #4799 (comment):
Sometimes, the IMDS does not have the full metadata about a hotplugged NIC when triggered by a hotplug udev event.
We can see the difference in instance network metadata after a reboot:
Steps to reproduce the problem
And observe:
Environment details
cloud-init logs
cloud-init.tar.gz
cloud-init-post-reboot.tar.gz
The text was updated successfully, but these errors were encountered: