Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI jobs often fail with minikube error X Exiting due to GUEST_PROVISION #2264

Closed
nixpanic opened this issue Jul 9, 2021 · 27 comments
Closed
Labels
component/testing Additional test cases or CI work keepalive This label can be used to disable stale bot activiity in the repo priority-1

Comments

@nixpanic
Copy link
Member

nixpanic commented Jul 9, 2021

Describe the bug

When runninc CI jobs, minikube regularly fails with the following error:

* Failed to start kvm2 VM. Running "minikube delete" may fix it: creating host: create: Error creating machine: Error in driver during machine creation: IP not available after waiting: machine minikube didn't return IP after 1 minute

X Exiting due to GUEST_PROVISION: Failed to start host: creating host: create: Error creating machine: Error in driver during machine creation: IP not available after waiting: machine minikube didn't return IP after 1 minute

This prevents the job from continuing, and a /retest ... is needed.

Logs

From mini-e2e_k8s-1.20/1523:

* Creating kvm2 VM (CPUs=8, Memory=14336MB, Disk=20000MB) ...
* Deleting "minikube" in kvm2 ...
! StartHost failed, but will try again: creating host: create: Error creating machine: Error in driver during machine creation: IP not available after waiting: machine minikube didn't return IP after 1 minute
* Creating kvm2 VM (CPUs=8, Memory=14336MB, Disk=20000MB) ...
* Failed to start kvm2 VM. Running "minikube delete" may fix it: creating host: create: Error creating machine: Error in driver during machine creation: IP not available after waiting: machine minikube didn't return IP after 1 minute

X Exiting due to GUEST_PROVISION: Failed to start host: creating host: create: Error creating machine: Error in driver during machine creation: IP not available after waiting: machine minikube didn't return IP after 1 minute
@nixpanic
Copy link
Member Author

nixpanic commented Aug 3, 2021

#2343 (comment) failed again (logs)

@nixpanic
Copy link
Member Author

nixpanic commented Aug 3, 2021

#2308 (comment) failed here too (logs)

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

#2354 (comment) as well (logs)

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

failed in #2350 (comment) too (logs

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

#2341 (comment) hit this too (logs)

@Madhu-1
Copy link
Collaborator

Madhu-1 commented Aug 4, 2021

@nixpanic as we are hitting frequently is it a good idea to add a wrapper to retry?

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

#2322 (comment) hit this (logs)

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

@nixpanic as we are hitting frequently is it a good idea to add a wrapper to retry?

I don't know. There is already an automated retry by minikube. We should try to identify the cause and work on preventing it. Gathering the hostnames of the CentOS CI bare-metal machines might give a clue. Different groups of hosts have different hardware, and are in different subnets (https://wiki.centos.org/QaWiki/PubHardware).

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

#2341 (comment) as well (logs)

@nixpanic
Copy link
Member Author

nixpanic commented Aug 4, 2021

#2339 (comment) hit this too (logs)

@github-actions
Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in a week if no further activity occurs. Thank you for your contributions.

@github-actions
Copy link

This issue has been automatically closed due to inactivity. Please re-open if this still requires investigation.

@nixpanic nixpanic added keepalive This label can be used to disable stale bot activiity in the repo and removed wontfix This will not be worked on labels Nov 1, 2021
@nixpanic
Copy link
Member Author

nixpanic commented Nov 1, 2021

This is definitely not fixed yet 😞

@nixpanic
Copy link
Member Author

Seems like kubernetes/minikube#11459 is reported too.

@nixpanic
Copy link
Member Author

nixpanic commented Jun 2, 2023

This has not happened since a long time.

@nixpanic nixpanic closed this as completed Jun 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/testing Additional test cases or CI work keepalive This label can be used to disable stale bot activiity in the repo priority-1
Projects
None yet
Development

No branches or pull requests

3 participants