Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrating to EKS 1.13 receiving FailedCreatePodSandBox #571

Closed
TechnicalMercenary opened this issue Aug 5, 2019 · 3 comments
Closed

Migrating to EKS 1.13 receiving FailedCreatePodSandBox #571

TechnicalMercenary opened this issue Aug 5, 2019 · 3 comments

Comments

@TechnicalMercenary
Copy link

We have a terraform configuration that sets up a cluster on 1.12 but when moving to 1.13 we are now getting the following error....

Warning FailedCreatePodSandBox 8m3s kubelet, ip-10-241-235-194.us-west-2.compute.internal Failed create pod sandbox: rpc error: code = Unknown desc = [failed to set up sandbox container "c5a0458bfd2d15dac31047ba459db66b2679c1497d97784d4a2dda66e4bb5a30" network for pod "bonk-pipeline-1565047792578-driver": NetworkPlugin cni failed to set up pod "bonk-pipeline-1565047792578-driver_gregory" network: rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = "transport: Error while dialing dial tcp 127.0.0.1:50051: connect: connection refused", failed to clean up sandbox container "c5a0458bfd2d15dac31047ba459db66b2679c1497d97784d4a2dda66e4bb5a30" network for pod "bonk-pipeline-1565047792578-driver": NetworkPlugin cni failed to teardown pod "bonk-pipeline-1565047792578-driver_gregory" network: rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = "transport: Error while dialing dial tcp 127.0.0.1:50051: connect: connection refused"]

These instances are r3.2xlarge types that we are triggering a scale up on to start up additional nodes. The only Pod that we are purposely scheduling is a Spark Worker via the Spark Launcher.

I've attached the logs like others with similar FailedCreatePodSandBox issues . We are using the latest 1.13 worker AMI etc.

aws-cni-support.tar.gz

@TechnicalMercenary
Copy link
Author

and as is tradition I find this.... #282

@mogren
Copy link
Contributor

mogren commented Aug 5, 2019

@TechnicalMercenary Hey, sorry about that. I have a release candidate out with a potential fix for this issue. I'm planning to release a final v1.5.2 within the next few days.

@mogren
Copy link
Contributor

mogren commented Aug 6, 2019

@TechnicalMercenary v1.5.2 is out, please try this version if you are still seeing issues.

@mogren mogren closed this as completed Sep 28, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants