-
Notifications
You must be signed in to change notification settings - Fork 749
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
networkPlugin cni failed to teardown pod "traefik-6d4b5f9c9f-7sfg7_default" network: invalid version "": the version is empty] #1412
Comments
Updating my Kubernetes worker nodes to the latest AWS EKS version 1.19 AMI and then rebuilding the worker nodes fixed the issue. So, gonna assume the problem was hardcoded into the EKS 1.19 AMI I was using previously. |
ah, turns out, after CNI was changed to require the |
Hi @nodesocket I would like to review kubelet logs, can you please share me (varavaj@amazon.com) the log dump by running this script on the instance - Thanks. |
@jayanthvn and @mgoltzsche I don't have the previous EKS EC2 worker node instances anymore. I recreated them using the latest EKS AMI of
Previously, which was broke with the above error, I was using the AMI |
Thanks @nodesocket We will try to repro this. Can you please share the previous EKS and CNI version prior to 1.19 upgrade? Since cluster upgrade as far as I know shouldn't upgrade CNI since manifests will be applied only for new cluster creates. Did you upgrade CNI after upgrading to 1.19? |
@jayanthvn I don't even know how to upgrade CNI. The previous Kubernetes version we were running on EKS was That resulted in the error / issue. Then I upgrade the Kubernetes workers to the latest AMI from |
Thanks @nodesocket , I will try to repro this. |
@jayanthvn just edited/added more to my original reply above. Want to make sure you see it. |
@jayanthvn I just looked at my logs from this cluster, and even though all the pods are running successfully, I am seeing the following
|
Can you please email (varavaj@amazon.com) me the logs from the script - Thanks. |
I looked into Justin's cluster, CNI version in the cluster is "1.5.0" and starting from k8s 1.16 onwards "CNIVersion" should be set (Ref #604 ) which is missing in the CNI spec. The fix to add CNIversion in CNI spec (#605) is in 1.5.4 onwards [https://github.com/aws/amazon-vpc-cni-k8s/releases/tag/v1.5.4]. Hence seeing the invalid version errors -
CNI spec -
|
Also updating the cluster, will not update the addons - Please ref this - "Amazon EKS doesn't modify any of your Kubernetes add-ons when you update a cluster. After updating your cluster, we recommend that you update your add-ons to the versions listed in the following table for the new Kubernetes version that you're updating to. Steps to accomplish this are included in the update procedures." https://docs.aws.amazon.com/eks/latest/userguide/update-cluster.html |
This was the problem, and fixed by manually adding the CNI addon and selecting the latest version. It would have great if there was some sort of indication in the web console about running an incompatible version of CNI when upgrading Kubernetes versions and how to fix this. I feel like lots of users are going to get bitten by this if they upgrade old EKS clusters. |
Glad it got fixed :) yes will take the feedback to see how we can have documentation/console warning updated. Closing the issue now, please do reach out if you need any more info. |
Just upgraded our EKS cluster to Kubernetes version
1.19
. The worker nodes are instance typest3a.xlarge
. When re-deploying Traefik via Helm getting the following error on the Traefik pods as well as a pod namedstoreconfig-job-1-x9cc5
.Warning FailedCreatePodSandBox 3m38s kubelet Failed to create pod sandbox: rpc error: code = Unknown desc = [failed to set up sandbox container "bd85f8205cf2b59a5dc0230f82c24aba121487f802b17519528897839b2b8290" network for pod "traefik-6d4b5f9c9f-7sfg7": networkPlugin cni failed to set up pod "traefik-6d4b5f9c9f-7sfg7_default" network: add cmd: failed to assign an IP address to container, failed to clean up sandbox container "bd85f8205cf2b59a5dc0230f82c24aba121487f802b17519528897839b2b8290" network for pod "traefik-6d4b5f9c9f-7sfg7": networkPlugin cni failed to teardown pod "traefik-6d4b5f9c9f-7sfg7_default" network: invalid version "": the version is empty]
Any ideas? This worked previously deploying Traefik via Helm on the old version of Kubernetes.
The text was updated successfully, but these errors were encountered: