Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1.16.9: managed-premium, slow volume mounting/attaching: Unable to attach or mount volumes: timed out waiting for the condition #1663

Closed
bitsofinfo opened this issue Jun 10, 2020 · 3 comments
Labels

Comments

@bitsofinfo
Copy link

I constantly see 3-5 minute waits where pods are stuck initializing w/ this kind of warning shown on describe. After 3-5 minutes the warnings on pods like the below example go away then things function as normal.

  Warning  FailedMount       110s       kubelet, xyz-27390584-0  Unable to attach or mount volumes: unmounted volumes=[ws-xddtn], unattached volumes=[tekton-internal-workspace tekton-internal-home tekton-internal-secret-volume-yyapps-cicd-x-ssh-co-fxrfd tekton-internal-scripts tekton-creds-init-home tekton-internal-secret-volume-docker-x-2wx65 ws-xddtn tekton-internal-results yy-builder-token-8dhtc tekton-internal-tools tekton-internal-downward]: timed out waiting for the condition

My PVC looks like this:

              spec:
                storageClassName: managed-premium
                accessModes:
                  - ReadWriteOnce
                resources:
                  requests:
                    storage: 5Gi

The PV + PVC both look fine upon describe. But pods referencing the PVC constantly hang up and take forever to initialize.

2 nodes: (Standard D2s v3)

#884

@bitsofinfo
Copy link
Author

maybe related, #1464 the --node-status-update-frequency=5m on a cluster created

@bitsofinfo
Copy link
Author

bitsofinfo commented Jun 11, 2020

this definitely was the issue, reducing --node-status-update-frequency=10s massively improved the situation...

i.e. shell into each of your nodes and check the value in /etc/default/kubelet adjust and reboot

@bitsofinfo bitsofinfo changed the title 1.16.9: managed-premium, slow volume mounting/attaching 1.16.9: managed-premium, slow volume mounting/attaching: Unable to attach or mount volumes: timed out waiting for the condition Jun 11, 2020
@andyzhangx
Copy link
Contributor

We fixed this issue(node-status-update-frequency=5m) on early March 2020, so if your cluster was created before that date, pls add a new node pool, another workaround is using a daemonset to modify that kubelet flag and restart kubelet, here is an example:

https://raw.githubusercontent.com/andyzhangx/demo/master/dev/sysctl-node-status-update-frequency.yaml

@ghost ghost locked as resolved and limited conversation to collaborators Jul 22, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants