-
Notifications
You must be signed in to change notification settings - Fork 317
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No space left on device #528
Comments
/cc @seanknox |
@edernucci can you provide some information about your cluster?
|
@edernucci, can you provide the following outputs: cat /proc/cgroups |
@seanknox sure!
All namespaces have LimitRange with cpu and memory quotas, and sometimes I found some OOMKilled containers. I'm suspecting we have cgroup leak when the container is killed. |
@dsalamancaMS unfortunately the error appear after some hours/days. But here is how it are now:
|
Still rebooting nodes on a daily basis to workaround this issue. |
@edernucci can you open an support ticket in the Azure Portal? That will get it in front of our engineering team. |
@seanknox Microsoft support stated (support id 118081018768501) that this issue is kubernetes-related or kernel-related and is out of scope of AKS support. Please reopen the issue in order to keep track on open-source ecosystem. Regards, |
@edernucci still facing this issue on AKS (version:
|
Take a look at this script to do it for all nodes https://gist.github.com/brendan-rius/5ac9ec3dd7e196222c8b8b356f8973d2 |
Microsoft finally found the root cause of this issue: #1373 |
@edernucci issue 1373 doesn't fix the file handle limits |
@edernucci issue should have fixed Azure/aks-engine#1801 |
Hi there,
I'm experiencing this issue and seems to be related (or exactly) this one:
moby/moby#29638
My machines have a good disk free space and a good inode usage count. Every time I face this issue on kubelet log my /proc/cgroups are using 2500+ cgroups and I have to drain and restart the node. After reboot the cgroups usage is at 100 again and after some hours (sometimes days) the error appear again.
The text was updated successfully, but these errors were encountered: