cadvisor daemonset with containerd goes into crashloopbackoff #2855

madeinindiadot · 2021-04-27T20:47:34Z

I have deployed the cadvisor daemonset in my kubernetes cluster. My runtime is containerd. I constantly see that my pods are going into crashloopbackoff. I tried with version v0.30.2, v0.37.5 and latest. kubectl logs doesnt show anything. The pods say "OOMKilled". please advise what is being missed.

NAME READY STATUS RESTARTS AGE
cadvisor-4wl5l 1/1 Running 0 96s
cadvisor-95gqg 1/1 Running 0 96s
cadvisor-pfwpk 1/1 Running 0 98s
cadvisor-xgrp8 1/1 Running 0 99s
cadvisor-4wl5l 0/1 OOMKilled 0 2m
cadvisor-4wl5l 1/1 Running 1 2m1s
cadvisor-xgrp8 0/1 OOMKilled 0 2m9s
cadvisor-xgrp8 1/1 Running 1 2m10s
cadvisor-95gqg 0/1 OOMKilled 0 2m16s
cadvisor-pfwpk 0/1 OOMKilled 0 2m18s
cadvisor-95gqg 1/1 Running 1 2m17s
cadvisor-pfwpk 1/1 Running 1 2m19

skgsergio · 2021-05-03T23:10:50Z

You might have encountered the same issue as I did. I faced the same behavior and in my case it was because cadvisor for containerd gets all the env vars of all the containers resulting in a ton of memory usage as they get exported as metric labels.

You can add this flags to your cadvisor container in the daemonset, if it solves your issue then is probably the same as mine:

--store_container_labels=false
--whitelisted_container_labels=io.kubernetes.container.name,io.kubernetes.pod.name,io.kubernetes.pod.namespace

These flags are just a workaround of the issue. For docker, there is an explicit whitelisting for env vars which is not yet implemented for containerd (I made a PR for it to behave as it does with docker #2857).

madeinindiadot · 2021-05-04T14:13:54Z

Thanks @skgsergio Let me check that

madeinindiadot · 2021-05-05T14:11:43Z

Thanks @skgsergio Adding those parameters worked for me on the image gcr.io/cadvisor/cadvisor:v0.37.5
However, on the image k8s.gcr.io/cadvisor:v0.30.2, it says those parameters are not available. Is there any specific version on the kubernetes gcr registry docker image(k8s.gcr.io/cadvisor) from which these parameters are available? I am currently running a daemonset on the kubernetes with containerd

skgsergio · 2021-05-05T14:44:20Z

If it worked then 100% is the same issue where cadvisor is not filtering env vars for containerd containers.

Regarding the image, is there any reason you want to use the k8s.gcr.io repo? The images in that repo doesn't meant that there is a specific version of cadvisor for kubernetes, gcr.io/cadvisor/cadvisor:v0.37.5 works fine in kubernetes (we are using this in prod).

MonicaMagoniCom · 2021-05-06T12:22:26Z

I'm having the same issue. By following the suggestion, I added the flags and actually there are less OOM killed but still the memory used is high. Consider the attached image: this is the memory consumption of our 4 instances (memory limit is set to 700MiB).

iwankgb · 2021-05-07T16:33:09Z

@MonicaMagoniCom can you provide following information:

number of containers running on each host
cAdvisor command line parameters used
cAdvisor version used.

I agree that close to 700 MiB looks worrying.

CC: @Creatone @dashpole - it seems to be related to the increase discussed in #2853.

MonicaMagoniCom · 2021-05-10T07:36:14Z

@MonicaMagoniCom can you provide following information:

number of containers running on each host

cAdvisor command line parameters used

cAdvisor version used.

I agree that close to 700 MiB looks worrying.

CC: @Creatone @dashpole - it seems to be related to the increase discussed in #2853.

You find all the information here #2856

skgsergio · 2021-05-10T15:41:59Z

Looks like @MonicaMagoniCom is using GKE version 1.14.10-gke.42, and as far as I remember is not until 1.19 when GKE started shipping Kubernetes with containerd, so it might not be related to this specific issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cadvisor daemonset with containerd goes into crashloopbackoff #2855

cadvisor daemonset with containerd goes into crashloopbackoff #2855

madeinindiadot commented Apr 27, 2021

skgsergio commented May 3, 2021 •

edited

Loading

madeinindiadot commented May 4, 2021

madeinindiadot commented May 5, 2021

skgsergio commented May 5, 2021 •

edited

Loading

MonicaMagoniCom commented May 6, 2021

iwankgb commented May 7, 2021

MonicaMagoniCom commented May 10, 2021 •

edited

Loading

skgsergio commented May 10, 2021

cadvisor daemonset with containerd goes into crashloopbackoff #2855

cadvisor daemonset with containerd goes into crashloopbackoff #2855

Comments

madeinindiadot commented Apr 27, 2021

skgsergio commented May 3, 2021 • edited Loading

madeinindiadot commented May 4, 2021

madeinindiadot commented May 5, 2021

skgsergio commented May 5, 2021 • edited Loading

MonicaMagoniCom commented May 6, 2021

iwankgb commented May 7, 2021

MonicaMagoniCom commented May 10, 2021 • edited Loading

skgsergio commented May 10, 2021

skgsergio commented May 3, 2021 •

edited

Loading

skgsergio commented May 5, 2021 •

edited

Loading

MonicaMagoniCom commented May 10, 2021 •

edited

Loading