Telegraf seems to have a memory leak #111

mattk42 · 2016-06-17T16:44:11Z

In my cluster running the 2.0 official release I have noticed that telegraf works its way up to 2GB of memory usage before it seemingly gets cleaned up and starts over.

jchauncey · 2016-06-17T16:50:22Z

Are you noticing the pod restarting a lot? There was an issue in the 0.13.x version of telegraf that caused the binary to panic when sending data to influx - influxdata/telegraf#1268

Check the pod logs and see if you see a similar erorr message.

mattk42 · 2016-06-17T17:01:17Z

Unfortunately I actually killed off the DS, I have my own monitoring stuff in place so I shut down most of the deis-monitor components this morning.

I don't believe that the container is getting restarted though, if that was the case the container ids in the chart above would have changed.

titilambert · 2016-06-22T19:27:13Z

Hello, I think I got the same issue.
I suspect prometheus plugin. It seems it doesn't closing connection :/
Run on you apiservers: netstat -ntp | grep 8080 | wc -l

jchauncey · 2016-06-22T19:46:42Z

So we have seen this happen (especially on larger clusters) so we disabled the prometheus plugin by default in the image (although the chart turns it on). This means you will lose out on k8s metrics and container metrics. I will open an issue with telegraf and see if we can get it fixed.

jchauncey · 2016-06-22T20:03:01Z

See here - influxdata/telegraf#1405

titilambert · 2016-06-22T20:25:47Z

PR influxdata/telegraf#1406 created

jchauncey · 2016-06-24T19:54:58Z

So I have rebuilt the image to include latest master changes. It seems to have fixed the memory leak problem but Im not 100% on that. If you want to redeploy telegraf and check it out that would be awesome.

titilambert · 2016-06-27T16:28:34Z

@jchauncey testing today or tomorrow. I let you know when I get results ;)

titilambert · 2016-06-28T12:58:11Z

The issue is fix for me !

bacongobbler closed this as completed Jun 28, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Telegraf seems to have a memory leak #111

Telegraf seems to have a memory leak #111

mattk42 commented Jun 17, 2016

jchauncey commented Jun 17, 2016

mattk42 commented Jun 17, 2016

titilambert commented Jun 22, 2016

jchauncey commented Jun 22, 2016

jchauncey commented Jun 22, 2016

titilambert commented Jun 22, 2016

jchauncey commented Jun 24, 2016

titilambert commented Jun 27, 2016

titilambert commented Jun 28, 2016

Telegraf seems to have a memory leak #111

Telegraf seems to have a memory leak #111

Comments

mattk42 commented Jun 17, 2016

jchauncey commented Jun 17, 2016

mattk42 commented Jun 17, 2016

titilambert commented Jun 22, 2016

jchauncey commented Jun 22, 2016

jchauncey commented Jun 22, 2016

titilambert commented Jun 22, 2016

jchauncey commented Jun 24, 2016

titilambert commented Jun 27, 2016

titilambert commented Jun 28, 2016