Memory leak in server? #5170

mrdavidlaing · 2015-10-22T15:02:35Z

I'm running Kibana 4.2.beta2 as a Cloud Foundry app with 384MB of memory assigned to each app instance container.

I'm querying against a fairly small elasticsearch cluster, at a rate of about 20 requests / minute - basically very little load.

There appears be be some form of memory leak that causes the server side nodejs portion of Kibana to gradually consume more and more memory until it exhausts the 384M of memory allocated to it in the container.

Couple of follow on questions:

What is the correct amount of RAM to allocate to Kibana4 server?
Is this a known issue? Are other people running into something similar?
What information / test case if most helpful for me to provide to help debug further?

Thanks!

nestorsamelo · 2015-10-22T19:28:36Z

I am seeing the same. Memory consumption keeps increasing. I had to restart it at around 1.2 GB.

rashidkpc · 2015-10-26T14:44:02Z

What exactly occurs? Does the oom_killer slay the process? While its possible there could be a memory leak, the server doesn't do all that much. It possible this is just cache.

nestorsamelo · 2015-10-26T16:49:04Z

I have one instance running for 4 days and 18 hours:

root@host# ps -p [pid] -o etime=
 4-18:09:40

top shows:

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
[pid] root      20   0 1900m 1.1g 8316 S  0.0 27.6  18:31.08 node

proc memory statistics:

root@host# cat /proc/[pid]/statm
486466 279120 2079 4044 0 477276 0

According to the documentation:

    /proc/[pid]/statm
              Provides information about memory usage, measured in pages.
              The columns are:
                  size       (1) total program size (same as VmSize in /proc/[pid]/status)
                  resident   (2) resident set size (same as VmRSS in /proc/[pid]/status)
                  share      (3) shared pages (i.e., backed by a file)
                  text       (4) text (code)
                  lib        (5) library (unused in Linux 2.6)
                  data       (6) data + stack
                  dt         (7) dirty pages (unused in Linux 2.6)

In my case, it didn't get to a point where oom_killer had to step in, as it is running in a 4 GB VM.

Bennell · 2015-10-27T14:39:10Z

@rashidkpc: We can also confirm the memory leak.

We started the dev server locally with esvm also running. No extra config, no custom dashboards, clean slate from freshly cloned kibana repo.

We monitored the memory usage of the process for about half an hour (we will let it run some more) and the memory usage is slowly creeping up even without running the kibana UI. It leaks a lot faster if I keep refreshing the UI. So far the amount of memory used by the process has doubled since starting it.

I'm not sure what other details we could share. We have seen the memory leak happening on OSX and Linux, nodejs 0.12.7.

rashidkpc · 2015-10-27T18:41:22Z

Growing memory isn't necessarily a sign of a leak.

The VM will only garbage collect at signs of memory pressure. You can confirm this by taking a heapdump of the process, which will force a garbage collection. If you then wait some time, or perform some actions, then take another heap dump, you can compare them.

These were taken ~ 10 minutes apart, as you can see, the memory actually goes down. Unless the process is dying I wouldn't worry about the growing memory. It will be freed as needed.

cromega · 2015-10-27T21:57:32Z

@rashidkpc I can also confirm that freeing up memory doesn't happen. The app container has 384MB memory assigned which slowly keeps filling up until either the CloudFoundry runtime kills the container and starts a new one, or, and this happens more often, the kibana server just silently stops serving requests, breaking the UI.

I will try running the app with significantly more memory and see what happens.

rashidkpc · 2015-10-27T22:33:25Z

@cromega did 4.1.x behave differently? 4.2 switched to node 0.12 which is slightly more relaxed about GCing. I'm not familiar with how cloud foundry works unfortunately.

cromega · 2015-10-27T22:37:11Z

@rashidkpc: I am running 4.2-beta2, before that I used a 4.2-snapshot. I have no experience with 4.1. Thanks for the info, I will play around with different node versions.

nestorsamelo · 2015-10-27T22:59:01Z

@rashidkpc: The memory footprint of 4.1.2 (103 MB) is smaller than 4.2.0-beta2 (473 MB).

top for 4.1.2 (uptime 1-03:33:45):

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 1716 root      20   0  998m 103m 7136 S  0.0  2.6   0:02.31 node

top for 4.2.0-beta2 (uptime 6-00:28:29):

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
20998 root      20   0 1280m 473m 8316 S  0.3 12.0  23:33.94 node

This last one is the same instance as my previous comment, so (resident) memory consumption has indeed decreased as you suggested above, from 1.1 GB to 473 MB.

PSiAU · 2015-11-01T03:36:50Z

I'm not running Kibana as a Cloud Foundry app but am seeing the same issue 4.2.0 (final release). Kibana is using 646m of memory. Surely Kibana shouldn't use that much memory?

  PID USER      PR  NI  VIRT  RES  SHR S  %CPU %MEM    TIME+  COMMAND                                                                                                                  
11150 root      20   0 1183m 646m 8720 S   0.0 32.1   1:01.90 node

Update:

a 8 hours later, it has grown to this:

  PID USER      PR  NI  VIRT  RES  SHR S  %CPU %MEM    TIME+  COMMAND                                                                                         
11150 root      20   0 1389m 854m 5264 S   0.0 42.5   2:08.03 node

Running on Debian 7.9

PSiAU · 2015-11-01T23:24:02Z

As a test, I restarted Kibana and left it running on the server overnight without logging into it at all on the browser. Memory usage still grows nonetheless.

  PID USER      PR  NI  VIRT  RES  SHR S  %CPU %MEM    TIME+  COMMAND                                                                               
18161 root      20   0 1426m 799m 4324 S   0.3 39.7   1:17.29 node

nfnty · 2015-11-03T16:13:01Z

I'm having the same problem running in a Docker container with 4.2.0 and Elasticsearch 2.0.0. No problem with 4.1.2. 1GiB OOM killed after approximately 14 hours.

nfnty/arch-kibana:latest with arguments

ricardojdsilva87 · 2015-11-04T10:20:13Z

Hello,

Same problem here.
After upgrading to kibana 4.2.0 (ES and logstash upgrade too), I had some problem with ES crashing after a few hours, the OOM killer was invoked. I tried to lower the ES_HEAP to 25% of available RAM yesterday, but today the memory of the server was too low again (50MB remained)

After taking a better look to the top command, I've noticed that the node process was taking about 40%of the available RAM, meanwhile ES and logstash remained stable at ~40% and 7.5% (check below)

On the previous version of kibana (4.1.2) wit ES 1.7.2 and Logstash 1.5.4 I had no problem, the processes were running with no problems for 1 month until the upgrade. ES was configured with the recommended settings:

https://www.elastic.co/guide/en/elasticsearch/reference/current/setup-configuration.html

I'm going to leave it running for the rest of the day to check on the memory usage for each process, I'll update the thread later

Thanks for the help

ricardojdsilva87 · 2015-11-04T14:11:02Z

´Hello,

Just a quick update, the kibana 4.2.0 is running for about 2 hours and it seems that the memory usage of node increased 10% of the total RAM

Meanwhile ES and logstash remained pratically the same.

Best Regards

ricardojdsilva87 · 2015-11-05T10:58:09Z

Hello

After leaving the services running all night today morning I have this scenario:

The node memory usage climbed to 40% of the server RAM.
In a previous comment @rashidkpc said that the kibana server is using now the node v0.12. I'm going to try and investigate if there are any issues with this versions, maybe try a new node version.

Best regards

peterholak · 2015-11-05T11:12:52Z

Same here, see the attached picture, kibana was started at the 22:00 mark (no other change on the server happened since then) and I had it opened in browser for only about 2 mintues in total. Kibana 4.2.0, ES 2.0, Ubuntu 15.10.

rvrignaud · 2015-11-05T14:54:03Z

Hello,
Same problem here. Kibana 4.2.0 deployed on a container (using Ubuntu 15.04) with Elasticsearch 2.0. We have 0 usage of Kibana (no one is using it) however we see a linear growing usage of memory.
Here is the memory usage:

The first part of the graph show the behaviour with Kibana 4.1. The top green line is the memory threshold of the container. Each time Kibana reach the memory limit, it gets an OOM. As we can see on the graph, we have increased a few time the memory limit up to 1,5GB with no change of behaviour.

mrdavidlaing · 2015-11-06T14:12:01Z

@rashidkpc - You've labeled this not reproducible; how can we team together to get a test case you can use that reproduces the behaviour?

What development / test environment do you guys use - perhaps those of us on this ticket could put together something that replicates the error in your environment?

spalger · 2015-11-06T16:01:01Z

@mrdavidlaing we primarily use the chrome dev tools for memory profiling. I'm not sure how Rashid was collecting his snapshots (perhaps by signalling the node process) but I was doing the same by running the server with iron-node.

npm install -g iron-node
cd ~/dev/kibana
iron-node ./src/cli

With iron-node you can use the memory profiling tools to collect snapshots and see the items that are not being garbage collected, or collect memory over time and cause constant garbage collection. These types of metrics are a lot more meaningful than the total process memory level since v8 will only do light garbage collection until the it detects serious memory pressure.

PSiAU · 2015-11-08T22:52:24Z

Curiosity got the better of me so I left the node process growing just to see what would happen. It started consuming just about all free memory on the instance (not cache memory but Active memory). IO Wait hit 100% as the OS tried to deal with the low memory situation. I had to restart node before the OOM killer kicked in.

jraby · 2015-11-09T16:32:26Z

Just for the record, we're running Kibana 4.2.0 on ubuntu 12.04 and we see the same behaviour.

At startup, the kibana node process takes ~100mb (resident size seen from ps aux) and it grows steadily until the OOM killer kills something: Kibana or ES if we're not lucky.

Our usage pattern is very light, I'm the only user of the kibana installation at the moment and I don't have any dashboard open.

I've enabled verbose logging, it logs the memory usage every 5 seconds, I'll post it later.
For the moment, kibana reported using 70mb on startup and it now reports 107mb after 30 minutes.

RSS size as reported from ps aux started at ~100mb and it is now at 170mb.

mrdavidlaing · 2015-11-10T11:35:12Z

@jraby, @PSiAU - Out of interest, are you guys running Kibana inside a container, on a Ubuntu VM, or on baremetal?

PSiAU · 2015-11-10T11:56:51Z

I'm running it on Debian 7.9 (64-bit), in an EC2 instance.

fpighi · 2015-11-10T12:10:00Z

Having the same issue reported by other people here with Kibana 4.2.0. Memory usage on the server is going up with a linear pattern.
We are using kibana inside a docker container running on a Amazon Linux AMI in EC2.

jraby · 2015-11-10T12:35:52Z

Ubuntu 12.04 on ec2.
On Nov 10, 2015 7:10 AM, "Francesco Pighi" notifications@github.com wrote:

Having the same issue reported by other people here with Kibana 4.2.0.
Memory usage on the server is going up with a linear pattern.
We are using kibana inside a docker container running on a Amazon Linux
AMI in EC2.

—
Reply to this email directly or view it on GitHub
#5170 (comment).

danrimal · 2015-11-20T08:10:54Z

I have OOM killer log (VM had 2GB RAM):

[9827667.988558] Out of memory in UB 5517: OOM killed process 8709 (node) score 0 vm:5478276kB, rss:1712056kB, swap:1977324kB

Now, server has 6GB RAM and kibana 4.2.0 with sense and marvel app goes up to 6GB RAM after 24hours.

dngferreira · 2015-11-20T09:59:16Z

Using the --max-old-space-size=150 seems to fix the problem using docker.
Just as a safe side I have a 300mb limit on the container but it hasn't been triggered yet.

PSiAU · 2015-11-23T00:28:17Z

Looks like --max-old-space-size is working nicely, thanks @mrdavidlaing
Note: I was originally adding --max-old-space-size to /etc/init.d/kibana in the args parameter instead of /opt/kibana/bin/kibana. You need to add it to /opt/kibana/bin/kibana or it will not work.

thomasmarkus · 2015-11-23T07:38:36Z

I'm also happy to report that the PR from @mrdavidlaing also fixes the memory leak for us. 👍

FlorinAndrei · 2015-12-02T21:14:41Z

Can we get this fix into a release please? I've noticed that bin/kibana in the repo still doesn't limit memory usage.

dbason · 2015-12-04T01:39:14Z

We've also recently run into a similar issue with Kibana 4.3.

Edit: Will try the max-old-space-size fix.

dyakhnov · 2015-12-04T05:45:10Z

Same issue here: Kibana 4.3 in Docker 1.6.2 on AWS EC2
It works like Cookie Monster -- nom, nom, nom...

ricardclau · 2015-12-04T09:52:46Z

Same here, in my case I had a t2.micro instance running both Kibana 4.3 and ElasticSearch 2.1

With this particular setup ElasticSearch ended up crashing (presumably by the OOM killer) but it seems that the issue was with the nodejs application

@mrdavidlaing saved the day for me :)

Any plans for a 4.3.1 with this patch any soon? 👍

FlorinAndrei · 2015-12-04T20:49:41Z

--max-old-space-size=250 appears to fix it for me, process size seems to hover around 300 MB, give or take a few, no more OOM. Using Kibana 4.2.1

mrdavidlaing · 2015-12-08T22:39:59Z

I'm closing this because #5451 addresses this issue by allowing you to explicitly set node's GC settings via a NODE_OPTIONS environment variable. Should hopefully make the 4.4 release.

FWIW, in a low memory environment you should be able to prevent OOM errors by starting using:

NODE_OPTIONS="--max-old-space-size=200" bin/kibana

Adjusting 200 such that 100MB + 200MB < the the memory in MB available in your container.

rashidkpc · 2016-01-03T03:44:39Z

Adding for posterity: nodejs/node#2683

austinchambers · 2016-01-08T16:50:14Z

Given this issue was reproducible, perhaps the "not reproducible" tag should be removed?

…ana#5170)

ageis · 2017-05-10T00:39:26Z

See latest comment in #5451, the systemd service is broken due to this. Who thought > was a valid comment character?

tylersmalley · 2017-05-10T17:47:32Z

@ageis, not us. Make sure you're using our APT repository.

rashidkpc added the feedback_needed label Oct 26, 2015

rashidkpc added not reproducible and removed feedback_needed labels Oct 27, 2015

rashidkpc changed the title ~~Memory leak in Kibana 4.2.beta2 server~~ Memory leak in server Oct 27, 2015

rashidkpc changed the title ~~Memory leak in server~~ Memory leak in server? Oct 27, 2015

mrdavidlaing mentioned this issue Nov 20, 2015

Default NodeJS max heap limit to 250MB. Override by setting NODE_OPTIONS #5451

Closed

nabheet mentioned this issue Nov 23, 2015

Kibana docker instance stops running docker-library/kibana#18

Closed

spalger mentioned this issue Dec 8, 2015

Lets talk about v8's memory settings #5595

Closed

mrdavidlaing closed this as completed Dec 8, 2015

spalger added notabug When the issue is closed this label Indicates that it wasn't a bug and removed not reproducible labels Jan 8, 2016

This was referenced Jan 20, 2016

Kibana log files spujadas/elk-docker#16

Closed

Kibana shuts down over time: out of memory? spujadas/elk-docker#17

Closed

joaovicente pushed a commit to joaovicente/tracy-demo that referenced this issue Jan 31, 2016

Mitigating Kibana/Node high memory consumption issue (see elastic/kib…

272acb2

…ana#5170)

sadovnikov mentioned this issue Feb 2, 2016

Executors (kibana) shuts down due to unknown error mesos/kibana#13

Closed

rashidkpc mentioned this issue Apr 13, 2016

Possible memory leak - server #6862

Closed

deviantony mentioned this issue Apr 28, 2016

Kibana container shutting down in less than a day deviantony/docker-elk#51

Closed

blarghmatey mentioned this issue May 6, 2016

Add GC options to node for Kibana mitodl/salt-ops#33

Closed

bdero mentioned this issue May 12, 2016

Added node_options environment to kibana mitodl/elasticsearch-formula#2

Merged

inqueue mentioned this issue Nov 8, 2016

Documentation for configuring --max-old-space-size #9006

Closed

anthraxx pushed a commit to anthraxx/arch-pkgbuilds that referenced this issue Jul 18, 2017

fix for elastic/kibana#5170

57d3896

Memory leak in server? #5170

Memory leak in server? #5170

Comments

mrdavidlaing commented Oct 22, 2015

nestorsamelo commented Oct 22, 2015

rashidkpc commented Oct 26, 2015

nestorsamelo commented Oct 26, 2015

Bennell commented Oct 27, 2015

rashidkpc commented Oct 27, 2015

cromega commented Oct 27, 2015

rashidkpc commented Oct 27, 2015

cromega commented Oct 27, 2015

nestorsamelo commented Oct 27, 2015

PSiAU commented Nov 1, 2015

PSiAU commented Nov 1, 2015

nfnty commented Nov 3, 2015

ricardojdsilva87 commented Nov 4, 2015

ricardojdsilva87 commented Nov 4, 2015

ricardojdsilva87 commented Nov 5, 2015

peterholak commented Nov 5, 2015

rvrignaud commented Nov 5, 2015

mrdavidlaing commented Nov 6, 2015

spalger commented Nov 6, 2015

PSiAU commented Nov 8, 2015

jraby commented Nov 9, 2015

mrdavidlaing commented Nov 10, 2015

PSiAU commented Nov 10, 2015

fpighi commented Nov 10, 2015

jraby commented Nov 10, 2015

danrimal commented Nov 20, 2015

dngferreira commented Nov 20, 2015

PSiAU commented Nov 23, 2015

thomasmarkus commented Nov 23, 2015

FlorinAndrei commented Dec 2, 2015

dbason commented Dec 4, 2015

dyakhnov commented Dec 4, 2015

ricardclau commented Dec 4, 2015

FlorinAndrei commented Dec 4, 2015

mrdavidlaing commented Dec 8, 2015

rashidkpc commented Jan 3, 2016

austinchambers commented Jan 8, 2016

ageis commented May 10, 2017

tylersmalley commented May 10, 2017