event monitor causes infinite memory growth #165

minrk · 2018-04-23T08:17:22Z

The event reflector in #150 has causes a memory leak, which brought down Binder last week.

I suspect we aren't properly cleaning up the reflector when we are done with it. We need to investigate and fix this before releasing kubespawner or pushing it to zero-to-jupyterhub.

yuvipanda · 2018-04-23T09:40:17Z

@minrk it isn't being used for anything right now, so I reverted it in #166

minrk · 2018-04-23T10:01:59Z

I tracked it down. Instantiating a Watch unconditionally instantiates an APIClient, which unconditionally spawns n_cpus threads. So when we create one reflector per pod, we are creating a huge number of threads.

It's the same as the kube-4.0 upgrade bug, but isolated to a more specific circumstance.

minrk · 2018-04-23T10:02:41Z

It does provide debug logging, and is used in #153. But I agree that we should probably revert it for now, or pin kubernetes-3 again.

minrk · 2018-04-23T11:05:20Z

I opened a PR with swagger-codegen, which is responsible for the flood of threads.

yuvipanda · 2018-04-23T17:41:16Z

@minrk let's just revert it, and introduce it back when needed. Does that sound right, @clkao?

clkao · 2018-04-23T17:46:39Z

Yeah please revert it for now, and I'll make this part of #153. Is there any alternative way to make reflector sane before Watch() gets fixed by the swagger-codegen?

minrk · 2018-04-23T20:13:26Z

@clkao thanks. One way is to revert the kubernetes client to 3.x, which doesn't have this issue. We might also be able to find a workaround. I think we're not reliably cleaning these reflectors up, which could help.

clkao · 2018-04-24T01:14:16Z

hmm, i think the reflectors are stopped once the pod is running. maybe something is still missing.

consideRatio · 2020-10-25T03:06:51Z

I tracked it down. Instantiating a Watch unconditionally instantiates an APIClient, which unconditionally spawns n_cpus threads. So when we create one reflector per pod, we are creating a huge number of threads.

The reflectors we use are now Singleton's - assuming this is resolved due to that.

yuvipanda mentioned this issue Apr 23, 2018

Revert "Use EventReflector to watch pod events" #166

Merged

minrk mentioned this issue Apr 26, 2018

monkeypatch kubernetes to avoid ThreadPool problems #169

Merged

consideRatio closed this as completed Oct 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

event monitor causes infinite memory growth #165

event monitor causes infinite memory growth #165

minrk commented Apr 23, 2018

yuvipanda commented Apr 23, 2018

minrk commented Apr 23, 2018

minrk commented Apr 23, 2018

minrk commented Apr 23, 2018

yuvipanda commented Apr 23, 2018

clkao commented Apr 23, 2018

minrk commented Apr 23, 2018

clkao commented Apr 24, 2018

consideRatio commented Oct 25, 2020

event monitor causes infinite memory growth #165

event monitor causes infinite memory growth #165

Comments

minrk commented Apr 23, 2018

yuvipanda commented Apr 23, 2018

minrk commented Apr 23, 2018

minrk commented Apr 23, 2018

minrk commented Apr 23, 2018

yuvipanda commented Apr 23, 2018

clkao commented Apr 23, 2018

minrk commented Apr 23, 2018

clkao commented Apr 24, 2018

consideRatio commented Oct 25, 2020