server: memory pressure notification hooks #64965

jordanlewis · 2021-05-10T22:38:12Z

When CockroachDB runs out of memory, it usually manifests as a SIGKILL from the oomkiller, giving the program no time to respond with any crash dumps or other emergency actions.

It appears that cgroups can be configured to send notifications before this happens, at a configurable percent of used memory. See: https://www.kernel.org/doc/html/latest/admin-guide/cgroup-v2.html#memory-interface-files

It would be useful to let CockroachDB detect a close-to-OOM situation and perform custom logic, such as performing a crashdump, dumping active goroutines and a memory profile, or even dumping unnecessary caches.

Even without cgroups, perhaps we could poll Go's memstats and compare against a configured maximum, and use that to trigger any hooks.

Jira issue: CRDB-7366

The text was updated successfully, but these errors were encountered:

ajwerner · 2021-05-10T23:00:32Z

The Go folks have always wanted people to experiment with the prototype linked in golang/go#29696 (https://go-review.googlesource.com/c/go/+/46751). Maybe this exploration would call for giving that thing a shot.

abarganier · 2021-05-21T19:03:46Z

@ajwerner thanks for linking this - this definitely seems like something worth experimenting with, especially considering runtime experiments are happening elsewhere, such as @knz's task group resource accounting.

In a hypothetical where we actually used the SetMaxHeap API in crdb, I imagine we could have a goroutine responsible for crash dumps wait on the SetMaxHeap notify channel, where bytes is set to a very high value (one we'd expect to be a precursor to an OOM). If there's a send on the channel, it can immediately begin the process of writing crash dump information to a file in expectation of an OOM. Since you know crdb's internals much better than I do, what's your take on this idea? @jordanlewis?

knz · 2021-05-25T11:25:52Z

The idea is sound.

knz · 2021-05-25T11:26:19Z

Might even be worth adding an explicit call to runtime.GC() in that signal handler, who knows it may delay the OOM crash a bit further.

knz · 2021-07-29T17:05:26Z

this is obs infra, not server

abarganier · 2021-07-29T17:14:57Z

Just a quick update on this - through experimentation I was unsuccessful in subscribing to the memory.pressure_level notification from nodes running in containers, as the container processes' access to the cgroupfs is (understandably) readonly. The process needs write access to the cgroup.event_control file in the memory subsystem in order to subscribe to these notifications, and giving such write access comes with a big sacrifice in security (giving the container privileged access).

I've not given up entirely on this as it would be an exceptional heuristic for us to gain access to from inside each crdb node. Perhaps we can explore the possibility of subscribing to such notifications in the orchestration layer and then deliver notifications to the relevant crdb node over the network. More experimentation needs to be done to determine if this is a valid approach.

github-actions · 2023-08-29T11:09:19Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

jordanlewis added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label May 10, 2021

knz added C-investigation Further steps needed to qualify. C-label will change. and removed C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. labels May 11, 2021

joshimhoff mentioned this issue May 13, 2021

Consistently root cause OOMs, at least in a cloud context #65127

Open

jlinder added the T-server-and-security DB Server & Security label Jun 16, 2021

knz added the A-cc-enablement Pertains to current CC production issues or short-term projects label Jul 29, 2021

github-actions bot added the no-issue-activity label Aug 29, 2023

knz added T-observability-inf and removed T-server-and-security DB Server & Security no-issue-activity labels Aug 29, 2023

blathers-crl bot added the A-observability-inf label Aug 29, 2023

exalate-issue-sync bot added T-observability and removed T-observability-inf labels Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: memory pressure notification hooks #64965

server: memory pressure notification hooks #64965

jordanlewis commented May 10, 2021 •

edited by cockroach-jira-scripts

Loading

ajwerner commented May 10, 2021

abarganier commented May 21, 2021

knz commented May 25, 2021

knz commented May 25, 2021

knz commented Jul 29, 2021

abarganier commented Jul 29, 2021 •

edited

Loading

github-actions bot commented Aug 29, 2023

server: memory pressure notification hooks #64965

server: memory pressure notification hooks #64965

Comments

jordanlewis commented May 10, 2021 • edited by cockroach-jira-scripts Loading

ajwerner commented May 10, 2021

abarganier commented May 21, 2021

knz commented May 25, 2021

knz commented May 25, 2021

knz commented Jul 29, 2021

abarganier commented Jul 29, 2021 • edited Loading

github-actions bot commented Aug 29, 2023

jordanlewis commented May 10, 2021 •

edited by cockroach-jira-scripts

Loading

abarganier commented Jul 29, 2021 •

edited

Loading