Lighthouse OOM mitigations #7053

michaelsproul · 2025-02-27T05:18:40Z

Short term plan:

Move banned block checks higher in block verification to prevent repeat state lookups (before every instance of load_parent in block_verification.rs)
Encourage use of --state-cache-size 4 to avoid bad state cache pruning logic that is keeping 128x 180MB epoch boundary states around (~24GB of states).
(DONE) Remove block root lookups from status processing. We are getting killed looking up old states to compute the block root. We need a more aggressive version of this PR: Optimise status processing #5481.

Point (1) is intended to fix an OOM that happens to nodes that are in sync and forced to process junk.

Point (2) fixes OOMs during head sync due to lots of epoch boundary states being retain.

To investigate later:

Why are epoch boundary state diffs so large (180MB+), given that we should be basing them off each other while syncing sequential blocks? Answer: balances and inactivity_scores.
Is an earlier invalid block check sufficient to prevent OOM while synced? Are there are other states or valid side chains which are forcing us to load states and use too much memory?
Why is sync sending us so many copies of the invalid block? Is there parallelism that is causing the OOM near the head?

Future plans (long-term fixes):

Implement the PromiseCache concept used for attestation committees for beacon states. This is quite subtle to get right, a version was previously attempted but abandoned (Unify and lower state caches #5313). Tracking issue: Improve & unify parallel de-duplication caches #5112
Implement size-based pruning for the state cache. This is possible with my WIP changes from: State cache memory size WIP #6532. However, that code is quite immature and the pruning itself is expensive (1.5s-4s or more), so we cannot ship this quickly. There is also some subtlety around deciding which states to prune based on size (we could use a similar heuristic to the existing cull method on the 20% largest states).
Re-think pruning logic in cull so that it doesn't hang on to so many useless epoch boundary states.

The text was updated successfully, but these errors were encountered:

michaelsproul · 2025-02-27T10:26:47Z

Merged status processing fix to holesky-rescue branch:

Optimise status processing on release-v.7.0.0-beta.2 (TESTING branch) #7054

michaelsproul · 2025-02-27T11:30:28Z

Just thought of another source of unbounded state lookups: BlocksByRoot and BlocksByRange.

It might be time to build a dedicated in-memory DAG of block roots which we can use instead of the state-based block iterators.

cc @dapplion

eserilev · 2025-02-27T20:10:38Z

Why are epoch boundary state diffs so large (180MB+), given that we should be basing them off each other while syncing sequential blocks?

Inactivity leak penalties are applied at each epoch. The longer were in non-finality the heavier the penalties are, so not only validator balances are changing at epoch boundaries, but also potentially effective balances. This could be one reason epoch boundary state diffs are so large, esp considering how large the validator set is on holesky.

michaelsproul · 2025-03-03T02:16:10Z

Jimmy and I had a look a the diffs using lcli (based on the states from our experiment the other day, logs here). They are legitimately big.

Mostly balances and inactivity scores.

[2025-03-03T01:16:43Z INFO  lcli::skip_slots] Using mainnet spec
[2025-03-03T01:16:43Z INFO  lcli::skip_slots] Advancing 32 slots
[2025-03-03T01:16:43Z INFO  lcli::skip_slots] Doing 1 runs
[2025-03-03T01:16:43Z INFO  lcli::skip_slots] State path: "/home/michael/eth2/milhouse-diff-test/state_3714912.ssz"
[2025-03-03T01:16:44Z DEBUG lcli::transition_blocks] SSZ decoding /home/michael/eth2/milhouse-diff-test/state_3714912.ssz: 542.830343ms
[2025-03-03T01:16:49Z INFO  lcli::skip_slots] Post-state balances size (total/diff): 84594032-84594032 B/79783384 B
[2025-03-03T01:16:49Z INFO  lcli::skip_slots] Post-state validators size: 522924032-522924032 B/9432 B
[2025-03-03T01:16:49Z INFO  lcli::skip_slots] Post-state inactivity_scores size: 84594032-84594032 B/79783384 B

We've started working on a milhouse PR to intra-rebase a list on itself, i.e. to exploit internal structural sharing. If there are e.g 1k identical inactivity scores in a row, then we can reuse memory for them.

michaelsproul · 2025-03-03T07:26:26Z

Latest investigation reveals sources of state cache miss:

BlocksByRange requests that span across the finalized epoch. We can probably tweak our logic to avoid this requiring a state lookup.
Gossip blocks and advance head. We seem to be flushing good states from the cache, to the point of having to reload them.

Ideas to fix:

Splice block roots from freezer with fork choice for BlocksByRange
Maybe protect the head state in the state cache
Maybe avoid adding "ancestor states" to the state cache

realbigsean · 2025-03-03T17:46:14Z

BlocksByRange requests that span across the finalized epoch. We can probably tweak our logic to avoid this requiring a state lookup.
Splice block roots from freezer with fork choice for BlocksByRange

Made a PR for this here #7066

realbigsean · 2025-03-03T22:12:40Z

I've made a PR here that tries to make the state cache more intelligent, it's a bigger/more complicated change

https://github.com/sigp/lighthouse/pull/7069/files

michaelsproul added the optimization Something to make Lighthouse run more efficiently. label Feb 27, 2025

michaelsproul mentioned this issue Feb 26, 2025

Release v7.0.0-beta.clean #7039

Open

13 tasks

michaelsproul added v7.0.0 New release c. Q1 2025 v7.0.0-beta.clean Clean release post Holesky rescue labels Feb 27, 2025

jimmygchen mentioned this issue Feb 27, 2025

Optimise status processing on release-v.7.0.0-beta.2 (TESTING branch) #7054

Merged

jimmygchen mentioned this issue Feb 27, 2025

Change state cache size default to 32 #7055

Merged

jimmygchen mentioned this issue Feb 28, 2025

Load block roots from fork choice where possible when serving BlocksByRange requests #7058

Merged

michaelsproul mentioned this issue Mar 3, 2025

Implement intra_rebase to reuse identical subtrees sigp/milhouse#65

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lighthouse OOM mitigations #7053

Lighthouse OOM mitigations #7053

michaelsproul commented Feb 27, 2025 •

edited

Loading

michaelsproul commented Feb 27, 2025

michaelsproul commented Feb 27, 2025

eserilev commented Feb 27, 2025 •

edited

Loading

michaelsproul commented Mar 3, 2025 •

edited

Loading

michaelsproul commented Mar 3, 2025

realbigsean commented Mar 3, 2025

realbigsean commented Mar 3, 2025

Lighthouse OOM mitigations #7053

Lighthouse OOM mitigations #7053

Comments

michaelsproul commented Feb 27, 2025 • edited Loading

michaelsproul commented Feb 27, 2025

michaelsproul commented Feb 27, 2025

eserilev commented Feb 27, 2025 • edited Loading

michaelsproul commented Mar 3, 2025 • edited Loading

michaelsproul commented Mar 3, 2025

realbigsean commented Mar 3, 2025

realbigsean commented Mar 3, 2025

michaelsproul commented Feb 27, 2025 •

edited

Loading

eserilev commented Feb 27, 2025 •

edited

Loading

michaelsproul commented Mar 3, 2025 •

edited

Loading