Fix/Rocksdb cache not set #5578

asdacap · 2023-04-17T15:35:50Z

Originally, I thought of making the shared cache that we set across rocksdb table explicit.
But then it turns out because the cache is setup after the option is setup, in effect no block cache was set, causing it all rocksdb table to use 32 MB (or 8 MB, I don't know depends on rocksdb version) of block cache. For reference, the statedb's block cache is suppose to be about 900MB.
But then, if I set the block cache properly, block processing time becomes more spiky. Cache hit metric increase, less IO is observed, but block processing throughput suffer. In fact less cache for state improve block processing time stability.
I suspected that the block size (16K) is too large, causing each node lookup to have to fetch 16K in order to populate the cache.
True enough, setting a much lower block size (1K) significantly reduce IO (in terms of bps) by nearly 10x. However, lower block size comes with the downsize of proportionally higher number of block index (higher memory consumption), and higher IOPs.
To compensate for that, TwoLevelIndex is turned on, which significanly reduce the in-memory part of the index. CacheIndexAndFilterBlocks is also advised, which will cause the block index to fit in the block cache. With an existing DB, if CacheIndexAndFilterBlocks is set to true, memory usage decrease by about 1GB at start, but performance suffer, likely due to larger index and block size, making the cache less effective. CacheIndexAndFilterBlocks remains turned off, but a fresh sync db can turn that on to reduce memory.
Lower block size also comes with the downsize of higher iops. Experiment shows the different in block processing time between 4K and 512 block size is negligable. IO in terms of bps reduce even further with 512, but in terms of ops increase a little bit. Index cache hit rate goes down (512 would have 8 times more index than 4k), and data cache hit goes down too, making it less effective than 4k. Lower block size also increase the db size a little bit (about 10GB for 512 compared to 4K).
Anyway, I've set it to 4K by default fo state db and make it configurable.
Long story short, improved block processing time and consistency and reduced memory usage.
Require a resync/full prune to take effect. Or wait a few days for whole db to compact/rebuild.

Testing

Tested via running trace block of past 2000 blocks in sequence. In memory pruning is disabled. Single thread only.

6 runs in sequence:

Old config
Old config with cache properly setup
Old config with CacheIndexAndFilterBlocks and cache?
4K block size with block hash index type (some point lookup optimization that does not seems to do anything). CacheIndexAndFilterBlocks on.
4K block size. CacheIndexAndFilterBlocks on.
512 block size. CacheIndexAndFilterBlocks on.

Lower block size consistently shows lower IO (in bps) and higher throughput (request per sec).
IOPs never reach above 50k ops except with 512 block size.
Something is limiting my traceblock to 8 request per second. I've no idea what.

Changes

Explicitly specify shared block cache.
Copy OptimizeForPointLookup code to take effect.
Allow specifying separate block cache per table.
Allow skipping memory hint setting completely.
Set block size for state db to 4K.
Enable CacheIndexAndFilterBlocks and two level index.

Types of changes

What types of changes does your code introduce?

Cleanup
Refactoring
Optimization

Testing

Requires testing

Yes
No

If yes, did you write tests?

Yes
No

asdacap · 2023-04-17T16:43:35Z

Turns out, currently because InitCache happens after BuildOptions, the passed in _cache is 0, meaning they are probably are using the default 8MB block cache per table, which explain the strangely much higher cache hit.

Before -> After

Much more unstable block processing though. Gonna try to separate state cache from other cache.

asdacap · 2023-04-18T11:30:56Z

Confusingly, lower block cache equals faster block processing time?

asdacap · 2023-04-18T19:52:44Z

Seems to be caused by the block size, which could be too big, so the work populating the cache is probably bottlenecking the fetch. Reducing the block size to 1024, then setting up partitioned index shows 8 to 10 times less IO (in terms of bytes per sec), but increase iops. Uses less memory it seems, so thats nice too.

Graph is 7GB cache 1K block, 1GB cache 1K block, forward syncing a backup 16K block, 16K 8MB cache, 16K 1GB cache. All runs traceblock for past 500 block.

benaadams · 2023-04-18T23:32:02Z

Test errors look actual

Failed Cache_state_index("archive",False) [128 ms]
Error Message:
Expected propertyValue to be False because ropsten_archive.cfg: IDbConfig.CacheIndexAndFilterBlocks, but found True.

Failed Caches_in_fast_blocks("fast") [1 ms]
Error Message:
Expected propertyValue to be False because ropsten.cfg: IDbConfig.HeadersDbCacheIndexAndFilterBlocks, but found True.

Failed Cache_state_index("^archive",False) [145 ms]
Error Message:
Expected propertyValue to be False because ropsten.cfg: IDbConfig.CacheIndexAndFilterBlocks, but found True.

asdacap · 2023-04-19T09:20:10Z

Yes I'm turning that on with two level index type. Seems to significantly reduce memory as advertised. Does not seems to reduce memory with standard index, though.

asdacap · 2023-04-19T09:27:36Z

Actually, it does reduce memory, but massively reduce performance, likely due to very low block cache which was not configured correctly.

benaadams

Is a merge conflict

…cache' into cleanup/explicit-global-rocksdb-cache

asdacap · 2023-04-30T22:51:11Z

For reference. 80% data cache hit, improved processing time, but not much. Significantly improved 50p for state get. Significantly lower IO and iops.

(after, before)

asdacap · 2023-05-01T01:03:41Z

I made a mistake, the above result is with caching store disabled. Not sure about if caching store enabled.

asdacap added 3 commits April 17, 2023 21:57

Explicitly create the sharde cache

e60ef93

Explicitly specify shared block cache

acc2064

Match prev buffer size

069d441

Allocating special cache for state

b3d4d56

asdacap added 4 commits April 19, 2023 00:38

Using separate block cache except state

f00c09c

Turn on cache index and filter blocks.

6748ec3

Missed one option

1ae1773

Optimize filter for hits

52ef467

benaadams mentioned this pull request Apr 18, 2023

Change pointlookup to MB #5585

Closed

12 tasks

Reverting back most memory setting

09a8a39

asdacap added 2 commits April 20, 2023 23:58

Configurable block size

f77d77b

Minor comment adjustment

a1eaabf

asdacap changed the title ~~Cleanup/explicit shared rocksdb cache~~ Fix/Rocksdb cache not set Apr 20, 2023

asdacap added 3 commits April 21, 2023 00:12

Fix table prefix

d09c608

Fix missed Db

f276d06

Fix whitespace

dc364d2

asdacap marked this pull request as ready for review April 20, 2023 20:52

asdacap added 2 commits April 21, 2023 15:22

Disable index and filter blocks by default

cb1a3aa

Fix test

ccd8862

benaadams approved these changes Apr 24, 2023

View reviewed changes

benaadams added the performance label Apr 24, 2023

asdacap added 3 commits April 25, 2023 15:12

Merge branch 'master' into cleanup/explicit-global-rocksdb-cache

d97c7c2

Merge branch 'master' into cleanup/explicit-global-rocksdb-cache

86ce350

4k block size for headers too

e0ae7e3

Merge remote-tracking branch 'origin/cleanup/explicit-global-rocksdb-…

a70b785

…cache' into cleanup/explicit-global-rocksdb-cache

asdacap merged commit f87fe3f into master Apr 30, 2023

asdacap deleted the cleanup/explicit-global-rocksdb-cache branch April 30, 2023 22:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/Rocksdb cache not set #5578

Fix/Rocksdb cache not set #5578

asdacap commented Apr 17, 2023 •

edited

Loading

asdacap commented Apr 17, 2023 •

edited

Loading

asdacap commented Apr 18, 2023

asdacap commented Apr 18, 2023 •

edited

Loading

benaadams commented Apr 18, 2023

asdacap commented Apr 19, 2023 •

edited

Loading

asdacap commented Apr 19, 2023

benaadams left a comment

asdacap commented Apr 30, 2023

asdacap commented May 1, 2023

Fix/Rocksdb cache not set #5578

Fix/Rocksdb cache not set #5578

Conversation

asdacap commented Apr 17, 2023 • edited Loading

Testing

Changes

Types of changes

What types of changes does your code introduce?

Testing

Requires testing

If yes, did you write tests?

asdacap commented Apr 17, 2023 • edited Loading

asdacap commented Apr 18, 2023

asdacap commented Apr 18, 2023 • edited Loading

benaadams commented Apr 18, 2023

asdacap commented Apr 19, 2023 • edited Loading

asdacap commented Apr 19, 2023

benaadams left a comment

Choose a reason for hiding this comment

asdacap commented Apr 30, 2023

asdacap commented May 1, 2023

asdacap commented Apr 17, 2023 •

edited

Loading

asdacap commented Apr 17, 2023 •

edited

Loading

asdacap commented Apr 18, 2023 •

edited

Loading

asdacap commented Apr 19, 2023 •

edited

Loading