RPC call duration skyrocketed after upgrade to v2.7.2-stable #11503

lacasian · 2020-02-19T08:37:24Z

Parity Ethereum version: v2.7.2
Operating system: Linux
Installation: dockerhub
Fully synchronized: yes
Network: ethereum mainnet
Restarted: yes

Configuration:

  - --auto-update=none
  - --mode=active
  - --tracing=on
  - --pruning=archive
  - --db-compaction=ssd
  - --scale-verifiers
  - --num-verifiers=6
  - --jsonrpc-server-threads=5
  - --jsonrpc-threads=5
  - --cache-size=8000

After we upgraded from v2.5.13-stable to v2.7.2-stable, we've seen a huge spike in RPC call execution durations.

Here's a screenshot from our monitoring:

dvdplm · 2020-02-19T08:58:23Z

Thank you for the report, this looks worrying. Roughly how many RPC calls per second are you making here?

lacasian · 2020-02-19T09:06:53Z

@dvdplm It looks very worrying, indeed.

The RPC calls we're executing can be seen in the screenshot I posted. We're executing those for every new block. In the case of eth_getTransactionReceipt we're doing a batch call for all transactions included in the block. Also worth mentioning that those calls are executed in sequential order, not in parallel.

That being said + considering the avg durations for the calls after the upgrade, I'd say it's way less than 1 RPC per second.

lacasian · 2020-02-19T14:53:49Z

Small update here: it looks like it went back to normal after a while. Will continue to monitor it and see if the problems come back.

ordian · 2020-02-19T15:22:16Z

@kwix we have changed a few rocksdb parameters from 2.5 to 2.7, it could be a background rocksdb process doing a compaction, could you show us your rocksdb log file?
~/.local/share/io.parity.ethereum/chains/ethereum/db/<your account>/overlayrecent/db/LOG

lacasian · 2020-02-20T10:25:24Z

Hey @ordian !

This path ......./overlayrecent/db/LOG does not exist on my machine.
24h later, the RPC duration is still normal. Will close the issue for now.

ordian · 2020-02-20T10:34:48Z

@kwix sorry, I meant s/overlayrecent/archive (depends on your --pruning flag)
~/.local/share/io.parity.ethereum/chains/ethereum/db/<your account>/archive/db/LOG

lacasian · 2020-02-20T10:46:55Z

Here's the log you asked for:
LOG.log

From what I can see, it starts today but the event happened 2 days ago.

Thanks for looking into this!

ordian · 2020-02-20T10:57:56Z

Right, have you restarted the node today?

you can see a bunch of compaction_finished events in the log, my suspicion is that due to rocksdb cache settings change in 2.7 it triggered the whole db compaction/reorganization on disk, so all RPC requests were blocked on I/O. My recommendation is to add some monitoring for I/O (if you don't have it already).

If you encounter this issue again, please let us know.

valer-cara · 2020-02-20T13:00:31Z

Indeed the step-up in latency is correlated with a compaction at start time.

We zoomed out a little and noticed that there are very similar load increases correlated with container restarts in the past as well; so I think we can safely assume there's no correlation with this upgrade :)

Thanks! Closing this one 🙌

richardpringle mentioned this issue Feb 19, 2020

2.7.2 sync blocks extremely slow #11494

Open

dvdplm mentioned this issue Feb 19, 2020

pruning archive node cannot breach more than 2.0blk/s super slow 2.7.2 #11502

Closed

lacasian closed this as completed Feb 20, 2020

ordian mentioned this issue Feb 25, 2020

2.6.8 → 2.7.2: Stange disk usage pattern, allocating and freeing ~20GB in 48 hours #11516

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC call duration skyrocketed after upgrade to v2.7.2-stable #11503

RPC call duration skyrocketed after upgrade to v2.7.2-stable #11503

lacasian commented Feb 19, 2020

dvdplm commented Feb 19, 2020

lacasian commented Feb 19, 2020

lacasian commented Feb 19, 2020

ordian commented Feb 19, 2020

lacasian commented Feb 20, 2020

ordian commented Feb 20, 2020

lacasian commented Feb 20, 2020

ordian commented Feb 20, 2020

valer-cara commented Feb 20, 2020

RPC call duration skyrocketed after upgrade to v2.7.2-stable #11503

RPC call duration skyrocketed after upgrade to v2.7.2-stable #11503

Comments

lacasian commented Feb 19, 2020

dvdplm commented Feb 19, 2020

lacasian commented Feb 19, 2020

lacasian commented Feb 19, 2020

ordian commented Feb 19, 2020

lacasian commented Feb 20, 2020

ordian commented Feb 20, 2020

lacasian commented Feb 20, 2020

ordian commented Feb 20, 2020

valer-cara commented Feb 20, 2020