-
Notifications
You must be signed in to change notification settings - Fork 795
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parachain block skipped by validators on Kusama #950
Comments
I think this is approval voting, as the parachain block can only be backed in the relay chain block that builds on the same relay chain block as the parachain block. So, I think this is probably networking related, cc @eskimor |
@crystalin can you describe the columns of your images? |
|
We should notify all node operators to perform certain performance upgrades on their nodes, including network upgrades. Then come to see if the problem is solved. |
Yes, scaling up hardware/network should definitely help. Longer term we obviously want to improve performance & get contextual execution in.
More availability distribution. Yes could be networking issues or heavy load, in case of the latter distribution taking longer is a good thing as it prevents more severe consequences (blocks not getting on chain at all for example). That being said, we do see networking problems on Kusama and it is quite likely that those are indeed related to this issue. They are still being investigated. |
@eskimor and @bkchr , I think for this case it might not be directly related to the load. I can enable some logs on our validators if you tell me which one would be interesting to track |
Can you check the following metrics on your internal network:
in comparison to:
That would be interesting, also how many nodes are running on that internal network? |
There are cases were a full validator group rotation (2 minutes) didn't include any block:
The block 529200 was quite big, PoV was ranging from 500kb to 1000kb for that block (different collators):
@eskimor I'll provide that information quickly |
Does this issue still exist? |
It is , and is also related to paritytech/substrate#10359 (and others) |
This ticket is to start thinking of improving the current situation on Kusama.
It happens from time to time that a parachain block is not included in the relaychain for many consecutive blocks.
On average, it happens of 1 minutes to have 1 slot missed (the block is ingested in the relay 18s after instead of the normal 12s).
Also, around once every hour, it misses many slots, taking around 1 minutes to ingest 1 block.
Description:
moonriver Block 1 [ 2%][Ext: 3(Eth: 4)(Z: ...)][Pool: 5(+ 6)(Z ...)][ 7s](hash: 8) by 9
I believe this is due to the validators not doing a good job including those blocks.
For information, all those blocks were produced by a different collator each time.
I checked the PoV and those blocks have around ~300kb (reported by PoV log line, so I think it is uncompressed)
It happens more when the blocks are containing many transaction, but the execution time is never over 300ms on decent hardware.
One possible reason could be that the validators are not fast enough to validate the parachain block but with the 300ms time to execute (and it usually takes less time to validate), I kind of doubt it
The text was updated successfully, but these errors were encountered: