Node gets banned during reorgs if mining #4799

sdbondi · 2022-10-12T10:55:54Z

WARN Could not provide requested block d54cd80ab95951fbc8cc605935ee11f9d8f9bdb2cd24ca319643a2bc16198241 to peer because not stored

This results in my node being banned (1 less connection each time) because we sent a block hash, but could not provide it later due to reorg (race condition). Full block requests happen a lot during reorgs because we check if header.prev_hash != *current_meta.best_block() { and then request the full block immediately if true.

The reason for the ban is to prevent spamming of block hashes in the previous scheme.

I think we should:

Validate the header in reconcile block
if valid, we know PoW was done so continue - if we request the block but the peer no longer has it, dont ban and discard the message
if invalid, ban the peer that sent it
Problem: A peer could send a non-tip header it mined that is not part of the chain. I think that we'd have to deal with this heuristically.

The text was updated successfully, but these errors were encountered:

stringhandler · 2022-10-12T10:58:11Z

Maybe a short ban?

sdbondi · 2022-10-13T08:35:19Z

Good point, we ban for 100s only so that is ok. I'm still concerned about network splits during reorgs and that this is a fairly common case even though there are no malicious nodes. I was mining at a faster rate, and won a few blocks fairly quickly. This resulted in almost 50% of my connections being lost. Perhaps that is ok since it is unusual to produce many blocks (~4) that quickly (within 30-40s).

There are a few options that I can think of (no particular order):

We ~~don't~~ put reorged blocks back into the orphan pool. This allows a sender to provide a reorged-out block and will reduce requests for previously reorged-out blocks. We did this previously but took it out I think?
We could accept the bans as is, and the risk that it could create more network splits during reorg situations (closes this issue).
Validate the header, if the header is valid and within say 5 blocks of our tip (the "reorg zone") we don't ban the peer if they cannot provide the full block
We process one block at a time, so there could be a few nodes ahead in the queue, this could take a few seconds to resolve. By that time the sending node has reorged to the better chain. We could perhaps ignore "stale" newblock messages.
We could ignore non-tip blocks and rely on sync for reorgs (potentially disruptive).
Keep track of recent block misses, and only ban the peer if it happens frequently (overly complex)

I guess we could try selfish mining attacks to see if we can force significant network splits.

stringhandler · 2022-11-01T08:59:29Z

I would say:

Keep the block in the reorg pool. When a sync peer asks for it, return it so that you can prove that you have it. I initially thought that you might want to say "here it is, but it's not my longest chain", but I suppose the other node can work that out anyway.

stringhandler · 2022-11-01T09:00:12Z

IMPACT: This could cause a weird hard fork, so let's bump the priority

sdbondi · 2022-11-21T05:26:39Z

Might be related to the short bans for this case, started merged mining with a sudden high hash rate and saw a sudden drop in peer connections.

Description --- Remove the `FetchBlocksByHash` handler. It was only called from a single place and although designed to handle multiple blocks it was only ever sending a single hash at once making the multi-block functionality useless. Instead, opt to use the existing `GetBlockByHash` handler and expand that handler to accept a new `orphans` flag. Passing this flag means we'll accept found blocks from the orphan pool, Motivation and Context --- Previously if a node had re-orged after a sync had started it may result in not providing the complete block for a block it claimed it had. This results in a brief ban. Make it also return blocks from the orphan pool and let the peer figure out what to do with it. How Has This Been Tested? --- Tests, and running nodes. Fixes: #4799

stringhandler added this to the Stagenet Freeze milestone Nov 1, 2022

stringhandler added C-bug Category - fixes a bug, typically associated with an issue. A-base_node Area - The Tari base node executable and libraries labels Nov 1, 2022

stringhandler added the S-high-severity Severity - High label Nov 1, 2022

stringhandler moved this to Must Do in Tari Esme Testnet Nov 15, 2022

stringhandler moved this from Must Do to Selected for development in Tari Esme Testnet Nov 15, 2022

brianp self-assigned this Nov 15, 2022

stringhandler moved this from Selected for development to In Progress in Tari Esme Testnet Nov 16, 2022

brianp mentioned this issue Nov 24, 2022

fix: node gets banned on reorg #4949

Merged

stringhandler closed this as completed in #4949 Nov 28, 2022

Repository owner moved this from In Progress to Done in Tari Esme Testnet Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node gets banned during reorgs if mining #4799

Node gets banned during reorgs if mining #4799

sdbondi commented Oct 12, 2022

stringhandler commented Oct 12, 2022

sdbondi commented Oct 13, 2022 •

edited

Loading

stringhandler commented Nov 1, 2022

stringhandler commented Nov 1, 2022

sdbondi commented Nov 21, 2022

Node gets banned during reorgs if mining #4799

Node gets banned during reorgs if mining #4799

Comments

sdbondi commented Oct 12, 2022

stringhandler commented Oct 12, 2022

sdbondi commented Oct 13, 2022 • edited Loading

stringhandler commented Nov 1, 2022

stringhandler commented Nov 1, 2022

sdbondi commented Nov 21, 2022

sdbondi commented Oct 13, 2022 •

edited

Loading