-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Purge offline validators #344
Comments
@aeyakovenko, can you recommend an entrypoint for this code? |
we should see a RequestUpdate from nodes in the network. |
@aeyakovenko, right now the validators pick random peers to RequestUpdate from, so should we just set the timeout to be sufficiently long so that the probability that two peers will connect is very high? Or we could set a timeout period after which a validator can send an explicit heartbeat request if they suspect a node on the network might be offline. |
@carllin initially we can set the timeout long enough, and eventually we can start gossip availability stats between nodes and aggregate those values. The latter is tricky because we can’t trust rumors from adversarial nodes on the network |
* Needed for #341. Create a dummy entry with public key 0..., but with a valid gossip address that we can ask for updates. This will allow validators to discover the full network by just knowing a single node's gossip address without knowing anything else about their identity. * once we start removing dead validators this entry should get purged since we will never see a message from public key 0, #344
…olana-labs#344) Enable accountsdb_scan_account_storage_no_bank tests for hot storage.
Avalanche will split segments between all validators. If very few are offline, erasure codes can be used to fill the gaps, but if too many, the validators will need to request the missing segments, which is very slow. Nodes should actively identify offline nodes and purge them.
The text was updated successfully, but these errors were encountered: