[Merged by Bors] - v1.1.6 Fork Choice changes #2822

realbigsean · 2021-11-18T19:19:39Z

Issue Addressed

Resolves: #2741
Includes: #2853 so that we can get ssz static tests passing here on v1.1.6. If we want to merge that first, we can make this diff slightly smaller

Proposed Changes

Changes the justified_epoch and finalized_epoch in the ProtoArrayNode each to an Option<Checkpoint>. The Option is necessary only for the migration, so not ideal. But does allow us to add a default logic to None on these fields during the database migration.
Adds a database migration from a legacy fork choice struct to the new one, search for all necessary block roots in fork choice by iterating through blocks in the db.
updates related to always atomically update justified and finalized ethereum/consensus-specs#2727
- We will have to update the persisted forkchoice to make sure the justified checkpoint stored is correct according to the updated fork choice logic. This boils down to setting the forkchoice store's justified checkpoint to the justified checkpoint of the block that advanced the finalized checkpoint to the current one.
- AFAICT there's no migration steps necessary for the update to allow applying attestations from prior blocks, but would appreciate confirmation on that
I updated the consensus spec tests to v1.1.6 here, but they will fail until we also implement the proposer score boost updates. I confirmed that the previously failing scenario new_finalized_slot_is_justified_checkpoint_ancestor will now pass after the boost updates, but haven't confirmed all tests will pass because I just quickly stubbed out the proposer boost test scenario formatting.
This PR now also includes proposer boosting Proposer LMD Score Boosting ethereum/consensus-specs#2730

Additional Info

I realized checking justified and finalized roots in fork choice makes it more likely that we trigger this bug: ethereum/consensus-specs#2727

It's possible the combination of justified checkpoint and finalized checkpoint in the forkchoice store is different from in any block in fork choice. So when trying to startup our store's justified checkpoint seems invalid to the rest of fork choice (but it should be valid). When this happens we get an InvalidBestNode error and fail to start up. So I'm including that bugfix in this branch.

Todo:

Fix fork choice tests
Self review
Add fix for always atomically update justified and finalized ethereum/consensus-specs#2727
Rebase onto Kintusgi
Fix num_active_validators calculation as @michaelsproul pointed out
Clean up db migrations

realbigsean · 2021-12-01T22:08:01Z

Still failing an EF test. This implementation differs from the spec a bit in how this issue is handled: ethereum/consensus-specs#2757

So marking this as blocked for now

consensus/proto_array/src/proto_array.rs

paulhauner · 2021-12-02T05:58:07Z

Github closed this automatically, not me. Sorry!

realbigsean · 2021-12-13T15:52:43Z

I really liked the used of superstruct and derivative to make things cleaner.

Props to @michaelsproul for this!

Co-authored-by: Paul Hauner <paul@paulhauner.com>

- Use a `HasSet` rather than a `Vec` in `map_relevant_epochs_to_roots`

…gsean/lighthouse into justified-checkpoint-root # Conflicts: # beacon_node/beacon_chain/src/schema_change/migration_schema_v7.rs

realbigsean · 2021-12-13T16:41:23Z

Ok all comments addressed! Diff here: https://github.com/sigp/lighthouse/compare/e266fe1539760faaaefbdcb8ddcd4f392207ec77..9d4692f6c

michaelsproul

Merge time!

bors r+

@michaelsproul

## Issue Addressed Resolves: #2741 Includes: #2853 so that we can get ssz static tests passing here on v1.1.6. If we want to merge that first, we can make this diff slightly smaller ## Proposed Changes - Changes the `justified_epoch` and `finalized_epoch` in the `ProtoArrayNode` each to an `Option<Checkpoint>`. The `Option` is necessary only for the migration, so not ideal. But does allow us to add a default logic to `None` on these fields during the database migration. - Adds a database migration from a legacy fork choice struct to the new one, search for all necessary block roots in fork choice by iterating through blocks in the db. - updates related to ethereum/consensus-specs#2727 - We will have to update the persisted forkchoice to make sure the justified checkpoint stored is correct according to the updated fork choice logic. This boils down to setting the forkchoice store's justified checkpoint to the justified checkpoint of the block that advanced the finalized checkpoint to the current one. - AFAICT there's no migration steps necessary for the update to allow applying attestations from prior blocks, but would appreciate confirmation on that - I updated the consensus spec tests to v1.1.6 here, but they will fail until we also implement the proposer score boost updates. I confirmed that the previously failing scenario `new_finalized_slot_is_justified_checkpoint_ancestor` will now pass after the boost updates, but haven't confirmed _all_ tests will pass because I just quickly stubbed out the proposer boost test scenario formatting. - This PR now also includes proposer boosting ethereum/consensus-specs#2730 ## Additional Info I realized checking justified and finalized roots in fork choice makes it more likely that we trigger this bug: ethereum/consensus-specs#2727 It's possible the combination of justified checkpoint and finalized checkpoint in the forkchoice store is different from in any block in fork choice. So when trying to startup our store's justified checkpoint seems invalid to the rest of fork choice (but it should be valid). When this happens we get an `InvalidBestNode` error and fail to start up. So I'm including that bugfix in this branch. Todo: - [x] Fix fork choice tests - [x] Self review - [x] Add fix for ethereum/consensus-specs#2727 - [x] Rebase onto Kintusgi - [x] Fix `num_active_validators` calculation as @michaelsproul pointed out - [x] Clean up db migrations Co-authored-by: realbigsean <seananderson33@gmail.com>

paulhauner

🚀

bors · 2021-12-13T22:51:38Z

Pull request successfully merged into unstable.

Build succeeded:

## Proposed Changes With proposer boosting implemented (#2822) we have an opportunity to re-org out late blocks. This PR adds three flags to the BN to control this behaviour: * `--disable-proposer-reorgs`: turn aggressive re-orging off (it's on by default). * `--proposer-reorg-threshold N`: attempt to orphan blocks with less than N% of the committee vote. If this parameter isn't set then N defaults to 20% when the feature is enabled. * `--proposer-reorg-epochs-since-finalization N`: only attempt to re-org late blocks when the number of epochs since finalization is less than or equal to N. The default is 2 epochs, meaning re-orgs will only be attempted when the chain is finalizing optimally. For safety Lighthouse will only attempt a re-org under very specific conditions: 1. The block being proposed is 1 slot after the canonical head, and the canonical head is 1 slot after its parent. i.e. at slot `n + 1` rather than building on the block from slot `n` we build on the block from slot `n - 1`. 2. The current canonical head received less than N% of the committee vote. N should be set depending on the proposer boost fraction itself, the fraction of the network that is believed to be applying it, and the size of the largest entity that could be hoarding votes. 3. The current canonical head arrived after the attestation deadline from our perspective. This condition was only added to support suppression of forkchoiceUpdated messages, but makes intuitive sense. 4. The block is being proposed in the first 2 seconds of the slot. This gives it time to propagate and receive the proposer boost. ## Additional Info For the initial idea and background, see: ethereum/consensus-specs#2353 (comment) There is also a specification for this feature here: ethereum/consensus-specs#3034 Co-authored-by: Michael Sproul <micsproul@gmail.com> Co-authored-by: pawan <pawandhananjay@gmail.com>

## Proposed Changes With proposer boosting implemented (sigp#2822) we have an opportunity to re-org out late blocks. This PR adds three flags to the BN to control this behaviour: * `--disable-proposer-reorgs`: turn aggressive re-orging off (it's on by default). * `--proposer-reorg-threshold N`: attempt to orphan blocks with less than N% of the committee vote. If this parameter isn't set then N defaults to 20% when the feature is enabled. * `--proposer-reorg-epochs-since-finalization N`: only attempt to re-org late blocks when the number of epochs since finalization is less than or equal to N. The default is 2 epochs, meaning re-orgs will only be attempted when the chain is finalizing optimally. For safety Lighthouse will only attempt a re-org under very specific conditions: 1. The block being proposed is 1 slot after the canonical head, and the canonical head is 1 slot after its parent. i.e. at slot `n + 1` rather than building on the block from slot `n` we build on the block from slot `n - 1`. 2. The current canonical head received less than N% of the committee vote. N should be set depending on the proposer boost fraction itself, the fraction of the network that is believed to be applying it, and the size of the largest entity that could be hoarding votes. 3. The current canonical head arrived after the attestation deadline from our perspective. This condition was only added to support suppression of forkchoiceUpdated messages, but makes intuitive sense. 4. The block is being proposed in the first 2 seconds of the slot. This gives it time to propagate and receive the proposer boost. ## Additional Info For the initial idea and background, see: ethereum/consensus-specs#2353 (comment) There is also a specification for this feature here: ethereum/consensus-specs#3034 Co-authored-by: Michael Sproul <micsproul@gmail.com> Co-authored-by: pawan <pawandhananjay@gmail.com>

## Issue Addressed NA ## Proposed Changes - Implements ethereum/consensus-specs#3290 - Bumps `ef-tests` to [v1.3.0-rc.4](https://github.com/ethereum/consensus-spec-tests/releases/tag/v1.3.0-rc.4). The `CountRealizedFull` concept has been removed and the `--count-unrealized-full` and `--count-unrealized` BN flags now do nothing but log a `WARN` when used. ## Database Migration Debt This PR removes the `best_justified_checkpoint` from fork choice. This field is persisted on-disk and the correct way to go about this would be to make a DB migration to remove the field. However, in this PR I've simply stubbed out the value with a junk value. I've taken this approach because if we're going to do a DB migration I'd love to remove the `Option`s around the justified and finalized checkpoints on `ProtoNode` whilst we're at it. Those options were added in #2822 which was included in Lighthouse v2.1.0. The options were only put there to handle the migration and they've been set to `Some` ever since v2.1.0. There's no reason to keep them as options anymore. I started adding the DB migration to this branch but I started to feel like I was bloating this rather critical PR with nice-to-haves. I've kept the partially-complete migration [over in my repo](https://github.com/paulhauner/lighthouse/tree/fc-pr-18-migration) so we can pick it up after this PR is merged.

With proposer boosting implemented (sigp#2822) we have an opportunity to re-org out late blocks. This PR adds three flags to the BN to control this behaviour: * `--disable-proposer-reorgs`: turn aggressive re-orging off (it's on by default). * `--proposer-reorg-threshold N`: attempt to orphan blocks with less than N% of the committee vote. If this parameter isn't set then N defaults to 20% when the feature is enabled. * `--proposer-reorg-epochs-since-finalization N`: only attempt to re-org late blocks when the number of epochs since finalization is less than or equal to N. The default is 2 epochs, meaning re-orgs will only be attempted when the chain is finalizing optimally. For safety Lighthouse will only attempt a re-org under very specific conditions: 1. The block being proposed is 1 slot after the canonical head, and the canonical head is 1 slot after its parent. i.e. at slot `n + 1` rather than building on the block from slot `n` we build on the block from slot `n - 1`. 2. The current canonical head received less than N% of the committee vote. N should be set depending on the proposer boost fraction itself, the fraction of the network that is believed to be applying it, and the size of the largest entity that could be hoarding votes. 3. The current canonical head arrived after the attestation deadline from our perspective. This condition was only added to support suppression of forkchoiceUpdated messages, but makes intuitive sense. 4. The block is being proposed in the first 2 seconds of the slot. This gives it time to propagate and receive the proposer boost. For the initial idea and background, see: ethereum/consensus-specs#2353 (comment) There is also a specification for this feature here: ethereum/consensus-specs#3034 Co-authored-by: Michael Sproul <micsproul@gmail.com> Co-authored-by: pawan <pawandhananjay@gmail.com>

NA - Implements ethereum/consensus-specs#3290 - Bumps `ef-tests` to [v1.3.0-rc.4](https://github.com/ethereum/consensus-spec-tests/releases/tag/v1.3.0-rc.4). The `CountRealizedFull` concept has been removed and the `--count-unrealized-full` and `--count-unrealized` BN flags now do nothing but log a `WARN` when used. This PR removes the `best_justified_checkpoint` from fork choice. This field is persisted on-disk and the correct way to go about this would be to make a DB migration to remove the field. However, in this PR I've simply stubbed out the value with a junk value. I've taken this approach because if we're going to do a DB migration I'd love to remove the `Option`s around the justified and finalized checkpoints on `ProtoNode` whilst we're at it. Those options were added in sigp#2822 which was included in Lighthouse v2.1.0. The options were only put there to handle the migration and they've been set to `Some` ever since v2.1.0. There's no reason to keep them as options anymore. I started adding the DB migration to this branch but I started to feel like I was bloating this rather critical PR with nice-to-haves. I've kept the partially-complete migration [over in my repo](https://github.com/paulhauner/lighthouse/tree/fc-pr-18-migration) so we can pick it up after this PR is merged.

realbigsean added the work-in-progress PR is a work-in-progress label Nov 18, 2021

realbigsean force-pushed the justified-checkpoint-root branch from 1707807 to 227462d Compare November 29, 2021 18:33

realbigsean changed the base branch from unstable to kintsugi November 29, 2021 18:37

realbigsean added ready-for-review The code is ready for review and removed work-in-progress PR is a work-in-progress labels Nov 29, 2021

paulhauner mentioned this pull request Nov 30, 2021

[Merged by Bors] - Retrospective invalidation of exec. payloads for opt. sync #2837

Closed

1 task

michaelsproul mentioned this pull request Nov 30, 2021

Proposer boosting in fork choice #2838

Closed

realbigsean added blocked and removed ready-for-review The code is ready for review labels Dec 1, 2021

realbigsean changed the title ~~Justified checkpoint root~~ v1.1.6 Spec changes Dec 1, 2021

michaelsproul reviewed Dec 2, 2021

View reviewed changes

consensus/proto_array/src/proto_array.rs Outdated Show resolved Hide resolved

paulhauner force-pushed the kintsugi branch from 2d8650b to f3c237c Compare December 2, 2021 03:32

paulhauner deleted the branch sigp:unstable December 2, 2021 05:51

paulhauner closed this Dec 2, 2021

paulhauner reopened this Dec 2, 2021

paulhauner changed the base branch from kintsugi to unstable December 2, 2021 05:58

realbigsean added 12 commits December 2, 2021 09:51

move Legacy structs to schema_change.rs

198aab7

update justified epoch to checkpoint in fork choice

7cc1009

Initial migration implementation

0257f8d

cleanup

bb69528

More cleanup, some bug fixes

ff37f62

update/refactor to include adding the finalized root to the fork choice

665cb6c

make sure node_mutator is applied to the finalized block

1b0c1a0

Don't update common ancestors

b229cd3

Correctly find heads

8bf19e1

add some comments, use correct index in node_mutator

02f702c

small cleanup

2b85351

Update comments

fa52b65

michaelsproul added waiting-on-author The reviewer has suggested changes and awaits thier implementation. and removed ready-for-merge This PR is ready to merge. labels Dec 13, 2021

realbigsean and others added 6 commits December 13, 2021 10:54

Update beacon_node/beacon_chain/src/schema_change/migration_schema_v7.rs

267c989

Co-authored-by: Paul Hauner <paul@paulhauner.com>

Update beacon_node/beacon_chain/src/schema_change/migration_schema_v7.rs

fbe6ff9

Co-authored-by: Paul Hauner <paul@paulhauner.com>

Update consensus/proto_array/src/proto_array.rs

0f314ff

Co-authored-by: Paul Hauner <paul@paulhauner.com>

- Use max_by_key in update_store_justified_checkpoint

3700ad6

- Use a `HasSet` rather than a `Vec` in `map_relevant_epochs_to_roots`

Merge branch 'justified-checkpoint-root' of https://github.com/realbi…

9361952

…gsean/lighthouse into justified-checkpoint-root # Conflicts: # beacon_node/beacon_chain/src/schema_change/migration_schema_v7.rs

comment update

9d4692f

realbigsean added ready-for-review The code is ready for review and removed waiting-on-author The reviewer has suggested changes and awaits thier implementation. labels Dec 13, 2021

michaelsproul approved these changes Dec 13, 2021

View reviewed changes

michaelsproul added ready-for-merge This PR is ready to merge. and removed ready-for-review The code is ready for review labels Dec 13, 2021

paulhauner approved these changes Dec 13, 2021

View reviewed changes

bors bot changed the title ~~v1.1.6 Fork Choice changes~~ [Merged by Bors] - v1.1.6 Fork Choice changes Dec 13, 2021

bors bot closed this Dec 13, 2021

paulhauner mentioned this pull request Feb 13, 2023

[Merged by Bors] - Fork choice modifications and cleanup #3962

Closed

paulhauner mentioned this pull request Apr 26, 2023

DB migration for fork choice cleanup #4233

Closed

realbigsean deleted the justified-checkpoint-root branch November 21, 2023 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - v1.1.6 Fork Choice changes #2822

[Merged by Bors] - v1.1.6 Fork Choice changes #2822

realbigsean commented Nov 18, 2021 •

edited

Loading

realbigsean commented Dec 1, 2021

paulhauner commented Dec 2, 2021

realbigsean commented Dec 13, 2021 •

edited

Loading

realbigsean commented Dec 13, 2021

michaelsproul left a comment

paulhauner left a comment

bors bot commented Dec 13, 2021

[Merged by Bors] - v1.1.6 Fork Choice changes #2822

[Merged by Bors] - v1.1.6 Fork Choice changes #2822

Conversation

realbigsean commented Nov 18, 2021 • edited Loading

Issue Addressed

Proposed Changes

Additional Info

realbigsean commented Dec 1, 2021

paulhauner commented Dec 2, 2021

realbigsean commented Dec 13, 2021 • edited Loading

realbigsean commented Dec 13, 2021

michaelsproul left a comment

Choose a reason for hiding this comment

paulhauner left a comment

Choose a reason for hiding this comment

bors bot commented Dec 13, 2021

realbigsean commented Nov 18, 2021 •

edited

Loading

realbigsean commented Dec 13, 2021 •

edited

Loading