A new more complete test format for ETH 2.0 testing #877

protolambda · 2019-04-03T01:38:13Z

The format

See specs/test_formats/README.md file in PR, includes new testing glossary, configuration considerations, and general format.

Test type formats will be documented in markdown files in the specs/test_formats/ folder.

Upgrade path:

First, we just need consensus on the general format.
Update the CI/pyspec PR (Combine specs and test-generators #851), it includes a base test-generator to make life easy
Upgrade the existing test generators, documenting them in the progress
Introduce new test types

Test Types

The following test-types are proposed:

Existing types
- BLS
- SSZ
- Shuffling
Next up
- general state transition, (pre state, [block...]) -> post state
- transactions (operations), one handler per type (pre state, tx) -> post state
- epoch sub-transitions (not the ones that just process a batch)
To be looked into
- fork choice tests
- "deltas": test computed changes in balances
- suggestions welcome

How it affects existing test types

BLS

To be updated to move the "sub types" to separate test suites, each with their own handler. I.e. it would look like:

├── bls
│   ├── signing
│   │   └── sign_msg.yml
│   ├── verify_single
│   │   ├── invalid.yml
│   │   └── valid.yml
│   ├── verify_multiple
│   │   ├── invalid.yml
│   │   └── valid.yml
│   ├── aggregate_sigs
│   │   ├── single.yml
│   │   ├── mixed_invalid.yml
│   │   └── multiple.yml
│   ├── aggregate_pubkeys
│   │   ├── single.yml
│   │   ├── two.yml
│   │   ├── multiple.yml
│   │   └── duplicate.yml
│   └── ...
...

SSZ

The SSZ tests are compatible as is, and the base generator is based of of the design of this generator.
Easy to port over to use the new base generator.

Shuffling

Although the current testing is good, it sort of mixes validator activation with the shuffling algorithm.
We may decide to change it to just test shuffling based on a list of indices later down the road.

protolambda · 2019-04-03T01:38:59Z

Work in progess, suggestions welcome, aiming at fast-paced progress to get new state transition tests in

specs/test_formats/README.md

Co-Authored-By: protolambda <proto@protolambda.com>

hwwhww

From your tweet:

With this, together with a "fork timeline" format, we can have a standardized way of initializing testing contexts, and test networks. 👀 Close to the standardized genesis format ETH 1.0 never got to have

Side note: eth 1.0 does have WIP standardized genesis format: ethereum/EIPs#1085

"Fork-timeline" looks like the chain configuration in client side, take eth1 client for example since they have forks already: geth, trinity, harmony. In eth2, it looks like we can define it in a list of Fork objects.

I wonder what's the use case of having forks/chain configurable in tests? It looks like setting up a short-lived testnet and running the script of given operations - but the thing we want to test could be done with some normal slot-to-slot state transition tests too and might be more deterministic.

specs/core/0_beacon-chain.md

specs/test_formats/README.md

Co-Authored-By: protolambda <proto@protolambda.com>

protolambda · 2019-04-05T08:48:12Z

@hwwhww

Side note: eth 1.0 does have WIP standardized genesis format

Yes, the timeline is very similar to the "params" section of the genesis format proposed for eth 1. But with slots, because we can :)

The "genesis" and "accounts" sections are not so compatible with eth 2: genesis of eth 2 is based on contract events on some eth-1-like chain. And I assume this is the case for a testnet as well. Either based on a connection with a real net, or just some raw contract log event data to inject in an eth 2 genesis setup.
We're not quite there yet with cross-client testnets, so I prefer to just focus on the forks timeline, and ensure the standard has a way of telling tests what fork to use when running e.g. a state-transition test with some arbitrary slot number in it.
Also, since tests are just based on input data, and there's no need for genesis data in most tests, maybe we should just keep it lean. We can already define test-nets with just a combination of a fork-timeline and a pointer to a deposit contract on a testnet or something.

protolambda · 2019-04-05T11:31:03Z

In eth2, it looks like we can define it in a list of Fork objects

@hwwhww Interesting, but stating both previous and current version for every entry in the list seems very verbose. Like spelling out a raw linked list. It does bring up an interesting point however: do we want forks based on epoch number (less flexible, current Fork format) or slot number (very exact, old Fork format)

hwwhww · 2019-04-05T12:26:28Z

The "genesis" and "accounts" sections are not so compatible with eth 2: genesis of eth 2 is based on contract events on some eth-1-like chain. And I assume this is the case for a testnet as well. Either based on a connection with a real net, or just some raw contract log event data to inject in an eth 2 genesis setup.

Yes, agreed that eth2 genesis format will be quite different from eth1. Just FYI. 🙂

a way of telling tests what fork to use when running e.g. a state-transition test with some arbitrary slot number in it.

For the state transition test, my gut tells me that one fork setting (i.e., set that this state transition happens with which fork) is good enough for most cases. But not sure if there's any case of mulit-fork timeline requirement?

but stating both previous and current version for every entry in the list seems very verbose

Right, but state transition tests probably only need one Fork object!

do we want forks based on epoch number (less flexible, current Fork format) or slot number (very exact, old Fork format)

I'd say based on current phase 0 design, it makes more sense to do with epoch only - unless we do another new fundemental change in the future... 😢

After all, the two different angles of test format designs so far:

Generic: as this PR, one general format for all cases
Categories: the current test formats, different formats for different tests (e.g., bls, shuffling, ssz)

I do like this solution more! 👍

djrtwo · 2019-04-05T21:50:28Z

I'd say based on current phase 0 design, it makes more sense to do with epoch only - unless we do another new fundemental change in the future...

Agreed -- epoch number.

Also agree that using the Fork data structure to specify a chain is overkill. We just maintain current and prev version in the BeaconState to ensure we can process signatures from right before and after the fork. The fork timeline list will naturally fit into the Fork construct as needed.

But not sure if there's any case of mulit-fork timeline requirement?

Might be useful to test a fork in rapid succession (kind of like the constantinople and petersburg forks).

protolambda · 2019-04-07T02:09:18Z

Changed fork timeline definition to use epoch numbers

hwwhww

LGTM :)

djrtwo

a couple of nitpicks, then ready to merge

configs/constant_presets/minimal.yaml

specs/core/0_beacon-chain.md

A new more complete test format for ETH 2.0 testing

8006772

Add note on configuration of constants

96ab5a3

protolambda mentioned this pull request Apr 3, 2019

Combine specs and test-generators #851

Merged

djrtwo approved these changes Apr 3, 2019

View reviewed changes

djrtwo and others added 5 commits April 3, 2019 14:12

Update specs/test_formats/README.md

54eba8c

Co-Authored-By: protolambda <proto@protolambda.com>

Update specs/test_formats/README.md

04b9ce8

Co-Authored-By: protolambda <proto@protolambda.com>

Update specs/test_formats/README.md

5790af7

Co-Authored-By: protolambda <proto@protolambda.com>

Update specs/test_formats/README.md

55d21c1

Co-Authored-By: protolambda <proto@protolambda.com>

more explicit about relations between generator, runner, type, handler

9fe9a00

djrtwo requested a review from hwwhww April 3, 2019 03:39

ethereum deleted a comment Apr 4, 2019

hwwhww reviewed Apr 4, 2019

View reviewed changes

hwwhww and others added 2 commits April 5, 2019 19:24

Update specs/test_formats/README.md

13fc498

Co-Authored-By: protolambda <proto@protolambda.com>

remove confusing note

4bf20a1

consistent naming of network types

09cecca

JustinDrake added the scope:CI/tests/pyspec label Apr 6, 2019

forks are based on epoch numbers, as per spec

1c81638

include example configs and fork timelines, with format spec

c5ab543

hwwhww approved these changes Apr 7, 2019

View reviewed changes

djrtwo approved these changes Apr 7, 2019

View reviewed changes

configs/constant_presets/minimal.yaml Show resolved Hide resolved

specs/core/0_beacon-chain.md Outdated Show resolved Hide resolved

protolambda added 2 commits April 7, 2019 16:17

include minimal testing constants from previous pytests

c5d2696

update comment, fix net naming

117e157

djrtwo merged commit 2baa242 into dev Apr 7, 2019

djrtwo deleted the sydney-test-format branch April 7, 2019 06:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A new more complete test format for ETH 2.0 testing #877

A new more complete test format for ETH 2.0 testing #877

protolambda commented Apr 3, 2019 •

edited

Loading

protolambda commented Apr 3, 2019

hwwhww left a comment •

edited

Loading

protolambda commented Apr 5, 2019

protolambda commented Apr 5, 2019

hwwhww commented Apr 5, 2019

djrtwo commented Apr 5, 2019

protolambda commented Apr 7, 2019

hwwhww left a comment

djrtwo left a comment

A new more complete test format for ETH 2.0 testing #877

A new more complete test format for ETH 2.0 testing #877

Conversation

protolambda commented Apr 3, 2019 • edited Loading

The format

Test Types

How it affects existing test types

BLS

SSZ

Shuffling

protolambda commented Apr 3, 2019

hwwhww left a comment • edited Loading

Choose a reason for hiding this comment

protolambda commented Apr 5, 2019

protolambda commented Apr 5, 2019

hwwhww commented Apr 5, 2019

djrtwo commented Apr 5, 2019

protolambda commented Apr 7, 2019

hwwhww left a comment

Choose a reason for hiding this comment

djrtwo left a comment

Choose a reason for hiding this comment

protolambda commented Apr 3, 2019 •

edited

Loading

hwwhww left a comment •

edited

Loading