ci: ab test firewood in reexecution benchmarks #4297

Elvis339 · 2025-09-18T16:30:44Z

Why this should be merged

This patch introduces a custom GitHub Actions workflow that enables automated performance testing and comparison of two arbitrary versions of Firewood running C-Chain re-execution benchmarks on our self-hosted runners. This capability is essential for:

Performance regression detection: Establishes baseline comparison capability with statistical analysis coming in follow-up PR for more rigorous regression identification
Release validation: Compare candidate releases against stable baselines before deployment

How this works

The workflow implements an A/B testing approach with the following architecture:

Runs baseline and candidate versions simultaneously on identical hardware
Flexible Version Support:
- Tagged releases (uses pre-built binaries when available for faster setup)
- Commit hashes (builds from source for development testing)
- Compatible AvalancheGo version pairing for each Firewood version
Automated Comparison: Extracts performance metrics (mgas/s) and calculates percentage changes

Technical Implementation

This workflow enables configuration options:

Runner Selection
- Choose optimal hardware for consistent benchmarking
Version Pairing Strategy
- The workflow requires specifying both Firewood and AvalancheGo versions for each test case, ensuring compatibility matrix (AvGO <--depends-on-> CorETH -> Firewood)
Baseline Pair: Known stable versions for comparison reference
- firewood-baseline-version: Stable Firewood release (e.g., ffi/v0.0.12) or commit hash
- avalanchego-baseline-version: Compatible AvalancheGo version (defaults to master)
Candidate Pair: Versions under test
- firewood-candidate-version: New Firewood version to evaluate
- avalanchego-candidate-version: Corresponding AvalancheGo version
Benchmark Scope Control
- Block Range: Configurable test boundaries (start-block to end-block)
- Default Range: Blocks 101-200 provides a quick ~100-block validation

How this was tested

Ran in CI

Need to be documented in RELEASES.md?

No. This is an internal development tool that doesn't affect end-user functionality or require release notes.

Compares performance between two Firewood versions by running C-Chain re-execution benchmarks. Supports both tagged releases (uses pre-built binaries when available) and source builds. Each version runs with its specified AvalancheGo commit for compatibility.

…ood 0.0.8 vs 0.0.12

…m source

…nput

.github/actions/c-chain-reexecution-benchmark/action.yml

.github/actions/run-monitored-tmpnet-cmd/action.yml

…io_uring`

Elvis339 added 2 commits September 18, 2025 20:11

ci: temp disable other CI jobs and run only c-chain-reexecution-firew…

c8ad840

…ood 0.0.8 vs 0.0.12

github-project-automation bot added this to avalanchego Sep 18, 2025

Elvis339 changed the title ~~C chain reexecution firewood ab test~~ ci: ab test firewood in reexecution benchmarks Sep 18, 2025

Elvis339 added 18 commits September 18, 2025 20:48

ci: temp fix failing jobs

bdf2c1e

ci: temp fix install nix after avgo checkout in v0.0.8

54c1117

ci: temp fix install sys dependencies for building 0.0.8 firewood fro…

fb2f78b

…m source

ci: temp fix 0.0.8 remove path from checkout

4eacf98

ci: temp fix 0.0.8 remove working directory

eebb32c

ci: temp fix 0.0.8 debug

54ddb2a

ci: temp fix 0.0.8 reorder checkout

d4d1906

ci: temp fix 0.0.8 copy target

d0f73df

ci: temp fix 0.0.8 resolve git issues

7b3ceef

ci: temp fix 0.0.12 resolve git issues

bf50489

ci: temp fix 0.0.12 dont pin ref

de29c38

ci: temp fix 0.0.12 git stash

21d3ebf

chore(build_firewood): remove unused script

4bfb6fe

ci(setup-firewood): add new composite action to setup firewood

829b233

ci(firewood-ab-test)

b58e7ce

ci: temp update ref for 0.0.8 Firewood

641d614

ci

54b11c4

ci

d238bfa

Elvis339 self-assigned this Sep 19, 2025

Elvis339 added 7 commits September 22, 2025 18:01

Merge branch 'master' into c-chain-reexecution-firewood-ab-test

ac6d6fa

ci(c-chain-reexecution-benchamrk): add prometheus-url as required i…

869a45d

…nput

ci

b9b7b87

ci: temp add prometheus-* vars

0b4f8b5

ci: temp add loki-* vars

120eb23

ci: remove loki-* vars and use different runner for v0.0.12

f0827b4

ci: tmp. disable comparison

786a06d

Elvis339 added 7 commits September 22, 2025 20:31

ci: tmp. isolate cache for v0.0.12

2b01334

ci: revet cache clearing

93f074c

ci(c-chain-reexecution): use nix shell for setting task env

10ff7f0

clear go cache

f2a0089

clear go cache

393b28e

clear go cache after nix installation

8bf557f

export run env

15ef329

maru-ava reviewed Sep 22, 2025

View reviewed changes

.github/actions/c-chain-reexecution-benchmark/action.yml Outdated Show resolved Hide resolved

maru-ava reviewed Sep 22, 2025

View reviewed changes

.github/actions/run-monitored-tmpnet-cmd/action.yml Outdated Show resolved Hide resolved

Elvis339 added 14 commits September 23, 2025 00:38

Merge branch 'master' into c-chain-reexecution-firewood-ab-test

1810016

merge master

0ebaafb

ci: use Firewood compatible current-state-dir-src

680e44e

ci: add --cap-add=SYS_ADMIN to container allowing Firewood to use `…

e0c46be

…io_uring`

ci: add --priviledged flag

9e4a31e

ci: temp

5acd1db

ci(firewood-ab): bump rust version to 1.89

ff53e9c

ci(firewood-ab): bump rust version to 1.89

c9d0f76

ci(firewood-ab): bump rust version to 1.89

da931b8

ci(firewood-ab)

ccfcba8

ci: verify Firewood setup

d6ed035

ci: update current-state-dir-src

7a65c62

ci: debug

c666012

ci: debug

732b0a2

joshua-kim moved this to Ready 🚦 in avalanchego Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: ab test firewood in reexecution benchmarks #4297

ci: ab test firewood in reexecution benchmarks #4297

Uh oh!

Elvis339 commented Sep 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ci: ab test firewood in reexecution benchmarks #4297

Are you sure you want to change the base?

ci: ab test firewood in reexecution benchmarks #4297

Uh oh!

Conversation

Elvis339 commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

How this works

Technical Implementation

How this was tested

Need to be documented in RELEASES.md?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Elvis339 commented Sep 18, 2025 •

edited

Loading