Skip to content

refactor(e2e): remove SnapshotManager from e2e tests#19509

Merged
spalladino merged 1 commit intonextfrom
palla/remove-snapshot-manager-2
Jan 13, 2026
Merged

refactor(e2e): remove SnapshotManager from e2e tests#19509
spalladino merged 1 commit intonextfrom
palla/remove-snapshot-manager-2

Conversation

@spalladino
Copy link
Contributor

@spalladino spalladino commented Jan 12, 2026

Summary

  • Remove SnapshotManager class and its snapshot/restore infrastructure from e2e tests
  • Tests now call setupFromFresh directly instead of going through the snapshot manager abstraction
  • Rename applyFooSnapshot methods to applyFoo (remove Snapshot suffix)

Changes

  • Delete ISnapshotManager, SnapshotManager, MockSnapshotManager classes
  • Delete createSnapshotManager factory and setupFromState function
  • Keep setupFromFresh, teardown, deployAccounts, publicDeployAccounts as free functions
  • Update all 9 test base classes and their associated test files to use direct setup pattern

Next steps

  • Unify the lingering setupFromFresh with the existing setup method
  • Create a base EndToEnd class
  • Categorize and reorganize tests based on environment (single-sequencer with auto-mine, single-sequencer with prover with real time, multi-sequencer with mock p2p, multi-sequencer with real p2p)

🤖 Generated with Claude Code

Part of A-219

@spalladino spalladino added the ci-no-fail-fast Sets NO_FAIL_FAST in the CI so the run is not aborted on the first failure label Jan 12, 2026
@spalladino spalladino force-pushed the palla/remove-snapshot-manager-2 branch 5 times, most recently from 9ae1df8 to a96c518 Compare January 12, 2026 21:41
@spalladino spalladino changed the title refactor: Remove SnapshotManager from e2e tests refactor(e2e): remove SnapshotManager from e2e tests Jan 12, 2026
@AztecBot
Copy link
Collaborator

AztecBot commented Jan 12, 2026

Flakey Tests

🤖 says: This CI run detected 2 tests that failed, but were tolerated due to a .test_patterns.yml entry.

\033FLAKED\033 (8;;http://ci.aztec-labs.com/dbc1d0e3e50d10bf�dbc1d0e3e50d10bf8;;�): yarn-project/end-to-end/scripts/run_test.sh web3signer src/composed/web3signer/e2e_multi_validator_node_key_store.test.ts (37s) (code: 1) (\033Santiago Palladino\033: refactor(e2e): remove SnapshotManager from e2e tests (#19509))
\033FLAKED\033 (8;;http://ci.aztec-labs.com/3f4adb8383f3c5c5�3f4adb8383f3c5c58;;�):  yarn-project/end-to-end/scripts/run_test.sh simple src/e2e_p2p/gossip_network.test.ts (448s) (code: 1) group:e2e-p2p-epoch-flakes (\033Santiago Palladino\033: refactor(e2e): remove SnapshotManager from e2e tests (#19509))

@spalladino spalladino added this pull request to the merge queue Jan 13, 2026
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 13, 2026
@spalladino
Copy link
Contributor Author

Prover full is failing. Will review tomorrow.

@spalladino spalladino force-pushed the palla/remove-snapshot-manager-2 branch from a96c518 to 3c7ccaf Compare January 13, 2026 13:18
@spalladino spalladino added this pull request to the merge queue Jan 13, 2026
Merged via the queue into next with commit a708814 Jan 13, 2026
16 checks passed
@spalladino spalladino deleted the palla/remove-snapshot-manager-2 branch January 13, 2026 14:44
ludamad added a commit that referenced this pull request Feb 23, 2026
Slide 19 (§4 insights · PR correlation): two-column layout showing which
PRs caused each weekly flake spike and which fixes produced each recovery:

Spikes:
- W02 (2,647 flakes): Santiago refactors #19532/#19509/#19564 exposed
  timing races across p2p/epoch simultaneously
- W04 (935 flakes): PhilWindle #19982 added cross-chain mbps tests
  without pre-deflaking — valid_epoch_pruned_slash 0→346 events
- W06 (850 flakes): three high-risk PRs merged same day (#20047 peer
  scoring, #20241 max checkpoints→32, #20257 hash constants)

Fixes:
- W03 recovery: Santiago #19914 — checkpointed chain tip for PXE
  (root fix; PXE was using latest not checkpointed block)
- W05 recovery: Santiago #20088 slasher multi-block fix + #20140
  discv5 deflake + GCP step-down (−6 testbed namespaces)
- W07 improvement: Santiago #20351 mbps fix (p2p_client 311→0),
  #20462 remove hardcoded 10s timeout, ludamad #20613 CI parallelism

Also: correct three factual errors spotted during full review —
- Summary: next P50 is growing (+10% in 3 weeks), not stable
- Flake trend W07 note: e2e-p2p-epoch-flakes dropped 373×, not just
  "251 flakes lowest since December"
- Gaps slide: replaced stale "ci_phases broken" card with GCP egress
  costs gap (bc→awk fix is deployed; egress attribution is the gap now)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci-no-fail-fast Sets NO_FAIL_FAST in the CI so the run is not aborted on the first failure

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants