Skip to content

fix(validator): remove hardcoded 10s timeout for previous block syncing#20462

Merged
spalladino merged 1 commit intomerge-train/spartanfrom
palla/fix-hardcoded-timeout
Feb 12, 2026
Merged

fix(validator): remove hardcoded 10s timeout for previous block syncing#20462
spalladino merged 1 commit intomerge-train/spartanfrom
palla/fix-hardcoded-timeout

Conversation

@spalladino
Copy link
Contributor

Now the validator times out trying to sync the previous block for validating a checkpoint at the beginning of the next slot, when the checkpoint is either dropped or checkpointed.

Now the validator times out trying to sync the previous block for
validating a checkpoint at the beginning of the next slot, when the
checkpoint is either dropped or checkpointed.
@spalladino spalladino enabled auto-merge (squash) February 12, 2026 22:00
@AztecBot
Copy link
Collaborator

Flakey Tests

🤖 says: This CI run detected 1 tests that failed, but were tolerated due to a .test_patterns.yml entry.

\033FLAKED\033 (8;;http://ci.aztec-labs.com/e3d9750f176969ef�e3d9750f176969ef8;;�):  yarn-project/end-to-end/scripts/run_test.sh simple src/e2e_epochs/epochs_mbps.parallel.test.ts "builds multiple blocks per slot with L2 to L1 messages" (185s) (code: 0) group:e2e-p2p-epoch-flakes

@spalladino spalladino merged commit f689858 into merge-train/spartan Feb 12, 2026
11 of 12 checks passed
@spalladino spalladino deleted the palla/fix-hardcoded-timeout branch February 12, 2026 22:22
github-merge-queue bot pushed a commit that referenced this pull request Feb 13, 2026
BEGIN_COMMIT_OVERRIDE
chore(spartan): bots sync to the proposed chain in mbps net (#20432)
fix(grind-test): pass NAME_POSTFIX (#20437)
chore(e2e): replace link to slack conversation with summary (#20438)
feat: add cross-chain message bot (#20408)
chore(bot): fix tsconfig (#20464)
fix(prover): store simulation duration as ms (#20460)
fix(validator): remove hardcoded 10s timeout for previous block syncing
(#20462)
END_COMMIT_OVERRIDE
ludamad added a commit that referenced this pull request Feb 23, 2026
Slide 19 (§4 insights · PR correlation): two-column layout showing which
PRs caused each weekly flake spike and which fixes produced each recovery:

Spikes:
- W02 (2,647 flakes): Santiago refactors #19532/#19509/#19564 exposed
  timing races across p2p/epoch simultaneously
- W04 (935 flakes): PhilWindle #19982 added cross-chain mbps tests
  without pre-deflaking — valid_epoch_pruned_slash 0→346 events
- W06 (850 flakes): three high-risk PRs merged same day (#20047 peer
  scoring, #20241 max checkpoints→32, #20257 hash constants)

Fixes:
- W03 recovery: Santiago #19914 — checkpointed chain tip for PXE
  (root fix; PXE was using latest not checkpointed block)
- W05 recovery: Santiago #20088 slasher multi-block fix + #20140
  discv5 deflake + GCP step-down (−6 testbed namespaces)
- W07 improvement: Santiago #20351 mbps fix (p2p_client 311→0),
  #20462 remove hardcoded 10s timeout, ludamad #20613 CI parallelism

Also: correct three factual errors spotted during full review —
- Summary: next P50 is growing (+10% in 3 weeks), not stable
- Flake trend W07 note: e2e-p2p-epoch-flakes dropped 373×, not just
  "251 flakes lowest since December"
- Gaps slide: replaced stale "ci_phases broken" card with GCP egress
  costs gap (bc→awk fix is deployed; egress attribution is the gap now)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants