Skip to content

test: flag e2e_ha_full as flake under HA pipelining#23541

Merged
PhilWindle merged 2 commits into
merge-train/spartanfrom
cb/d394ef6145e7
May 24, 2026
Merged

test: flag e2e_ha_full as flake under HA pipelining#23541
PhilWindle merged 2 commits into
merge-train/spartanfrom
cb/d394ef6145e7

Conversation

@AztecBot
Copy link
Copy Markdown
Collaborator

Dequeued from merge-train/spartan again: http://ci.aztec-labs.com/136431da99834194.

The HA full suite keeps failing under proposer pipelining with shifting symptoms. In this run the dashboard log shows recurring validator:proposal-handler Timed out waiting for block with archive matching checkpoint proposal warnings (slot 98, 115, …) and an Error building checkpoint at slot 127: already proposed block for slot 127 index 0 on HA-4 — i.e. the 5 HA peers race on the same proposal. The bundled #23539 (parallel peer teardown) and #23524 (afterAll hook timeout) entries did not catch this run because jest's per-test summary was not reached within the dashboard log capture.

This PR adds a broad regex-only entry under .test_patterns.yml to flag any failure of yarn-project/end-to-end/scripts/run_test.sh ha src/composed/ha/e2e_ha_full.test.ts as a flake. Owner: @PaLLa, matching the existing pipelining-flavoured entries for this suite.

The intent is to unblock the merge queue while the HA pipelining stabilisation work continues; narrow the regex (or add a real fix) once the failure modes settle down.


Created by claudebox · group: slackbot

http://ci.aztec-labs.com/136431da99834194 dequeued PR #23344 again with
the HA full suite failing under proposer pipelining. The visible
failure signals in the dashboard log are recurring
"Timed out waiting for block with archive matching checkpoint proposal"
warnings across slots and an
"Error building checkpoint at slot 127: already proposed block for slot
127 index 0" on HA-4, so multiple peers race on the same proposal under
pipelining. The existing per-assertion / afterAll-hook flake entries
for this suite did not match because jest's per-test banner is not
reached within the dashboard log capture.

Add a broad regex-only entry to flag any failure of
yarn-project/end-to-end/scripts/run_test.sh ha src/composed/ha/e2e_ha_full.test.ts
as a flake until the HA pipelining work stabilises.
@AztecBot AztecBot added the claudebox Owned by claudebox. it can push to this PR. label May 24, 2026
@PhilWindle PhilWindle marked this pull request as ready for review May 24, 2026 14:21
@PhilWindle PhilWindle enabled auto-merge (squash) May 24, 2026 14:21
@PhilWindle PhilWindle merged commit 482ca36 into merge-train/spartan May 24, 2026
22 checks passed
@PhilWindle PhilWindle deleted the cb/d394ef6145e7 branch May 24, 2026 14:26
PhilWindle pushed a commit that referenced this pull request May 24, 2026
## Summary

PR #23344 was dequeued again at
https://github.com/AztecProtocol/aztec-packages/actions/runs/26364977301.
The flake-flag added in #23541 wasn't enough — ci3 retries once and the
HA full suite fails both attempts. Multiple distinct failure modes have
been hit recently:

- `✕ should coordinate governance voting across HA nodes`
- `✕ should distribute work across multiple HA nodes`
- afterAll teardown hangs → `Exceeded timeout of [0-9]+ ms for a hook`
- 5-peer races on checkpoint proposals (`Timed out waiting for block
with archive matching checkpoint proposal`, `already proposed block for
slot N index 0`)

This change collapses the four overlapping `e2e_ha_full` entries in
`.test_patterns.yml` into a single `skip: true` entry so the suite is
outright skipped on `merge-train/spartan` until HA pipelining
stabilises.

## Notes

- Targets `merge-train/spartan` per yarn-project default base.
- Adjacent `src/e2e_slashing/attested_invalid_proposal.test.ts` entry
left untouched.
- Owner kept as @spyros (current owner of the suite-level entry).

Requested in Slack by @PhilWindle.

---
*Created by
[claudebox](https://claudebox.work/v2/sessions/a745321353cb58ff) ·
group: `slackbot`*
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

claudebox Owned by claudebox. it can push to this PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants