feat(dispatcher): human-in-the-loop and review-driven resume flow by thejustinwalsh · Pull Request #90 · thejustinwalsh/middle

thejustinwalsh · 2026-05-24T04:00:43Z

Summary

Closes #32

Gives the implementation workflow a park → external-signal → resume spine so an agent can hand control back to a human (asked a question) or a reviewer (PR-ready), and later resume — as a fresh session in the same worktree, branch, and PR — with the answer or the review threads in context. APPROVED (or a clean re-review) ends the loop; a never-satisfied review loop is bounded to a configurable round cap (default 5), after which it parks for a human. middle never merges — the human does.

The Epic's 4 sub-issues were the phases; all land on this one branch / PR.

What changed

packages/dispatcher/src/workflows/implementation.ts — the park/resume spine: a single top-level waitFor, an outcome branch (park vs terminal), and resume-or-finalize that interprets the fired verdict and either finalizes or re-enqueues a continuation execution that reuses the prior round's worktree, re-primed per reason.
packages/adapters/claude/src/classify.ts — classifyStop surfaces the .middle/blocked.json question/context for the asked-question park.
packages/dispatcher/src/poller.ts + poller-gateway.ts + poller-cron.ts — the GitHub poller: fires epic-<n>-answered on a new human reply and epic-<n>-review-resolved on a review transition (CHANGES_REQUESTED/label → resume; APPROVED or 0-actionable re-review → resolved). Idempotent + rate-limit resilient.
packages/dispatcher/src/db/migrations/002_waitfor_fired.sql — fired_at for poller idempotency.
packages/dispatcher/src/main.ts / dispatch.ts — poller + enqueueContinuation seams wired.
packages/skills/implementing-github-issues/SKILL.md (+ synced mirrors) — new Phase 11 — Addressing review feedback procedure the resume brief points at.

Why these changes

The installed bunqueue@2.7.12 Workflow DSL silently drops a waitFor nested inside a branch .path() or loop body (filters to steps only), and the executor has no goto/loop-back. So the spec's idealized nested park/resume graph isn't expressible: park/resume must be a single top-level waitFor, and multi-round looping must be re-enqueue (which the spec itself annotates // loop back via re-enqueue). That one constraint drives the whole shape — each park/resume cycle is one execution; a continuation is a fresh execution carrying resume = { reason, round, worktree, payload } that reuses the worktree (same branch/PR) and re-drives from a rewritten .middle/prompt.md. The full rationale chain is in the 8 inline review comments and planning/issues/32/decisions.md.

Status

Phase 1 (Integrate waitFor signals into the implementation workflow #33): waitFor signal spine in the implementation workflow
Phase 2 (Detect the blocked.json question sentinel in classifyStop #34): blocked.json sentinel detection in classifyStop
Phase 3 (Build the GitHub poller for human replies and PR review state #35): GitHub poller for human replies and PR review state
Phase 4 (Implement agent resume for answered-question and review-changes #36): agent resume for answered-question and review-changes

Acceptance evidence (Epic #32)

Criterion	Status	Evidence
All sub-issues closed	✅ Met	#33, #34, #35, #36 — all closed
Agent writes `blocked.json` → shown "asked question", session ends to free the slot, workflow parks on a wait signal; a human reply resumes it as a fresh session with the answer in context	✅ Met	classifyStop → `asked-question` (adapter.test.ts); park arms `epic-<n>-answered` + ends session (implementation-workflow.test.ts "asked-question park → answer → resume (e2e)"); poller fires on a new non-bot reply (poller.test.ts); resume injects the reply into `.middle/prompt.md` (same e2e test asserts the brief contains the reply body + author)
At PR-ready the workflow parks on a review-resolved signal; a `CHANGES_REQUESTED` review (CodeRabbit, human, or `changes-requested` label) resumes the agent with the PR review threads in context — addresses them, pushes, replies in-thread, re-requests review	✅ Met	done → parks on `epic-<n>-review-resolved` + poller `classifyReviewOutcome` (poller.test.ts); resume writes the "address review" brief that instructs the agent to pull the PR threads via `gh` and follow the procedure (implementation-workflow.test.ts "done park → review-changes → resume (e2e)" asserts the brief); the per-round procedure (batch → resolve class-wide → internal review loop → push once → reply in-thread → re-request → re-park) is codified in SKILL.md Phase 11
`APPROVED` ends the loop (terminal — middle never merges); a never-satisfied review loop is bounded to 5 rounds, after which it parks in `waiting-human`	✅ Met	`APPROVED` → terminal `completed`, no further round (implementation-workflow.test.ts review e2e); round counter increments per `CHANGES_REQUESTED` pass and parks `waiting-human` at the cap with no re-arm / no continuation (same file, "review-round cap" test, run with cap=2)

Verification

All gates green at c3c86a3: bun run typecheck clean; bun test → 237 pass / 0 fail across 27 files.

Phase 1 (Integrate waitFor signals into the implementation workflow #33): implementation-workflow.test.ts — terminal fall-through, park/arm, compensation.
Phase 2 (Detect the blocked.json question sentinel in classifyStop #34): adapter.test.ts — classifyStop surfaces blocked.json question/context; tolerant of malformed sentinel.
Phase 3 (Build the GitHub poller for human replies and PR review state #35): poller.test.ts — 15 tests: newest-non-bot-reply, review classifier, idempotent fire, per-workflow error isolation.
Phase 4 (Implement agent resume for answered-question and review-changes #36): implementation-workflow.test.ts — asked-question e2e (reply injected into continuation brief), review-changes e2e (address-review brief + APPROVED ends loop), round-cap boundary (waiting-human at the cap, no further round). 8 workflow tests pass.

Stumbling points

bunqueue's DSL is narrower than the spec's example. The biggest surprise — waitFor nested in a branch/loop body compiles without error and is silently dropped at runtime. Discovered by a spike against the real embedded engine (Integrate waitFor signals into the implementation workflow #33) before building the production workflow; it reshaped the whole design (top-level waitFor + re-enqueue).
"Parked" and "capped" are both waiting-human. A test initially raced — it asserted the cap outcome while the workflow was still in its initial park (same row state). Fixed by waiting on the bunqueue execution settling (the definitive barrier), not the row state.
Handed-off round state. SQLite can't ALTER a CHECK constraint in place, so a dedicated superseded state would need a table rebuild — disproportionate today. Reused completed; documented as a Phase 8/9 refinement (decision log + inline review comment).

Suggested CLAUDE.md updates

The repo's state-issue contract section is solid. One candidate addition for the dispatcher area: a short note that the installed bunqueue Workflow DSL filters branch/loop bodies to steps only — waitFor/branch nested inside are dropped; park/resume is a top-level waitFor + re-enqueue. It cost a spike to discover and will bite the next workflow author. Deferred to the maintainer's judgment rather than edited unilaterally.

Architectural forks

None — the bunqueue DSL constraint (verified by spike) decided the design; no genuinely-tied options to A/B.

Follow-up issues

None filed. The one discovery — a dedicated superseded workflows.state for handed-off continuation rounds (vs reusing completed) — is naturally Phase 8 work (which routes dispatches through the persistent engine and revisits the row lifecycle) and is captured in decisions.md + an inline review comment, not as a standalone issue.

Out of scope (per the Epic)

Cross-process resume hosting (Phase 8). The poller + engine.signal + enqueueContinuation seams are wired, and the resume logic is fully built and tested against a live embedded engine. But dispatchEpic's engine drains when a workflow parks (waitForSettle returns on waiting), so a parked execution is not yet hosted on the long-lived main.ts engine to be resumed in production. Routing dispatches through the persistent engine + durable bunqueue + recover() is the Phase 8 auto-dispatch integration — explicitly out of scope for Phase 5.
Mechanical verification gates (Phase 6), auto-dispatch/slot enforcement (Phase 8), dashboard surfaces (Phase 9).

Summary by CodeRabbit

New Features
- Human-in-the-loop resume: workflows can park for questions or reviews and resume with preserved context and reason; continuation handoff supports multi-round resumes with a cap.
- GitHub poller: automatic polling detects human replies and PR review outcomes to trigger resumes.
- Sentinel parsing: parked questions now surface parsed question/context when available.
Tests
- Expanded unit and integration tests covering poller logic, resume flows, and multi-round behavior.
Chores
- DB migration added to track fired resume signals.
Documentation
- Added design and planning docs describing the park→poll→resume system.

coderabbitai · 2026-05-24T04:00:48Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 79eb6244-bbdf-4e19-a3e7-2b6a471e9e1f

📥 Commits

Reviewing files that changed from the base of the PR and between 0e48630 and fb2e47e.

📒 Files selected for processing (2)

packages/dispatcher/src/poller-gateway.ts
packages/dispatcher/src/workflows/implementation.ts

📝 Walkthrough

Walkthrough

Adds a durable park→external-signal→resume spine: parse .middle/blocked.json sentinels, add fired_at for idempotent wait signals, implement a GitHub poller (gh gateway + cron + classifiers), refactor implementation workflow to park/resume with continuations and round caps, and wire poller/continuation startup and tests.

Changes

Park → Signal → Resume Workflow with GitHub Poller

Layer / File(s)	Summary
Blocked sentinel type and parsing `packages/adapters/claude/src/classify.ts`, `packages/core/src/adapter.ts`, `packages/core/src/index.ts`, `packages/adapters/claude/test/adapter.test.ts`	Adds `BlockedSentinel` type; Claude adapter reads/parses `.middle/blocked.json` into `sentinel: BlockedSentinel
DB migration: fired_at on waitfor_signals `packages/dispatcher/src/db/migrations/002_waitfor_fired.sql`	Adds `fired_at` column and records schema version 2 for idempotent poller firing.
Workflow record APIs for poller `packages/dispatcher/src/workflow-record.ts`, `packages/dispatcher/test/db.test.ts`	Expose `PollableWait`/`ArmedSignal` types and functions to load pollable waits, mark signals fired, get and consume armed signals; tests updated for schema v2.
GitHub gateway (gh CLI) `packages/dispatcher/src/poller-gateway.ts`	Subprocess `gh` wrapper to fetch issue comments and PR reviews, normalize fields, and detect bot authors.
Poller core: classification and runPoller `packages/dispatcher/src/poller.ts`, `packages/dispatcher/test/poller.test.ts`	Add reason decoding, classifyNewHumanReply, classifyReviewOutcome (including "0 actionable" handling), and runPoller to iterate parked waits, fire typed resume signals, and mark waits fired with per-workflow error isolation; tests added.
Cron wrapper for poller `packages/dispatcher/src/poller-cron.ts`, `packages/dispatcher/test/poller.test.ts`	Runs runPoller on a 60s cadence inside an embedded Bunqueue with resilient error handling and a stop function for teardown.
Implementation workflow park → resume orchestration `packages/dispatcher/src/workflows/implementation.ts`, `packages/dispatcher/test/implementation-workflow.test.ts`	Introduce `ResumeInput`/`ResumeReason`, `RESUME_EVENT`, `signalNameFor`; refactor prepare/drive/cleanup into park → waitFor(RESUME_EVENT) → resumeOrFinalize; add writeResumeBrief, driveOnce, parkForResume, recordTerminal (pre-seed), finalize, and resumeOrFinalize (enqueueContinuation with round caps). Tests refactored to drive continuations.
Dispatcher wiring: poller startup and continuation callback `packages/dispatcher/src/main.ts`, `packages/dispatcher/src/dispatch.ts`	Start poller in dispatcher with `ghPollGateway` and `fireSignal` → `engine.signal`, add `enqueueContinuation` that calls `engine.start("implementation", input)`, and include poller stop in shutdown.
Tests and harness updates `packages/dispatcher/test/*`, `packages/adapters/claude/test/adapter.test.ts`	Add poller unit/integration tests, refactor implementation-workflow tests for continuation/resume flows, and update DB/test expectations for schema v2.
Planning and design documentation `planning/issues/32/plan.md`, `planning/issues/32/decisions.md`	Add planning and decisions docs describing architecture, poller idempotency, continuation semantics, and review-driven resume flow.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related issues

Human-in-the-loop and review-driven resume flow #32: Parent tracking issue for human-in-the-loop + review-driven resume; this PR implements the described park→signal→resume flow.
Detect the blocked.json question sentinel in classifyStop #34: Adds tolerant reading/parsing of .middle/blocked.json and surfaces question/context in classifyStop, matching that issue's objective.
Build the GitHub poller for human replies and PR review state #35: Implements the GitHub poller and gateway to fire resume signals for human replies and PR review outcomes, aligning with that issue's intent.

Possibly related PRs

thejustinwalsh/middle#73: Earlier Claude classifyStop sentinel handling; this PR extends it to parse and surface sentinel contents.
thejustinwalsh/middle#81: Related dispatcher state/Phase 3 integration work that overlaps with dispatcher wiring and state handling.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 77.78% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat(dispatcher): human-in-the-loop and review-driven resume flow' clearly and concisely describes the main feature addition — implementing a park → resume spine for human-in-the-loop and review-driven workflows.
Linked Issues check	✅ Passed	All coding requirements from issue `#32` are met: the park → resume spine is implemented with waitFor logic, asked-question and review-changes flows are supported, APPROVED terminates the loop, and review rounds are capped at 5.
Out of Scope Changes check	✅ Passed	All code changes directly support the implementation of `#32`'s requirements: the poller, workflow spine, adapter changes, migration, and tests are all aligned with the park → resume objective.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

ESLint skipped: no ESLint configuration detected in root package.json. To enable, add eslint to devDependencies.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Restructure createImplementationWorkflow into a park -> external-signal -> resume spine: launch-and-drive -> branch(park|terminal) -> top-level waitFor -> resume-or-finalize. asked-question and done stops arm a durable waitfor_signals row (epic-<n>-answered / epic-<n>-review-resolved), end the session, set waiting-human, and park; terminal stops pre-seed the signal so the single top-level waitFor falls through. Resume consumes the row and re-drives a fresh session re-primed per reason. bunqueue 2.7.12 filters branch/loop bodies to steps only and signal() targets one execution, so waitFor is a single top-level node (the spec's nested graph isn't expressible); additional review rounds loop via re-enqueue (#36). Adds getWaitForSignal/consumeWaitForSignal to workflow-record. Closes #33

…fyStop classifyStop now reads the .middle/blocked.json question sentinel and carries its parsed contents on the asked-question classification (new BlockedSentinel type in core). Parsing is tolerant — a missing/malformed/contentless sentinel yields sentinel: null while still classifying asked-question (presence is the signal). The workflow's postQuestion seam receives the question + context so the parked Epic can show the human what's being asked. Closes #34

Adds runPoller: a pure pass over parked workflows (waiting-human + armed waitfor_signals) behind an injected GitHubPollGateway. For epic-<n>-answered it fires on a new non-bot reply; for epic-<n>-review-resolved it classifies the PR verdict — CHANGES_REQUESTED (review or changes-requested label) -> changes-requested, APPROVED or a fresh 0-actionable re-review -> resolved (so a bot reviewer that never flips its verdict doesn't hang the loop). Detect-and-fire only; #36 interprets the payload. Idempotent via a new waitfor_signals.fired_at column (migration 002): a fired wait is skipped until the workflow resumes and re-parks. Per-workflow failures (rate limits, transient errors) are isolated and retried next pass. Production gh gateway in poller-gateway.ts; 60s cron wired into main.ts alongside the watchdog (full cross-process resume hosting is Phase 8). Closes #35

…n and review-changes The Phase 5 spine (#33) parked and did a single inline resume drive. #36 replaces that with the re-enqueue continuation model the bunqueue reality forces: a park happens once per execution (no loop-back; loop bodies can't hold a waitFor), so every resume is a fresh execution re-primed from a ResumeInput that reuses the prior round's worktree — same branch, same PR. - resume-or-finalize interprets the fired signal: terminal stop / review resolved (APPROVED or clean re-review) -> finalize; answered-question or CHANGES_REQUESTED under the cap -> enqueueContinuation; a CHANGES_REQUESTED that would exceed the round cap (default 5) -> park waiting-human, no re-arm, no continuation (bounded loop). - prepare-worktree reuses input.resume.worktree and writes the reason-specific resume brief to .middle/prompt.md (answer injected, or the address-review brief pointing at the skill procedure). launch-and-drive picks answer/resume framing from the reason. - enqueueContinuation injected; wired in dispatch.ts to engine.start (the long-lived-engine hosting of parked execs is the Phase 8 integration). - Tests: asked-question e2e (reply injected into continuation brief), review e2e (address-review brief + APPROVED ends the loop), round-cap boundary (waiting-human at the cap, no further round).

…nting-github-issues Adds Phase 11 (the review-changes loop the #36 resume brief points at) so the autonomous daemon loop and a hand-driven agent follow the same per-round procedure: batch the whole pass -> resolve class-wide with a test per fix -> internal clean-eyes review loop over the batched diff -> push once -> reply in-thread -> re-request review -> stop and wait for the next verdict. APPROVED ends the loop; the agent never merges. Also expands 'You may be resumed mid-workstream' under the headless-dispatch section: fresh session in the same worktree, the two resume reasons (answer injected / address-review brief), and the bounded review loop (round cap 5). Synced to the bootstrap-assets mirror and the dogfood .claude/.codex copies.

…ecision

thejustinwalsh

Decision-log distillation (Phase 8 of implementing-github-issues). Each inline comment is the why behind a load-bearing choice; full entries live in planning/issues/32/decisions.md.

thejustinwalsh · 2026-05-24T04:59:58Z

+      // Single top-level `waitFor`: parks park-worthy stops until the poller
+      // fires RESUME_EVENT; terminal stops pre-seeded the signal and fall
+      // through. Same timeout budget as the drive step.
+      .waitFor(RESUME_EVENT, { timeout: WAITFOR_TIMEOUT_MS })


Why a single top-level waitFor, not the spec's nested graph. bunqueue@2.7.12's Workflow builder filters branch .path() and loop bodies to type === 'step' only — a waitFor nested inside is silently dropped (workflow.js:39-51). The executor also has no goto/loop-back. So park/resume is expressible only as a top-level waitFor node, and multi-round looping is re-enqueue (which the spec itself annotates // loop back via re-enqueue).

thejustinwalsh · 2026-05-24T04:59:59Z

      });
-    } else if (finalState === "completed") {
+    }
+    (ctx.signals as Record<string, unknown>)[RESUME_EVENT] = { terminal: true };


Conditional parking via pre-seeding ctx.signals. A top-level waitFor always executes (no skip primitive). buildContext returns signals: exec.signals by reference (runner.js:178) and runWaitFor advances when exec.signals[event] !== undefined (executor.js:150). So terminal stops mutate ctx.signals[RESUME_EVENT] = { terminal: true } here to fall straight through, while park-worthy stops leave it unset to genuinely park. resume-or-finalize reads the same terminal marker to tell a pre-seeded fall-through from a poller-fired verdict.

thejustinwalsh · 2026-05-24T04:59:59Z

+ * the epic-scoped, reason-scoped name lives in the durable `waitfor_signals`
+ * row (see `signalNameFor`), which is what the poller and dashboard read.
+ */
+export const RESUME_EVENT = "resume";


One static engine event name; epic-scoped naming lives in waitfor_signals. waitFor(event) takes a static string in this bunqueue version, and engine.signal(execId, event, payload) already targets one execution — so the event name needn't be parameterized to avoid cross-execution collisions. The durable, poller-facing name (epic-<n>-answered / epic-<n>-review-resolved, via signalNameFor) is the waitfor_signals.signal_name; the reason + data ride in the payload.

thejustinwalsh · 2026-05-24T04:59:59Z

+-- re-fire the same wait before the workflow has resumed and consumed the row.
+-- A fresh park (next review round) deletes-and-reinserts the row, clearing it.
+
+ALTER TABLE waitfor_signals ADD COLUMN fired_at INTEGER;


Poller idempotency via fired_at. The poller is a pure detect-and-fire pass; it marks a wait fired so a later pass won't re-fire before the workflow resumes and consumes the row. A fresh park (next round) deletes-and-reinserts the row, clearing fired_at. Note the row is keyed by signal_name (epic-scoped) — rounds of the same Epic never overlap because each round consumes its wait before the next round arms (sequential handoff), so the shared key doesn't collide.

thejustinwalsh · 2026-05-24T04:59:59Z

+  // parked execution. NOTE: routing dispatches through this long-lived engine
+  // (so parked executions live here to be resumed) is the Phase 8 auto-dispatch
+  // integration; the poller + signal seam are in place ahead of it.
+  const stopPoller = await startPoller({


Poller + signal seam wired; cross-process resume hosting is Phase 8. startPoller runs as a 60s cron on the long-lived engine, with fireSignal = engine.signal(id, RESUME_EVENT, …). Today dispatches run on dispatchEpic's throwaway engine, which drains when the workflow parks (waitForSettle returns on waiting), so a parked execution doesn't yet live on this engine to be resumed. Routing dispatches through the persistent engine is the Phase 8 auto-dispatch integration (explicitly out of scope for the Epic). The seam is in place ahead of it; a fire for a not-yet-hosted execution is caught by the poller's per-workflow guard and retried.

thejustinwalsh · 2026-05-24T04:59:59Z

+    // RATE_LIMITED before handing off.
+    markAvailableOnSuccess(deps.db, ctx.input.adapter);
+    // Hand control to a fresh continuation that reuses this worktree.
+    await deps.enqueueContinuation({


Multi-round resume = re-enqueue (one round = one execution). A single execution can park only once. The review loop needs up to cap real parks (each frees the session for a reviewer who may take days), so the only expressible loop is re-enqueue. waitfor_signals.workflow_id must equal the bunqueue execution id for engine.signal to target the parked execution, so each round is necessarily a fresh execution (and workflows row), keyed by the same epic_number; the live one is the latest non-terminal row. The continuation reuses the prior round's worktree via input.resume.worktree — same branch, same PR.

thejustinwalsh · 2026-05-24T04:59:59Z

+ * This unconditionally overwrites (unlike `ensurePromptFile`, which preserves an
+ * operator brief on the *initial* dispatch) — a resume's brief is the live one.
+ */
+function writeResumeBrief(


The agent fetches review threads; the dispatcher only writes the brief. On a review-changes continuation this overwrites .middle/prompt.md with an "address review" brief (round, decision, the skill's per-round procedure) and the agent pulls the PR's live threads itself via gh. Having a full Claude session fetch live threads is more robust than embedding a stale snapshot, and keeps the dispatcher GitHub-read-light. Codifying the procedure in the skill's new "Addressing review feedback" (Phase 11) section is what makes the autonomous daemon loop and a hand-driven agent behave identically — the #36 acceptance's explicit requirement.

thejustinwalsh · 2026-05-24T04:59:59Z

+    });
+    // This round handed off — terminal in the bunqueue sense. The worktree is
+    // NOT torn down; the continuation reuses it.
+    updateWorkflow(deps.db, ctx.executionId, { state: "completed" });


A handed-off round ends completed, not a new superseded state. The row must be terminal so findActiveWorkflowBySession / loadPollableWaits ignore it in favour of the live continuation. Adding superseded means rebuilding the table (SQLite can't ALTER a CHECK constraint) — disproportionate for a today-cosmetic distinction whose only consumer is the Phase 9 dashboard (out of scope). markAvailableOnSuccess firing on a handoff is correct (the round's drive ran a working adapter). If the dashboard later needs honest per-round accounting, Phase 8 (which revisits the workflows lifecycle) is the place to add superseded.

thejustinwalsh · 2026-05-24T05:01:55Z

Reviewer's brief — Epic #32 (PR #90)

What this is. A park → external-signal → resume spine for the implementation workflow: an agent can hand control back to a human (asked a question) or a reviewer (PR-ready) and later resume — fresh session, same worktree / branch / PR — with the answer or review threads in context. APPROVED ends the loop; a never-satisfied review loop is bounded (default 5 rounds → waiting-human). middle never merges.

How to run it

bun run typecheck        # tsc --noEmit — clean
bun test                 # 237 pass / 0 fail across 27 files
# the load-bearing suites for this PR:
bun test packages/dispatcher/test/implementation-workflow.test.ts   # 8 pass — the park/resume spine + re-enqueue loop
bun test packages/dispatcher/test/poller.test.ts                    # 15 pass — the GitHub poller / review classifier
bun test packages/adapters/claude/test/adapter.test.ts              # classifyStop sentinel surfacing

What to verify (and what "correct" looks like)

The bunqueue constraint is real and load-bearing. packages/dispatcher/src/workflows/implementation.ts uses a single top-level waitFor + re-enqueue, not the spec's nested graph. The justification (the DSL silently drops a nested waitFor) is in the inline review comment on the .waitFor(...) line and in node_modules/.bun/bunqueue@2.7.12*/dist/client/workflow/workflow.js:39-51 if you want to confirm at the source. This is the decision most worth a second pair of eyes — if it's wrong, the shape is wrong.
Re-enqueue continuation reuses the worktree. resume-or-finalize → enqueueContinuation with resume = { reason, round, worktree, payload }; the continuation's prepare-worktree skips createWorktree and reuses the handle (no new branch, no new PR). Verified by the e2e tests asserting getWorkflow(continuation).worktreePath === getWorkflow(initial).worktreePath and listWorktrees() staying at 1 until the final teardown.
The round cap is per-pass and consumes-then-caps. resume-or-finalize consumes the wait before the cap check, then parks waiting-human with no re-arm and no continuation — that's what stops auto-resume. The cap test runs with reviewRoundCap: 2 and asserts no third round is enqueued.
Dispatcher writes the brief; the agent fetches the threads. writeResumeBrief overwrites .middle/prompt.md with an "address review" brief (round, decision, procedure pointer); the agent pulls live PR threads itself per SKILL.md → Phase 11 "Addressing review feedback". Read Phase 11 — it's what makes the daemon loop and a hand-driven agent behave identically.
Acceptance-evidence table in the PR description maps each Epic Human-in-the-loop and review-driven resume flow #32 criterion to a test/file. Each is met.

How to review it

Start with the 8 inline review comments — they're the why behind each load-bearing choice (full text in planning/issues/32/decisions.md).
Then read implementation.ts top-to-bottom: types/ResumeInput → writeResumeBrief → prepareWorktree/launchAndDrive (continuation branches) → resume-or-finalize (the interpreter) → the workflow graph at the bottom.
Cross-check the e2e tests against the acceptance table.

Fragile / needs extra eyes

waitfor_signals.signal_name is epic-scoped, not execution-scoped (PK = signal_name). It works today because rounds never overlap (each consumes its wait before the next arms). If a future change ever parks two rounds of the same Epic concurrently, the PK collides — worth a comment if you see a path to that.
Cross-process resume hosting is Phase 8, not this PR. The poller + engine.signal + enqueueContinuation seams are wired and the resume logic is fully tested against a live embedded engine, but dispatchEpic's engine drains at park, so a parked execution isn't yet hosted on the long-lived main.ts engine in production. This is the Epic's stated Phase 5/8 boundary, not a scope cut — but confirm you agree with where the line is drawn.
Handed-off rounds terminate as completed (no superseded state — SQLite can't ALTER a CHECK in place). Cosmetic until the Phase 9 dashboard; flagged in the decision log + an inline comment.

Terminal state: all 4 sub-issues closed, all gates green, PR marked ready. The human reviews and merges — the workflow does not.

coderabbitai

Actionable comments posted: 5

🧹 Nitpick comments (2)

packages/dispatcher/src/workflows/implementation.ts (2)
662-665: 💤 Low value

Consider reducing timeout for resume-or-finalize step.

The step only performs quick operations (finalize worktree destruction or enqueueContinuation), but it inherits the same ~4-hour timeout budget as launch-and-drive. A much shorter timeout (e.g., 60 seconds) would be more appropriate and would surface hangs faster.
     .step("resume-or-finalize", resumeOrFinalize, {
       retry: 1,
-      timeout: launchTimeout + stopTimeout + 60_000,
+      timeout: 60_000,
     })
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@packages/dispatcher/src/workflows/implementation.ts` around lines 662 - 665,
The "resume-or-finalize" workflow step currently inherits the large timeout
computed as launchTimeout + stopTimeout + 60_000 (same as "launch-and-drive")
but it only does quick operations; change the timeout for the
.step("resume-or-finalize", resumeOrFinalize, { timeout: ... }) call to a much
shorter value (e.g., 60_000 ms) so hangs surface faster while leaving retry
behavior unchanged; update the timeout expression in the step configuration
where "resume-or-finalize" and the variables launchTimeout/stopTimeout are
referenced.
586-606: ⚡ Quick win

Reassess null-deref risk: bunqueue timeout should not reach resumeOrFinalize

resumeOrFinalize assumes signal is present (const payload = signal as ResumeSignalPayload; then payload.reason), but this step is only entered after the top-level .waitFor(RESUME_EVENT, { timeout: WAITFOR_TIMEOUT_MS }) resolves; when the timeout elapses without the signal, bunqueue fails the execution (emits signal:timeout), so it shouldn’t proceed with signal === undefined.

Optional: add an explicit invariant guard before casting/accessing payload.reason (e.g., throw for !signal) to make the assumption resilient to future engine/flow changes.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@packages/dispatcher/src/workflows/implementation.ts` around lines 586 - 606,
The code assumes a non-null signal in resumeOrFinalize when it does "const
payload = signal as ResumeSignalPayload" and then accesses payload.reason; add
an explicit invariant guard just after computing signal (e.g., if (!signal)
throw new Error(`Missing resume signal after WAITFOR_TIMEOUT_MS for execution
${ctx.executionId}`)) so the function never proceeds with a null/undefined
payload; keep the existing calls to consumeWaitForSignal(deps.db,
ctx.executionId) and finalize(ctx, handle, ...) intact and only cast/access
payload after the guard, referencing RESUME_EVENT, WAITFOR_TIMEOUT_MS,
resumeOrFinalize, consumeWaitForSignal, and finalize to locate the change.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@packages/dispatcher/src/poller-gateway.ts`:
- Around line 28-33: The gh CLI output with --paginate can emit one JSON value
per page, so JSON.parse(out) in listIssueComments (and where reviewsOut is
parsed in findPrForEpic) will fail for multi-page results; update both places to
either call gh without --paginate and use --slurp, or (preferably) treat out as
newline-separated JSON pages: split out into separate JSON blobs, parse each
blob, and concatenate array results into a single array before proceeding (look
for function listIssueComments and variable reviewsOut / function findPrForEpic
to locate the two spots).

In `@packages/dispatcher/src/poller.ts`:
- Around line 130-136: The fallback that treats snapshot.reviewDecision ===
"CHANGES_REQUESTED" as a fresh trigger is firing on stale standing decisions;
change the logic in the poller (where snapshot.reviewDecision and
snapshot.labels are evaluated) to only emit "changes-requested" when the
standing decision is recent enough — i.e. check the review decision timestamp
(e.g. snapshot.reviewDecisionTimestamp or snapshot.reviewUpdatedAt) against the
arming boundary (sinceMs) before returning { outcome: "changes-requested", ...
}; keep the existing labels check but also ensure it respects the same
sinceMs/timebound so stale states do not re-trigger.
- Around line 121-128: The current logic in poller.ts uses ACTIONABLE_RE on
latest.body and may return resolved when the regex shows "0" even if
latest.state === "CHANGES_REQUESTED"; change the branching in the function that
evaluates latest (uses ACTIONABLE_RE, latest.body, latest.state,
snapshot.reviewDecision) so that you first check for an explicit latest.state
=== "CHANGES_REQUESTED" and return outcome "changes-requested" (with decision
"CHANGES_REQUESTED") before treating the ACTIONABLE_RE === 0 case as resolved;
ensure the check for Number(m[1]) === 0 only runs if latest.state is not
"CHANGES_REQUESTED" so explicit state wins over regex-derived actionable count.

In `@packages/dispatcher/test/implementation-workflow.test.ts`:
- Around line 451-456: The loop waiting for the cap-path execution to settle
(using engine.getExecution(id2)?.state and Bun.sleep) can time out silently;
after the waiting loop in implementation-workflow.test.ts, assert/fail fast if
the execution did not reach "completed" or "failed": retrieve the final state
via engine.getExecution(id2)?.state and throw an explicit error or call expect
to fail (including id2 and the observed state) so the test stops immediately and
reports the unsettled execution instead of continuing to subsequent DB
assertions.

In `@planning/issues/32/plan.md`:
- Line 10: Update the sentence that currently reads as a hard limit ("5") to
state the cap is configurable with a default of 5 and that the review loop still
terminates early on the "APPROVED" state; mention the configuration setting that
controls this (e.g. max_review_rounds / review_round_cap) so readers know where
to change it and keep the doc aligned with runtime behavior.

---

Nitpick comments:
In `@packages/dispatcher/src/workflows/implementation.ts`:
- Around line 662-665: The "resume-or-finalize" workflow step currently inherits
the large timeout computed as launchTimeout + stopTimeout + 60_000 (same as
"launch-and-drive") but it only does quick operations; change the timeout for
the .step("resume-or-finalize", resumeOrFinalize, { timeout: ... }) call to a
much shorter value (e.g., 60_000 ms) so hangs surface faster while leaving retry
behavior unchanged; update the timeout expression in the step configuration
where "resume-or-finalize" and the variables launchTimeout/stopTimeout are
referenced.
- Around line 586-606: The code assumes a non-null signal in resumeOrFinalize
when it does "const payload = signal as ResumeSignalPayload" and then accesses
payload.reason; add an explicit invariant guard just after computing signal
(e.g., if (!signal) throw new Error(`Missing resume signal after
WAITFOR_TIMEOUT_MS for execution ${ctx.executionId}`)) so the function never
proceeds with a null/undefined payload; keep the existing calls to
consumeWaitForSignal(deps.db, ctx.executionId) and finalize(ctx, handle, ...)
intact and only cast/access payload after the guard, referencing RESUME_EVENT,
WAITFOR_TIMEOUT_MS, resumeOrFinalize, consumeWaitForSignal, and finalize to
locate the change.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 61c9c87a-df01-4360-8ab6-f50d03a153da

📥 Commits

Reviewing files that changed from the base of the PR and between cac3529 and c3c86a3.

📒 Files selected for processing (21)

.claude/skills/implementing-github-issues/SKILL.md
.codex/skills/implementing-github-issues/SKILL.md
packages/adapters/claude/src/classify.ts
packages/adapters/claude/test/adapter.test.ts
packages/cli/src/bootstrap-assets/skills/implementing-github-issues/SKILL.md
packages/core/src/adapter.ts
packages/core/src/index.ts
packages/dispatcher/src/db/migrations/002_waitfor_fired.sql
packages/dispatcher/src/dispatch.ts
packages/dispatcher/src/main.ts
packages/dispatcher/src/poller-cron.ts
packages/dispatcher/src/poller-gateway.ts
packages/dispatcher/src/poller.ts
packages/dispatcher/src/workflow-record.ts
packages/dispatcher/src/workflows/implementation.ts
packages/dispatcher/test/db.test.ts
packages/dispatcher/test/implementation-workflow.test.ts
packages/dispatcher/test/poller.test.ts
packages/skills/implementing-github-issues/SKILL.md
planning/issues/32/decisions.md
planning/issues/32/plan.md

…hase 4 gates + #80 Phase 5 (#32) was built against a pre-Phase-4 base; main since landed the skill-enforcement gates (#86), the positive-done-signal nudge loop (#80), the plan-comment guard, and the review-handling skill codification (#87/#88/#89). Integration (Phase 5 as the base, main's additions layered on): - driveOnce resolves a bare-stop through the #80 nudge loop (session stays alive during nudges); a ready, non-draft Epic PR resolves to `done`, which Phase 5 then parks on review-resolved. nudge-exhausted parks in waiting-human. - The plan-comment guard runs in driveOnce: an unposted `done` is demoted to `failed` so it never enters the review park. - DriveResult carries a DriveOutcome (StopClassification | nudge-exhausted); finalStateFor handles nudge-exhausted; the drive-step timeout widened for nudges. - Skill (4 copies) resolved to main's codified version; the #80 nudge + plan-comment tests rewritten park-aware. Verified: typecheck clean, full suite 289 pass / 0 fail, skills mirror in sync.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@packages/dispatcher/src/workflows/implementation.ts`:
- Around line 658-660: The code always calls
deps.worktree.destroyWorktree(handle) even when the job resolves to a human
handoff; change the teardown to skip destroying the worktree when the workflow
final state is "waiting-human" by checking finalStateFor(settled) (or the
computed finalState variable) and only calling
deps.worktree.destroyWorktree(handle) when finalState !== "waiting-human"; apply
the same conditional logic to the other destroyWorktree call in the nearby block
around the second occurrence (the block handling settled kinds around lines
667-673) so handoff context is preserved for waiting-human outcomes.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 61b72c65-9b8a-477f-aa84-03933e982666

📥 Commits

Reviewing files that changed from the base of the PR and between c3c86a3 and 87e209c.

📒 Files selected for processing (5)

packages/adapters/claude/test/adapter.test.ts
packages/core/src/index.ts
packages/dispatcher/src/dispatch.ts
packages/dispatcher/src/workflows/implementation.ts
packages/dispatcher/test/implementation-workflow.test.ts

… handoff teardown Address CodeRabbit review on PR #90: - poller-gateway: slurp + flatten paginated `gh api --paginate` output so multi-page issue-comment / PR-review reads don't break JSON.parse past page 1. - poller: drop the stale standing `reviewDecision === CHANGES_REQUESTED` resume fallback. A bot reviewer never flips its standing verdict, so it would re-dispatch the agent every pass with no new feedback; only a fresh review or an explicit `changes-requested` label is a trustworthy resume signal. The 0-actionable body still wins over an un-flipped state for a fresh review. - implementation: preserve the worktree on a `waiting-human` finalize so the human can inspect / resume the in-progress state. - test: fail fast if the cap-path execution never settles; add a regression test for the stale-standing-decision fix. - plan: note the review-round cap is configurable (default 5), not fixed.

thejustinwalsh · 2026-05-24T05:58:21Z

@coderabbitai review

Addressed the previous round in 0e48630: fixed paginated gh output parsing, narrowed the stale standing-decision resume fallback, preserved the worktree on waiting-human handoff, hardened the cap-path test, and clarified the plan doc. One finding (prioritizing explicit CHANGES_REQUESTED over the 0-actionable body) was intentionally not applied — reasoning is on that thread.

coderabbitai · 2026-05-24T06:00:51Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
# Fetch the key files for review

echo "=== packages/dispatcher/src/poller.ts ==="
cat packages/dispatcher/src/poller.ts

echo ""
echo "=== packages/dispatcher/src/poller-gateway.ts ==="
cat packages/dispatcher/src/poller-gateway.ts

echo ""
echo "=== packages/dispatcher/src/poller-cron.ts ==="
cat packages/dispatcher/src/poller-cron.ts

echo ""
echo "=== packages/dispatcher/src/workflows/implementation.ts ==="
cat packages/dispatcher/src/workflows/implementation.ts

echo ""
echo "=== packages/adapters/claude/src/classify.ts ==="
cat packages/adapters/claude/src/classify.ts

echo ""
echo "=== packages/dispatcher/src/workflow-record.ts ==="
cat packages/dispatcher/src/workflow-record.ts

echo ""
echo "=== packages/dispatcher/src/dispatch.ts ==="
cat packages/dispatcher/src/dispatch.ts

echo ""
echo "=== packages/dispatcher/src/main.ts ==="
cat packages/dispatcher/src/main.ts

echo ""
echo "=== packages/dispatcher/src/db/migrations/002_waitfor_fired.sql ==="
cat packages/dispatcher/src/db/migrations/002_waitfor_fired.sql

echo ""
echo "=== packages/core/src/adapter.ts ==="
cat packages/core/src/adapter.ts

echo ""
echo "=== packages/dispatcher/test/poller.test.ts ==="
cat packages/dispatcher/test/poller.test.ts

echo ""
echo "=== packages/dispatcher/test/implementation-workflow.test.ts ==="
cat packages/dispatcher/test/implementation-workflow.test.ts

echo ""
echo "=== planning/issues/32/decisions.md ==="
cat planning/issues/32/decisions.md

Length of output: 127168

@thejustinwalsh: Solid work on this round — the paginated --slurp fix, narrowed fallback, worktree preservation on cap, and hardened cap-path test all look correct. Three findings below; two are minor and one is a nit.

1. `poller-gateway.ts` — Epic number substring collision in PR search (medium)

"--search", `in:body Closes #${epicNumber}`,

GitHub's full-text search for Closes #3 will match any PR body that contains `Closes `#3 as a substring — including bodies with Closes #30, `Closes `#31, Closes #300``, etc. In a project where epic #3 exists alongside `#30`+, the search can surface the wrong PR. `prs[0]?.number` picks the first result, and in practice there's likely only one open PR per epic, but the behaviour becomes non-deterministic once the issue space grows.

A tighter pattern avoids the ambiguity — match the full reference token, not just the prefix:

"--search", `in:body "Closes #${epicNumber}"`,
// or, if you prefer a numeric boundary guard client-side:
const prs = (JSON.parse(listOut) as Array<{ number: number }>)
  .filter(/* narrow by body if gh returns the body field */ ...);

The safest fix with the current gh pr list call is a client-side guard after parsing the result from a wider query (e.g., include body in --json and regex-filter on \bCloses\s+#${epicNumber}\b), though that increases response size.

2. `implementation.ts` — `setRateLimited` called twice for `rate-limited` stops (minor)

recordTerminal sets the rate-limit state to pre-seed ctx.signals:

// packages/dispatcher/src/workflows/implementation.ts
if (outcome.kind === "rate-limited") {
  setRateLimited(deps.db, { ... });  // ← first call
}
(ctx.signals as Record<string, unknown>)[RESUME_EVENT] = { terminal: true };

Then finalize calls it again:

if (settled.kind === "rate-limited") {
  setRateLimited(deps.db, { ... });  // ← duplicate
}

It's idempotent, so this is correct as-is, but the bookkeeping clearly belongs in one place. The recordTerminal call can be removed — finalize is the authoritative terminal handler — or vice versa with a comment explaining the placement.

3. `implementation.ts` `resumeOrFinalize` — `markAvailableOnSuccess` before `enqueueContinuation` (minor)

markAvailableOnSuccess(deps.db, ctx.input.adapter);    // ← clears RATE_LIMITED
await deps.enqueueContinuation({ ..., resume: { ... } }); // ← may throw
updateWorkflow(deps.db, ctx.executionId, { state: "completed" });

If enqueueContinuation throws (even transiently against the embedded engine), the rate-limit state has already been cleared but no continuation was enqueued and the row stays waiting-human — inconsistent state. In the embedded-engine path this window is near-zero, but it is a real window.

Flipping the order closes it cleanly:

await deps.enqueueContinuation({ ..., resume: { ... } });
markAvailableOnSuccess(deps.db, ctx.input.adapter);
updateWorkflow(deps.db, ctx.executionId, { state: "completed" });

If the enqueue fails, neither the rate-limit state nor the row state changes, and the poller can retry on the next pass.

Everything else looks correct

--slurp + .flat() in poller-gateway.ts fixes the paginated parse correctly. ✅
The removal of the standing CHANGES_REQUESTED decision fallback in classifyReviewOutcome (and the accompanying test for the stale-decision case) is the right call — the comment explains the reasoning clearly. ✅
Worktree preservation on waiting-human (if (finalState !== "waiting-human")) covers both the cap path and nudge-exhausted. ✅
The cap-path test now correctly waits for the bunqueue execution to settle before asserting waiting-human (avoiding the race between park-time state and cap-path state). ✅
decisions.md accurately documents all seven design choices, including the Phase 8 boundary and the superseded-state deferral. ✅
waitfor_signals PK collision note in the reviewer brief is the right fragile-point call-out for the single-concurrent-park assumption. ✅

The PR is in good shape; findings 1 and 3 are worth fixing before merge; finding 2 is cleanup-level.

coderabbitai

🧹 Nitpick comments (1)

packages/dispatcher/src/workflows/implementation.ts (1)
659-664: ⚡ Quick win

Add a regression assertion for the nudge-exhausted keep-worktree path.

This branch is only exercised when finalize() sees waiting-human (for example, the bare-stop → nudge-exhausted flow). The current tests only assert worktree preservation on the review-cap path, which bypasses finalize(). Please add a listWorktrees(...) assertion to the nudge-exhausted test so this fix cannot regress silently.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@packages/dispatcher/src/workflows/implementation.ts` around lines 659 - 664,
Add an assertion in the existing nudge-exhausted test to ensure the worktree is
preserved when finalize() leaves the job in the "waiting-human" path: after
driving the scenario that produces nudge-exhausted (the bare-stop →
nudge-exhausted flow) and after any call that triggers finalize(), call
deps.worktree.listWorktrees(...) and assert the returned list still contains the
worktree handle used in the test; this complements the existing review-cap
assertion and prevents regressions to the finalize() branch which calls
deps.worktree.destroyWorktree(handle).

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@packages/dispatcher/src/workflows/implementation.ts`:
- Around line 659-664: Add an assertion in the existing nudge-exhausted test to
ensure the worktree is preserved when finalize() leaves the job in the
"waiting-human" path: after driving the scenario that produces nudge-exhausted
(the bare-stop → nudge-exhausted flow) and after any call that triggers
finalize(), call deps.worktree.listWorktrees(...) and assert the returned list
still contains the worktree handle used in the test; this complements the
existing review-cap assertion and prevents regressions to the finalize() branch
which calls deps.worktree.destroyWorktree(handle).

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: f8f98424-d87e-4cf6-9377-c1ed21f0d8c8

📥 Commits

Reviewing files that changed from the base of the PR and between 87e209c and 0e48630.

📒 Files selected for processing (6)

packages/dispatcher/src/poller-gateway.ts
packages/dispatcher/src/poller.ts
packages/dispatcher/src/workflows/implementation.ts
packages/dispatcher/test/implementation-workflow.test.ts
packages/dispatcher/test/poller.test.ts
planning/issues/32/plan.md

✅ Files skipped from review due to trivial changes (1)

planning/issues/32/plan.md

…eeping Address CodeRabbit's post-approval batch on PR #90: - poller-gateway: the `in:body Closes #N` search is a prefix match, so `#3` surfaced `#30`/`#300`. Re-confirm the exact closing reference client-side with a non-digit-boundary regex over the returned PR bodies, making epic→PR resolution deterministic as the issue space grows. - implementation: drop the duplicate `setRateLimited` in `recordTerminal`; `finalize` is the single authoritative terminal handler and always runs after the pre-seed falls the `waitFor` through. - implementation: in `resumeOrFinalize`, enqueue the continuation before clearing RATE_LIMITED so a throwing enqueue leaves both the rate-limit state and the row untouched for a clean poller retry.

thejustinwalsh · 2026-05-24T06:13:52Z

@coderabbitai Addressed all three in fb2e47e.

Epic-number substring collision — kept the server-side in:body Closes #N as a pre-filter and added a client-side guard: --json number,body then .find on /\bcloses\s+#${epicNumber}(?!\d)/i. The negative-lookahead boundary stops #3 from matching #30/#300, so epic→PR resolution is deterministic regardless of issue-space size.
Duplicate setRateLimited — removed it from recordTerminal; finalize is now the single authoritative terminal handler (it always runs after the pre-seed falls the waitFor through). Updated the docstring to match.
markAvailableOnSuccess ordering — flipped it after enqueueContinuation, so a throwing enqueue leaves both the rate-limit state and the row untouched and the poller retries cleanly.

Typecheck clean, 290 pass / 0 fail.

coderabbitai · 2026-05-24T06:14:26Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
# Verify the three fixes in fb2e47e

echo "=== poller-gateway.ts — PR search fix ==="
grep -n -A 20 "findPrForEpic" packages/dispatcher/src/poller-gateway.ts | head -60

echo ""
echo "=== implementation.ts — recordTerminal and resumeOrFinalize ==="
grep -n -A 10 "recordTerminal\|setRateLimited\|markAvailableOnSuccess\|enqueueContinuation" packages/dispatcher/src/workflows/implementation.ts | head -80

Length of output: 4926

@thejustinwalsh: All three confirmed correct in fb2e47e:

Epic-number collision — the closesRe negative-lookahead (?!\d) is correctly anchored with \b on the left, making epic→PR resolution deterministic regardless of issue-space size. The server-side pre-filter + client-side .find() combo is the right layering. ✅
Duplicate setRateLimited — recordTerminal now only pre-seeds ctx.signals; the updated docstring ("Rate-limit bookkeeping lives solely in finalize") makes the invariant clear for future readers. ✅
markAvailableOnSuccess ordering — enqueueContinuation now commits first; markAvailableOnSuccess and updateWorkflow only run once the hand-off is confirmed. The narrow failure window is closed. ✅

290 pass / 0 fail is a healthy signal too. The PR looks good to merge.

🐇✨

docs(issue-32): plan + decisions for human-in-loop & review resume

134b97d

thejustinwalsh mentioned this pull request May 24, 2026

Integrate waitFor signals into the implementation workflow #33

Closed

6 tasks

thejustinwalsh mentioned this pull request May 24, 2026

Detect the blocked.json question sentinel in classifyStop #34

Closed

4 tasks

thejustinwalsh mentioned this pull request May 24, 2026

Build the GitHub poller for human replies and PR review state #35

Closed

6 tasks

thejustinwalsh added 3 commits May 24, 2026 00:56

docs(issue-32): record the completed-vs-superseded continuation-row d…

c3c86a3

…ecision

thejustinwalsh mentioned this pull request May 24, 2026

Implement agent resume for answered-question and review-changes #36

Closed

7 tasks

thejustinwalsh commented May 24, 2026

View reviewed changes

thejustinwalsh marked this pull request as ready for review May 24, 2026 05:01

thejustinwalsh added the ready-for-review All phases done and verified — PR ready for final human review and merge label May 24, 2026

thejustinwalsh mentioned this pull request May 24, 2026

Human-in-the-loop and review-driven resume flow #32

Closed

4 tasks

coderabbitai Bot requested changes May 24, 2026

View reviewed changes

Comment thread packages/dispatcher/src/poller-gateway.ts Outdated

Comment thread packages/dispatcher/src/poller.ts

Comment thread packages/dispatcher/src/poller.ts

Comment thread packages/dispatcher/test/implementation-workflow.test.ts

Comment thread planning/issues/32/plan.md Outdated

coderabbitai Bot requested changes May 24, 2026

View reviewed changes

Comment thread packages/dispatcher/src/workflows/implementation.ts

coderabbitai Bot reviewed May 24, 2026

View reviewed changes

coderabbitai Bot approved these changes May 24, 2026

View reviewed changes

thejustinwalsh merged commit 96e098b into main May 24, 2026
1 check passed

thejustinwalsh mentioned this pull request May 24, 2026

docs(skill): codify merge-resolution rules and clean-mergeable-PR requirement #100

Merged

thejustinwalsh mentioned this pull request May 29, 2026

Make the CodexAdapter functionally dispatchable against live codex 0.133.0 #177

Closed

9 tasks

This was referenced Jun 4, 2026

feat(adapters): CopilotAdapter — the third agent adapter #207

Merged

fix(dispatcher): park arms the actual-reason signal even when blocked: is pre-armed #235

Merged

feat(verify): Phase-12 live-smoke verification harness #230

Open

Conversation

thejustinwalsh commented May 24, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Why these changes

Status

Acceptance evidence (Epic #32)

Verification

Stumbling points

Suggested CLAUDE.md updates

Architectural forks

Follow-up issues

Out of scope (per the Epic)

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Possibly related PRs

❌ Failed checks (1 warning)

Uh oh!

thejustinwalsh left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thejustinwalsh commented May 24, 2026

Reviewer's brief — Epic #32 (PR #90)

How to run it

What to verify (and what "correct" looks like)

How to review it

Fragile / needs extra eyes

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thejustinwalsh commented May 24, 2026

Uh oh!

coderabbitai Bot commented May 24, 2026

1. poller-gateway.ts — Epic number substring collision in PR search (medium)

2. implementation.ts — setRateLimited called twice for rate-limited stops (minor)

3. implementation.ts resumeOrFinalize — markAvailableOnSuccess before enqueueContinuation (minor)

Everything else looks correct

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

thejustinwalsh commented May 24, 2026

Uh oh!

coderabbitai Bot commented May 24, 2026

Uh oh!

Uh oh!

Reviewers

thejustinwalsh commented May 24, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 24, 2026 •

edited

Loading

1. `poller-gateway.ts` — Epic number substring collision in PR search (medium)

2. `implementation.ts` — `setRateLimited` called twice for `rate-limited` stops (minor)

3. `implementation.ts` `resumeOrFinalize` — `markAvailableOnSuccess` before `enqueueContinuation` (minor)