Skip to content

backlog: Codex-parallel Otto-86 refinement — 4-stage progression + test-mode bounding + Windows-support use case#255

Merged
AceHack merged 2 commits intomainfrom
backlog/otto-86-refine-codex-parallel-with-multi-claude-intermediate
Apr 24, 2026
Merged

backlog: Codex-parallel Otto-86 refinement — 4-stage progression + test-mode bounding + Windows-support use case#255
AceHack merged 2 commits intomainfrom
backlog/otto-86-refine-codex-parallel-with-multi-claude-intermediate

Conversation

@AceHack
Copy link
Copy Markdown
Member

@AceHack AceHack commented Apr 24, 2026

Summary

  • Extends Otto-78's 3-stage Codex-parallel progression into a 4-stage arc with a new multi-Claude-Code intermediate (stage b), Windows support as concrete use case (stage d), Otto as readiness-signaller, and hard test-mode time-limits-or-process-kill requirement.
  • Two Aaron Otto-86 messages both absorbed in one refinement commit.

4-stage progression

  • (a) Today — single coordinator; Otto on Claude Code.
  • (b) Multi-Claude-Code peer-harness experiment — NEW intermediate; test parallel coordination before introducing harness-difference.
  • (c) Multi-harness with Codex — gated on (b) passing + Codex built-out + Otto's explicit trust signal.
  • (d) Full peer-harness with Windows-support real workload — possibly via Codex ("telephone line" transfer-learning end-to-end test).

Test-mode bounding (hard requirement)

Aaron: "make sure when in peer-harness mode you give the other one time limits or process kill them either way, just while we are testing we don't want the other peer harness to run forever during tests only when in real use."

  • Time-limits OR explicit process-kill on the non-primary during testing.
  • Removed once peer-harness moves from test to real use.
  • Concrete mechanism named in a future test-plan doc.

Otto is the readiness-signaller

Aaron: "i wont do it until you tell me we are ready" — stage (c) launch is gated on Otto's explicit readiness signal, per Otto-82 authority-calibration.

Scope limits

  • No second Claude session without experiment design + dry-run.
  • No skipping (b) to jump to (c).
  • No unbounded test runs.
  • No premature readiness signals.
  • Otto-78 primary-switch clause unchanged.

Authority

Within standing authority per Otto-82 calibration — BACKLOG-row refinement on an already-refinement-friendly row.

Test plan

  • Both Aaron messages quoted verbatim
  • 4-stage progression named explicitly
  • Test-mode bounding as hard requirement
  • Otto-readiness-signal as maintainer-acknowledgment gate
  • Windows-support use case documented
  • Otto-78 primary-switch clause preserved

🤖 Generated with Claude Code

Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates docs/BACKLOG.md to refine the Otto-86 backlog entry by defining a 4-stage peer-harness progression (adding a multi-Claude-Code intermediate stage), making “test-mode bounding” (timeout/process-kill) a hard requirement during testing, and capturing Windows support as the concrete stage-(d) workload, gated by Otto’s explicit readiness signal.

Changes:

  • Adds an Otto-86 refinement section with verbatim source messages plus a 4-stage progression (a–d).
  • Documents test-mode bounding as a hard requirement for stages (b) and (c) tests.
  • Introduces Windows support as the motivating real-workload use case for stage (d) and ties it to FACTORY-HYGIENE audit surfaces.

Comment thread docs/BACKLOG.md Outdated
Comment thread docs/BACKLOG.md Outdated
Comment thread docs/BACKLOG.md Outdated
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack added a commit that referenced this pull request Apr 24, 2026
…age peer-harness progression refinement

Split-attention tick after 6 straight bounded-deliverable ticks.
Primary PR #254 Muratori corrected-table research doc; mid-tick
2-message Aaron directive drove PR #255 Codex-parallel
refinement (4-stage progression + test-mode bounding +
Windows-support use case + Otto-signals-readiness gate).

Key observations:

1. Split-attention pattern back in use; proportionate; not
   default.
2. Readiness-signal gate is new 4th Aaron-signoff category
   (inverse direction: Otto signals, Aaron acts).
3. Test-mode bounding generalises beyond peer-harness; BP-NN
   promotion candidate when exercised on second experiment
   class.
4. "Telephone line" imagery = retractability-by-design at
   transfer-learning layer; Claude→Claude→Codex→Windows
   survives-with-fidelity is the portability claim to validate.

Stacked on #253 (Otto-85 history).
AceHack and others added 2 commits April 24, 2026 10:29
…t-mode bounding, Windows-support use case

Aaron Otto-86 two-message refinement of the Otto-78
Codex-parallel row:

Message 1 (progression + use case): extends Otto-79's 3-stage
progression (a single-today / b bounded-experiment /
c peer-harness) into a 4-stage arc with a new intermediate:

- (a) single coordinator today
- (b) NEW multi-Claude-Code peer-harness experiment before Codex
- (c) multi-harness with Codex (gated on (b) + Codex ready)
- (d) full peer-harness with Windows-support real workload
      (possibly Codex, end-to-end "telephone line" test)

Otto is the readiness-signaller; Aaron waits. Stage (c) launch
is a specifically-asked-for design review per Otto-82
authority-calibration — Otto's readiness signal is the gate.

Message 2 (test-mode bounding, hard requirement): "make sure
when in peer-harness mode you give the other one time limits
or process kill them either way, just while we are testing we
don't want the other peer harness to run forever during tests
only when in real use." Time-limits / process-kill on the
non-primary during testing; removed once the peer-harness
moves from test-mode to real use.

Concrete bounding options named (Otto designs the specific
mechanism in a test-plan doc):
- wall-clock timeout per experiment
- explicit process-kill target at end
- test-mode flag capping async-dispatch budget
- wall-time instrumentation

Scope limits:
- No second Claude session without experiment design + dry-run
- No skipping multi-Claude stage (b) to jump to Codex (c)
- No unbounded test runs — time-limits load-bearing
- No premature readiness-signals
- Otto-78 primary-switch clause unchanged; this refines stages
  around it, not the primary-determination model

Lands within-standing-authority per Otto-82 calibration
(BACKLOG-row refinement on a row that was already refinement-
friendly; no signoff gate triggered).

Authorising memory:
memory/feedback_peer_harness_progression_starts_multi_claude_
first_windows_support_concrete_use_case_otto_signals_
readiness_2026_04_23.md

Otto-86 tick split-attention deliverable (alongside Muratori
corrected-table research PR #254).
…CTORY-HYGIENE format)

Three P1/nit review threads on the Otto-86 4-stage peer-harness
progression refinement, fixed in-row (same-row body edits, no
cross-row reshuffling):

1. PRRT_kwDOSF9kNM59R5L8 (line 2638): dropped the incorrect
   "Otto-79's 3-stage arc" parenthetical. Otto-79 in BACKLOG
   is the agent-email-password-storage row, not a 3-stage arc;
   the 4-stage progression stands on its own description.

2. PRRT_kwDOSF9kNM59R5Ly (line 2645): rephrased the readiness-
   signal paragraph to name the Otto-82 design-review discipline
   directly instead of pointing at an in-repo memory path
   (memory/ paths are generally out-of-repo per drain convention
   applied to earlier review threads of this shape).

3. PRRT_kwDOSF9kNM59R5MD (line 2656): normalised
   "FACTORY-HYGIENE #51 + #55" → "FACTORY-HYGIENE row #51 and
   row #55" to match the common "row #NN" format used elsewhere
   in the file.

No new semantics; pure review-thread drain edits inside the
same BACKLOG row body (append-only-within-section discipline
preserved — no row added, no row reordered).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@AceHack AceHack force-pushed the backlog/otto-86-refine-codex-parallel-with-multi-claude-intermediate branch from 85f4ed9 to 4449984 Compare April 24, 2026 14:32
@chatgpt-codex-connector
Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

@AceHack AceHack merged commit 89bd36f into main Apr 24, 2026
10 checks passed
@AceHack AceHack deleted the backlog/otto-86-refine-codex-parallel-with-multi-claude-intermediate branch April 24, 2026 14:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants