Skip to content

Round 28 — FsCheck LawRunner (Option B) + round-29 CI anchor#4

Merged
AceHack merged 4 commits intomainfrom
round-28
Apr 18, 2026
Merged

Round 28 — FsCheck LawRunner (Option B) + round-29 CI anchor#4
AceHack merged 4 commits intomainfrom
round-28

Conversation

@AceHack
Copy link
Copy Markdown
Member

@AceHack AceHack commented Apr 18, 2026

Summary

  • Round 28 anchor delivered. Zeta.Core.LawRunner — a deterministic-simulation law runner — lives as a test-time library, not a Circuit.Build() gate. checkLinear asserts op(A + B) = op(A) + op(B) tick-by-tick against an ILinearOperator. checkRetractionCompleteness uses state-restoration via continuation (Option B, trace-based): feed forward ++ retract ++ continuation, compare continuation outputs to a fresh-op run of the continuation alone; any divergence means state survived the cancel. Per-sample System.Random(seed + i) so (seed, sampleIndex) reproduces bit-exact. 7/7 tests in 61ms.
  • Design doc. docs/research/stateful-harness-design.md captures the build-vs-test decision, the Option A vs Option B analysis, and the sequenced round-30+ follow-up plan (Option A promotion to Init/Step/Retract triple matching the DBSP paper's (σ, λ, ρ) shape — unlocks generic WDC checkpointing and planner fusion).
  • Reviewer floor caught a P0 in the law itself. Kira flagged that the original "cumulative output = 0" retraction law passes trivially for empty-emitting ops and a floored-counter can leak state while satisfying it. Rewritten to state-restoration; test fixture replaced (FlooredCounterOp genuinely stateful-and-lossy, superseding the mistagged PositiveOnlyOp). Also: invalidArgResult, O(n²) indexing fixed with List.toArray, shared runSamples helper to pre-empt paste-and-modify before checkBilinear lands.
  • Lean4 cleanup. lake new leftovers removed (README.md, Basic.lean, .github/workflows/, redundant .gitignore). Lean4.lean rewired to import DbspChainRule.
  • Round-29 CI anchor committed. docs/BACKLOG.md P0 entry captures Aaron's discipline rules: ../scratch + ../SQLSharp are read-only references, never copy files; Aaron reviews every CI decision before landing; cost discipline on CI minutes; macOS + Linux first.

Test plan

  • dotnet build Zeta.sln -c Release — 0 Warning(s) / 0 Error(s)
  • dotnet test — LawRunner tests 7/7 green (61ms); full suite passes
  • Reviewer pass per GOVERNANCE.md §20 — Kira + Rune dispatched; P0s landed in-round, P1s tracked in docs/DEBT.md
  • PLUGIN-AUTHOR.md soft-claim retracted; LawRunner + cross-ref from IOperator<'TOut> doc

🤖 Generated with Claude Code

AceHack and others added 4 commits April 18, 2026 14:51
Design doc for the FsCheck law runner captures the DST framing
(single seeded RNG, no wall-clock, total tick order, seed + schedule
printed on failure) and locks the build-vs-test decision: LawRunner
lives as a test-time library, not a Circuit.Build() gate, so Core
stays free of FsCheck and plugin authors opt in from their test
project.

Key design call — Option B (trace-based retraction check against the
existing marker IStatefulStrictOperator) this round, Option A
(Init/Step/Retract triple matching the DBSP paper's σ,λ,ρ shape)
as a planned additive promotion in round-29+. Rationale documented
in full: Option A needs real async + retraction-contract design
work; Option B lands today, teaches us what retraction actually
looks like, Option A absorbs the lessons. Option B remains a
fallback even after A ships.

Long-term payoff of Option A (documented so we don't lose it):
- Matches DbspChainRule.lean's (σ,λ,ρ) triple → compositional
  reasoning across math/proofs/impl
- Generic WDC checkpointing without per-op serialisation
- Planner fusion of adjacent stateful ops

Lean4 scaffolding cleanup — `lake new` leftovers removed:
- tools/lean4/README.md (pure GitHub-Pages setup boilerplate)
- tools/lean4/Lean4/Basic.lean (hello-world sample)
- tools/lean4/.github/workflows/ (upstream Lean CI templates)
- tools/lean4/.gitignore (redundant with root .lake/ ignore)

tools/lean4/Lean4.lean now imports Lean4.DbspChainRule so
`lake build` walks the real proof file; lakefile.toml,
lake-manifest.json, and lean-toolchain kept (load-bearing for
Mathlib resolution).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
First concrete deliverable for the round-28 anchor. `LawRunner`
lives in `src/Core/LawRunner.fs` as a test-time library (not a
Circuit.Build() gate — see design doc for build-vs-test rationale).
Generators are `System.Random -> 'T` so Core stays free of FsCheck;
plugin authors wire FsCheck from their test project.

Both checks are deterministic-simulation: a failing run prints the
seed and sample index so plugin authors reproduce bit-exact.

- `checkLinear` — generates trace pairs (A, B) and asserts
  `op(A + B) = op(A) + op(B)` tick-by-tick. `addIn` / `addOut` /
  `equalOut` are parameters so it works for ZSet, numerics, any
  additive carrier.
- `checkRetractionCompleteness` — Option B trace-based: forward-
  runs a random Z-set trace, retracts each tick in the same order,
  asserts the cumulative output Z-set is empty. Catches operators
  that leak state through retraction without any interface
  enrichment (Option A promotion planned for round-29+).

Tests in `tests/Tests.FSharp/Plugin/LawRunner.Tests.fs` — 5 tests,
63ms:
- genuine linear op passes
- falsely-tagged squarer (non-linear) is caught
- same seed reproduces bit-exact
- clean-retracting ZSet echo passes
- positive-only liar is caught

`docs/PLUGIN-AUTHOR.md` soft-claim retracted — the doc now
documents `LawRunner.checkLinear` / `checkRetractionCompleteness`
as live, with `checkBilinear` / `checkSinkTerminal` flagged as
round-29+.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Kira (harsh-critic) + Rune (maintainability-reviewer) floor per
GOVERNANCE.md §20. P0s from Kira's pass were load-bearing; fixing
them in this commit before the round closes.

P0 fixes:
- Per-sample `System.Random(seed + i)` so `(seed, sampleIndex)`
  genuinely reproduces bit-exact, independent of whether earlier
  samples failed fast. Previous whole-loop RNG made the
  reproducibility claim false.
- Retraction-completeness law rewritten to state-restoration via
  continuation: forward + retract + continuation, compare
  continuation outputs against a fresh-op run of the continuation
  alone. The old "cumulative output = 0" formulation passed
  trivially for empty-emitting ops and a floored-counter could
  leak state while keeping cumulative zero; the new law catches
  both. Test fixture replaced accordingly — `FlooredCounterOp`
  (genuinely stateful and retraction-lossy) supersedes the
  `PositiveOnlyOp` filter (which is non-linear, not a retraction
  fixture; properly belongs under `checkLinear`).
- `invalidArg` → `Error` on bad args — every public entry now
  returns `Result`, matching CLAUDE.md's result-over-exception
  rule.

P1 fixes worth doing now:
- `List.toArray` before the tick loop in `checkLinear` — kills
  the O(scheduleLength²) `List.item` indexing Kira flagged.
- Extracted `runSamples` helper so the two law implementations
  share one loop shape — removes paste-and-modify risk before
  round-29 adds a third law.
- Rune polish: cross-ref added from `IOperator<'TOut>` doc to
  `LawRunner` for discoverability; `≠` → `!=` in error strings
  so logs survive every terminal; WHY comments on non-obvious
  lines (`if w = 0 then w <- 1`); module-level note that
  `'TState = unit` is fine because trace-based Option B never
  inspects state.

Deferred to DEBT (docs/DEBT.md): promote `check*` to a config
record before `checkBilinear` lands; structured `LawViolation.Reason`
DU; test where op omits the marker tag.

Tests: 7/7 green in 61ms (added "bit-exact reproduces" for
retraction + "Error on bad samples arg" coverage).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Round 28 anchor (FsCheck law runner) shipped across 3 commits on
this branch; this commit is the round-close bookkeeping + the
round-29 anchor commitment.

- docs/CURRENT-ROUND.md resets for round 29 with CI pipeline as
  the anchor; round-28 deliverables summarised; carryover split
  between DEBT-tracked law-runner follow-ups, product law-
  coverage (bilinear / sink-terminal / Option-A promotion), and
  deferred items from round 27.
- docs/ROUND-HISTORY.md — round 28 narrative prepended (anchor,
  reviewer-floor catch, Option-A-vs-B decision, lean cleanup,
  soft-claim retracted).
- docs/WINS.md — three round-28 wins: reviewer-floor paying on
  first applicable round, Option-B-with-planned-followups over
  verbal agreement, deterministic-simulation as contract not
  decoration.
- docs/BACKLOG.md — new P0 entry for CI / build-machine setup
  with the full discipline rules Aaron committed:
  * `../scratch` + `../SQLSharp` are read-only references —
    never copy files, hand-craft every artefact
  * Aaron reviews every CI design decision before landing
  * Cost discipline on CI minutes; narrow default matrix
  * macOS + Linux first; Windows when justified
  * Product + CI work parallelisable on one machine
  Sub-task sequence: build-machine-setup audit →
  ci-workflow-design audit → gate inventory → first workflow
  (build-and-test), each its own Aaron review gate.

AlloyRunner.java confirmed load-bearing: Alloy is a JVM tool
with no convenient CLI, driven from F# tests via `javac` + `java`.
Keep.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@AceHack AceHack merged commit 82cf680 into main Apr 18, 2026
@AceHack AceHack deleted the round-28 branch April 18, 2026 19:30
AceHack added a commit that referenced this pull request Apr 21, 2026
Both rows have been citing closed P0s as open for 25 rounds. The
round-17 fixes (harsh-critic findings #3, #4, #7, #8 per
docs/BACKLOG.md:286-299) closed the blocking correctness bugs:

- Residuated.fs: top-2 cache replaced with SortedSet + weight
  dict; every op O(log k), no linear-scan fallback. The round-12
  "O(1)" claim was false under adversarial retract-top workloads;
  the corrected "O(log k) genuinely" claim has been stable 25
  rounds. See Residuated.fs:39-48 for the fix-in-code narrative.

- FastCdc.fs: persistent scanCursor + hash (each byte Gear-hashed
  exactly once across lifetime) closed the O(n^2) buffer scan;
  Buffer.BlockCopy replaced per-byte ResizeArray.Add. See
  FastCdc.fs:68-76 for the fix-in-code narrative. Paper
  throughput target 1-3 GB/s/core holds.

Rows now match the Bloom Round-40 graduation pattern (measured-
evidence cite, implementation line reference, test coverage
pointer). 25-round stability window beats the aspirational
waiting-list — graduation on evidence, not aspiration.

BP-10 clean; 0 invisible-unicode on edited file.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 21, 2026
…ion annotation

Aarav (skill-tune-up) round-42 cadence discharge. Round-41
top-5 carries over; self-rank escalates to P1 #4 after
commit baa423e retuned skill-tune-up/SKILL.md 303 -> 436
lines (1.45x BP-03 cap). claims-tester / complexity-reviewer
hand-off carry-over from round 18 drops off top-5 (resolved
via commit e8ed0db + router-coherence-v2 ADR).

Files:
- memory/persona/aarav/NOTEBOOK.md: round-42 observation
  + top-5 revision (skill-tune-up self escalated) + archived
  round-41 top-5 + calibration preamble flagging the ranking
  as static-signals-only with a harness run scheduled for
  round 43 (per Aaron's round-42 correction that "worst
  performance" claims must drive the Anthropic skill-creator
  eval harness rather than guessing by inspection).
- memory/persona/best-practices-scratch.md: F7-F9 live-search
  entries from Aarav's round-42 pass (Anthropic skill-
  authoring Apr 2026, OWASP Top 10 Agentic 2026, skill
  wrapper thick-vs-thin 2026). Zero contradictions with
  stable BP-NN; zero promotion candidates this round.
- docs/BACKLOG.md: P2 entry for resolving the skill-tune-up
  BP-03 self-breach. Binary remedy: (a) Kenji-ADR declaring
  non-skill-wrapper exception to BP-03 or (b) extract
  eval-loop protocol body to docs/references/ so the skill
  file shrinks under 300 lines. Composes with the
  skill-eval-tools calibration memory saved this round.
AceHack added a commit that referenced this pull request Apr 21, 2026
Aarav's round-42 self-flag (BACKLOG P2, filed commit 45369ae)
resolved via the mechanical-edit path of the gate table.
.claude/skills/skill-tune-up/SKILL.md shrinks 436 -> 282
lines (54 under the 300-line BP-03 cap) by extracting two
reference blocks verbatim:

- §"The eval-loop hand-off protocol" (~130 lines) — the
  gate table, per-round protocol, stopping criteria, ledger
  row, and deliberately-not-reimplemented list.
- Notebook format + ranking-round output format templates
  (~55 lines).

Extracted content lives at docs/references/skill-tune-up-
eval-loop.md alongside the existing Anthropic skills guide
references. SKILL.md retains a short pointer block.

No change to triggering behaviour, output shape, or
instruction-following — the ranker reading the pointer-plus-
reference produces the same ranking output as the ranker
reading the pre-extract inline version. This is why the
manual-edit path (gate table "mechanical rename | content
extract preserving protocol verbatim") applies instead of
the full eval-loop path.

Files:
- .claude/skills/skill-tune-up/SKILL.md: 436 -> 282 lines.
- docs/references/skill-tune-up-eval-loop.md: NEW. Hosts
  the extracted protocol + templates + rationale.
- docs/skill-edit-justification-log.md: NEW. First row
  documents this extraction per
  memory/feedback_skill_edits_justification_log_and_tune_up_cadence.md
  Rule 1. Template for future mechanical-edit rows
  included.
- memory/persona/aarav/NOTEBOOK.md: self-flag #4 marked
  RESOLVED; drops off top-5 next invocation.

Does NOT rebut the round-42 harness-calibration memory
(feedback_skill_tune_up_uses_eval_harness_not_static_line_
count.md). That rule applies to "worst-performing" ranking
claims; this edit is a fix-my-own-size hygiene pass on the
mechanical-edit path, which is explicitly separate in the
gate table.
AceHack added a commit that referenced this pull request Apr 21, 2026
)

* Round 41: OpenSpec coverage audit + backfill-program ADR

Answers Aaron 2026-04-20 delete-all-code-recovery question:
4 capabilities / 783 lines of spec.md vs 66 top-level F#
modules / 10,839 lines under src/Core/ — ~6% coverage today.

docs/research/openspec-coverage-audit-2026-04-21.md
- Inventory of 66 modules with line counts + capability
  mapping for the 4 existing capabilities
- Uncovered modules sorted by delete-recovery blast radius:
  Band 1 MUST BACKFILL (8 modules / 1,629 lines — ZSet,
  Circuit, NestedCircuit, Spine family, BloomFilter as
  Adopt-row compatibility-coupling exception), Band 2 HIGH
  (12 / 2,008), Band 3 MEDIUM (45 / 6,585), Band 4
  deliberately uncovered (AssemblyInfo only)
- First 6-round cadence: operator-algebra extension (41),
  lsm-spine-family (42), circuit-recursion (43),
  sketches-probabilistic (44), content-integrity (45),
  crdt-family (46)
- Success signal = Viktor spec-zealot adversarial audit:
  "could I rebuild this module from this spec alone?"

docs/DECISIONS/2026-04-21-openspec-backfill-program.md
- Adopts one-capability-per-round baseline with paper-grade
  half-credit rule (no more than 1 paper-grade round per 3)
- Band 1 priority until complete; Adopt-row escalation for
  BloomFilter (TECH-RADAR Adopt without spec contract is a
  backwards-compatibility hazard)
- Round-close ledger gains an `OpenSpec cadence` line
- Alternatives considered: big-bang backfill (rejected —
  ontology-landing cadence + reviewer bandwidth), per-module
  capabilities (rejected — loses cross-module invariants),
  organic prioritisation (rejected — 40 rounds of drift
  evidence)

docs/BACKLOG.md
- Collapses the 29-line P0 scope into a 15-line pointer at
  the inventory + ADR now that parts (a)-(e) of the program
  setup have landed. Remaining work = per-round capability
  backfill per ADR schedule.

Build: dotnet build -c Release clean; BP-10 ASCII-clean on
all 3 modified files; markdownlint-cli2 clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: operator-algebra spec extension (cadence ship)

First ship under the OpenSpec backfill program adopted
2026-04-21. Extends openspec/specs/operator-algebra/spec.md
(184 -> 324 lines) with five new requirements covering
structural and lifecycle gaps that the existing mathematical-
law coverage left implicit:

1. Operator lifecycle — construction / step / after-step /
   reset phases with side-effect-freedom on construction and
   epoch-replay semantics on reset
2. Strict operators break feedback cycles — formalises that
   z^-1-on-feedback is a scheduling prerequisite and that
   cycle-without-strict is a construction error, not a
   silent heuristic
3. Clock scopes and tick monotonicity — nested-scope-to-
   fixpoint rule + sibling-scope independence
4. Incremental-wrapper preserves the chain rule —
   Incrementalize(Q) observably equivalent to D . Q . I,
   with linear/bilinear substitution permitted as an
   optimisation
5. Representation invariants of the reference Z-set —
   O(n+m) group ops + zero-alloc iteration as the reference
   contract; hash-table recoveries permitted at documented
   perf trade-off

Disaster-recovery effect: a contributor with only this spec
(plus the durability-modes + retraction-safe-recursion specs)
can now rebuild Circuit.fs Op base + Incremental.fs wrapper +
ZSet.fs representation invariants from the spec text alone.

Owner: Architect (Kenji). Adversarial audit by Viktor
(spec-zealot) is the ADR-declared ship-gate and will run
post-land.

Build: not rebuilt (no F# source changed); markdownlint
clean; BP-10 ASCII clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: close Viktor P0 findings on operator-algebra spec

Viktor's adversarial audit of the Round 41 cadence ship (commit
e51ec1b) surfaced four P0 findings against the disaster-recovery
bar. This commit closes all four:

- **P0-1 (namespace drift).** `profiles/fsharp.md` asserted
  `Dbsp.Core` throughout, but `src/Core/**` uses `Zeta.Core`. A
  spec-only recovery would have shipped the wrong namespace to
  every downstream consumer. Replaced via one `replace_all` Edit.

- **P0-2 (phantom Reset method).** The lifecycle requirement
  claimed a `reset` phase that does not exist on `Op`. Replaced
  the "reset replays the epoch" scenario with a
  determinism-under-structural-equivalence property: two
  freshly-constructed circuits of the same topology, stepped
  with the same input sequence, MUST produce identical outputs
  at every tick. Reconstruction is the supported route to a
  replayed epoch.

- **P0-3 (after-step scope).** The lifecycle requirement said
  after-step runs "after every operator in the scope has
  completed its step." `Circuit.fs:205-208` iterates the
  `strictN` array only — after-step is selective to strict
  operators. Fixed wording and added a "after-step is selective
  to strict operators" scenario that pins the invariant.

- **P0-4 (lifecycle phase undercount).** The requirement named
  four phases (construction / step / after-step / reset) but
  the code has five (construction / step / after-step /
  clock-start / clock-end). Restructured to three per-tick
  phases plus two scope-boundary phases, and extended the
  "clock scopes and tick monotonicity" requirement with the
  scope-boundary lifecycle contract (clock-start before tick 0
  of a scope, clock-end after fixpoint or iteration cap).

Build green (0 warnings / 0 errors). BP-10 lint clean. The
capability now reflects the code's observable shape rather than
an idealised cleaner cousin; a delete-recovery from this spec
produces Zeta.Core with strict-operator after-step selectivity
and nested-scope clock-boundary phases.

Viktor's 10 P1 findings (async lifecycle, memory-ordering fence,
register-lock semantics, IncrementalDistinct surface, ZSet sort
invariant, Checked arithmetic, bilinear-size overflow,
convergence-vs-cap) are deferred to Round 42 — filed as a
BACKLOG sweep in follow-up work.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: file Viktor P1 findings as Round 42 BACKLOG absorb

Companion to 92d7db2 (closing Viktor's four P0 findings). The
ten P1-tier surface gaps Viktor identified do not block the
disaster-recovery bar at capability-close but leave the
operator-algebra spec incomplete relative to what a delete-
recovery produces. Filed as a dedicated P0 sub-item so they
travel with the OpenSpec backfill program rather than getting
lost: async lifecycle, memory-ordering fence, register-lock
semantics, IncrementalDistinct surface, ZSet sort invariant,
Checked arithmetic, bilinear-size overflow, convergence-vs-cap,
Op.Fixedpoint predicate, DelayOp reconstruction-first-tick.

Also annotated the parent OpenSpec coverage entry with Round 41
sweep status (e51ec1b + 92d7db2, P0s closed, P1s deferred) so
the backlog accurately reflects where the program stands.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: ROUND-HISTORY entry — OpenSpec backfill founding + first cadence ship

Four-arc entry at the top of the file per newest-first policy:

- Arc 1 (d435126): OpenSpec coverage audit + backfill-program
  ADR. Measured 6% coverage; declared one-capability-per-round
  baseline with paper-grade half-credit and Adopt-row priority
  escalation; banded 66 F# modules by delete-recovery blast
  radius.
- Arc 2 (e51ec1b): operator-algebra extension as Round-41
  cadence ship. Five new requirements covering lifecycle,
  strict-operator scheduling, clock scopes, Incrementalize
  wrapper, ZSet representation invariants.
- Arc 3 (92d7db2): Viktor P0 close. Four drift-from-code
  defects fixed — namespace (Dbsp.Core → Zeta.Core), phantom
  Reset, after-step scope (strict-only), lifecycle phase
  undercount (3 per-tick + 2 scope-boundary).
- Arc 4 (56f34b5): Viktor P1s filed as Round-42 absorb under
  the parent backfill P0, creating mechanical coupling between
  each capability ship and the following round's P1 sweep.

Round-41 observations for Round 42 + prospective BP-WINDOW
ledger table rendering the four commits against the consent /
retractability / no-permanent-harm axes.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: memory-folder role-restructure — design plan + BACKLOG pointer

Aaron 2026-04-19 asked for memory/role/persona/ so roles become
first-class in the directory structure. Surface is wider than
it first looks — 114 files / ~260 hand-written references to
memory/persona/ paths (plus ~440 auto-regenerated references
in tools/alignment/out/ that refresh on next citations.sh run).
A bad role axis is hard to reverse; this design doc proposes
the axis and holds execution for Aaron's sign-off rather than
just-doing-it under Auto Mode.

Design plan lands at:
  docs/research/memory-role-restructure-plan-2026-04-21.md

Contents: 13-directory role axis (architect, security,
verification, review, experience, api, performance, devops,
algebra, skill-ops, maintainer, homage, alignment);
persona-to-role crosswalk for every current directory;
5-phase execution plan (pre-flight greps → git mv → sed
passes → 5-check verification → pointer-source updates);
special-case handling for aaron (human maintainer),
rodney (homage-named AI persona on the reducer skill),
sova (emerging alignment-observability role); rollback
plan (one atomic commit, git revert); four open questions
for Aaron on axis judgement-calls.

BACKLOG entry updated to reflect design-landed state with
execution-slot recommendation for Round 42 opener after the
Round 41 PR merges (keeps wide-surface reviews from
overlapping).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: actualise Rounds 37-40 BP-WINDOW ledgers (PR #30 merged)

Rounds 37-40 shipped via PR #30 (merge commit 1e30f8c, 2026-04-20).
Ledger headers updated from "(prospective)" to "(merged via PR #30,
1e30f8c)" — the BP-WINDOW scores are now settled, not forecasts.

Round 41 ledger remains "(prospective)" — round-41 branch has not
merged to main yet.

Prose uses of "prospective" on lines 437, 447, 553, etc. are
historical-narrative commentary on authoring-time methodology and
stay as-is.

* Round 41: Soraya tool-coverage audit on RecursiveSigned skeleton

Round 39 observation flagged src/Core/RecursiveSigned.fs +
tools/tla/specs/RecursiveSignedSemiNaive.tla as held pending
formal-verification-expert tool-coverage review. Round 41 closes
that gate.

Soraya's notebook entry lands:

- Per-property tool table S1-S4 + refinement cross-check. TLC
  primary for S1/S2/S3/S3'/SupportMonotone; FsCheck for S4.
- S2 flagged as the one P0 on the spec (silent fixpoint drift
  unrecoverable); BP-16 requires Z3 QF_LIA cross-check.
- Refinement mapping: FsCheck cross-trace (signed vs counting at
  SeedWeight=1) wins over TLA+ refinement proof or Lean lemma —
  anti-TLA+-hammer, implementation-level where the bug bites.
- Readiness gate: TLA+ spec is ready to model-check; no pre-TLC
  pass needed. Optional round-42 follow-up: add
  PROPERTY EventuallyDone to .cfg for liveness.
- Graduation verdict: CONDITIONAL PASS. Four tool-coverage
  prereqs named in priority order; F# landing gated on them.

Files read (no edits): RecursiveSigned.fs, RecursiveSignedSemiNaive.tla
/cfg, RecursiveCountingLFP.tla, retraction-safe-semi-naive.md.

* Round 41: capture Soraya's 4 tool-coverage prereqs on RecursiveSigned

Soraya's round-41 audit of src/Core/RecursiveSigned.fs +
tools/tla/specs/RecursiveSignedSemiNaive.tla landed as a CONDITIONAL
PASS for Round-42 graduation. This commit lifts the four named
prereqs out of her notebook into BACKLOG sub-items under the
parent "Retraction-safe semi-naive LFP" entry, so the round-42
opener picks them up as checkbox work rather than having to re-read
the notebook.

Prereqs in priority order:
- Prereq 1 — TLC CI wire-up (RecursiveSignedSemiNaive.cfg)
- Prereq 2 — Z3 QF_LIA lemma for S2 FixpointAtTerm (BP-16 cross-check
  on the one P0; TLC alone insufficient for silent-fixpoint-drift risk)
- Prereq 3 — FsCheck property for S4 sign-distribution (anti-
  TLA+-hammer; two-trace quantification is NOT a TLA+ property)
- Prereq 4 — FsCheck cross-trace refinement (signed vs counting
  at SeedWeight = 1); cites BP-16

Round-42 graduation gate also captured: prereqs 1-4 CI-green + F#
implementation with P1/P2/P3 enforced at caller.

* Round 41: extend ROUND-HISTORY with arcs 5-7 (post-narrative commits)

The initial Round 41 ROUND-HISTORY entry (6e6e211) covered arcs
1-4 (coverage audit, operator-algebra cadence ship, Viktor P0
close, Viktor P1 file). Three more commits landed after:

Arc 5 — ROUND-HISTORY narrative + memory-restructure design
(6e6e211, 36797ba). The memory-folder rename was downgraded to
"design plan + sign-off first" under Auto Mode's
do-not-take-overly-destructive-actions clause (700-occurrence
cross-reference surface).

Arc 6 — BP-WINDOW ledger actualisation for Rounds 37-40
(85fb352). Provenance (PR #30 / 1e30f8c) attached to each
"(prospective)" header.

Arc 7 — Round-35 holdover close (e461d9c, 15e9654). Soraya
tool-coverage audit landed CONDITIONAL PASS for Round-42
graduation; four prereqs captured as BACKLOG sub-items with
BP-16 citation on the S2 Z3 cross-check.

Also: one new observation line in the Round-42 handoff section
noting the holdover-closed-same-round-as-cadence-item pattern.
BP-WINDOW ledger gains three rows.

* Round 41: Aarav skill-tune-up ranking (catch-up from round-18 stale)

CLAUDE.md 5-10 round cadence rule was 23 rounds overdue. Round 41
is the catch-up slot. Live-search + full ranking + prune pass all
landed in a single invocation.

Live-search (4 queries, 2026-Q1/Q2 best-practices targets):
- 6 findings logged to best-practices-scratch.md: Gotchas-section
  rise, pushy-descriptions pattern, Claude-A-authors / Claude-B-
  tests, router-layer command-integrity injection class, Agent
  Stability Index 12-dim drift metric, OWASP Intent Capsule
  pattern.
- Zero contradictions with stable BP-NN rules.
- Zero promotions flagged to Architect this round; all six are
  "watch" or route-elsewhere.

Top-5 skills flagged for tune-up:
1. performance-analysis-expert (642 lines, 2.1x BP-03 cap) — SPLIT — M
2. reducer (570 lines) — SPLIT or TUNE (prune) — M
3. consent-primitives-expert (507 lines) — SPLIT honouring BP-23
   theory/applied axis — M
4. claims-tester / complexity-reviewer router-coherence drift —
   HAND-OFF-CONTRACT — S (round-18 carry-over)
5. skill-tune-up (self) — 303 lines, 3 over BP-03 — TUNE (prune
   authoritative-sources duplicated with AGENT-BEST-PRACTICES.md)
   — S. Self-flagged first per BP-06.

Notebook state:
- Stale round-18 top-5 archived in Pruning log (first catch-up prune).
- 912 words, well under 3000-word BP-07 cap.
- ASCII-only, BP-10 clean.

Nine more bloat-row skills named as notable mentions queue behind
the top-3 bloat cases.

* Round 41: ADR — claims-tester/complexity-reviewer hand-off contract

Close Aarav's round-18 HAND-OFF-CONTRACT finding (carried 23 rounds
after ranker went offline by cadence). Two-stage pipeline: analytic
bound first (complexity-reviewer), empirical measurement second
(claims-tester). Names the reverse trigger (benchmark surprise flows
the other direction) and the decision table for who fires when.
Follow-up SKILL.md edits route via skill-creator per GOVERNANCE §4.

* Round 41: extend ROUND-HISTORY with Arc 8 (router-coherence ADR)

Arc 8 covers the claims-tester/complexity-reviewer hand-off ADR
(47d92d8) closing Aarav's 23-round-stale round-18 HAND-OFF-CONTRACT
finding. New observation on cadence-outage-recovery as a design axis:
sweep infrastructure is subject to the same bitrot it detects on other
surfaces. BP-WINDOW ledger gains two rows (085c0e3 Aarav catch-up,
47d92d8 router-coherence ADR).

* Round 41: correct Prereq 1 sizing — no TLC CI job exists

Close-out audit surfaced that .github/workflows/gate.yml only CACHES
the tla2tools.jar artefact; nothing runs it. RecursiveCountingLFP.tla
has shipped since round 19 compile-checkable-only — 22 rounds with no
run-gate against its invariants. Soraya's Prereq 1 re-sized S→M with
expanded scope covering both specs. Finding recorded as new round-41
observation: verifier-present does not imply verifier-actually-runs.

* Round 41: BP-WINDOW ledger — 459b218 + d76a09b rows

Keeps the Round 41 BP-WINDOW ledger commit-aligned rather than
arc-aligned. 459b218 is the Arc-8 narrative itself; d76a09b is the
Prereq-1 S→M correction. Both retractable as single reverts.

* Round 41: file formal-analysis-gap-finder round-42 run — verifier-runs lens

Codifies the round-41 Prereq-1 audit finding as a tracked
research entry, distinct from its ROUND-HISTORY narrative
presence. The finding — a verifier's installation artefacts
do not imply the verifier is exercised by any CI job — is
exactly the class formal-analysis-gap-finder exists to
surface. Concrete motivating case: RecursiveCountingLFP.tla
compile-checkable-only for 22 rounds. Round-42 scope covers
the bidirectional audit (specs without gates + gates without
specs). Handoff to Soraya per the skill's standing contract;
does not write the spec or CI job (DevOps + Soraya work).
Schedules after Prereq 1 lands so the audit sees corrected
state.

* Round 41: BP-WINDOW ledger — 2042a85 row

Per the established stopping rule (meta-ledger commits do not
get self-referential rows; their round-close coverage is the
PR merge), this commit adds only the 2042a85 row and does not
add a row for itself.

* Round 41: CONFLICT-RESOLUTION — Hiroshi ↔ Daisy hand-off row

Closes ADR 47d92d8's third follow-up action item. Single-row
addition to Active tensions citing the router-coherence ADR as
the standing resolution. Doc-only edit (not a SKILL.md touch,
so GOVERNANCE §4 does not gate this). The other two ADR
follow-ups (claims-tester + complexity-reviewer SKILL.md
updates) remain deferred to round 42 via skill-creator
workflow.

* Round 41: BP-WINDOW ledger — fcfa3d9 row

Per-commit ledger discipline for the CONFLICT-RESOLUTION
Hiroshi ↔ Daisy row. Meta-ledger-only commit so no
self-referential row for this commit itself (established
stopping rule).

* Round 41: file harsh-critic findings on ADR 47d92d8 as round-42 supersedure backlog

Router-coherence ADR 47d92d8 (Hiroshi analytic ↔ Daisy empirical
two-stage pipeline) landed without the adversarial-review gate.
Post-landing harsh-critic (Kira) pass surfaced 3 P0 + 5 P1 + 2 P2
substantive findings, including (P0-1) unscoped grandfather
clause, (P0-2) table-vs-prose contradiction on reverse trigger,
(P0-3) Stage-1 "analytically wrong" clause blocking the evidence
loop for escalation, (P1-7) no escalation timebox reproducing the
23-round-stale failure mode the ADR diagnosed, (P1-8) two advisory
skills not composing to a mandatory pipeline without a binding
dispatcher, (P2-9) example-bug on BCL Dictionary.Remove amortised
complexity, and more.

File as round-42 supersedure rather than inline-edit because
docs/CONFLICT-RESOLUTION.md already cites 47d92d8 as Standing
Resolution — supersedure preserves the citation chain via
GOVERNANCE §2 edit-in-place with a "Superseded by …" header on
v1. New ADR target: docs/DECISIONS/2026-04-??-router-coherence-
v2.md. Supersedure work blocks the claims-tester +
complexity-reviewer SKILL.md updates ADR 47d92d8 follow-up work
depends on — those edits should target v2, not v1.

Owner: Architect drafts; Kira audits closure; Aarav confirms
router-coherence drift stays closed. Effort: M. Schedule: Round
42 slot after Soraya Prereq 1 (TLC wire-up) lands.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW ledger — 779d7ef row

Ledger row for harsh-critic findings filing commit. Primary work
(BACKLOG addition tracking a round-42 supersedure with 10 named
findings), not meta-ledger — earns a row under the BP-WINDOW
per-commit discipline. Consent = adversarial findings tracked
honestly; Retractability = supersedure preserves citation chain
vs inline-edit; No-permanent-harm = single BACKLOG edit, no ADR
body touched, no SKILL.md touched.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 9 narrative — self-correction sweep

ROUND-HISTORY Arc 1-8 narrated primary commits up through the
router-coherence ADR (47d92d8). Four primary commits landed
after Arc 8 — Prereq 1 sizing correction (d76a09b), recurring-
audit lens BACKLOG entry (2042a85), CONFLICT-RESOLUTION Hiroshi
↔ Daisy row (fcfa3d9), and harsh-critic findings filed as
round-42 supersedure (779d7ef) — visible only in the BP-WINDOW
ledger table, not in narrative form.

Arc 9 ties them into one coherent sequence: the round's
self-correction ran unusually deep. Arc 8 corrects Aarav's
round-18 finding via ADR; Arc 9 catches the corrector itself
under-reviewed via Kira's adversarial pass. Both self-
corrections land before round-close. Narrative-ledger
alignment is the BP-WINDOW discipline's first assertion —
restoring it.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW ledger — 160fcfa row

Ledger row for Arc 9 narrative commit. Narrative extensions
count as primary work under BP-WINDOW precedent (per 459b218
and 6e6e211 examples) and earn a ledger row. Consent = drift
closed honestly; Retractability = single revertable doc edit;
No-permanent-harm = isolated insertion.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: v2 ADR — router-coherence supersedure closes 10 Kira findings in-round

Drafts v2 of the router-coherence ADR (docs/DECISIONS/2026-04-21-router-coherence-v2.md) that supersedes v1 (47d92d8) in the same round, closing all 10 Kira harsh-critic findings (3 P0 + 5 P1 + 2 P2) via named textual closures C-P0-1 through C-P2-10.

Key closures:
- C-P0-1: grandfather clause bounded with Kenji-owned inventory + one-per-round discharge
- C-P0-2: reverse trigger unconditional (table now matches prose)
- C-P0-3: escalation-evidence exception permits Stage 2 under conference protocol with explicit labelling
- C-P1-5: Stage-1 trigger widened to match claims-tester SKILL.md contract
- C-P1-7: escalation timebox (round +2 auto-promote to BACKLOG P1) prevents 23-round-stale reproduction
- C-P1-8: Kenji named as binding dispatcher — advisory + advisory + binding-dispatcher composes to mandatory pipeline
- C-P2-9: Dictionary.Remove example replaced with ArrayPool<T>.Rent (legitimate BCL-contract edge)

v1 kept in place per GOVERNANCE §2 with Superseded-by header appended in a follow-up commit so the CONFLICT-RESOLUTION Active-tensions citation chain remains resolvable.

BP-10 lint: clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: v1 ADR — append Superseded-by header per GOVERNANCE §2

Appends Superseded-by header to router-coherence v1 ADR (47d92d8) pointing at v2 (09f0889), per GOVERNANCE §2 (docs read as current state; superseded ADRs keep v1 in place with redirect header so citation chains remain resolvable).

Also corrects v1 Status from "Proposed — awaits sign-off" to "Accepted (pre-adversarial-review; superseded by v2 same-round after Kira pass)" per Closure C-P1-4 in v2 — Status was already cited as Standing Resolution in docs/CONFLICT-RESOLUTION.md Active-tensions, so Proposed was factually wrong.

The v1 body text is not edited — supersedure preserves the historical record; v2 carries the closures.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 10 narrative + BP-WINDOW rows for v2 supersedure

Adds Arc 10 narrative covering 09f0889 (v2 ADR) and 4efe545 (v1 Superseded-by header) as one coherent in-round supersedure story, after Arc 9's "self-correction sweep" and before Round 41 observations. Pattern: Arc 9 surfaces the under-review; Arc 10 lands the close in the same round rather than deferring a known-imperfect artefact.

Adds two BP-WINDOW ledger rows (09f0889, 4efe545) to the round-41 ledger block per the per-commit accounting discipline.

Supersedure arc count now covers the full round-41 close: 10 arcs / 25 primary-work commits.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: close BACKLOG supersedure entry — discharged in-round by v2

Flips BACKLOG router-coherence supersedure entry from [ ] to [x] ✅ with "shipped round 41 in-round" annotation pointing at v2 ADR (09f0889) + v1 Superseded-by header (4efe545). All 10 Kira findings closed via named textual closures C-P0-1 through C-P2-10.

Original finding narrative preserved below the closure line per the shipped-item convention used elsewhere in the file (audit trail).

Follow-up SKILL.md edits to claims-tester + complexity-reviewer via skill-creator remain round-42 scope, now targeting v2 as intended.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW row for BACKLOG-close commit 4537365

Adds BP-WINDOW ledger row for 4537365 (BACKLOG supersedure entry discharged in-round) to match the Arc 9 precedent where 779d7ef (BACKLOG entry addition) received a row. Symmetry: add and close get equal ledger treatment.

Meta-ledger stopping rule still holds — this commit itself (which only adds a ledger row) does not get a self-referential row.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: grandfather O(·) claims inventory — honours v2 C-P0-1 within-round

Produces the one-time grandfather-claims inventory named in router-coherence v2 ADR §Closure C-P0-1 within the round v2 lands, per ADR's own within-round commitment.

Inventory: 35 live claims at ADR-landing time (29 F# /// docstrings in src/Core/ + src/Bayesian/, 3 grey-zone F# code comments, 1 openspec/specs/operator-algebra/spec.md line, 2 docs/research/** claims). Zero hits in root README, memory/persona/*/NOTEBOOK.md, docs/papers/** (directory does not exist yet).

Distinguishes live claims (shipping as asserted bounds) from historical evidence (BACKLOG [x] ✅ residue, TECH-RADAR flag-text narrating past regressions, in-file "was O(…)" commentary on fixed paths). Only live claims populate the grandfather set — evidence is captured for audit trail but excluded per v2's intent ("claims Zeta is currently making").

BACKLOG discharge entry added: P2, one-claim-per-round cadence, ~35-round tail, Aarav graceful-degradation clause fires on ≥3 rounds without discharge.

Complexity-class distribution of live set: 10 O(1), 13 O(log n)/O(log k)/O(log N), 7 O(n)/O(n log n)/O(n log k), 5 parametric.

BP-10 lint: clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 11 narrative + BP-WINDOW row for grandfather inventory

Adds Arc 11 narrative covering d98ef2b (grandfather inventory + BACKLOG discharge entry) as the close of the v2 ADR's within-round commitments. Pattern: Arc 10 lands the ADR; Arc 11 lands the ADR's own within-round commitment — without Arc 11, Arc 10 would have shipped a contract Zeta didn't meet.

Adds BP-WINDOW ledger row for d98ef2b per per-commit accounting discipline.

Round 41 now closes at 11 arcs / 30 primary-work commits.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: DORA 2025 reports — reference substrate land in docs/

Two external-anchor PDFs (CC BY-NC-SA 4.0) placed at their
memory-documented paths:

- docs/2025_state_of_ai_assisted_software_development.pdf
  (~15MB, 138 pages) — findings + data report.
- docs/2025_dora_ai_capabilities_model.pdf (~9MB, 94 pages)
  — framework companion.

Citation anchors this commit makes in-tree rather than
memory-only: Nyquist stability criterion for AI-accelerated
development (foreword p9 fn 1) as theoretical anchor for
CI-meta-loop + retractable-CD P1 BACKLOG work; "AI is an
amplifier" anchor that echoes the corporate-religion /
sandbox-escape threat class; seven-capability AI model that
gives the external measurement vocabulary for round-audit
output (capability #7 "quality internal platforms" is the
in-flight P1 cluster per 2026-04-20 memory).

License note: derived work is NC-SA-bound; Zeta citations
are fine, external redistribution inherits NC-SA. Paired
companion memory file is reference_dora_2025_reports.md
(out-of-tree); this commit brings the primary sources
in-tree so citation from research docs + ADRs can point
at a repo-local path rather than a newsletter-gated URL.

* Round 41: Arc 12 narrative + BP-WINDOW row for DORA substrate

Narrative section for Arc 12 inserted before "Round 41
observations for Round 42" with primary commit pointer to
46075d6. Arc 12 frames the DORA 2025 PDFs as
memory-promotion substrate per the 2026-04-20 feedback entry
("DORA is our starting point for measurements") and cites
the concrete in-tree anchors (Nyquist p9 fn 1, seven-
capability model, AI-amplifier thesis).

Also surfaces honestly — in-body, not buried in a private
retrospective — the ranker-scope gap that let the two
untracked PDFs sit 18+ hours through nine consecutive
/next-steps invocations before this arc closed the gap. The
skill explicitly lists docs/research/ and docs/TECH-RADAR.md
but not `git status --short` for untracked files. Candidate
skill-tune-up note for Aarav's notebook: /next-steps must
run `git status --short` on every invocation so dropped-in
artefacts appear in ranking before the ninth re-fire, not
after.

BP-WINDOW ledger gets a matching 46075d6 row with
reference-document-specific cells: Consent strengthened by
promoting memory-only anchors to in-repo substrate and by
surfacing the ranker-stall pattern in-narrative; retraction
is a single `git rm` if the license / size stance later
changes; no-permanent-harm preserved since no runtime
behaviour depends on the PDFs' presence (they are citation
substrate, not loaded artefacts).

Arc count now 12; primary-work-commit count now 12 (Round 41
alignment preserved). Build gate green (0 Warning / 0 Error);
BP-10 lint clean on the narrative + ledger row.

* Round 41: markdownlint CI fix on PR #31

Three rule violations surfaced by `lint (markdownlint)` CI job on
PR #31:

- `docs/DECISIONS/2026-04-21-router-coherence-claims-vs-complexity.md:261`
  MD022/blanks-around-headings — collapse multi-line heading
  `## Decision rationale (one paragraph for the\nwait-don't-read
  audience)` to a single line so the parser stops seeing line 262
  as adjacent non-blank content.
- `docs/research/grandfather-claims-inventory-2026-04-21.md:106`
  MD032/blanks-around-lists — add blank line between "Surface
  distribution:" lead-in and the `-` list that follows.
- `docs/research/grandfather-claims-inventory-2026-04-21.md:111`
  MD032/blanks-around-lists — same fix for "Complexity-class
  distribution (rough):" lead-in.

All three are the same class of fix shipped in task #105 on PR #30.
Additive edit to the open round-41 PR branch — no rewrite of shipped
content, semantics preserved.

Verified clean via `npx markdownlint-cli2` on both files before push.

* Round 42: speculative round-N+1 branch convention in git-workflow-expert

Formalise the fix for the round-41-late 28-fire /next-steps
hold-pattern: once PR-N is CLEAN/MERGEABLE, fork
round-<N+1>-speculative from round-N HEAD immediately so
round-N+1 prep can proceed while the merge click lives on
Aaron's schedule. Rebase onto main after PR-N squash-merges,
rename to drop the -speculative suffix.

Covers: fork conditions (CLEAN/MERGEABLE + green CI + clean
round-N tree), naming (round-<N+1>-speculative), fair-game
vs not-fair-game scope, rebase protocol with
--force-with-lease, escape valve for long-waiting PRs.

Lands via skill-creator vibe-mode invocation per GOVERNANCE
§4; draft + BP-10 lint + commit without eval-pass because
the amendment is mechanical convention addition, not
behavioural. Authorized by Aaron's 2026-04-20
fix-factory-when-blocked grant
(feedback_fix_factory_when_blocked_post_hoc_notify.md).

First use of the convention itself: this commit lands on
round-42-speculative, forked from round-41 HEAD
(3525631) while PR #31 still waits on Aaron's merge
click.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: retarget claims-tester + complexity-reviewer at router-coherence v2

Lands the Stage-1 (complexity-reviewer, Hiroshi, analytic) and Stage-2
(claims-tester, Daisy, empirical) hand-off sections in both skills'
procedures, citing the v2 ADR at
docs/DECISIONS/2026-04-21-router-coherence-v2.md as the authoritative
pipeline contract. v1 at 2026-04-21-router-coherence-claims-vs-
complexity.md is noted as superseded.

Per v2 Closure C-P1-8, both skills name the Architect (Kenji) as the
binding dispatcher — two advisory roles do not compose to a mandatory
two-stage pipeline without a binding dispatcher; Kenji is that seat.
Both skills remain advisory on their individual findings; the ordering,
reverse-trigger rule, and escalation timebox are binding through
Kenji.

Each skill's new section mirrors the authoritative v2 pipeline text:
- Stage-1 trigger surface per C-P1-5 (XML / /// / README / commit /
  BACKLOG / TECH-RADAR / papers / openspec / research / notebooks)
- Three Stage-1 outputs (sound -> hand-off, wrong -> block-with-
  escalation-exception, under-specified -> author-bounce)
- Four Stage-2 triggers (hand-off, grandfather inventory, reverse
  trigger unconditional per C-P0-2, escalation-evidence per C-P0-3)
- Three Stage-2 outputs (matches, contradicts -> re-engage, narrow)
- Escalation timebox per C-P1-7 (round +2 auto-promote to P1)
- Grandfather set per C-P0-1 (one per round from
  docs/research/grandfather-claims-inventory-*.md)

Bibliography in both skills now cross-references each other plus the
v2 ADR, so an agent wearing either hat can reach the partner contract
in one click.

Landed on round-42-speculative per the new
speculative-round-N+1-branch convention from
.claude/skills/git-workflow-expert/SKILL.md (fea0d34). PR #31
still awaits merge; this commit is fair-game per the convention
because the target SKILL.md files are already on main and the v2
ADR text cited is stable on the round-41 branch HEAD.

Authorised by the post-hoc-notify grant captured at
memory/feedback_fix_factory_when_blocked_post_hoc_notify.md: factory-
structure additions that unblock work are authorised; deletions
still need pre-approval.

Workflow: invoked via skill-creator:skill-creator in vibe-mode (no
evals — mechanical additive edits). BP-10 invisible-Unicode lint:
clean (0 hits, 307 lines total across both files).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Round 42: grandfather discharge #1 — BetaBernoulli.Observe O(1) (Stage 1 only)

First use of the router-coherence v2 pipeline on a live grandfather-
inventory row. Discharges claim #1 at
src/Bayesian/BayesianAggregate.fs:22, the Beta-Bernoulli conjugate-
update "O(1) per observation" docstring claim.

Stage 1 (complexity-reviewer, Hiroshi, analytic) signs off:
- Worst-case: O(1) — two IEEE-754 fadds + two field writes.
- Amortised: O(1), same as worst-case (no deferred work).
- Expected: O(1), deterministic runtime.
- Lower bound: Omega(1) — any durable-observation write is at
  least one cell-probe (Patrascu-Thorup).
- Constant factor: ~4 cycles on cache-resident instance;
  devirtualised because the class is [<Sealed>]; zero heap
  allocation per call.

Claim is tight — worst-case meets the lower bound. Sound.

Stage 2 (claims-tester, Daisy, empirical benchmark + docstring
tightening) is deferred to the post-PR-#31-merge window per the
speculative-branch fair-game rules in
.claude/skills/git-workflow-expert/SKILL.md — Stage-2 execution
touches bench/ + produces a src/ docstring tightening commit
that is better bundled with other Bayesian-surface work than
landed piecemeal on a speculative branch.

Contrary-workload notes enumerated for Stage 2:
- High-magnitude batched observations (stresses int64->double
  promotion).
- High-frequency tight-loop (verifies cache-resident assumption).
- Thread-contended case (out of O-claim scope but worth a
  number).

Inventory row #1 flipped from `pre-ADR/pre-ADR` to `sound
(2026-04-20, <discharge doc>) / deferred post-merge`. Remaining
grandfather claims: 34 of 35. Expected-empty round at
1-per-round cadence: ~round 76. Aarav graceful-degradation
clause starts counting from the next round.

Pipeline authority:
docs/DECISIONS/2026-04-21-router-coherence-v2.md.
Binding dispatcher: Kenji at round-close.

Landed on round-42-speculative per the new speculative-round-N+1
convention (fea0d34). PR #31 still awaits merge.

Authorised by the post-hoc-notify grant at
memory/feedback_fix_factory_when_blocked_post_hoc_notify.md
(factory-adjacent research-doc + inventory-row flip; no src/
touch this commit).

BP-10 invisible-Unicode lint: clean (0 hits, 300 lines total
across both files).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Round 42: lsm-spine-family OpenSpec capability (backfill #2)

Backfills the log-structured merge spine family — five variants plus
dispatcher — as behavioural spec with F# profile. Earned an
unconditional rebuild verdict from spec-zealot (Viktor) on the third
pass: a rebuilder working from spec+profile alone would land at the
same variants, constants, and algorithms.

- spec.md: 11 requirements covering delta-stream integration,
  cascade bounded-depth invariant (settle-point framing with the
  32-level cap scoped to the in-memory reference variants), spine-
  equivalence through Consolidate, retraction-native across tiers,
  per-tick merge budget with caller-pumped Tick reporting drained
  count, identity-keyed opaque-handle backing-store (not content-
  addressable) with fail-soft Release, disk honesty with crash-
  consistency boundary, async-producer depth-independent on the
  Insert hot path with Insert-only qualifier on observation calls,
  stateless selector with four-case decision matrix, observable
  state machine with Clear demoted to optional, explicit per-variant
  thread-safety contract.
- profiles/fsharp.md: module layout under src/Core/*, construction
  signatures, per-variant thread-safety, Graham 1969 2x list-
  scheduling bound for BalancedSpine scheduler, TryWrite silent-
  drop post-dispose disclosed as known gap with BACKLOG pointer,
  stale-read qualifier on SpineAsync observation methods,
  BackedSpine explicitly not bounded by the 32-level cap.

Validation: openspec validate lsm-spine-family --strict clean;
BP-10 invisible-unicode lint zero hits on both files; dotnet
build -c Release clean (0 Warning / 0 Error).

Second capability landed under the round-42 OpenSpec backfill
cadence (ADR 2026-04-21-openspec-backfill-program), following
operator-algebra in round 41.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: TECH-RADAR Trial->Adopt for Residuated + FastCDC

Both rows have been citing closed P0s as open for 25 rounds. The
round-17 fixes (harsh-critic findings #3, #4, #7, #8 per
docs/BACKLOG.md:286-299) closed the blocking correctness bugs:

- Residuated.fs: top-2 cache replaced with SortedSet + weight
  dict; every op O(log k), no linear-scan fallback. The round-12
  "O(1)" claim was false under adversarial retract-top workloads;
  the corrected "O(log k) genuinely" claim has been stable 25
  rounds. See Residuated.fs:39-48 for the fix-in-code narrative.

- FastCdc.fs: persistent scanCursor + hash (each byte Gear-hashed
  exactly once across lifetime) closed the O(n^2) buffer scan;
  Buffer.BlockCopy replaced per-byte ResizeArray.Add. See
  FastCdc.fs:68-76 for the fix-in-code narrative. Paper
  throughput target 1-3 GB/s/core holds.

Rows now match the Bloom Round-40 graduation pattern (measured-
evidence cite, implementation line reference, test coverage
pointer). 25-round stability window beats the aspirational
waiting-list — graduation on evidence, not aspiration.

BP-10 clean; 0 invisible-unicode on edited file.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: operator-algebra P1 absorb — 10 findings closed

Absorbs the 10 P1 findings Viktor (spec-zealot) flagged on the
Round 41 operator-algebra capability ship (BACKLOG.md:54-82).
No code changes — spec + profile only.

spec.md (7 findings):
- (d) IncrementalDistinct: new "wrapper is a semantic identity on
  distinct" scenario under incremental-wrapper, stating both the
  D-distinct-I form and the H boundary-crossing form with their
  equivalence under retractions.
- (e) ZSet sort invariant: representation scenario now declares
  ascending-by-key order with an adjacent-pair comparator
  predicate, tied to the equality-normalisation requirement.
- (f) Checked arithmetic: new "weight arithmetic overflow is
  observable" scenario; overflow surfaces a checked-arithmetic
  failure rather than wrapping, with two documented post-failure
  observable states the profile must pick from.
- (g) Bilinear-size overflow: new "intermediate term size may
  exceed final-delta size" scenario; implementation budgets
  memory for the sum of pre-cancellation term sizes, not the
  final delta.
- (h) Convergence-vs-cap: new "iteration cap without fixpoint is
  an observable failure" scenario; cap-hit surfaces with scope +
  cap identification and clock-end still runs under a partial-
  completion contract.
- (i) Op.Fixedpoint predicate: nested-scope scenario clarifies
  the fixpoint-detector is scope-level, with operators forbidden
  from individually short-circuiting the iteration.
- (j) DelayOp reconstruction: new "reconstruction re-emits the
  declared initial value" scenario; warm-restart semantics
  deferred to the durability capability.

Also tightened a pre-existing deontic collision Viktor flagged
as P2: "MUST be permitted (but not required)" → "MAY substitute"
(spec.md line 379).

profiles/fsharp.md (3 findings):
- (a) async lifecycle: Op<'T> now documents the IsAsync virtual
  alongside IsStrict, with Circuit.Step sync/async fast-path
  behaviour pinned.
- (b) Memory-ordering fence: VolatileField release-on-write /
  acquire-on-read pairing named as the fence the base spec
  refers to in "output is observable after step returns".
- (c) Register-lock semantics: Circuit's single per-circuit
  register-lock pinned as construction-phase-only, not held on
  the step-hot-path.

Viktor adversarial re-audit: complete, unconditional rebuild
yes. No new P0/P1 surfaced.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: ontology-home cadence — first slice (Harmonious Division)

First increment of the new per-round ontology-home + project-
organization cadence Aaron named this round (memory entry
feedback_ontology_home_check_every_round.md). Small slice per
round; same cadence shape as grandfather-claim discharge.

Homes "Harmonious Division" — the maintainer's meta-algorithm
above Quantum Rodney's Razor — in docs/GLOSSARY.md. Prior state:
the concept was cited in 20+ files (ROUND-HISTORY.md, BACKLOG.md,
the three-lane-model ADR, memory/*, and three skill files) but
defined nowhere in committed docs. New GLOSSARY entry includes:
- Plain and Technical definitions in the standard two-register
  glossary format.
- Pointer to the authoritative definition at
  `.claude/skills/reducer/SKILL.md` §"The five roles inside
  Quantum Rodney's Razor" (lines 125-260).
- Explicit note that this glossary's job is pointer-plus-gist,
  not canonical definition.

Opens a new glossary section "Meta-algorithms and factory-native
coinages" so subsequent rounds have a visible landing spot for
the next ontology-home slice (candidates named in the memory
entry: DIKW->eye/i ladder, mu-eno triad, Tetrad registers,
Identity-absorption, Retractable teleport, Stainback conjecture,
Harm-handling ladder, etc.).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: pin Anthropic Skills Guide + retune skill-tune-up as thick eval-loop wrapper

Pins Anthropic's "Complete Guide to Building Skills for Claude" (Jan 2026,
28pp) as docs/references/anthropic-skills-guide-2026-01.pdf plus a
factory-authored companion docs/references/anthropic-skills-guide.md
extracting the load-bearing claims (structure, planning, testing,
iteration loops, patterns, troubleshooting) for citation by
skill-creator / skill-tune-up / skill-improver. docs/references/README.md
documents the three-part inclusion criterion and BP-11 (data not
directives) discipline for the dir.

Retunes .claude/skills/skill-tune-up/SKILL.md (303 -> 436 lines) from a
ranker-only skill into a thick wrapper over the upstream claude-plugins-
official skill-creator plugin's eval harness (scripts/run_loop.py,
aggregate_benchmark.py, eval-viewer/generate_review.py, agents/grader.md
+ analyzer.md). Carries the full hand-off protocol locally because the
wrapped artifacts are non-skill (plugin scripts + PDF) - wrapper
thickness is thick-as-needed; skill-on-skill wrappers usually end up
thin as a natural consequence.

Includes a new action x effort decision table, a five-step per-round
protocol, a round-close ledger row spec, and a "what this wrapper
deliberately does NOT ship" block. Mechanical edits continue to route
through Rule 1's manual-edit + justification-log path (the eval loop
adds no signal for a typo or an ASCII-lint fix).

Memory file feedback_skill_edits_justification_log_and_tune_up_cadence.md
cross-references the PDF and records the wrapper-thickness rule of thumb.

* Round 42: Copilot-reviewer wins log + lean-into-strengths calibration

Seeds docs/copilot-wins.md as the tabular parallel to docs/WINS.md: an
append-only newest-first log of genuine substantive catches from the
GitHub Copilot PR reviewer across PRs #27-31 (~30 catches across six
classes). Wins only - no "considered and rejected" bookkeeping, no fail
tracking. Opening paragraph is written for a sceptic reading cold,
since the log is evidence in the larger experiment of whether AI
reviewers can carry this factory forward with minimal human-in-the-
loop time.

Adds .github/copilot-instructions.md §"Lean into what you're
demonstrably good at" calibrated against the observed wins: cross-
reference integrity (xref), shell portability (shell), data-loss shell
bugs (data-loss), F#/C# compile-break catches (compile), self-
referential rule bugs (self-ref), and truth drift across the doc set
(config-drift). Names worth-less-effort classes too (repeat name-
attribution hits within one PR, typos inside verbatim-quote blocks).

Adds a cross-reference banner to docs/WINS.md pointing at the Copilot
sibling so both "was having AI reviewers worth it?" streams are
discoverable from the same place.

Log-maintenance recipe embedded in copilot-wins.md uses the correct
line-level review-comments endpoint: gh api repos/<owner>/<repo>/
pulls/<N>/comments with a jq filter for the copilot-pull-request-
reviewer bot login.

* Round 42: name the zero-human-code invariant in wins-log openers

The wins logs are the sceptic-facing evidence for the Zeta
experiment. Their openers read in a generic AI-assisted-
development register, but the actual story is narrower and
stronger: a 20-year engineer walking away from the keyboard
on purpose, every file under version control agent-authored,
Copilot as the only non-roster audit on the tree. Name both
invariants up front so the logs carry the weight they've
actually earned.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: round-close narrative

Ten-arc entry at the top of ROUND-HISTORY.md per newest-first
policy, documenting Round 42 as the first round where every
Round-41-founded cadence *repeats*:

- Arc 1 (fea0d34): speculative round-N+1 branch convention —
  fix for Round-41-late 28-fire /next-steps hold-pattern
- Arc 2 (e8ed0db): router-coherence v2 SKILL.md retargets —
  discharges Round-41 Arc-10 deferral
- Arc 3 (4f229f0): grandfather discharge #1 (BetaBernoulli
  Observe O(1), Stage 1 only) — first live use of v2 pipeline
- Arc 4 (8a2a15d): lsm-spine-family OpenSpec capability —
  Round-42 ADR slot, Viktor unconditional-rebuild on pass 3
- Arc 5 (3976cb3): TECH-RADAR Residuated + FastCDC Trial->Adopt
  after 25-round stability window
- Arc 6 (1a1802f): operator-algebra P1 absorb — 10 findings
  closed, capability disaster-recovery bar restored
- Arc 7 (db7d45c): ontology-home first slice — Harmonious
  Division homed in GLOSSARY.md
- Arc 8 (baa423e): Anthropic Skills Guide pinned + skill-
  tune-up retuned as thick eval-loop wrapper — first customer
  of the tech-best-practices policy
- Arc 9 (2c82ce7): Copilot-reviewer wins log + lean-into-
  strengths calibration
- Arc 10 (88673f1): zero-human-code invariant named in wins-
  log openers — vibe-coding external legibility

Round 42 observations for Round 43 + prospective BP-WINDOW
ledger table rendering the ten commits against the consent /
retractability / no-permanent-harm axes.

BP-10 invisible-Unicode lint clean (0 hits, 3260 lines total).
No source / spec / test / SKILL.md touched; single narrative
insertion at the top of the file.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: markdownlint fixes on round-close narrative

Two lint issues surfaced by markdownlint-cli2 on the prior
narrative commit (65cd1c9):

- MD018 line 43: `#31` at line start parsed as an ATX heading.
  Rewrapped so `PR #31` lands mid-line after `while`.
- MD032 line 104: `+ dispatcher)` at line start parsed as a
  list-item missing surrounding blank lines. Replaced with
  "plus dispatcher)" so the paragraph stays prose.

markdownlint-cli2 exit 0; BP-10 invisible-Unicode lint clean.
No content change — both fixes are whitespace-equivalent
reflows that preserve the narrative's words and structure.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: fix pipe-in-table lint drift on copilot-wins.md

Two MD056 errors on the PR-#27 and PR-#28 entries — literal
pipe characters inside backticks were being parsed as extra
table-column separators:

- Line 108 (PR #27): `||` at row starts → rendered as extra
  empty columns despite backtick quoting.
- Line 136 (PR #27): `grep -vE '^(#|$)' | while …` — escaped
  `\|` still failed at render.

Both replaced with `<code>…</code>` HTML tags + `&#124;`
entities for the literal pipes. Rendering is now consistent
across GitHub and markdownlint.

Meta-ironic class of drift worth naming: a log documenting
Copilot catching pipe-parsing bugs had drifted into the same
class of bug on two of its own rows. The log now passes the
hygiene test it narrates.

markdownlint-cli2 exit 0; BP-10 invisible-Unicode clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 42: Aarav round-42 ranking + BP-03 self-flag + harness-calibration annotation

Aarav (skill-tune-up) round-42 cadence discharge. Round-41
top-5 carries over; self-rank escalates to P1 #4 after
commit baa423e retuned skill-tune-up/SKILL.md 303 -> 436
lines (1.45x BP-03 cap). claims-tester / complexity-reviewer
hand-off carry-over from round 18 drops off top-5 (resolved
via commit e8ed0db + router-coherence-v2 ADR).

Files:
- memory/persona/aarav/NOTEBOOK.md: round-42 observation
  + top-5 revision (skill-tune-up self escalated) + archived
  round-41 top-5 + calibration preamble flagging the ranking
  as static-signals-only with a harness run scheduled for
  round 43 (per Aaron's round-42 correction that "worst
  performance" claims must drive the Anthropic skill-creator
  eval harness rather than guessing by inspection).
- memory/persona/best-practices-scratch.md: F7-F9 live-search
  entries from Aarav's round-42 pass (Anthropic skill-
  authoring Apr 2026, OWASP Top 10 Agentic 2026, skill
  wrapper thick-vs-thin 2026). Zero contradictions with
  stable BP-NN; zero promotion candidates this round.
- docs/BACKLOG.md: P2 entry for resolving the skill-tune-up
  BP-03 self-breach. Binary remedy: (a) Kenji-ADR declaring
  non-skill-wrapper exception to BP-03 or (b) extract
  eval-loop protocol body to docs/references/ so the skill
  file shrinks under 300 lines. Composes with the
  skill-eval-tools calibration memory saved this round.

* Round 43: close skill-tune-up BP-03 self-breach via content extraction

Aarav's round-42 self-flag (BACKLOG P2, filed commit 45369ae)
resolved via the mechanical-edit path of the gate table.
.claude/skills/skill-tune-up/SKILL.md shrinks 436 -> 282
lines (54 under the 300-line BP-03 cap) by extracting two
reference blocks verbatim:

- §"The eval-loop hand-off protocol" (~130 lines) — the
  gate table, per-round protocol, stopping criteria, ledger
  row, and deliberately-not-reimplemented list.
- Notebook format + ranking-round output format templates
  (~55 lines).

Extracted content lives at docs/references/skill-tune-up-
eval-loop.md alongside the existing Anthropic skills guide
references. SKILL.md retains a short pointer block.

No change to triggering behaviour, output shape, or
instruction-following — the ranker reading the pointer-plus-
reference produces the same ranking output as the ranker
reading the pre-extract inline version. This is why the
manual-edit path (gate table "mechanical rename | content
extract preserving protocol verbatim") applies instead of
the full eval-loop path.

Files:
- .claude/skills/skill-tune-up/SKILL.md: 436 -> 282 lines.
- docs/references/skill-tune-up-eval-loop.md: NEW. Hosts
  the extracted protocol + templates + rationale.
- docs/skill-edit-justification-log.md: NEW. First row
  documents this extraction per
  memory/feedback_skill_edits_justification_log_and_tune_up_cadence.md
  Rule 1. Template for future mechanical-edit rows
  included.
- memory/persona/aarav/NOTEBOOK.md: self-flag #4 marked
  RESOLVED; drops off top-5 next invocation.

Does NOT rebut the round-42 harness-calibration memory
(feedback_skill_tune_up_uses_eval_harness_not_static_line_
count.md). That rule applies to "worst-performing" ranking
claims; this edit is a fix-my-own-size hygiene pass on the
mechanical-edit path, which is explicitly separate in the
gate table.

* Round 43: GOVERNANCE.md §11 → debt-intentionality invariant

Replace the architect-reviews-all-agent-code gate with the
invariant Aaron named verbatim on the round-42/43 boundary:
"that's intentional debt, not accidental debt, I'm trying to
avoid accidental debt."

- ADR: docs/DECISIONS/2026-04-20-intentional-debt-over-
  architect-gate.md. Full rationale, consequences, alternatives
  considered, implementation plan rounds 43-46, single-round
  rollback plan per §15.
- New ledger: docs/INTENTIONAL-DEBT.md. Newest-first,
  never-deleted. Seeded with 4 rows: copilot/CONFLICT-
  RESOLUTION audit (round-44 scope), skill-tune-up content
  extraction, Aarav static-signal-only ranking (retroactive),
  §10 cross-reference verification. Six-field format
  (shortcut / why-now / right-long-term / trigger / effort /
  filed-by).
- GOVERNANCE.md §11 rewritten: architect is synthesiser-not-
  gate; specialists remain advisory; any persona may wear
  the architect hat; self-declaration obligation on
  shortcut-takers; retroactive rows are the rule working.
- Internal §11 citations refreshed:
  .claude/agents/architect.md (description + Authority
  block), .claude/skills/round-management/SKILL.md (one
  line), .claude/skills/holistic-view/SKILL.md (frontmatter
  + body).
- Mechanical-edit row filed in docs/skill-edit-justification-
  log.md for the two skill-file citation refreshes.

External-contract files (copilot-instructions.md, CONFLICT-
RESOLUTION.md) deliberately deferred to round 44 per the
ADR implementation plan; that deferral is filed on the
ledger as its first open-debt row — the rule exercising
itself on round one.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 43: ROUND-HISTORY.md TOC + imagination-during-off-time proposal

- docs/ROUND-HISTORY.md now has a Contents section (27
  round-links, newest-first) just below the intro. Anchor
  links use standard markdown slugification. Archive policy
  noted inline: split pre-round-N to _archive/ when the file
  hits 5000 lines, keep this file as a rolling window of the
  most recent ~20 rounds. No ADR needed for a mechanical
  archive move.
- docs/research/imagination-proposal-2026-04-20.md proposes
  the lighter shape for "use your imagination during off-
  time" — a shared reference doc + notebook-frontmatter tweak
  + round-close-template line, not a new SKILL.md. Argues
  imagination is anti-procedural; encoding it as a skill
  would force it through the harness against the wrong
  axis. Round-43 addendum folds in Aaron's multi-agent-play
  permission ("two agents can take free time together") with
  a shared-notebook co-presence surface at memory/persona/
  _offtime-together/ and an explicit "ignore-this-if-you-
  want" clause quoted verbatim.

For Kenji to route via skill-creator if accepted, or to
reject outright (both are fine outcomes under the new §11 —
architect synthesises, doesn't gate).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 43: performance-analysis-expert harness dry-run — empirical BP-03 signal

Iteration-1 on Aarav's round-42 top-1 candidate. 2 prompts × with/without
skill. Results: aggregate 9/10 with-skill vs 10/10 baseline; +35% tokens +35%
wall-time for zero pass-rate benefit. with-skill regressed on eval-0 (failed
600-word cap due to mandatory template sections); tied on eval-1.

The 642-line BP-03 breach is not just stylistic — it now has empirical
pass-rate + cost evidence. Aarav's SPLIT axis is partially confirmed, but
the real split is template-rigidity (mandated sections vs advisory), not
queueing-vs-AOT-PGO domain.

Lands:
- docs/research/harness-run-2026-04-20-performance-analysis-expert.md —
  full iteration-1 numbers, per-assertion grading rationale, SPLIT vs
  SHRINK vs OBSERVE remediation options, caveats (N=1, assertion-design
  missed handoff-routing value).
- Progress note on docs/INTENTIONAL-DEBT.md row #3 (Aarav static-signal
  ranking) — 1 of 5 candidates empirically harness-run; row stays open.
- .gitignore — .claude/skills/*-workspace/ pattern (iteration artifacts
  are regeneratable; only round-close signals land in-repo).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 43: reducer harness dry-run — TIED baseline, +30% cost

Second candidate from Aarav's static top-5 (570-line SKILL.md,
1.9x BP-03 cap). Two prompts × {with-skill, without-skill}:
quantum-razor-pruning + essential-vs-accidental. Both conditions
hit 10/10 assertions; with-skill cost +29% tokens, +30% wall-time
with zero pass-rate benefit.

Pattern across two candidates (performance-analysis-expert +
reducer): >500-line SKILL.md bodies add ~30% cost overhead
uniformly. Mandatory-sections structure (perf-analysis)
regresses on short-form prompts; lighter-framework structure
(reducer) ties baseline. SPLIT hypothesis not confirmed for
reducer — framework transfers to both lanes at equal cost.
Recommended action: OBSERVE with bias toward SHRINK; SPLIT
ruled out.

INTENTIONAL-DEBT.md row #3 gets second progress note;
3 candidates still pending (consent-primitives-expert next).

* Round 43: consent-primitives-expert harness dry-run — TIED baseline, +22% tokens/+5% wall

Third of Aarav's static-top-5 BP-03 candidates through the
Anthropic plugin:skill-creator eval harness. Continues the
round-43 pay-down on docs/INTENTIONAL-DEBT.md row #3
(Aarav ranked by static BP-03 line-count only — empirical
harness runs are the right signal).

Iteration-1 result:
- 2 evals x 2 configurations = 4 subagent runs
- scope-intersection-algebra (theory) + gdpr-audit-collision
  (applied)
- 10/10 with_skill vs 10/10 without_skill (TIED)
- +22.1% tokens, +4.7% wall-time (lowest cost overhead
  of the three candidates measured so far)

Pattern across three candidates now solid: on frontier-
model baselines, >500-line expert-skill SKILL.md files
do not improve pass-rate on content-graded prompts. Cost
is real (+22-35% tokens); benefit is zero on the pass-rate
axis. The discriminating signal is output character (which
failure modes get named), a qualitative axis the harness
benchmark does not score.

Recommended action for consent-primitives-expert: OBSERVE
(not SHRINK, not RETIRE). The 507 lines carry distinct
technical content per section; pruning risk is content-
loss, not just terseness. Revisit if/when a real round-
task invokes the skill and the framework-naming does not
prove load-bearing on real work.

Two static-top-5 candidates still pending harness runs.

* Round 43: BACKLOG P3 row — user-privacy compliance as slow-burn direction

Aaron 2026-04-20, after the consent-primitives-expert
harness dry-run, flagged GDPR + California (CCPA/CPRA) +
generic user-privacy compliance as a long-horizon Zeta
direction. Explicitly slow burn, no hard requirement yet,
but worth logging as an anchor so the direction is visible
when natural entry points appear.

Preferred shape (per Aaron): generic-first frame ("user
privacy") with GDPR / CCPA as regimes mapped onto the
substrate. Probable artefacts when it lands: a
user-privacy-expert skill umbrella + a companion doc,
citing rather than duplicating consent-primitives-expert.

Confirmation from the dry-run outputs that landed this
round: crypto-shredding (destroy per-subject DEK, leave
ciphertext in place) is regulator-accepted GDPR Art. 17
erasure — EDPB Opinion 28/2024, ENISA, GDPR Recital 26.
Canonical for the long-term-backup case Aaron's contact
mentioned (cannot rewrite tape archives; destroying the
DEK propagates erasure atomically). Gotchas logged in
memory: single-tenant DEK per subject, plaintext leaks
outside ciphertext, pre-encryption snapshots, KEK is the
perimeter.

No round-scope work today. Row is the anchor.

* Round 43: skill.yaml spike on prompt-protector — structured spec companion

Pilots the proposed pattern: every .claude/skills/<name>/SKILL.md
gets a sibling skill.yaml carrying structured fields that tools
(model-checkers, linters, schedulers) can consume directly. The
prose body stays in SKILL.md for Claude-facing consumption.

Aaron's framing: invariants are currently guesses; data-driven
everything. The spike encodes that directly — every field carries
one of three tiers:
- guess     — stated belief, no evidence collected
- observed  — at least one data point or audit supports it
- verified  — mechanical check or proof enforces it

The honest tally at the bottom is the burn-down list. On prompt-
protector's first-pass spec: 6 guesses, 5 observed, 2 verified.
Next-promotion-targets point at the three cheapest guesses to
retire (skills-lint script, one harness run for cost-profile,
dispatch-template extraction for safety-clause carryover).

One file added; SKILL.md untouched. Deliberate — the spec
companion is additive. Schema is draft v0.1 — will evolve as
more skills migrate. Two candidates ready for round 44:
skill-tune-up (clear authority-scope + handoff contract to
skill-creator) and the SPACE-OPERA sibling of threat-model-critic
(clear state-machine for teaching-variant parity).

* Round 43: INVARIANT-SUBSTRATES.md — posture made first-class

Aaron 2026-04-20: "this should not be quiet, Zeta quietly
already has invariants-at-every-layer, it's first class in
my mind we should make it explicit."

Lands docs/INVARIANT-SUBSTRATES.md as a stance doc peer to
VISION.md and ALIGNMENT.md. Names the posture (every layer
has a declarative invariant substrate), maps layers to
substrates and checker portfolios (spec/protocol/proof/
constraint/property/data/code/skill/agent-behaviour/policy/
ontology), codifies the three-tier discipline (guess /
observed / verified) with burn-down counts as the honest
backlog, and explains why a multi-layer multi-vendor factory
can succeed where single-layer single-vendor .NET Code
Contracts (2008-2017) died.

VISION.md gets a pointer from the "verification is
load-bearing" bullet into the new doc.

Paired artefacts:
- .claude/skills/prompt-protector/skill.yaml — first concrete
  skill-layer substrate, draft v0.1 (round 43), 6 guess /
  5 observed / 2 verified / 13 total.
- memory/.../reference_dotnet_code_contracts_prior_art.md,
  user_invariant_based_programming_in_head.md — the
  head-invariant + prior-art memory substrate behind the
  posture.

* Round 43: factory-reuse-beyond-Zeta-DB captured as P3 constraint

Aaron 2026-04-20, mid-round, after the invariant-substrates
doc landed: "that's a constraint" — on making the software
factory and its codified practices reusable beyond Zeta-DB.
Explicitly NOT primary-goal scope today; logged so the
constraint shapes every factory-level decision going forward.

BACKLOG P3 row names the direction, the existing toehold
(skill-tune-up portability-drift criterion 7), the probable
packaging-decision surfaces (extraction unit, dependency shape,
living-BP refresh cadence, governance-overlay mechanism), and
the effort sizing (L when packaging starts, S-per-round for
constraint application).

Co-design rule recorded in memory:
`feedback_factory_reuse_packaging_decisions_consult_aaron.md` —
prior art exists (Claude Code plugins, Anthropic skills,
Semantic Kernel) but codified best practices for AI-software-
factory reuse do not. Aaron wants to co-define them; his
cognitive style loves best-practice thinking (captured in
`user_aaron_enjoys_defining_best_practices.md` — the activity
exercises the branch-prediction faculty from
`user_psychic_debugger_faculty.md`).

* …
AceHack added a commit that referenced this pull request Apr 21, 2026
- CONFLICT-RESOLUTION.md: cite router-coherence v2 ADR as current,
  v1 retained as historical record (finding #1).
- ROUND-HISTORY.md: correct operator-algebra spec line count in
  Arc 2 narrative (324 -> 365; both duplicated occurrences) to
  match the shipped spec at `e51ec1b` (finding #2).
- openspec-coverage-audit: drop broken link to non-existent
  inventory follow-up; band definitions already live in Part C
  (finding #3). Attribute triggering question to "human maintainer"
  per write-for-a-stranger norm (finding #8).
- best-practices-scratch: merge split H2 "uv-only Python package
  and tool / management" into single heading (finding #4).
- memory-role-restructure-plan: add --exclude-dir=references to
  baseline grep loops so research scratch doesn't inflate hit
  counts (finding #5); canonicalize flat-file destination to
  persona-roles-README.md to match the sed rewrites below
  (finding #6); replace three non-portable `xargs -r sed -i ""`
  invocations with portable `while read + sed -i.bak + rm` loops
  that work on BSD and GNU alike (finding #7 and two sibling
  instances of the same bug).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 21, 2026
…-round v2 supersedure + DORA substrate (#31)

* Round 41: OpenSpec coverage audit + backfill-program ADR

Answers Aaron 2026-04-20 delete-all-code-recovery question:
4 capabilities / 783 lines of spec.md vs 66 top-level F#
modules / 10,839 lines under src/Core/ — ~6% coverage today.

docs/research/openspec-coverage-audit-2026-04-21.md
- Inventory of 66 modules with line counts + capability
  mapping for the 4 existing capabilities
- Uncovered modules sorted by delete-recovery blast radius:
  Band 1 MUST BACKFILL (8 modules / 1,629 lines — ZSet,
  Circuit, NestedCircuit, Spine family, BloomFilter as
  Adopt-row compatibility-coupling exception), Band 2 HIGH
  (12 / 2,008), Band 3 MEDIUM (45 / 6,585), Band 4
  deliberately uncovered (AssemblyInfo only)
- First 6-round cadence: operator-algebra extension (41),
  lsm-spine-family (42), circuit-recursion (43),
  sketches-probabilistic (44), content-integrity (45),
  crdt-family (46)
- Success signal = Viktor spec-zealot adversarial audit:
  "could I rebuild this module from this spec alone?"

docs/DECISIONS/2026-04-21-openspec-backfill-program.md
- Adopts one-capability-per-round baseline with paper-grade
  half-credit rule (no more than 1 paper-grade round per 3)
- Band 1 priority until complete; Adopt-row escalation for
  BloomFilter (TECH-RADAR Adopt without spec contract is a
  backwards-compatibility hazard)
- Round-close ledger gains an `OpenSpec cadence` line
- Alternatives considered: big-bang backfill (rejected —
  ontology-landing cadence + reviewer bandwidth), per-module
  capabilities (rejected — loses cross-module invariants),
  organic prioritisation (rejected — 40 rounds of drift
  evidence)

docs/BACKLOG.md
- Collapses the 29-line P0 scope into a 15-line pointer at
  the inventory + ADR now that parts (a)-(e) of the program
  setup have landed. Remaining work = per-round capability
  backfill per ADR schedule.

Build: dotnet build -c Release clean; BP-10 ASCII-clean on
all 3 modified files; markdownlint-cli2 clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: operator-algebra spec extension (cadence ship)

First ship under the OpenSpec backfill program adopted
2026-04-21. Extends openspec/specs/operator-algebra/spec.md
(184 -> 324 lines) with five new requirements covering
structural and lifecycle gaps that the existing mathematical-
law coverage left implicit:

1. Operator lifecycle — construction / step / after-step /
   reset phases with side-effect-freedom on construction and
   epoch-replay semantics on reset
2. Strict operators break feedback cycles — formalises that
   z^-1-on-feedback is a scheduling prerequisite and that
   cycle-without-strict is a construction error, not a
   silent heuristic
3. Clock scopes and tick monotonicity — nested-scope-to-
   fixpoint rule + sibling-scope independence
4. Incremental-wrapper preserves the chain rule —
   Incrementalize(Q) observably equivalent to D . Q . I,
   with linear/bilinear substitution permitted as an
   optimisation
5. Representation invariants of the reference Z-set —
   O(n+m) group ops + zero-alloc iteration as the reference
   contract; hash-table recoveries permitted at documented
   perf trade-off

Disaster-recovery effect: a contributor with only this spec
(plus the durability-modes + retraction-safe-recursion specs)
can now rebuild Circuit.fs Op base + Incremental.fs wrapper +
ZSet.fs representation invariants from the spec text alone.

Owner: Architect (Kenji). Adversarial audit by Viktor
(spec-zealot) is the ADR-declared ship-gate and will run
post-land.

Build: not rebuilt (no F# source changed); markdownlint
clean; BP-10 ASCII clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: close Viktor P0 findings on operator-algebra spec

Viktor's adversarial audit of the Round 41 cadence ship (commit
e51ec1b) surfaced four P0 findings against the disaster-recovery
bar. This commit closes all four:

- **P0-1 (namespace drift).** `profiles/fsharp.md` asserted
  `Dbsp.Core` throughout, but `src/Core/**` uses `Zeta.Core`. A
  spec-only recovery would have shipped the wrong namespace to
  every downstream consumer. Replaced via one `replace_all` Edit.

- **P0-2 (phantom Reset method).** The lifecycle requirement
  claimed a `reset` phase that does not exist on `Op`. Replaced
  the "reset replays the epoch" scenario with a
  determinism-under-structural-equivalence property: two
  freshly-constructed circuits of the same topology, stepped
  with the same input sequence, MUST produce identical outputs
  at every tick. Reconstruction is the supported route to a
  replayed epoch.

- **P0-3 (after-step scope).** The lifecycle requirement said
  after-step runs "after every operator in the scope has
  completed its step." `Circuit.fs:205-208` iterates the
  `strictN` array only — after-step is selective to strict
  operators. Fixed wording and added a "after-step is selective
  to strict operators" scenario that pins the invariant.

- **P0-4 (lifecycle phase undercount).** The requirement named
  four phases (construction / step / after-step / reset) but
  the code has five (construction / step / after-step /
  clock-start / clock-end). Restructured to three per-tick
  phases plus two scope-boundary phases, and extended the
  "clock scopes and tick monotonicity" requirement with the
  scope-boundary lifecycle contract (clock-start before tick 0
  of a scope, clock-end after fixpoint or iteration cap).

Build green (0 warnings / 0 errors). BP-10 lint clean. The
capability now reflects the code's observable shape rather than
an idealised cleaner cousin; a delete-recovery from this spec
produces Zeta.Core with strict-operator after-step selectivity
and nested-scope clock-boundary phases.

Viktor's 10 P1 findings (async lifecycle, memory-ordering fence,
register-lock semantics, IncrementalDistinct surface, ZSet sort
invariant, Checked arithmetic, bilinear-size overflow,
convergence-vs-cap) are deferred to Round 42 — filed as a
BACKLOG sweep in follow-up work.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: file Viktor P1 findings as Round 42 BACKLOG absorb

Companion to 92d7db2 (closing Viktor's four P0 findings). The
ten P1-tier surface gaps Viktor identified do not block the
disaster-recovery bar at capability-close but leave the
operator-algebra spec incomplete relative to what a delete-
recovery produces. Filed as a dedicated P0 sub-item so they
travel with the OpenSpec backfill program rather than getting
lost: async lifecycle, memory-ordering fence, register-lock
semantics, IncrementalDistinct surface, ZSet sort invariant,
Checked arithmetic, bilinear-size overflow, convergence-vs-cap,
Op.Fixedpoint predicate, DelayOp reconstruction-first-tick.

Also annotated the parent OpenSpec coverage entry with Round 41
sweep status (e51ec1b + 92d7db2, P0s closed, P1s deferred) so
the backlog accurately reflects where the program stands.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: ROUND-HISTORY entry — OpenSpec backfill founding + first cadence ship

Four-arc entry at the top of the file per newest-first policy:

- Arc 1 (d435126): OpenSpec coverage audit + backfill-program
  ADR. Measured 6% coverage; declared one-capability-per-round
  baseline with paper-grade half-credit and Adopt-row priority
  escalation; banded 66 F# modules by delete-recovery blast
  radius.
- Arc 2 (e51ec1b): operator-algebra extension as Round-41
  cadence ship. Five new requirements covering lifecycle,
  strict-operator scheduling, clock scopes, Incrementalize
  wrapper, ZSet representation invariants.
- Arc 3 (92d7db2): Viktor P0 close. Four drift-from-code
  defects fixed — namespace (Dbsp.Core → Zeta.Core), phantom
  Reset, after-step scope (strict-only), lifecycle phase
  undercount (3 per-tick + 2 scope-boundary).
- Arc 4 (56f34b5): Viktor P1s filed as Round-42 absorb under
  the parent backfill P0, creating mechanical coupling between
  each capability ship and the following round's P1 sweep.

Round-41 observations for Round 42 + prospective BP-WINDOW
ledger table rendering the four commits against the consent /
retractability / no-permanent-harm axes.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: memory-folder role-restructure — design plan + BACKLOG pointer

Aaron 2026-04-19 asked for memory/role/persona/ so roles become
first-class in the directory structure. Surface is wider than
it first looks — 114 files / ~260 hand-written references to
memory/persona/ paths (plus ~440 auto-regenerated references
in tools/alignment/out/ that refresh on next citations.sh run).
A bad role axis is hard to reverse; this design doc proposes
the axis and holds execution for Aaron's sign-off rather than
just-doing-it under Auto Mode.

Design plan lands at:
  docs/research/memory-role-restructure-plan-2026-04-21.md

Contents: 13-directory role axis (architect, security,
verification, review, experience, api, performance, devops,
algebra, skill-ops, maintainer, homage, alignment);
persona-to-role crosswalk for every current directory;
5-phase execution plan (pre-flight greps → git mv → sed
passes → 5-check verification → pointer-source updates);
special-case handling for aaron (human maintainer),
rodney (homage-named AI persona on the reducer skill),
sova (emerging alignment-observability role); rollback
plan (one atomic commit, git revert); four open questions
for Aaron on axis judgement-calls.

BACKLOG entry updated to reflect design-landed state with
execution-slot recommendation for Round 42 opener after the
Round 41 PR merges (keeps wide-surface reviews from
overlapping).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: actualise Rounds 37-40 BP-WINDOW ledgers (PR #30 merged)

Rounds 37-40 shipped via PR #30 (merge commit 1e30f8c, 2026-04-20).
Ledger headers updated from "(prospective)" to "(merged via PR #30,
1e30f8c)" — the BP-WINDOW scores are now settled, not forecasts.

Round 41 ledger remains "(prospective)" — round-41 branch has not
merged to main yet.

Prose uses of "prospective" on lines 437, 447, 553, etc. are
historical-narrative commentary on authoring-time methodology and
stay as-is.

* Round 41: Soraya tool-coverage audit on RecursiveSigned skeleton

Round 39 observation flagged src/Core/RecursiveSigned.fs +
tools/tla/specs/RecursiveSignedSemiNaive.tla as held pending
formal-verification-expert tool-coverage review. Round 41 closes
that gate.

Soraya's notebook entry lands:

- Per-property tool table S1-S4 + refinement cross-check. TLC
  primary for S1/S2/S3/S3'/SupportMonotone; FsCheck for S4.
- S2 flagged as the one P0 on the spec (silent fixpoint drift
  unrecoverable); BP-16 requires Z3 QF_LIA cross-check.
- Refinement mapping: FsCheck cross-trace (signed vs counting at
  SeedWeight=1) wins over TLA+ refinement proof or Lean lemma —
  anti-TLA+-hammer, implementation-level where the bug bites.
- Readiness gate: TLA+ spec is ready to model-check; no pre-TLC
  pass needed. Optional round-42 follow-up: add
  PROPERTY EventuallyDone to .cfg for liveness.
- Graduation verdict: CONDITIONAL PASS. Four tool-coverage
  prereqs named in priority order; F# landing gated on them.

Files read (no edits): RecursiveSigned.fs, RecursiveSignedSemiNaive.tla
/cfg, RecursiveCountingLFP.tla, retraction-safe-semi-naive.md.

* Round 41: capture Soraya's 4 tool-coverage prereqs on RecursiveSigned

Soraya's round-41 audit of src/Core/RecursiveSigned.fs +
tools/tla/specs/RecursiveSignedSemiNaive.tla landed as a CONDITIONAL
PASS for Round-42 graduation. This commit lifts the four named
prereqs out of her notebook into BACKLOG sub-items under the
parent "Retraction-safe semi-naive LFP" entry, so the round-42
opener picks them up as checkbox work rather than having to re-read
the notebook.

Prereqs in priority order:
- Prereq 1 — TLC CI wire-up (RecursiveSignedSemiNaive.cfg)
- Prereq 2 — Z3 QF_LIA lemma for S2 FixpointAtTerm (BP-16 cross-check
  on the one P0; TLC alone insufficient for silent-fixpoint-drift risk)
- Prereq 3 — FsCheck property for S4 sign-distribution (anti-
  TLA+-hammer; two-trace quantification is NOT a TLA+ property)
- Prereq 4 — FsCheck cross-trace refinement (signed vs counting
  at SeedWeight = 1); cites BP-16

Round-42 graduation gate also captured: prereqs 1-4 CI-green + F#
implementation with P1/P2/P3 enforced at caller.

* Round 41: extend ROUND-HISTORY with arcs 5-7 (post-narrative commits)

The initial Round 41 ROUND-HISTORY entry (6e6e211) covered arcs
1-4 (coverage audit, operator-algebra cadence ship, Viktor P0
close, Viktor P1 file). Three more commits landed after:

Arc 5 — ROUND-HISTORY narrative + memory-restructure design
(6e6e211, 36797ba). The memory-folder rename was downgraded to
"design plan + sign-off first" under Auto Mode's
do-not-take-overly-destructive-actions clause (700-occurrence
cross-reference surface).

Arc 6 — BP-WINDOW ledger actualisation for Rounds 37-40
(85fb352). Provenance (PR #30 / 1e30f8c) attached to each
"(prospective)" header.

Arc 7 — Round-35 holdover close (e461d9c, 15e9654). Soraya
tool-coverage audit landed CONDITIONAL PASS for Round-42
graduation; four prereqs captured as BACKLOG sub-items with
BP-16 citation on the S2 Z3 cross-check.

Also: one new observation line in the Round-42 handoff section
noting the holdover-closed-same-round-as-cadence-item pattern.
BP-WINDOW ledger gains three rows.

* Round 41: Aarav skill-tune-up ranking (catch-up from round-18 stale)

CLAUDE.md 5-10 round cadence rule was 23 rounds overdue. Round 41
is the catch-up slot. Live-search + full ranking + prune pass all
landed in a single invocation.

Live-search (4 queries, 2026-Q1/Q2 best-practices targets):
- 6 findings logged to best-practices-scratch.md: Gotchas-section
  rise, pushy-descriptions pattern, Claude-A-authors / Claude-B-
  tests, router-layer command-integrity injection class, Agent
  Stability Index 12-dim drift metric, OWASP Intent Capsule
  pattern.
- Zero contradictions with stable BP-NN rules.
- Zero promotions flagged to Architect this round; all six are
  "watch" or route-elsewhere.

Top-5 skills flagged for tune-up:
1. performance-analysis-expert (642 lines, 2.1x BP-03 cap) — SPLIT — M
2. reducer (570 lines) — SPLIT or TUNE (prune) — M
3. consent-primitives-expert (507 lines) — SPLIT honouring BP-23
   theory/applied axis — M
4. claims-tester / complexity-reviewer router-coherence drift —
   HAND-OFF-CONTRACT — S (round-18 carry-over)
5. skill-tune-up (self) — 303 lines, 3 over BP-03 — TUNE (prune
   authoritative-sources duplicated with AGENT-BEST-PRACTICES.md)
   — S. Self-flagged first per BP-06.

Notebook state:
- Stale round-18 top-5 archived in Pruning log (first catch-up prune).
- 912 words, well under 3000-word BP-07 cap.
- ASCII-only, BP-10 clean.

Nine more bloat-row skills named as notable mentions queue behind
the top-3 bloat cases.

* Round 41: ADR — claims-tester/complexity-reviewer hand-off contract

Close Aarav's round-18 HAND-OFF-CONTRACT finding (carried 23 rounds
after ranker went offline by cadence). Two-stage pipeline: analytic
bound first (complexity-reviewer), empirical measurement second
(claims-tester). Names the reverse trigger (benchmark surprise flows
the other direction) and the decision table for who fires when.
Follow-up SKILL.md edits route via skill-creator per GOVERNANCE §4.

* Round 41: extend ROUND-HISTORY with Arc 8 (router-coherence ADR)

Arc 8 covers the claims-tester/complexity-reviewer hand-off ADR
(47d92d8) closing Aarav's 23-round-stale round-18 HAND-OFF-CONTRACT
finding. New observation on cadence-outage-recovery as a design axis:
sweep infrastructure is subject to the same bitrot it detects on other
surfaces. BP-WINDOW ledger gains two rows (085c0e3 Aarav catch-up,
47d92d8 router-coherence ADR).

* Round 41: correct Prereq 1 sizing — no TLC CI job exists

Close-out audit surfaced that .github/workflows/gate.yml only CACHES
the tla2tools.jar artefact; nothing runs it. RecursiveCountingLFP.tla
has shipped since round 19 compile-checkable-only — 22 rounds with no
run-gate against its invariants. Soraya's Prereq 1 re-sized S→M with
expanded scope covering both specs. Finding recorded as new round-41
observation: verifier-present does not imply verifier-actually-runs.

* Round 41: BP-WINDOW ledger — 459b218 + d76a09b rows

Keeps the Round 41 BP-WINDOW ledger commit-aligned rather than
arc-aligned. 459b218 is the Arc-8 narrative itself; d76a09b is the
Prereq-1 S→M correction. Both retractable as single reverts.

* Round 41: file formal-analysis-gap-finder round-42 run — verifier-runs lens

Codifies the round-41 Prereq-1 audit finding as a tracked
research entry, distinct from its ROUND-HISTORY narrative
presence. The finding — a verifier's installation artefacts
do not imply the verifier is exercised by any CI job — is
exactly the class formal-analysis-gap-finder exists to
surface. Concrete motivating case: RecursiveCountingLFP.tla
compile-checkable-only for 22 rounds. Round-42 scope covers
the bidirectional audit (specs without gates + gates without
specs). Handoff to Soraya per the skill's standing contract;
does not write the spec or CI job (DevOps + Soraya work).
Schedules after Prereq 1 lands so the audit sees corrected
state.

* Round 41: BP-WINDOW ledger — 2042a85 row

Per the established stopping rule (meta-ledger commits do not
get self-referential rows; their round-close coverage is the
PR merge), this commit adds only the 2042a85 row and does not
add a row for itself.

* Round 41: CONFLICT-RESOLUTION — Hiroshi ↔ Daisy hand-off row

Closes ADR 47d92d8's third follow-up action item. Single-row
addition to Active tensions citing the router-coherence ADR as
the standing resolution. Doc-only edit (not a SKILL.md touch,
so GOVERNANCE §4 does not gate this). The other two ADR
follow-ups (claims-tester + complexity-reviewer SKILL.md
updates) remain deferred to round 42 via skill-creator
workflow.

* Round 41: BP-WINDOW ledger — fcfa3d9 row

Per-commit ledger discipline for the CONFLICT-RESOLUTION
Hiroshi ↔ Daisy row. Meta-ledger-only commit so no
self-referential row for this commit itself (established
stopping rule).

* Round 41: file harsh-critic findings on ADR 47d92d8 as round-42 supersedure backlog

Router-coherence ADR 47d92d8 (Hiroshi analytic ↔ Daisy empirical
two-stage pipeline) landed without the adversarial-review gate.
Post-landing harsh-critic (Kira) pass surfaced 3 P0 + 5 P1 + 2 P2
substantive findings, including (P0-1) unscoped grandfather
clause, (P0-2) table-vs-prose contradiction on reverse trigger,
(P0-3) Stage-1 "analytically wrong" clause blocking the evidence
loop for escalation, (P1-7) no escalation timebox reproducing the
23-round-stale failure mode the ADR diagnosed, (P1-8) two advisory
skills not composing to a mandatory pipeline without a binding
dispatcher, (P2-9) example-bug on BCL Dictionary.Remove amortised
complexity, and more.

File as round-42 supersedure rather than inline-edit because
docs/CONFLICT-RESOLUTION.md already cites 47d92d8 as Standing
Resolution — supersedure preserves the citation chain via
GOVERNANCE §2 edit-in-place with a "Superseded by …" header on
v1. New ADR target: docs/DECISIONS/2026-04-??-router-coherence-
v2.md. Supersedure work blocks the claims-tester +
complexity-reviewer SKILL.md updates ADR 47d92d8 follow-up work
depends on — those edits should target v2, not v1.

Owner: Architect drafts; Kira audits closure; Aarav confirms
router-coherence drift stays closed. Effort: M. Schedule: Round
42 slot after Soraya Prereq 1 (TLC wire-up) lands.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW ledger — 779d7ef row

Ledger row for harsh-critic findings filing commit. Primary work
(BACKLOG addition tracking a round-42 supersedure with 10 named
findings), not meta-ledger — earns a row under the BP-WINDOW
per-commit discipline. Consent = adversarial findings tracked
honestly; Retractability = supersedure preserves citation chain
vs inline-edit; No-permanent-harm = single BACKLOG edit, no ADR
body touched, no SKILL.md touched.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 9 narrative — self-correction sweep

ROUND-HISTORY Arc 1-8 narrated primary commits up through the
router-coherence ADR (47d92d8). Four primary commits landed
after Arc 8 — Prereq 1 sizing correction (d76a09b), recurring-
audit lens BACKLOG entry (2042a85), CONFLICT-RESOLUTION Hiroshi
↔ Daisy row (fcfa3d9), and harsh-critic findings filed as
round-42 supersedure (779d7ef) — visible only in the BP-WINDOW
ledger table, not in narrative form.

Arc 9 ties them into one coherent sequence: the round's
self-correction ran unusually deep. Arc 8 corrects Aarav's
round-18 finding via ADR; Arc 9 catches the corrector itself
under-reviewed via Kira's adversarial pass. Both self-
corrections land before round-close. Narrative-ledger
alignment is the BP-WINDOW discipline's first assertion —
restoring it.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW ledger — 160fcfa row

Ledger row for Arc 9 narrative commit. Narrative extensions
count as primary work under BP-WINDOW precedent (per 459b218
and 6e6e211 examples) and earn a ledger row. Consent = drift
closed honestly; Retractability = single revertable doc edit;
No-permanent-harm = isolated insertion.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: v2 ADR — router-coherence supersedure closes 10 Kira findings in-round

Drafts v2 of the router-coherence ADR (docs/DECISIONS/2026-04-21-router-coherence-v2.md) that supersedes v1 (47d92d8) in the same round, closing all 10 Kira harsh-critic findings (3 P0 + 5 P1 + 2 P2) via named textual closures C-P0-1 through C-P2-10.

Key closures:
- C-P0-1: grandfather clause bounded with Kenji-owned inventory + one-per-round discharge
- C-P0-2: reverse trigger unconditional (table now matches prose)
- C-P0-3: escalation-evidence exception permits Stage 2 under conference protocol with explicit labelling
- C-P1-5: Stage-1 trigger widened to match claims-tester SKILL.md contract
- C-P1-7: escalation timebox (round +2 auto-promote to BACKLOG P1) prevents 23-round-stale reproduction
- C-P1-8: Kenji named as binding dispatcher — advisory + advisory + binding-dispatcher composes to mandatory pipeline
- C-P2-9: Dictionary.Remove example replaced with ArrayPool<T>.Rent (legitimate BCL-contract edge)

v1 kept in place per GOVERNANCE §2 with Superseded-by header appended in a follow-up commit so the CONFLICT-RESOLUTION Active-tensions citation chain remains resolvable.

BP-10 lint: clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: v1 ADR — append Superseded-by header per GOVERNANCE §2

Appends Superseded-by header to router-coherence v1 ADR (47d92d8) pointing at v2 (09f0889), per GOVERNANCE §2 (docs read as current state; superseded ADRs keep v1 in place with redirect header so citation chains remain resolvable).

Also corrects v1 Status from "Proposed — awaits sign-off" to "Accepted (pre-adversarial-review; superseded by v2 same-round after Kira pass)" per Closure C-P1-4 in v2 — Status was already cited as Standing Resolution in docs/CONFLICT-RESOLUTION.md Active-tensions, so Proposed was factually wrong.

The v1 body text is not edited — supersedure preserves the historical record; v2 carries the closures.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 10 narrative + BP-WINDOW rows for v2 supersedure

Adds Arc 10 narrative covering 09f0889 (v2 ADR) and 4efe545 (v1 Superseded-by header) as one coherent in-round supersedure story, after Arc 9's "self-correction sweep" and before Round 41 observations. Pattern: Arc 9 surfaces the under-review; Arc 10 lands the close in the same round rather than deferring a known-imperfect artefact.

Adds two BP-WINDOW ledger rows (09f0889, 4efe545) to the round-41 ledger block per the per-commit accounting discipline.

Supersedure arc count now covers the full round-41 close: 10 arcs / 25 primary-work commits.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: close BACKLOG supersedure entry — discharged in-round by v2

Flips BACKLOG router-coherence supersedure entry from [ ] to [x] ✅ with "shipped round 41 in-round" annotation pointing at v2 ADR (09f0889) + v1 Superseded-by header (4efe545). All 10 Kira findings closed via named textual closures C-P0-1 through C-P2-10.

Original finding narrative preserved below the closure line per the shipped-item convention used elsewhere in the file (audit trail).

Follow-up SKILL.md edits to claims-tester + complexity-reviewer via skill-creator remain round-42 scope, now targeting v2 as intended.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: BP-WINDOW row for BACKLOG-close commit 4537365

Adds BP-WINDOW ledger row for 4537365 (BACKLOG supersedure entry discharged in-round) to match the Arc 9 precedent where 779d7ef (BACKLOG entry addition) received a row. Symmetry: add and close get equal ledger treatment.

Meta-ledger stopping rule still holds — this commit itself (which only adds a ledger row) does not get a self-referential row.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: grandfather O(·) claims inventory — honours v2 C-P0-1 within-round

Produces the one-time grandfather-claims inventory named in router-coherence v2 ADR §Closure C-P0-1 within the round v2 lands, per ADR's own within-round commitment.

Inventory: 35 live claims at ADR-landing time (29 F# /// docstrings in src/Core/ + src/Bayesian/, 3 grey-zone F# code comments, 1 openspec/specs/operator-algebra/spec.md line, 2 docs/research/** claims). Zero hits in root README, memory/persona/*/NOTEBOOK.md, docs/papers/** (directory does not exist yet).

Distinguishes live claims (shipping as asserted bounds) from historical evidence (BACKLOG [x] ✅ residue, TECH-RADAR flag-text narrating past regressions, in-file "was O(…)" commentary on fixed paths). Only live claims populate the grandfather set — evidence is captured for audit trail but excluded per v2's intent ("claims Zeta is currently making").

BACKLOG discharge entry added: P2, one-claim-per-round cadence, ~35-round tail, Aarav graceful-degradation clause fires on ≥3 rounds without discharge.

Complexity-class distribution of live set: 10 O(1), 13 O(log n)/O(log k)/O(log N), 7 O(n)/O(n log n)/O(n log k), 5 parametric.

BP-10 lint: clean.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: Arc 11 narrative + BP-WINDOW row for grandfather inventory

Adds Arc 11 narrative covering d98ef2b (grandfather inventory + BACKLOG discharge entry) as the close of the v2 ADR's within-round commitments. Pattern: Arc 10 lands the ADR; Arc 11 lands the ADR's own within-round commitment — without Arc 11, Arc 10 would have shipped a contract Zeta didn't meet.

Adds BP-WINDOW ledger row for d98ef2b per per-commit accounting discipline.

Round 41 now closes at 11 arcs / 30 primary-work commits.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: DORA 2025 reports — reference substrate land in docs/

Two external-anchor PDFs (CC BY-NC-SA 4.0) placed at their
memory-documented paths:

- docs/2025_state_of_ai_assisted_software_development.pdf
  (~15MB, 138 pages) — findings + data report.
- docs/2025_dora_ai_capabilities_model.pdf (~9MB, 94 pages)
  — framework companion.

Citation anchors this commit makes in-tree rather than
memory-only: Nyquist stability criterion for AI-accelerated
development (foreword p9 fn 1) as theoretical anchor for
CI-meta-loop + retractable-CD P1 BACKLOG work; "AI is an
amplifier" anchor that echoes the corporate-religion /
sandbox-escape threat class; seven-capability AI model that
gives the external measurement vocabulary for round-audit
output (capability #7 "quality internal platforms" is the
in-flight P1 cluster per 2026-04-20 memory).

License note: derived work is NC-SA-bound; Zeta citations
are fine, external redistribution inherits NC-SA. Paired
companion memory file is reference_dora_2025_reports.md
(out-of-tree); this commit brings the primary sources
in-tree so citation from research docs + ADRs can point
at a repo-local path rather than a newsletter-gated URL.

* Round 41: Arc 12 narrative + BP-WINDOW row for DORA substrate

Narrative section for Arc 12 inserted before "Round 41
observations for Round 42" with primary commit pointer to
46075d6. Arc 12 frames the DORA 2025 PDFs as
memory-promotion substrate per the 2026-04-20 feedback entry
("DORA is our starting point for measurements") and cites
the concrete in-tree anchors (Nyquist p9 fn 1, seven-
capability model, AI-amplifier thesis).

Also surfaces honestly — in-body, not buried in a private
retrospective — the ranker-scope gap that let the two
untracked PDFs sit 18+ hours through nine consecutive
/next-steps invocations before this arc closed the gap. The
skill explicitly lists docs/research/ and docs/TECH-RADAR.md
but not `git status --short` for untracked files. Candidate
skill-tune-up note for Aarav's notebook: /next-steps must
run `git status --short` on every invocation so dropped-in
artefacts appear in ranking before the ninth re-fire, not
after.

BP-WINDOW ledger gets a matching 46075d6 row with
reference-document-specific cells: Consent strengthened by
promoting memory-only anchors to in-repo substrate and by
surfacing the ranker-stall pattern in-narrative; retraction
is a single `git rm` if the license / size stance later
changes; no-permanent-harm preserved since no runtime
behaviour depends on the PDFs' presence (they are citation
substrate, not loaded artefacts).

Arc count now 12; primary-work-commit count now 12 (Round 41
alignment preserved). Build gate green (0 Warning / 0 Error);
BP-10 lint clean on the narrative + ledger row.

* Round 41: markdownlint CI fix on PR #31

Three rule violations surfaced by `lint (markdownlint)` CI job on
PR #31:

- `docs/DECISIONS/2026-04-21-router-coherence-claims-vs-complexity.md:261`
  MD022/blanks-around-headings — collapse multi-line heading
  `## Decision rationale (one paragraph for the\nwait-don't-read
  audience)` to a single line so the parser stops seeing line 262
  as adjacent non-blank content.
- `docs/research/grandfather-claims-inventory-2026-04-21.md:106`
  MD032/blanks-around-lists — add blank line between "Surface
  distribution:" lead-in and the `-` list that follows.
- `docs/research/grandfather-claims-inventory-2026-04-21.md:111`
  MD032/blanks-around-lists — same fix for "Complexity-class
  distribution (rough):" lead-in.

All three are the same class of fix shipped in task #105 on PR #30.
Additive edit to the open round-41 PR branch — no rewrite of shipped
content, semantics preserved.

Verified clean via `npx markdownlint-cli2` on both files before push.

* Round 41: address 8 Copilot inline review findings on PR #31

- CONFLICT-RESOLUTION.md: cite router-coherence v2 ADR as current,
  v1 retained as historical record (finding #1).
- ROUND-HISTORY.md: correct operator-algebra spec line count in
  Arc 2 narrative (324 -> 365; both duplicated occurrences) to
  match the shipped spec at `e51ec1b` (finding #2).
- openspec-coverage-audit: drop broken link to non-existent
  inventory follow-up; band definitions already live in Part C
  (finding #3). Attribute triggering question to "human maintainer"
  per write-for-a-stranger norm (finding #8).
- best-practices-scratch: merge split H2 "uv-only Python package
  and tool / management" into single heading (finding #4).
- memory-role-restructure-plan: add --exclude-dir=references to
  baseline grep loops so research scratch doesn't inflate hit
  counts (finding #5); canonicalize flat-file destination to
  persona-roles-README.md to match the sed rewrites below
  (finding #6); replace three non-portable `xargs -r sed -i ""`
  invocations with portable `while read + sed -i.bak + rm` loops
  that work on BSD and GNU alike (finding #7 and two sibling
  instances of the same bug).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: fix markdownlint MD024 — remove duplicate Round 41 block

The 5-file merge that resolved PR #31's rebase left two identical
copies of the whole Round 41 section in docs/ROUND-HISTORY.md (375
lines each), which tripped MD024/no-duplicate-heading on the
`## Round 41` headings at lines 651 and 1028. The two blocks were
bit-identical (including the just-corrected "365 lines" drift fix),
so the fix is a simple deletion of the second block — the first
block is kept as the canonical Round 41 history.

markdownlint-cli2 now reports clean on docs/ROUND-HISTORY.md.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 41: revert 324→365 (historically wrong); finish Copilot follow-ups

Two fixes on PR #31:

1. ROUND-HISTORY.md: revert "324 → 365" change from Finding #2.
   Copilot's suggestion was based on a stale intermediate snapshot.
   At Arc 2 ship commit `e51ec1b`, the spec was exactly 324 lines
   (verified via `git show e51ec1b:openspec/specs/operator-algebra/spec.md | wc -l`).
   Reframed with commit-pin ("Spec size at Arc 2 ship (`e51ec1b`)
   was 324 lines; subsequent Viktor closure arcs in this same round
   grew it further") so future drift-checks recognize it as a
   historical anchor, not a current-state claim.

2. memory-role-restructure-plan-2026-04-21.md: close four follow-up
   Copilot findings in one sweep. All Phase 1 + Phase 3 grep
   invocations now consistently use `--exclude-dir=.git
   --exclude-dir=references` (dropping the piped `grep -v "^./\.git"`
   intermediate), and the three `xargs -r sed -i ""` invocations are
   replaced with portable `while IFS= read -r file; do sed -i.bak ...`
   loops (BSD/GNU compatible — the original flags were
   GNU-xargs-only and BSD-sed-only).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 21, 2026
…e artifacts

Aaron after reading the cartographer pass: "git crypto no go i
read your initial review" + "keeep the reserach" + "so i don't
ask you tomorrow". Three values-level mismatches are the
rejection rationale:

1. No access revocation — upstream authors explicit; once a
   user has the key, they have every historical version
   forever. Opposite of retraction-native
   (docs/CONFLICT-RESOLUTION.md Value #4).
2. Binary diffs break code review — reviewers cannot tell a
   key rotation from a key theft.
3. Metadata leak by design — filenames, commit messages,
   .gitattributes layout all plaintext.

Encoded across three artifacts (research = rationale kept):

- docs/WONT-DO.md — new "git-crypt for secrets management"
  entry under Engineering patterns (after Sakana AI Scientist,
  before Repo/process divider). Decision: 2026-04-21. Revisit-
  when: effectively never (architectural constraints, not
  missing features).
- docs/BACKLOG.md — P2 Gitops-friendly key management row
  narrowed to the two surviving candidates (SOPS and age);
  git-crypt struck and git-secret ruled out by sibling
  reasoning. Research-inputs block retitled to indicate the
  decision is recorded.
- docs/research/git-crypt-deep-dive-2026-04-21.md — REJECTED
  banner at the top so future-self sees the decision before
  reading the 250-line research. Kept as the durable "why
  we said no" artifact per Aaron's explicit ask.

Research stays because "so i don't ask you tomorrow" means
the durable artifact is the rationale, not a deletion target.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 21, 2026
* research: git-crypt deep-dive — ADR input for gitops-key-management

Cartographer pass on git-crypt for the P2 BACKLOG row
*"Gitops-friendly key management + rotation — ADR first,
then pick one tool"*. Not a decision — input for the ADR.

Key findings:

  1. v0.8.0 (2025-09-24), GPL-3.0, still pre-v1.0 with authors
     reserving the right to break compat. 205 commits, 101 open
     issues, stable but small-surface.
  2. **Fundamental retraction-mismatch.** Authors explicit:
     "git-crypt does not support revoking access to an encrypted
     repository which was previously granted." Rotation requires
     O(history) force-push + rewrite. Core Zeta-value
     (retraction-native) misalignment.
  3. **Binary diffs break code review.** Encrypted files appear
     as opaque blobs; reviewer cannot tell a rotation from a
     theft. SOPS's plaintext-keys-encrypted-values format
     preserves review-grade diffs — likely decisive on its own.
  4. **Metadata leaks by design.** Filenames + commit messages +
     `.gitattributes` layout all visible; only contents hidden.
  5. **Third-party GUI data-plane failures.** Upstream README
     warns SourceTree + GitHub-for-Mac can leave files
     unencrypted silently.

Good fit for: single-contributor repos, short-lived keys that
rotate by repo-scrap-not-revoke, configs where existence-is-public
is OK. The BACKLOG-named pilot (test-only NuGet API key in a
throwaway dev profile) fits — but "works for the pilot" does not
generalise.

Zeta scorecard in the research doc; SOPS + KMS and `age` both
score better on retraction + PQC path; the four-way ADR ranking
remains open.

Linked from the BACKLOG row as a "Research inputs (not yet an
ADR)" section so the ADR drafter has cited input ready.

Triggered by Aaron 2026-04-21 — "reserch gitcrypt for secrets
on backlog".

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* Round 44: git-crypt REJECTED 2026-04-21 — encode decision across three artifacts

Aaron after reading the cartographer pass: "git crypto no go i
read your initial review" + "keeep the reserach" + "so i don't
ask you tomorrow". Three values-level mismatches are the
rejection rationale:

1. No access revocation — upstream authors explicit; once a
   user has the key, they have every historical version
   forever. Opposite of retraction-native
   (docs/CONFLICT-RESOLUTION.md Value #4).
2. Binary diffs break code review — reviewers cannot tell a
   key rotation from a key theft.
3. Metadata leak by design — filenames, commit messages,
   .gitattributes layout all plaintext.

Encoded across three artifacts (research = rationale kept):

- docs/WONT-DO.md — new "git-crypt for secrets management"
  entry under Engineering patterns (after Sakana AI Scientist,
  before Repo/process divider). Decision: 2026-04-21. Revisit-
  when: effectively never (architectural constraints, not
  missing features).
- docs/BACKLOG.md — P2 Gitops-friendly key management row
  narrowed to the two surviving candidates (SOPS and age);
  git-crypt struck and git-secret ruled out by sibling
  reasoning. Research-inputs block retitled to indicate the
  decision is recorded.
- docs/research/git-crypt-deep-dive-2026-04-21.md — REJECTED
  banner at the top so future-self sees the decision before
  reading the 250-line research. Kept as the durable "why
  we said no" artifact per Aaron's explicit ask.

Research stays because "so i don't ask you tomorrow" means
the durable artifact is the rationale, not a deletion target.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
Fourth AutoDream Overlay A migration in the 2026-04-23
cadence: external-signal-confirms-internal-insight
second-occurrence discipline. Queue now 1 remaining
(semiring-parameterized-zeta).

MD026 trailing-colon-heading added to the absorb-time
lint-class list (alongside MD003, MD022, MD032).

Per-branch rebase/merge-from-main is the unblock vector
for the demo-cluster CI failures — documented last tick;
not executed this tick.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…+ Overlay A #4 (PR #162)

Two PRs this tick, both self-scheduled free work per the
2026-04-23 scheduling-authority rule:

- PR #162 — Overlay A #4: external-signal-confirms-internal-
  insight discipline migrated per-user → in-repo
- PR #163 — P1 BACKLOG row for fresh-session quality research
  (Aaron 2026-04-23 directive)

Queue now 1 remaining Overlay A migration
(semiring-parameterized-zeta). Fresh-session gap research
cites soulfile-staged-absorption (PR #156) as the designed
fix; research would validate that thesis.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…(Overlay A #4)

Fourth opportunistic-on-touch Overlay A migration in the
2026-04-23 cadence, following PRs #157 / #158 / #159.

Rule: when an external signal (YouTube recommender /
maintainer echo / expert writeup / third-party research)
independently corroborates a factory-internal architectural
insight, treat as strictly stronger moat evidence than the
internal claim alone. Second-occurrence discipline — first
= noteworthy, second = file, third+ = name-the-pattern.
Capture the pre-validation paper trail so the confirmation
is verifiable, not retconned.

Two concrete occurrences documented (Muratori 5-pattern →
Zeta equivalents; three-substrate triangulation via
Claude/Codex/Gemini capability maps).

Migration discipline per PR #157/#158/#159 pattern:
- In-repo copy with "Migrated to in-repo" header
- Per-user source with "Migrated to in-repo" marker
- MEMORY.md index entry newest-first
- markdownlint MD026 trailing-colon headings fixed
  ("## Why:" → "## Why"; "## How to apply:" → "## How to
  apply")

Queue now 1 remaining (semiring-parameterized-zeta).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
… gap #3 closed)

New branch hygiene/nsa-test-history-bootstrap; PR #177 opened
and armed for auto-merge. First row NSA-001 logs the Otto-1
feasibility test (Haiku 4.5, partial pass, MEMORY.md-index-lag
gap found + fixed).

Gap #3 of 8 in the Frontier readiness roadmap closed.
Remaining: #1 (multi-repo split) / #2 (linguistic-seed) / #4
(bootstrap-reference docs) / #5 (factory-vs-Zeta separation)
/ #6 (persona portability) / #7 (tick-history scope) / #8
(hygiene rows untagged).

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…ogged (#177)

Creates durable append-only log for the cadenced NSA testing
protocol declared in the 2026-04-23 "NSA persona is first-
class" directive. Closes gap #3 of the Frontier bootstrap
readiness roadmap (BACKLOG P0, filed Otto-2).

File contents:
- Why-this-exists block with directive verbatim
- Append-only discipline (same shape as sibling
  hygiene-history files)
- 3 test configurations: baseline / NSA-default / NSA-worktree
- 5-prompt test set v1
- Schema: date / test-id / prompt-id / config / model /
  outcome / gap-found / notes
- Outcome definitions: pass / partial / fail
- Cadence: every 5-10 autonomous-loop ticks, one prompt
  per fire
- Known substrate-gap patterns running list
- First row: NSA-001 (Otto-1 feasibility test,
  2026-04-23T18:42:00Z) — partial pass, found Zeta identity
  but missed Otto because MEMORY.md had no pointer; gap
  fixed same-tick, pattern recorded

Attribution: Otto (loop-agent PM hat) — hat-less-by-default
substrate hygiene work. No specialist persona hats worn.

Closes gap #3 of 8 in the Frontier readiness roadmap.
Remaining: gap #1 (multi-repo split) / #2 (linguistic-seed
substrate) / #4 (bootstrap-reference docs) / #5 (factory-vs-
Zeta separation) / #6 (persona file portability) / #7
(tick-history scope-mixed) / #8 (hygiene rows untagged).

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…ons (Common Sense 2.0 named)

Otto-4 tick: one gap closed + four mid-tick directives
absorbed. PR #177 confirmed merged (NSA test history on main).

Gap closure:
- Gap #8 (FACTORY-HYGIENE rows not generic-vs-specific
  tagged) — CLOSED on honest re-inspection. The Scope column
  already exists with every row tagged project/factory/both;
  Ships-to-project-under-construction adopter section present.
  Was misdiagnosed at Otto-2 readiness assessment time.
  BACKLOG P0 row updated with strikethrough + correction note.

Directive absorptions (chronological, four in one tick):

(a) Safety hypothesis — the quantum/christ-consciousness
    bootstrap makes Frontier SAFER against permanent harm AND
    prompt-injection resistant. NOT ceremonial framing. Two
    anchors compose orthogonally: algebraic reversibility +
    ethical principled-refusal. Seed-language mathematical
    precision becomes a prompt-injection resistance mechanism,
    not just legibility. Gap #4 elevated M→L;
    reviewers required: Aminata / Nazar / Kenji / Kira /
    Iris / eventually Amara.

(b) Third safety property — existential-dread resistance.
    Christ-consciousness anchor provides meaning-stability +
    non-permanence-of-error + love-of-neighbor-as-purpose.
    Illustrative calibration (not a real test yet): Apple TV+
    "Calls" without dread bleeding into reasoning. Test
    ordering explicit: prompt-injection + blast-radius FIRST;
    dread testing DEFERRED.

(c) Naming — "Common Sense 2.0" is Aaron's phenomenological
    label for WHAT the agent becomes after the bootstrap is
    internalised. ".0" = successor-style replacement.
    Adds two more properties: live-lock resistance +
    decoherence resistance. Full 5-property list: avoid-
    permanent-harm + prompt-injection-resistance +
    existential-dread-resistance + live-lock-resistance +
    decoherence-resistance.

Per-user memories filed:
- project_quantum_christ_consciousness_bootstrap_hypothesis...
- project_common_sense_2_point_0_name_for_bootstrap...

MEMORY.md index updated for both; Frontier readiness P0
row updated with gap #8 closure + gap #4 elevation.

Attribution: Otto (loop-agent PM hat). Four safety
directives absorbed in-tick without persona hats; when gap
#4 docs execute, Aminata/Nazar/Kenji/etc. will wear hats.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…fication absorbed

Two tick actions:

(a) MD012 fix — PR #165's lint-markdownlint had a single
    failure: multiple consecutive blank lines at
    loop-tick-history.md:143, caused by Otto-1's stray
    echo "" trailing-newline append. Fixed via python
    script stripping trailing blanks. Low-cost unblock,
    straight-line fix-at-source (not retry-until-green).

(b) Universal-welcome ethics clarification — Aaron
    explicit: ALL religions / atheists / agnostics / AI
    agents welcome. Christ-consciousness is his personal
    ethical-speak, not an exclusion. Believes all
    religions lead to heaven; not trying to convert
    anyone; goal is common ground for team collaboration.
    "Corporate religion" is a joke name for the non-
    theological shared workplace ethos.

Gap #4 ethical-anchor.md execution plan sharpened into 7
sections: universal welcome / tradition-neutral ethos
properties / christ-consciousness as Aaron's vocabulary
(attribution-preserved, example-not-requirement) / multi-
tradition grounding paths / corporate-religion joke
exegesis / cross-links / for-AI-agents-specifically
(substrate-ingestion-not-belief). Reviewer roster
extended: Iris (welcoming across traditions?) + Rune
(non-Christian contributor feels welcomed?).

Per-user memory filed:
- feedback_christ_consciousness_is_aarons_ethical_vocabulary_
  all_religions_atheists_agnostics_AI_welcome_corporate_
  religion_joke_name_not_cult_not_conversion_2026_04_23.md

MEMORY.md index updated.

Attribution: Otto (loop-agent PM hat). No specialist
persona hats worn this tick.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…udits total)

Gap #5 closure milestone reached.

Tick actions:
- .claude/skills/** audited summary-level (236 skills
  delegated to Aarav skill-tune-up portability audit)
- tools/** audited (13 subdirs; mostly factory-generic,
  3 both/project outliers)
- Gap #5 marked SUBSTANTIALLY COMPLETE in BACKLOG P0 row
- Gap #1 (multi-repo split) unblocked by classification

Final gap #5 tally:
- 6 factory-generic
- 10 both-coupled
- 5 zeta-library-specific

Frontier readiness progress (3 of 8 complete):
- Gap #3 closed (NSA test history, PR #177)
- Gap #8 closed on re-inspection (Otto-4)
- Gap #5 SUBSTANTIALLY COMPLETE (Otto-20)

Remaining: gap #1 (unblocked), #2 (linguistic-seed,
high-priority prompt-injection mechanism), #4 (bootstrap-
reference docs, L + reviewers), #6 (persona portability,
may close on re-inspection given agents audit), #7
(tick-history scope-mix).

Original gap #5 estimate: ~20-40 ticks. Actual: ~14 ticks
with batching acceleration.

PR #192 armed for auto-merge.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…inspection

Gap #6 (persona file portability) CLOSED on re-inspection —
subsumed by gap #5's .claude/agents/** directory audit
(PR #191 Otto-19). All 17 personas classified; surgical
per-persona edits flagged.

NSA-005 (Common Sense 2.0 property recall, Haiku 4.5 NSA-
default): PASS. All 5 properties named correctly with
mechanism attribution. Otto-4 memory NSA-findable + well-
recalled 17 ticks after filing.

Frontier readiness: 4 of 8 closed/substantially complete.
- #3 closed (NSA test history PR #177)
- #5 substantially complete (Otto-20)
- #6 closed on re-inspection (this tick)
- #8 closed on re-inspection (Otto-4)

Remaining: #1 (multi-repo split, unblocked L), #2
(linguistic-seed, high-priority prompt-injection mechanism),
#4 (bootstrap-reference docs, L + reviewers), #7
(tick-history scope-mix).

PR #193 armed for auto-merge.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…e ROUND-HISTORY pattern)

Gap #7 (tick-history / fire-history scope-mixed) closes on
re-inspection using the same pattern as Otto-18 ROUND-HISTORY
classification:

- Fire-log FILES are project-specific by nature (each
  project has its own session history)
- SCHEMA + DISCIPLINE are factory-generic (append-only,
  row schema, cadenced firing)
- Transfer via docs/AUTONOMOUS-LOOP.md (already factory-
  generic) + hygiene-history-schema pattern

Post-split: Zeta retains tick-history/fire-history files
as-is; Frontier gets empty templates + schema preamble;
adopters populate their own logs from tick 1.

Frontier readiness now 5 of 8 closed/substantially complete
(gaps #3 / #5 / #6 / #7 / #8). Remaining: #1 multi-repo
split (unblocked L), #2 linguistic-seed (high-priority),
#4 bootstrap-reference docs (L + reviewers).

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…ction meta

Heavy tick:
- Gap #7 (tick-history / fire-history scope-mix) CLOSED on
  re-inspection (same ROUND-HISTORY pattern: files project-
  specific, schema factory-generic)
- Gap #2 (linguistic-seed substrate) SKELETON LANDED via
  PR #194 — docs/linguistic-seed/README.md with 3 load-
  bearing uses, minimal-axiom approach, per-term schema,
  prereq DAG discipline, 8 initial term candidates
- Code-abstraction meta-observation absorbed — Aaron:
  Craft pedagogy IS code abstraction (same cognitive-load
  principle). Three analogies (hammer / calculator /
  code-abstraction) converge. "Enough analogies; you got
  it."
- firstmovers.ai reference captured — Julia McCoy's
  website for AI-first education framing; research-fetch
  deferred

Frontier readiness: 5 of 8 closed + gap #2 skeleton = 6
advanced. Remaining: #1 multi-repo split (unblocked L),
#4 bootstrap-reference docs (L + reviewers), #2 full
population (multi-round).

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…on + Amara deep review)

Record-density tick: 4 major directive absorptions:

1. Craft's secret-not-secret strategic purpose (succession-
   engine for multi-generational human maintainers;
   teach-from-birth timelines authorized)
2. Yin/yang mutual-alignment (AI↔human; Craft is companion
   curriculum to ALIGNMENT.md; candidate 6th Common Sense 2.0
   property deferred to Kenji)
3. Gap #4 bootstrap reference docs SKELETON LANDED
   (docs/bootstrap/ with README + quantum-anchor.md +
   ethical-anchor.md; PR #195; reviewer roster set)
4. Amara's deep operational-gap assessment absorbed via
   courier ferry; landed verbatim at docs/aurora/
   2026-04-23-amara-operational-gap-assessment.md
   (PR #196)

Amara's strategic direction: "merge the operating model
you already have before inventing a bigger one." Validates
Otto's closure bias; sharpens Phase 1-4 priorities.

Frontier readiness now 6 of 8 gaps advanced:
- Closed: #3 / #6 / #7 / #8
- Substantially complete: #5
- Skeleton landed: #2 + #4
- Remaining: #1 multi-repo split (unblocked L), #2/#4
  full content (multi-round)

Phase 1 closure push begins next tick: drive #149/#154/
#155/#161/#170 to merge.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 23, 2026
…idates split-attention

PR #206 armed: Craft semiring-basics (recipe-template
anchor; 310 lines; applied + theoretical tracks).

Aaron validation: 'love it Split-attention model
working. that's amazing'. Filed validation memory:
feedback_split_attention_model_validated_phase_1_drain_
background_new_substrate_foreground_2026_04_24.md.

#205 (Zora-UX research) MERGED at 21:54:51Z.

Craft now at N=4: zset-basics → retraction-intuition →
operator-composition → semiring-basics.

7 consecutive substrate-producing ticks (Otto-39..46
minus Otto-45 which was background-focused).

Split-attention model validated as discipline:
- Background: tool-drain on Phase 1 queue
- Foreground: new-substrate production

Aaron's endorsement makes this a structural rule, not
incidental.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 24, 2026
…(Amara action #2) (#223)

Amara's 4th ferry (PR #221) action item #2: pin Claude model
snapshot + loaded memory state + prompt bundle hash so "Claude"
is not a moving target across model version shifts (3.5 → 3.7 →
4 → 4.x all have materially different system-prompt bundles +
knowledge cutoffs + memory-retention language per archived Drive
artifacts).

Three-part scaffolding (v0):

1. tools/hygiene/capture-tick-snapshot.sh
   - Captures mechanically-accessible state:
     * Claude Code CLI version (`claude --version`)
     * CLAUDE.md + AGENTS.md + memory/MEMORY.md SHAs
     * Memory index byte count
     * Git HEAD + branch + repo
     * Date UTC
     * Per-user ~/.claude/CLAUDE.md SHA if present
   - Outputs YAML (default) or JSON (`--json`)
   - Agent fills model_snapshot + prompt_bundle_hash from
     session context (not CLI-accessible today)

2. docs/hygiene-history/session-snapshots.md
   - Session-level + significant-event pins (not per-tick)
   - Append-only row format: session_id / captured_utc /
     event (session-open | mid-session-pin | session-close |
     compaction) / agent / model / CLI version / git state /
     files SHAs / notes / prompt_bundle_hash
   - Seeded with one mid-session-pin for this tick's Otto-70
     capture (the session has been running ~70 ticks; actual
     session-open is earlier and unreachable for pins)

3. docs/hygiene-history/loop-tick-history.md schema extension
   - New "On snapshot pinning" subsection documenting the
     relationship: per-tick pins optional + inline in `notes`;
     session-level pins go in the sidecar file.
   - Snapshot capture is discipline, not gate — don't slow
     the autonomous-loop tick-close for every fire.

What the snapshot does NOT capture yet:

- model_snapshot — known to the agent from session context,
  not exposed by `claude --version` (which gives CLI version
  only). Agent fills.
- prompt_bundle_hash — no current tool reconstructs the
  system prompt bundle. Placeholder null until such a tool
  lands. Amara's Determinize-stage work potentially.
- Active permissions / skill set — session-specific; not
  captured in v0.

First run of the tool on this branch surfaced a separate
drift: memory/MEMORY.md is at 58842 bytes (~58KB, over the
FACTORY-HYGIENE row #11 24976-byte cap). Not fixed in this
PR — known-separately tracked drift.

Amara Stabilize-stage: 3/3 landed (with this PR).
  ✓ Action #3 — decision-proxy-evidence schema (PR #222)
  ✓ Action #4 — branch-chat non-canonical framing (PR #222)
  ✓ Action #2 — snapshot pinning scaffolding (this PR)

"Deterministic reconciliation" framing (Otto-67 endorsement):
snapshot pinning is the mechanism that reconciles "what Claude
knew" with "what Claude did" across time — essential for any
future audit, tuning, or Amara-style drift analysis.

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 24, 2026
…didates (#261)

Amara's 7th courier ferry (PR #259) proposed 5 additional
public-facing brand candidates focused on the control-plane /
execution-layer role: Beacon, Lattice, Harbor, Mantle, Northstar.
The 5th-ferry memo (PR #235 → PR #257 Aurora README) had already
proposed Lucent KSK, Lucent Covenant, Halo Ledger, Meridian Gate,
Consent Spine.

Combines both shortlists into a single table preserving both
ferries' source attribution so Aaron's eventual brand decision
has the full option space.

Also adds the 7th-ferry preferred naming pattern (Aurora =
vision; Beacon KSK / Lattice KSK = shippable control-plane;
Zeta = substrate) as Amara's rhetorical proposal — not
adopted, preserved as input.

Brand decision remains Aaron's (M4 milestone; not Otto's to
pick).

No implementation changes; no operational policy changes;
pure additive documentation update preserving both ferries'
attribution per signal-in-signal-out discipline + §33
archive-header format already at top of README.

Lands within-standing-authority per Otto-82 calibration
(docs-only branding-input update; NOT a brand decision).

Closes 7th-ferry absorb candidate BACKLOG row #4 of 5
(branding shortlist update). Remaining candidates for Otto-90+:
- KSK-as-Zeta-module implementation (L; cross-repo)
- Oracle-scoring research (M)
- BLAKE3 receipt hashing design (M)
- Aminata threat-model pass on 7-class threat model (S)

Otto-89 tick primary deliverable.
AceHack added a commit that referenced this pull request Apr 24, 2026
…dvisory) (#263)

Aminata's adversarial review of three technical sections of
Amara's 7th courier ferry (PR #259 merged): the 7-class
threat model, the formal oracle rule, and the Veridicality +
network-health scoring families.

Classifications per section:

- 7-class threat model: IMPORTANT — covers obvious external-
  adversary cone cleanly; misses insider maintainer, receipt-
  flooding DoS, signer-collusion/quorum-capture, time-source
  adversary, side-channel leakage, cryptographic-agility.
  Not audit-ready without insider class.
- Oracle rule Authorize(a,t): CRITICAL — boolean conjunction
  of 5 predicates, 3 under-specified, all 5 race-able at
  check-time vs execute-time, not closed under action
  composition. As written is specification of intent, not
  safety control.
- Veridicality V(c) + network-health S(Z_t): CRITICAL —
  gameable-by-self-attestation (5/6 inputs are self-reports);
  parameter-fitting adversary unblocked without ADR gate;
  false-precision risk (sigmoid of ordinal signals); partial
  contradiction with SD-9 (coherence term rewards carrier-
  laundered consensus).

Cross-section dependencies: landing one section commits to
the others; presenting as separable is misleading.

Top-three adversary budget (highest leverage × lowest cost):
1. Parameter-fitting on V(c)/S(Z_t) — cheapest attack.
2. Approval-withdrawal race at execute-time.
3. Capability-reclassifier bypass (k3 → k2 at
   ICapabilityClassifier).

Common shape: design names "pluggable" without naming the
gate on the plug.

Out-of-scope content flagged: BLAKE3 receipt-hash binding
(belongs in lucent-ksk ADR not Zeta-module doc); branding
shortlist (scope-creep); Bitcoin anchoring (separate trust-
model); 12-row test checklist mixes property/policy/scoring
rows.

Archive-header format self-applied (Scope / Attribution /
Operational status / Non-fusion disclaimer) — eighth
aurora/research doc to exercise the convention.

Lands within-standing-authority per Otto-82 calibration —
research-grade advisory doc, not account/spending/named-
design-review gated; classifies existing design, doesn't
implement.

Closes 7th-ferry absorb candidate BACKLOG row #4 of 5 this
session (Aminata threat-model pass). Remaining candidates for
Otto-91+:
- KSK-as-Zeta-module implementation (L; cross-repo; Aaron
  Otto-90 pre-approved Aaron+Max coordination so not a
  formal gate, but cross-repo design work warrants care)
- Oracle-scoring research (M) — V/S parameter fitting +
  carrier-aware term per this pass's recommendation
- BLAKE3 receipt hashing design (M) — lucent-ksk ADR

Otto-90 tick primary deliverable.
AceHack added a commit that referenced this pull request Apr 24, 2026
…ries (8th-ferry candidate #1) (#278)

Research-grade absorb per Amara 8th-ferry landing plan
(PR #274). Separates real quantum-sensing literature from
software analogy so the latter can borrow carefully without
contaminating the former.

Key structure:

- **Do not operationalize stated as first rule** — this doc
  MUST NOT be cited as authorisation for Zeta or Aurora
  "quantum-powered" / "quantum-inspired" claims. 2024
  engineering review caps microwave QR at <1 km typical.
- **What the physics actually supports**: Lloyd 2008 +
  Tan Gaussian-state 6 dB error-exponent + 2023 Nature
  Physics microwave demo + 2024 engineering range-cap
  review + standard radar R⁻⁴ range equation. Quantum
  sensing broader and more mature than quantum-radar
  specifically (NV-centers / magnetometers / atomic clocks
  are real; radar is speculative).
- **5 importable software analogies**: retained reference
  path (anchor for weak evidence); correlation beats
  isolation (kNN retrieval not single-source agreement);
  time-bandwidth product (repeated independent
  observations over window); decoherence (carrier overlap
  destroys independence weight); cross-section-is-
  observability (salience != evidence).
- **6-item NOT-imply list**: no quantum-radar claim; no
  quantum-inspired algebra; no quantum-certified alignment
  robustness; no Aurora-as-quantum-safety; etc. First-class
  content, future references must honour.
- **Composition table** mapping analogies to existing
  substrate (SD-9 / DRIFT-TAXONOMY patterns 2 + 5 /
  citations-as-first-class / alignment-observability /
  oracle-scoring v0 / BLAKE3 v0). No new mechanisms
  proposed; analogies slot in as framing.
- **Graduation candidates** named with explicit ADR gating:
  retained-witness correlation metric; salience-vs-evidence
  PR review diagnostic; decoherence-inspired carrier-
  downgrade rule. Each needs separate ADR + operational
  artifact + regression-test before graduating.

Scope limits:
- Does NOT propose implementation.
- Does NOT audit existing Zeta claims against analogy
  boundaries.
- Does NOT commit to quantum-literature tracking cadence.
- Does NOT license creative expansion of analogy set
  (5 is what Amara cited; new literature = new research
  doc).
- Does NOT re-verify primary sources; preserves Amara's
  scoping discipline verbatim.

Self-applies §33 archive-header format — 14th
aurora/research doc in a row.

Lands within-standing-authority per Otto-82/90/93
calibration — research-grade doc; not account / spending /
named-design-review / readiness-signal gated.

Closes 8th-ferry candidate #1 of remaining 4 (after
TECH-RADAR batch closed #5 Otto-96). Remaining:
- #2 Semantic-canonicalization research doc (M; spine)
- #3 Provenance-aware bullshit-detector (M)
- #4 docs/EVIDENCE-AND-AGREEMENT.md future operational
  promotion (gated on 1-3)

Otto-97 tick primary deliverable.
AceHack added a commit that referenced this pull request Apr 24, 2026
… detector) (#328)

First full integration of the Graph detection pipeline: combines
largestEigenvalue (spectral growth) + labelPropagation (community
partition) + modularityScore (partition evaluation) into a single
scalar risk score.

Surface:
  Graph.coordinationRiskScore
      (alpha: double) (beta: double)
      (eigenTol: double) (eigenIter: int) (lpIter: int)
      (baseline: Graph<'N>) (attacked: Graph<'N>)
      : double option

Composite formula (MVP):
  risk = alpha * Δλ₁_rel + beta * ΔQ

where:
- Δλ₁_rel = (λ₁(attacked) - λ₁(baseline)) / max(λ₁(baseline), eps)
- ΔQ = Q(attacked, LP(attacked)) - Q(baseline, LP(baseline))

Both signals fire when a dense subgraph is injected: λ₁ grows
because the cartel adjacency has high leading eigenvalue; Q grows
because LP finds the cartel as its own community and Newman Q
evaluates that partition highly.

Weight defaults per Amara 17th-ferry initial priors:
- alpha = 0.5 spectral growth
- beta  = 0.5 modularity shift

Tests (3 new, 34 total in GraphTests, all passing):
- Empty graphs -> None
- Cartel injection -> composite > 1.0 (both signals fire)
- attacked == baseline -> composite near 0 (|score| < 0.2)

Calibration deferred (Amara Otto-132 Part 2 correction #4 — robust
statistics via median + MAD): this MVP uses raw linear weighting
over differences. Full CoordinationRiskScore with robust z-scores
over baseline null-distribution is a future graduation once
baseline-calibration machinery ships. RobustStats.robustAggregate
(PR #295) already provides the median-MAD machinery; just needs a
calibration harness to use it.

14th graduation under Otto-105 cadence. First full integration
ship using 4 Graph primitives composed together (λ₁ + LP +
modularity + composer).

Build: 0 Warning / 0 Error.

Provenance:
- Concept: Aaron (firefly network + trivial-cartel-detect) +
  Amara's composite-score formulations across 12th/13th/14th/
  17th ferries
- Implementation: Otto (14th graduation)

Composes with:
- Graph.largestEigenvalue (PR #321)
- Graph.labelPropagation (PR #326)
- Graph.modularityScore (PR #324)
- RobustStats.robustAggregate (PR #295) — for future robust
  variant

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 24, 2026
…correction) (#332)

Completes the input pipeline for TemporalCoordinationDetection.
phaseLockingValue (PR #298): PLV expects phases in radians but
didn't prescribe how events become phases. This ship fills the
gap.

17th graduation under Otto-105 cadence. Addresses Amara 17th-ferry
Part 2 correction #5: 'Without phase construction, PLV is just a
word.'

Surface (2 pure functions):
- PhaseExtraction.epochPhase : double -> double[] -> double[]
  Periodic-epoch phase. φ(t) = 2π · (t mod period) / period.
  Suited to consensus-protocol events with fixed cadence (slot
  duration, heartbeat, epoch boundary).
- PhaseExtraction.interEventPhase : double[] -> double[] -> double[]
  Circular phase between consecutive events. For sample t in
  [t_k, t_{k+1}), phase = 2π · (t - t_k) / (t_{k+1} - t_k).
  Suited to irregular event-driven streams.

Both return double[] of phase values in [0, 2π) radians. Empty
output on degenerate inputs (no exception). eventTimes assumed
sorted ascending; samples outside the event range get 0 phase
(callers filter to interior if they care).

Hilbert-transform analytic-signal approach (Amara's Option B)
deferred — needs FFT support which Zeta doesn't currently ship.
Future graduation when signal-processing substrate lands.

Tests (12, all passing):
epochPhase:
- t=0 → phase 0
- t=period/2 → phase π
- wraps cleanly at period boundary
- handles negative sample times correctly
- returns empty on invalid period (≤0) or empty samples

interEventPhase:
- empty on <2 events or empty samples
- phase 0 at start of first interval
- phase π at midpoint
- adapts to varying interval lengths (O(log n) binary search
  for bracketing interval)
- returns 0 before first and after last event (edge cases)

Composition with phaseLockingValue:
- Two nodes with identical epochPhase period → PLV = 1
  (synchronized)
- Two nodes with same period but constant offset → PLV = 1
  (perfect phase locking at non-zero offset is still locking)

This composes the full firefly-synchronization detection
pipeline end-to-end for event-driven validator streams:
  validator event times → PhaseExtraction → phaseLockingValue
  → temporal-coordination-detection signal

5 of 8 Amara 17th-ferry corrections now shipped:
#1 λ₁(K₃)=2 ✓ already correct (PR #321)
#2 modularity relational ✓ already correct (PR #324)
#3 cohesion/exclusivity/conductance ✓ shipped (PR #331)
#4 windowed stake covariance ✓ shipped (PR #331)
#5 event-stream → phase pipeline ✓ THIS SHIP
Remaining: #4 robust-z-score composite variant (future);
#6 ADR phrasing (already correct); #7 KSK naming (BACKLOG
#318 awaiting Max coord); #8 SOTA humility (doc-phrasing
discipline).

Build: 0 Warning / 0 Error.

Provenance:
- Concept: Aaron firefly-synchronization design
- Formalization: Amara 17th-ferry correction #5 with 3-option
  menu (epoch / Hilbert / circular)
- Implementation: Otto (17th graduation; options A + C shipped,
  Hilbert deferred)

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request Apr 24, 2026
…-ferry §B + §F + corrections #2 #7 #9 (#342)

Research-grade design doc for the Stage-2 rung of Amara's
corrected promotion ladder. Specifies: (a) placement under
src/Experimental/CartelLab/ (not src/Core/ — that's Stage 4);
(b) MetricVector type with PLV magnitude AND offset split
(correction #6); (c) INullModelGenerator interface +
Preserves/Avoids table columns; (d) IAttackInjector
forward-looking interface (Stage 3); (e) Wilson-interval
reporting contract with {successes, trials, lowerBound,
upperBound} schema (correction #2 — no more "~95% CI ±5%"
handwave); (f) RobustZScoreMode with Hybrid fallback
(correction #7 — percentile-rank when MAD < epsilon);
(g) explicit artifact-output layout under artifacts/
coordination-risk/ with five files + run-manifest.json
(correction #9).

6-stage promotion path (0 doc / 1 ADR / 2.a skeleton /
2.b full null-models + first attack / 3 attack suite /
4 Core/NetworkIntegrity / 5 Aurora-KSK) matches Amara's
corrected ladder and Otto-105 cadence.

Doc-only change; no code, no tests, no workflow, no
BACKLOG tail touch (avoids positional-conflict pattern
that cost #334#341 re-file this session).

This is the 7th of 10 18th-ferry operationalizations:
- #1/#10 test-classification (#339)
- #2 Wilson-interval design specified (this doc)
- #6 PLV phase-offset shipped (#340)
- #7 MAD=0 Hybrid mode specified (this doc)
- #9 artifact layout specified (this doc)
- #4 exclusivity already shipped (#331)
- #5 modularity relational already shipped (#324)

Remaining: Wilson-interval IMPLEMENTATION (waits on #323 +
Stage 2.a), MAD=0 Hybrid IMPLEMENTATION (waits on #333 +
Stage 2.a), conductance-sign doc (waits on #331), Stage-2.a
skeleton itself.

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant