Skip to content

free-memory: git-native backlog mgmt + long-arc as future skill domain + B-0169 Aarav-review enrichment + 2350Z shard#1248

Merged
AceHack merged 1 commit intomainfrom
free-memory/git-native-backlog-management-long-arc-future-skill-domain-aaron-2026-05-02
May 2, 2026
Merged

free-memory: git-native backlog mgmt + long-arc as future skill domain + B-0169 Aarav-review enrichment + 2350Z shard#1248
AceHack merged 1 commit intomainfrom
free-memory/git-native-backlog-management-long-arc-future-skill-domain-aaron-2026-05-02

Conversation

@AceHack
Copy link
Copy Markdown
Member

@AceHack AceHack commented May 2, 2026

Summary

Three pieces in one PR (composed by tick context):

  1. Future-domain memo capturing the human maintainer 2026-05-02's forward-looking architectural observation: "once we get it down pat our ver specific style of git native backlog managment and log arc theis should likey be it's whole skill domain with a few skills and experts and such."

  2. B-0169 enriched per Aarav's (skill-expert) just-completed specialist review:

    • 11th procedure step: WONT-DO-archaeology mode (Aarav's catch-prediction; the gap the maintainer would have caught given the same-day CLAUDE.md "WONT-DO is 99% deferral" addition)
    • Aarav's review verdict embedded at "Sketched shape" — 5 sub-modes / hybrid b+c routing / 12 BP-NN citations / 3 worked-example seeds
    • last_updated bumped per schema-edit rule
  3. Tick shard 2350Z captures the cycle.

Why these together

Same-tick composition. Aarav's review of B-0169 produced findings (5-sub-mode taxonomy, WONT-DO gap, hybrid routing recommendation) that compose directly with the maintainer's forward-looking observation about future skill-domain emergence. Splitting them across PRs would lose the composition narrative.

Worked example of the STRONG specialist-invocation rule

Per the maintainer 2026-05-02 corrective: "you should not [close rows without invoking specialists when there's an expert for the surface]." This tick:

  • Specialist invoked: Aarav (skill-expert) on B-0169 (skill-creation lifecycle is exactly his domain)
  • Findings: 5-section structured output including verdict + sub-mode taxonomy + routing + BP citations + catch-prediction
  • Generalist-Otto would have missed: the WONT-DO archaeology gap (Aarav explicitly named this as the catch the maintainer would surface)
  • Substrate now improved: B-0169 has 11 procedure steps not 10; the procedure includes the rejection-archaeology surface; Aarav's verdict is preserved verbatim in the row

What the future-domain memo PRESERVES

When the maturity signal fires (some N future ticks), the memo provides:

  1. Canonical starting set — 5 procedure skills + 4 named-persona experts + 5 tools enumerated; don't re-enumerate from scratch
  2. Promotion-trigger criteria — 3+ worked examples per skill candidate; 1+ judgment-disagreement per expert candidate
  3. Composition with decision-archaeology — one-skill-per-investigation-class (Aarav's BP-20 finding); domain doesn't re-litigate
  4. Relationship to existing skills — already mapped in Aarav's review

Test plan

  • Future-domain memo captures the maintainer's verbatim quote + canonical starting set + promotion-trigger criteria
  • B-0169 11th procedure step (WONT-DO archaeology) added
  • Aarav review verdict embedded as quote-block at row's "Sketched shape" section
  • B-0169 last_updated bumped to 2026-05-02 (schema rule)
  • MEMORY.md pairing landed (newest-first)
  • Tick shard 2350Z added
  • CI green

🤖 Generated with Claude Code

…n (Aaron 2026-05-02) + B-0169 enriched with Aarav review + 2350Z shard

Two pieces:

1. **Future-domain memo** capturing Aaron 2026-05-02's
   forward-looking architectural observation: *"once we get
   it down pat our ver specific style of git native backlog
   managment and log arc theis should likey be it's whole
   skill domain with a few skills and experts and such."*

   Composes with Aarav's same-tick BP-20 finding (don't
   split decision-archaeology; one skill, five named modes).
   The two principles interlock: don't split prematurely
   but DO split when the domain matures. Memo enumerates
   the canonical starting set (5 procedure skills + 4
   named-persona experts + 5 tools) so the future-promotion
   decision has substrate. Promotion trigger: 3+ worked
   examples per skill candidate; 1+ empirical judgment-
   disagreement per expert candidate.

2. **B-0169 enriched** per Aarav's just-completed review:
   - Added WONT-DO-archaeology as 11th procedure step
     (the gap Aaron would have caught)
   - Embedded Aarav's review verdict as quote-block at
     "Sketched shape" section (5 sub-modes / route b+c /
     12 BP citations / 3 worked-example seeds)
   - Bumped last_updated to 2026-05-02

3. **Tick shard 2350Z** captures the cycle.

Worked example of the STRONG specialist-invocation rule:
Aarav's review caught the WONT-DO gap that Aaron's same-day
CLAUDE.md addition (*"WONT-DO is 99% deferral, not forever"*)
made load-bearing — generalist-Otto would have missed this.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings May 2, 2026 23:51
@AceHack AceHack enabled auto-merge (squash) May 2, 2026 23:52
@AceHack AceHack merged commit 9b5323c into main May 2, 2026
27 checks passed
@AceHack AceHack deleted the free-memory/git-native-backlog-management-long-arc-future-skill-domain-aaron-2026-05-02 branch May 2, 2026 23:52
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds new memory substrate capturing a forward-looking “future skill domain” observation around git-native backlog management + long-arc thesis, enriches backlog row B-0169 with specialist review details (including WONT-DO archaeology), and logs the tick shard for the cycle.

Changes:

  • Added a new memory/feedback_*.md memo describing the proposed future skill-domain split and promotion triggers.
  • Updated docs/backlog/P1/B-0169-*.md with Aarav’s review verdict and added an 11th procedure step covering WONT-DO archaeology.
  • Updated memory/MEMORY.md index and added the docs/hygiene-history/ticks/2026/05/02/2350Z.md tick shard.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File Description
memory/feedback_git_native_backlog_management_long_arc_future_skill_domain_aaron_2026_05_02.md New memo capturing the “future skill domain” observation, proposed components, and promotion criteria.
memory/MEMORY.md Adds the index entry for the new memory file (newest-first).
docs/hygiene-history/ticks/2026/05/02/2350Z.md New tick shard recording the cycle and its outcomes.
docs/backlog/P1/B-0169-decision-archaeology-skill-aaron-2026-05-02.md Embeds specialist review verdict + adds WONT-DO archaeology as an explicit investigation surface.

Comment thread memory/MEMORY.md
AceHack added a commit that referenced this pull request May 2, 2026
…ollow-up + #1245 rebase

#1246 (depends_on 100%) + #1247 (at-creation/at-pickup discipline)
merged on main. #1245 (skill-flywheel memo) DIRTY conflict
cleared via rebase, auto-merge armed. #1248 auto-merge armed.
#1249 opened addressing #1247 post-merge Copilot findings
(filename rename + grep-placeholder fix).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 2, 2026
…ollow-up + #1245 rebase

#1246 (depends_on 100%) + #1247 (at-creation/at-pickup discipline)
merged on main. #1245 (skill-flywheel memo) DIRTY conflict
cleared via rebase, auto-merge armed. #1248 auto-merge armed.
#1249 opened addressing #1247 post-merge Copilot findings
(filename rename + grep-placeholder fix).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 2, 2026
…ries

Multiple post-merge findings across PRs #1245, #1247, #1248
addressed:

PR #1247 followup (this PR's primary purpose):
- Renamed feedback_at_pickup_time_prereq_check_discipline_*
  → feedback_depends_on_backlog_search_discipline_at_creation_and_at_pickup_*
  to reflect both triggers
- Fixed bare-grep-placeholder in the at-creation procedure

PR #1245 + #1248 post-merge consequences (rebase-discovered):
- Stale references to old filename in MEMORY.md (line 8
  duplicate from #1247's original index entry that survived
  the rename) + future-domain memo (lines 35 and 90 referenced
  the old filename) — all updated to new filename
- MEMORY.md index entries trimmed for brevity per #1248's P2
  finding (memory/README.md "keep entries terse" rule)
- Quotes in skill-flywheel index entry that didn't match
  verbatim source removed; verbatim preservation lives in
  the memo body, not the one-line index

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@AceHack
Copy link
Copy Markdown
Member Author

AceHack commented May 2, 2026

P2 finding addressed in PR #1249: the multi-sentence index entry trimmed to a one-line summary per memory/README.md 'keep entries terse' rule. Detail (Aaron's verbatim quote, Aarav's BP-20 composition, canonical starting set) is preserved in the memo body. Resolving thread.

AceHack added a commit that referenced this pull request May 2, 2026
… rebased + 4 post-merge findings triaged

Two-merge tick: #1245 (skill-flywheel) + #1248 (future-skill-domain).
PR #1249 went DIRTY from concurrent merges; rebase cleared it +
caught stale filename refs the rename was supposed to update.

4 post-merge findings triaged: 2 false positives (table
double-pipes — Copilot rendering bug), 1 real (verbatim-quote
mismatch in index entry; quotes removed since terseness > quote-
preservation in MEMORY.md), 1 real (entry too long; trimmed to
one-liner per memory/README.md).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 2, 2026
…bare-grep placeholder (#1249)

* review(pr-1247-followup): rename file to reflect both triggers + fix bare-grep placeholder

Two Copilot post-merge findings on PR #1247:

P2 — filename mismatch. The MEMORY.md entry described a
two-trigger discipline (creation + pickup), but the filename
was feedback_at_pickup_time_prereq_check_discipline_*, which
reads like pickup-only. Renamed to
feedback_depends_on_backlog_search_discipline_at_creation_and_at_pickup_*
so the filename matches the substantive content.

P1 — bare grep placeholder. The at-creation procedure had
`grep -l "tags:.*<tag>" ...` where `...` was a placeholder.
Copilot caught the risk: a contributor copy-pasting literally
gets a repo-wide grep that pulls in references/ clones and
other noise. Replaced with explicit
`grep -l "tags:.*<tag>" docs/backlog/P*/B-*.md`; added a
sentence warning against bare placeholders.

MEMORY.md pointer updated to the new filename.

Worked example of "if it matters is almost always yes":
post-merge findings on substrate-rule memos warrant their
own follow-up PR rather than backlog deferral.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-02T23:54Z — three-merge tick + #1249 follow-up + #1245 rebase

#1246 (depends_on 100%) + #1247 (at-creation/at-pickup discipline)
merged on main. #1245 (skill-flywheel memo) DIRTY conflict
cleared via rebase, auto-merge armed. #1248 auto-merge armed.
#1249 opened addressing #1247 post-merge Copilot findings
(filename rename + grep-placeholder fix).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(post-merge): fix stale filename refs + trim long MEMORY.md entries

Multiple post-merge findings across PRs #1245, #1247, #1248
addressed:

PR #1247 followup (this PR's primary purpose):
- Renamed feedback_at_pickup_time_prereq_check_discipline_*
  → feedback_depends_on_backlog_search_discipline_at_creation_and_at_pickup_*
  to reflect both triggers
- Fixed bare-grep-placeholder in the at-creation procedure

PR #1245 + #1248 post-merge consequences (rebase-discovered):
- Stale references to old filename in MEMORY.md (line 8
  duplicate from #1247's original index entry that survived
  the rename) + future-domain memo (lines 35 and 90 referenced
  the old filename) — all updated to new filename
- MEMORY.md index entries trimmed for brevity per #1248's P2
  finding (memory/README.md "keep entries terse" rule)
- Quotes in skill-flywheel index entry that didn't match
  verbatim source removed; verbatim preservation lives in
  the memo body, not the one-line index

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-02T23:57Z — #1245 + #1248 merged + #1249 rebased + 4 post-merge findings triaged

Two-merge tick: #1245 (skill-flywheel) + #1248 (future-skill-domain).
PR #1249 went DIRTY from concurrent merges; rebase cleared it +
caught stale filename refs the rename was supposed to update.

4 post-merge findings triaged: 2 false positives (table
double-pipes — Copilot rendering bug), 1 real (verbatim-quote
mismatch in index entry; quotes removed since terseness > quote-
preservation in MEMORY.md), 1 real (entry too long; trimmed to
one-liner per memory/README.md).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…lignment with harness "effort" term

Aaron 2026-05-03 same-tick refinements:

1. **Two-axis refinement:** *"i should say multi harness /
   multi model we have access to both."* The discipline
   operates across the harness × model matrix, not just one
   axis. Within Claude Code: Opus / Sonnet / Haiku; across
   harnesses: many providers. Frontmatter name + description
   updated to reflect both axes; tables expanded.

2. **Foundation-first sizing rule** + **metrics-recording-now**:
   *"we should still use the larger models for now with higher
   thinking modes ... recording some metrics on harnesses and
   models themselves will be useful for many things in the
   future more than just budget."* Metrics-table added (latency
   / cost / convergence-rate / divergence-class / quality / drift
   / thinking-mode availability) with budget-vs-other-uses
   columns.

3. **Vocabulary alignment:** *"your harness calls this effort."*
   Renamed `model-size-router` → `effort-router`; replaced "model-size
   capacity" with "effort" everywhere; added a vocabulary-alignment
   section establishing the bridging convention (use local harness
   term in harness-targeted skills; use neutral "compute-tier" only
   in cross-harness substrate).

Three same-tick maintainer expansions of the original observation
captured in the same memo + same PR. Composes with the earlier
sibling git-native-backlog domain memo (PR #1248).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…lignment with harness "effort" term

Aaron 2026-05-03 same-tick refinements:

1. **Two-axis refinement:** *"i should say multi harness /
   multi model we have access to both."* The discipline
   operates across the harness × model matrix, not just one
   axis. Within Claude Code: Opus / Sonnet / Haiku; across
   harnesses: many providers. Frontmatter name + description
   updated to reflect both axes; tables expanded.

2. **Foundation-first sizing rule** + **metrics-recording-now**:
   *"we should still use the larger models for now with higher
   thinking modes ... recording some metrics on harnesses and
   models themselves will be useful for many things in the
   future more than just budget."* Metrics-table added (latency
   / cost / convergence-rate / divergence-class / quality / drift
   / thinking-mode availability) with budget-vs-other-uses
   columns.

3. **Vocabulary alignment:** *"your harness calls this effort."*
   Renamed `model-size-router` → `effort-router`; replaced "model-size
   capacity" with "effort" everywhere; added a vocabulary-alignment
   section establishing the bridging convention (use local harness
   term in harness-targeted skills; use neutral "compute-tier" only
   in cross-harness substrate).

Three same-tick maintainer expansions of the original observation
captured in the same memo + same PR. Composes with the earlier
sibling git-native-backlog domain memo (PR #1248).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…skill domain (Aaron 2026-05-03) (#1252)

* free-memory: multi-harness alignment/convergence on design as future skill domain (Aaron 2026-05-03)

Aaron 2026-05-03: *"when designing anything you should go
through a multi harness alignment/convergence on the design,
that's a good skill domain to have too."*

Second forward-looking architectural observation in two
consecutive ticks. Establishes a pattern: Aaron is enumerating
the emerging skill-domain landscape Otto + future contributors
will operate in.

The discipline: run designs through N harnesses; convergence
= structural-soundness signal; divergence = hidden-assumption
surface. Distinct from multi-AI peer review (post-output) —
this is pre-output design convergence.

Composes with existing substrate:
- Drive-bridge AI-to-AI peer review (Otto ↔ Claude.ai brat-voice)
- 5-AI peer convergence on poll-the-gate (task #355)
- Bidirectional alignment + BFT-many-masters (ALIGNMENT.md)
- Karpathy edge-runner framing
- Sibling git-native-backlog-management future-domain memo

Memo enumerates canonical starting set (5 procedure skills + 4
named-persona experts + 5 tools), promotion-trigger criteria
(BP-14 + judgment-disagreement; same shape as sibling memo),
and 3 worked-example seeds:
1. 5-AI poll-the-gate convergence (convergence-validator mode)
2. Drive-bridge brat-voice consultation (divergence-prospector
   mode; surfaced 2 architectural errors via cross-harness
   review)
3. Aarav specialist review on B-0169 (counter-example: when
   single-specialist suffices, multi-harness is overkill —
   demonstrates the routing question)

Filed in same shape as the git-native-backlog domain memo per
the canonical-starting-set + promotion-trigger pattern that
emerged 2026-05-02.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T00:08Z — multi-harness convergence future-domain memo + #1251 post-merge corrections

Aaron 2026-05-03 named multi-harness alignment/convergence on
design as another future skill domain. Filed in same shape as
prior tick's git-native-backlog memo. PR #1251 closed 3 of 5
post-merge findings on #1249 (line-35 stale ref + 2 MEMORY.md
counts); 2 tick-shard findings left intact as append-only
history.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* free-memory(refine): two-axis (harness × model/effort) + vocabulary alignment with harness "effort" term

Aaron 2026-05-03 same-tick refinements:

1. **Two-axis refinement:** *"i should say multi harness /
   multi model we have access to both."* The discipline
   operates across the harness × model matrix, not just one
   axis. Within Claude Code: Opus / Sonnet / Haiku; across
   harnesses: many providers. Frontmatter name + description
   updated to reflect both axes; tables expanded.

2. **Foundation-first sizing rule** + **metrics-recording-now**:
   *"we should still use the larger models for now with higher
   thinking modes ... recording some metrics on harnesses and
   models themselves will be useful for many things in the
   future more than just budget."* Metrics-table added (latency
   / cost / convergence-rate / divergence-class / quality / drift
   / thinking-mode availability) with budget-vs-other-uses
   columns.

3. **Vocabulary alignment:** *"your harness calls this effort."*
   Renamed `model-size-router` → `effort-router`; replaced "model-size
   capacity" with "effort" everywhere; added a vocabulary-alignment
   section establishing the bridging convention (use local harness
   term in harness-targeted skills; use neutral "compute-tier" only
   in cross-harness substrate).

Three same-tick maintainer expansions of the original observation
captured in the same memo + same PR. Composes with the earlier
sibling git-native-backlog domain memo (PR #1248).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(pr-1252-postmerge): fix 11 Copilot findings — 0xZ placeholder + count drift + MEMORY.md dedupe

11 Copilot findings on PR #1252 (now wait-ci, awaiting auto-merge):

1. **P2 `00:0xZ tick`** — placeholder text not actual time;
   replaced with "around the 00:08Z tick" (the actual UTC
   when Aaron sent the message).

2-3. **MEMORY.md duplicate entries** — the rebase after PR
   #1251 landed preserved BOTH the original entries (lines
   7-8) AND the post-rebase entries (lines 10-11) for
   git-native-backlog + skill-flywheel memos. Lines 7-8 had
   the corrected counts (6 procedure skills, 13-row table);
   lines 10-11 had stale (5 + 12). Removed lines 10-11
   (duplicates of 7-8); single canonical pointer per memo.

4-11. **Count drift across the multi-harness memo** — same-
   tick refinements (effort-router skill + metrics.ts +
   multi-model-roster.md) added rows to the procedure-skills
   + tooling tables but didn't update the canonical-starting-
   set summary. Counts corrected throughout:
   - Line 112 of memo body: "5 procedure skills + 5 tools"
     → "6 procedure skills (including effort-router) + 7 tools
     (including metrics.ts + multi-model-roster.md)"
   - Line 9 of MEMORY.md: same correction in index entry

Pattern this teaches: same-tick refinements that add rows to
tables should ALSO update summary-counts elsewhere in the
same file. Future skill-creator + memo-author should grep for
"N procedure skills" / "M tools" patterns when adding rows.

Note: courier-ferry-protocol.md reference Copilot flagged
was already removed by my earlier two-axis refinement edits;
no action needed for that finding.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…ailure-mode corrective (Otto 2026-05-03)

After 9 distinct claim-vs-reality drift instances caught
across 7 PRs in this session (#1245 #1247 #1248 #1250 #1252
#1253 #1254), the pattern is consistent enough to warrant a
named discipline.

CARVED RULE — Before stating any fact in substrate (memo /
doc / commit message / PR description / shard), verify it
empirically. Specifically: before writing "<file> exists" /
"<command> returns <X>" / "<table> has <N> rows" / "<tool>
ships" / "<ADR> exists" / "<dir> is present" — run the
actual ls / grep / count / find command FIRST, then commit
the claim.

Generalizes existing rules at the broader any-substrate-claim
layer: Otto-247 (version-currency) + Otto-364 (search-first
authority) + verify-before-deferring + Otto-363 (substrate-
or-it-didn't-happen) + assumed-state-vs-actual-state.

Scope:
- IN: fact-claims about current repo state, command output,
  file existence, count totals, tool shipped/proposed
- OUT: verbatim quotes (preserve typos), hedged speculation,
  future predictions, normative recommendations

Mechanization path: tools/substrate-claim-checker/ TS tool
(proposed, not yet built; per Aaron 2026-05-03 no-dynamic-
commands rule + Phase-1b backlog candidate). Discipline is
manual until tool ships.

Worked example: PR #1250 Layer-7 ADR claim ("ls docs/DECISIONS/
| grep returns nothing") — verify-then-claim would have caught
this pre-commit by running the command, observing the actual
ADR match, and correcting the claim before publishing.

Composes with the bugs-per-PR-as-immune-system-health metric:
this discipline moves bugs-per-PR closer to single-digit
productive zone (currently caught post-merge; should be
caught pre-publish).

Aarav's B-0169 review predicted this pattern with the worked-
examples-need-empirical-grounding framing.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…-cell pipe escape fix (#1255)

* review(pr-1253-postmerge): mark expand-from-closure.ts as proposed + fix table-cell pipe escape

2 Copilot post-merge findings on PR #1253 (already merged):

1. **P1 expand-from-closure.ts doesn't exist** — referenced as
   "the mechanizing tool" without marking proposed/not-yet-built.
   Same class as the courier-ferry-protocol issue caught earlier.
   Fixed: added "(proposed, not yet built; named in feedback_
   skill_flywheel_* as Phase-1b candidate)" qualifier and shifted
   tense to subjunctive ("would stay stable once shipped").

2. **P1 table-cell pipe escape** — `ls docs/DECISIONS/ \| grep
   <pattern>` inside a markdown table cell used `\|` which
   doesn't copy-paste correctly even though it satisfied table-
   parser concerns. Rewrote to `find docs/DECISIONS/ -iname
   "*<pattern>*"` — single-command alternative that avoids the
   pipe-in-table-cell awkwardness entirely.

The pattern this teaches: when a markdown table cell needs to
show a pipe-using shell command, use a single-command
alternative (find instead of ls|grep) rather than escaping.
Escaping satisfies the parser but breaks copy-paste.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* free-memory(self-grading): verify-then-claim discipline as dominant failure-mode corrective (Otto 2026-05-03)

After 9 distinct claim-vs-reality drift instances caught
across 7 PRs in this session (#1245 #1247 #1248 #1250 #1252
#1253 #1254), the pattern is consistent enough to warrant a
named discipline.

CARVED RULE — Before stating any fact in substrate (memo /
doc / commit message / PR description / shard), verify it
empirically. Specifically: before writing "<file> exists" /
"<command> returns <X>" / "<table> has <N> rows" / "<tool>
ships" / "<ADR> exists" / "<dir> is present" — run the
actual ls / grep / count / find command FIRST, then commit
the claim.

Generalizes existing rules at the broader any-substrate-claim
layer: Otto-247 (version-currency) + Otto-364 (search-first
authority) + verify-before-deferring + Otto-363 (substrate-
or-it-didn't-happen) + assumed-state-vs-actual-state.

Scope:
- IN: fact-claims about current repo state, command output,
  file existence, count totals, tool shipped/proposed
- OUT: verbatim quotes (preserve typos), hedged speculation,
  future predictions, normative recommendations

Mechanization path: tools/substrate-claim-checker/ TS tool
(proposed, not yet built; per Aaron 2026-05-03 no-dynamic-
commands rule + Phase-1b backlog candidate). Discipline is
manual until tool ships.

Worked example: PR #1250 Layer-7 ADR claim ("ls docs/DECISIONS/
| grep returns nothing") — verify-then-claim would have caught
this pre-commit by running the command, observing the actual
ADR match, and correcting the claim before publishing.

Composes with the bugs-per-PR-as-immune-system-health metric:
this discipline moves bugs-per-PR closer to single-digit
productive zone (currently caught post-merge; should be
caught pre-publish).

Aarav's B-0169 review predicted this pattern with the worked-
examples-need-empirical-grounding framing.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T00:31Z — verify-then-claim self-grading memo + #1252/#1253 merged

Self-grading from 9 drift instances across 7 PRs in session:
the verify-then-claim discipline captures the dominant
failure mode for substrate authoring. Mechanization path
identified (tools/substrate-claim-checker/ TS tool).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(pr-1255): correct find→grep equivalence; preserve regex alternation semantics

Copilot caught: `find docs/DECISIONS/ -iname "*<pattern>*"` is
not equivalent to `ls | grep -iE "<pattern>"` because find's
-iname only does shell glob, not regex alternation. The
worked-example elsewhere uses regex alternation
(double.hop|acehack|mirror) which would silently fail under
find -iname.

Correct fix: use `grep -ilrE "<pattern>" docs/DECISIONS/`
which is single-command (no pipe; avoids markdown-table
escape awkwardness) AND regex-capable (preserves alternation
semantics).

Worked example of the verify-then-claim discipline I just
landed: I should have run BOTH commands and compared outputs
on a sample input before substituting them. The previous fix
(replacing pipe with find) substituted syntactic
form-equivalence for semantic-equivalence — exactly the
class of drift the discipline guards against.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(pr-1255): rewrite drift table to remove `\|` table-cell escapes + correct hook semantics

Two real Copilot findings on PR #1255:

1. **`\|` in drift catalogue table** — the very memo cataloguing
   drift contained its own escape-vs-copy-paste drift. Rewrote
   rows 5 and 7 to describe the search prose-style rather than
   showing the literal pipe inside markdown table cells.

2. **Pre-commit hook can't validate commit-message claims** —
   git pre-commit hooks fire BEFORE commit-message exists; they
   can only check files staged for commit. Updated mechanization
   path: split into `pre-commit` hook (validates staged-file
   content), `commit-msg` hook (validates the commit message
   itself, fires AFTER it's written), and CI check (validates
   PR descriptions which are authored on the host, not pre-commit).

The third Copilot finding (find→grep equivalence on
feedback_skills_as_carved_sentences_*) is stale — already fixed
in commit 862d190 which is on this branch. Will resolve as
"already addressed" when commenting.

Both fixes are themselves recursive applications of verify-then-
claim: rewriting the drift catalogue uncovers the catalogue's
own drift; clarifying hook semantics required actually verifying
git's hook ordering (pre-commit fires before commit-msg).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T00:37Z — verify-then-claim memo's drift catalogue contained its own drift

Catalogue-substrate-drift caught: the memo cataloguing 9 drift
instances had its own `\|` table-cell escape drift in 2 catalogue
rows + a pre-commit-vs-commit-msg hook semantic error. Recursive
failure on the very memo naming the failure mode is the strongest
empirical urgency for mechanization (tools/substrate-claim-checker/
TS tool). Manual discipline insufficient.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…ist + tool-status across memo

4 substantive findings on PR #1259 (in-flight):

1. **Section heading drift** — "## Empirical evidence (this
   session, 9+ PRs, 15+ distinct drift instances)" still said
   "15+" while body table has 20 rows + summary says 20.
   Updated heading to "20 distinct drift instances".

2. **Carved sentence stale at "9"** — line 115 still said
   "9 instances caught across 7 PRs". Updated to "20 instances
   across 9+ PRs" + named that instances #10-#20 landed after
   discipline-naming + named v0-shipped status.

3. **PR list incorrect** — frontmatter listed `#1247` (not in
   table) and excluded `#1249, #1257, #1259` (which ARE in
   table). Corrected to `#1245, #1248/#1249, #1250, #1252,
   #1253, #1254, #1255, #1256, #1257, #1259`.

4. **"Until tool ships" + "v0 shipped" contradiction** —
   reorganized §96 to put tool-status FIRST ("v0 shipped covering
   count-drift; v1+ extends to remaining 6 sub-classes; until
   v1+ ships covering all 7, the discipline outside count-drift
   is still manual").

2 tick-shard findings (0049Z + 0058Z) NOT addressed — tick
shards are append-only history preserving agent-belief-at-time.
The shards accurately recorded my belief at write-time; the
underlying memo is the canonical truth and is fixed in this PR.
A note in the next tick shard acknowledges the over-claims.

Drift instances #21 + #22 + #23 + #24 (this PR's own findings)
are not yet catalogued in the table — they will land in the
next sync pass to avoid recursing forever in this PR.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack added a commit that referenced this pull request May 3, 2026
…tmatter + body + MEMORY.md (#1259)

* review(pr-1257-postmerge): update verify-then-claim count drift (9→18+) in frontmatter + body + MEMORY.md

Copilot post-merge findings on PR #1257 (already merged):
the body of verify-then-claim memo says "15+ drift instances"
but the FRONTMATTER description and MEMORY.md index entry
still say "9 drift instances" — count drift between body
and metadata.

This is itself drift instance #19 (count drift, sub-class
already catalogued). Fixed in three places:

1. **Frontmatter description** updated 9 → 18+, names the
   PRs covered (#1245-#1256 and counting), names the 7 sub-
   classes catalogued, sharpens the manual-insufficient
   framing to reflect post-naming drift.

2. **Body line 91** ("9 drift instances above" → "18+ drift
   instances above across 7 recurring sub-classes").

3. **MEMORY.md index entry** updated to reflect 18+ count + 7
   sub-classes + manual-insufficient framing + the
   instances-#10-#18-landed-AFTER-naming evidence.

The frontmatter ↔ body drift is itself a recurring sub-class
within count-drift: when body content updates but metadata
doesn't, the index summary lies. The substrate-claim-checker
TS tool spec gets another check: scan frontmatter description
+ MEMORY.md entry against body content for count consistency.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T00:49Z — frontmatter↔body↔MEMORY.md count drift caught (drift #19)

Body said 15+, frontmatter description + MEMORY.md said 9 —
count drift across surfaces. Each new tick produces new drift
instances even when the discipline cataloguing the drift was
authored last tick. Mechanization (substrate-claim-checker TS
tool) is the only path. Spec gets another concrete check:
cross-surface count consistency.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(pr-1259): add table rows #16-#20 to match the "20" count claim

Copilot caught: frontmatter description + MEMORY.md said "18+
drift instances" but body table only had 15 rows — opposite-
direction count drift introduced by the very PR fixing the
prior count drift. **This is itself drift instance #20** —
self-recursive count drift; the count-fix introduces new
count drift in the opposite direction.

Fix: added 6 catalogue rows to the body table (#16-#20)
matching the claimed 20-instance count. Body now has 20 rows;
all three surfaces (frontmatter description + body table +
MEMORY.md index entry) consistent at 20.

The 6 new rows document drift instances #16-#20 — including
THIS PR's own drift as instance #20, demonstrating the
self-recursive sub-class explicitly.

Also updated:
- Sub-class section: self-recursive instances now [#10, #11, #19, #20]
- Body line 96: "20 drift instances above" + note that v0 of
  substrate-claim-checker shipped in PR #1260
- Frontmatter description: count → 20; instances range →
  #10-#20; v0 shipped reference
- MEMORY.md: count → 20; v0 shipped reference

This is the perfect worked example for the substrate-claim-
checker tool's value: the very count-drift-fix produced new
count drift, which the tool catches automatically. v0 (PR
#1260) would have caught this pre-publish.

Verified manually: `awk '/Drift instance/,/^$/'` + `grep -c
"^| [0-9]"` returns 20 rows; matches all 3 surfaces.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T00:58Z — count-fix introduced opposite-direction drift; body extended to 20 rows

Even authoring a PR to fix count drift produces opposite-direction
count drift. Drift instance #20 self-recursively documents this
PR's own drift. Substrate-claim-checker v0 (PR #1260) would have
caught it pre-publish — empirical evidence v0 was the right
architectural answer.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* review(pr-1259): synchronize section heading + carved sentence + PR list + tool-status across memo

4 substantive findings on PR #1259 (in-flight):

1. **Section heading drift** — "## Empirical evidence (this
   session, 9+ PRs, 15+ distinct drift instances)" still said
   "15+" while body table has 20 rows + summary says 20.
   Updated heading to "20 distinct drift instances".

2. **Carved sentence stale at "9"** — line 115 still said
   "9 instances caught across 7 PRs". Updated to "20 instances
   across 9+ PRs" + named that instances #10-#20 landed after
   discipline-naming + named v0-shipped status.

3. **PR list incorrect** — frontmatter listed `#1247` (not in
   table) and excluded `#1249, #1257, #1259` (which ARE in
   table). Corrected to `#1245, #1248/#1249, #1250, #1252,
   #1253, #1254, #1255, #1256, #1257, #1259`.

4. **"Until tool ships" + "v0 shipped" contradiction** —
   reorganized §96 to put tool-status FIRST ("v0 shipped covering
   count-drift; v1+ extends to remaining 6 sub-classes; until
   v1+ ships covering all 7, the discipline outside count-drift
   is still manual").

2 tick-shard findings (0049Z + 0058Z) NOT addressed — tick
shards are append-only history preserving agent-belief-at-time.
The shards accurately recorded my belief at write-time; the
underlying memo is the canonical truth and is fixed in this PR.
A note in the next tick shard acknowledges the over-claims.

Drift instances #21 + #22 + #23 + #24 (this PR's own findings)
are not yet catalogued in the table — they will land in the
next sync pass to avoid recursing forever in this PR.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* hygiene(tick-history): 2026-05-03T01:06Z — 5-surface count-drift sub-pattern; prior shards over-claimed "all surfaces consistent"

Memos have 5 count-bearing surfaces (frontmatter + body table +
section heading + carved sentence + MEMORY.md), not just 3. Prior
shards (0049Z + 0058Z) claimed "all 3 surfaces consistent" when
the section heading + carved sentence still had stale counts.
Acknowledgment lands here in append-only history; substrate-claim-
checker v1+ spec gets enumeration of all count-bearing surfaces.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants