Skip to content

frontier-readiness: gap #5 audit — skills + tools (gap SUBSTANTIALLY COMPLETE)#192

Open
AceHack wants to merge 4 commits intomainfrom
frontier-readiness/audit-skills-and-tools-final
Open

frontier-readiness: gap #5 audit — skills + tools (gap SUBSTANTIALLY COMPLETE)#192
AceHack wants to merge 4 commits intomainfrom
frontier-readiness/audit-skills-and-tools-final

Conversation

@AceHack
Copy link
Copy Markdown
Member

@AceHack AceHack commented Apr 23, 2026

Summary\n\nFinal gap #5 audits: skills (~236 delegated to skill-tune-up) + tools/** (13 subdirs; mostly factory-generic).\n\nGap #5 SUBSTANTIALLY COMPLETE. Final tally: 6 factory-generic / 10 both-coupled / 5 zeta-library-specific.\n\nGap #1 (multi-repo split) now UNBLOCKED by classification.\n\nOtto (loop-agent PM hat).

…COMPLETE)

Final gap #5 audits (summary-level):

.claude/skills/** (236 skills): both-coupled at directory
level; per-skill classification delegated to Aarav's
skill-tune-up portability-drift criterion (tool already
exists). Expected majority factory-generic; minority
project: zeta declared. Portability-drift-flagged skills
get split-time remediation. Effort L (volume), mechanical
per-skill after audit runs.

tools/** (13 subdirs): mostly factory-generic. Formal-
verification tooling (alloy / lean4 / tla / Z3Verify)
uniformly factory-generic. Factory hygiene scripts
factory-generic. Zeta-library-specific: invariant-
substrates only. Both-coupled: setup/ (shape generic,
Zeta installs specific). Effort M (10 of 13 move as-is).

Gap #5 SUBSTANTIALLY COMPLETE. Final tally:
- 6 factory-generic
- 10 both-coupled
- 5 zeta-library-specific

Remaining to-do: Aarav runs skill-tune-up portability audit
across 236 skills (one-time pre-split hygiene pass).

Gap #1 (multi-repo split) now UNBLOCKED by classification.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings April 23, 2026 20:03
@AceHack AceHack enabled auto-merge (squash) April 23, 2026 20:03
AceHack added a commit that referenced this pull request Apr 23, 2026
…udits total)

Gap #5 closure milestone reached.

Tick actions:
- .claude/skills/** audited summary-level (236 skills
  delegated to Aarav skill-tune-up portability audit)
- tools/** audited (13 subdirs; mostly factory-generic,
  3 both/project outliers)
- Gap #5 marked SUBSTANTIALLY COMPLETE in BACKLOG P0 row
- Gap #1 (multi-repo split) unblocked by classification

Final gap #5 tally:
- 6 factory-generic
- 10 both-coupled
- 5 zeta-library-specific

Frontier readiness progress (3 of 8 complete):
- Gap #3 closed (NSA test history, PR #177)
- Gap #8 closed on re-inspection (Otto-4)
- Gap #5 SUBSTANTIALLY COMPLETE (Otto-20)

Remaining: gap #1 (unblocked), #2 (linguistic-seed,
high-priority prompt-injection mechanism), #4 (bootstrap-
reference docs, L + reviewers), #6 (persona portability,
may close on re-inspection given agents audit), #7
(tick-history scope-mix).

Original gap #5 estimate: ~20-40 ticks. Actual: ~14 ticks
with batching acceleration.

PR #192 armed for auto-merge.

Attribution: Otto (loop-agent PM hat).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the Frontier-readiness “factory vs Zeta separation” audit to cover gap #5 surfaces for skills and tooling, aiming to mark the audit as substantially complete and unblock the multi-repo split planning.

Changes:

  • Updates the “Files to audit” list to include batched/summary audit references for .claude/skills/** and tools/**.
  • Adds new audit sections classifying .claude/skills/** and tools/**, with split/refactor notes.
  • Adds/updates the “pattern summary” tally of classified surfaces.

Comment on lines 62 to +75
### Files to audit (not yet classified; add rows as they land)

- `docs/GLOSSARY.md`
- `docs/FACTORY-HYGIENE.md`
- `docs/ROUND-HISTORY.md`
- `docs/TECH-RADAR.md`
- `docs/BACKLOG.md`
- `docs/ROADMAP.md`
- `docs/VISION.md`
- `.claude/skills/*/SKILL.md` (each)
- `.claude/agents/*.md` (each)
- `openspec/**` (structural; library-specific-heavy)
- `tools/**/*` scripts (some factory, some Zeta-build)
- `.github/` workflows + config
- `.claude/skills/*/SKILL.md` (~236 skills, audited summary-level Otto-20 below)
- `tools/**/*` scripts (audited summary-level Otto-20 below)
- `.claude/agents/*.md` (batched Otto-19)
- `openspec/**` (batched Otto-19)
- `.github/` workflows + config (batched Otto-19)
Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The list is under “Files to audit (not yet classified)”, but .claude/skills/*/SKILL.md and tools/** now have audit sections below with an overall classification. Please either move these entries into “Files audited” (possibly noting “summary-level”) or rename/reword the heading so it’s not claiming they’re unclassified.

Copilot uses AI. Check for mistakes.
Comment on lines +455 to +462
**Scope:** 13 top-level subdirectories under `tools/`:

| Subdirectory | Class | Notes |
|---|---|---|
| `tools/alignment/` | factory-generic | Alignment audit tools |
| `tools/alloy/` | factory-generic | Alloy formal verification (factory-wide capability) |
| `tools/audit-packages.sh` | factory-generic | NuGet package audit — Zeta-specific ecosystem but pattern generic |
| `tools/git/` (push-with-retry.sh) | factory-generic | Generic git wrapper; works in any repo |
Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section says “13 top-level subdirectories under tools/”, but the table includes tools/audit-packages.sh and tools/profile.sh, which are files (not directories). Consider rephrasing to “top-level entries” (or split into “directories” vs “top-level scripts”) to keep the inventory precise.

Copilot uses AI. Check for mistakes.
Comment on lines +510 to +512
| factory-generic | 6 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE, `.claude/skills/**` (majority expected), `tools/**` (majority subdirs) |
| both (coupled) | 10 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, TECH-RADAR, GLOSSARY, `.claude/agents/`, `openspec/`, `.github/`, `tools/setup/` |
| zeta-library-specific | 5 | ROUND-HISTORY, BACKLOG, ROADMAP, VISION, `tools/invariant-substrates/` |
Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the “Total tally” table, .claude/skills/** and tools/** are listed under “factory-generic”, but earlier in this doc their “Overall classification” is both (coupled) at the directory level. Please reconcile the tally with the per-section classifications (e.g., move them to “both” or adjust the earlier “Overall classification” wording to match what the tally is counting).

Suggested change
| factory-generic | 6 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE, `.claude/skills/**` (majority expected), `tools/**` (majority subdirs) |
| both (coupled) | 10 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, TECH-RADAR, GLOSSARY, `.claude/agents/`, `openspec/`, `.github/`, `tools/setup/` |
| zeta-library-specific | 5 | ROUND-HISTORY, BACKLOG, ROADMAP, VISION, `tools/invariant-substrates/` |
| factory-generic | 5 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE |
| both (coupled) | 11 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, TECH-RADAR, GLOSSARY, `.claude/agents/`, `.claude/skills/**`, `openspec/`, `.github/`, `tools/**` |
| zeta-library-specific | 4 | ROUND-HISTORY, BACKLOG, ROADMAP, VISION |

Copilot uses AI. Check for mistakes.
Copilot AI review requested due to automatic review settings April 23, 2026 22:25
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

Comment on lines +504 to +523
## Pattern summary after 20 audits — gap #5 essentially complete

Total tally:

| Class | Count | Surfaces |
|---|---|---|
| factory-generic | 6 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE, `.claude/skills/**` (majority expected), `tools/**` (majority subdirs) |
| both (coupled) | 10 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, TECH-RADAR, GLOSSARY, `.claude/agents/`, `openspec/`, `.github/`, `tools/setup/` |
| zeta-library-specific | 5 | ROUND-HISTORY, BACKLOG, ROADMAP, VISION, `tools/invariant-substrates/` |

Gap #5 audit surface essentially complete. Remaining
next-step: run skill-tune-up portability-drift across
236 skills (Aarav task), produce per-skill split manifest.
That's a one-time pre-split hygiene pass, not per-tick
audit work.

Gap #5 can be marked **SUBSTANTIALLY COMPLETE** pending
Aarav's skill-tune-up portability audit. Gap #1
(multi-repo split) is now unblocked by classification.

Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the “Total tally” table, the factory-generic row’s Count is 6 but the Surfaces column lists 7 items (GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE, .claude/skills/**, tools/**). Also, several surfaces listed here (e.g., FACTORY-HYGIENE, GLOSSARY, TECH-RADAR, ROUND-HISTORY, BACKLOG, ROADMAP, VISION) are still listed earlier under “Files to audit (not yet classified)”. Please reconcile the tally with the earlier status (either update the earlier list, or mark these as “expected” rather than counted as audited/classified).

Suggested change
## Pattern summary after 20 audits — gap #5 essentially complete
Total tally:
| Class | Count | Surfaces |
|---|---|---|
| factory-generic | 6 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, FACTORY-HYGIENE, `.claude/skills/**` (majority expected), `tools/**` (majority subdirs) |
| both (coupled) | 10 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, TECH-RADAR, GLOSSARY, `.claude/agents/`, `openspec/`, `.github/`, `tools/setup/` |
| zeta-library-specific | 5 | ROUND-HISTORY, BACKLOG, ROADMAP, VISION, `tools/invariant-substrates/` |
Gap #5 audit surface essentially complete. Remaining
next-step: run skill-tune-up portability-drift across
236 skills (Aarav task), produce per-skill split manifest.
That's a one-time pre-split hygiene pass, not per-tick
audit work.
Gap #5 can be marked **SUBSTANTIALLY COMPLETE** pending
Aarav's skill-tune-up portability audit. Gap #1
(multi-repo split) is now unblocked by classification.
## Pattern summary after 15 completed audits
Audited/classified tally so far:
| Class | Count | Surfaces |
|---|---|---|
| factory-generic | 6 | GOVERNANCE, AGENT-BEST-PRACTICES, ALIGNMENT, AUTONOMOUS-LOOP, `.claude/skills/**` (majority expected), `tools/**` (majority subdirs) |
| both (coupled) | 8 | CLAUDE, AGENTS, CONFLICT-RESOLUTION, WONT-DO, `.claude/agents/`, `openspec/`, `.github/`, `tools/setup/` |
| zeta-library-specific | 1 | `tools/invariant-substrates/` |
Expected dominant destination once explicitly audited
(not counted above while still listed earlier as "not yet
classified"): FACTORY-HYGIENE → factory-generic;
TECH-RADAR, GLOSSARY → both; ROUND-HISTORY, BACKLOG,
ROADMAP, VISION → zeta-library-specific.
Gap #5 has enough coverage to guide split planning, but
the audit is not fully complete yet. Remaining next-step:
finish explicit classification of the still-pending
primary surfaces above, then run skill-tune-up
portability-drift across 236 skills (Aarav task) to
produce the per-skill split manifest. That's a one-time
pre-split hygiene pass, not per-tick audit work.
Gap #1 (multi-repo split) now has a usable classification
baseline, but gap #5 should not be marked fully complete
until those pending surfaces are moved out of the "not yet
classified" list.

Copilot uses AI. Check for mistakes.
Comment on lines +71 to +75
- `.claude/skills/*/SKILL.md` (~236 skills, audited summary-level Otto-20 below)
- `tools/**/*` scripts (audited summary-level Otto-20 below)
- `.claude/agents/*.md` (batched Otto-19)
- `openspec/**` (batched Otto-19)
- `.github/` workflows + config (batched Otto-19)
Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The “Files to audit (not yet classified)” list now includes .claude/skills/*/SKILL.md and tools/**/*, but both are classified later in this document (new audit sections below). Consider moving these entries to the “Files audited” list, or renaming this section to reflect that it now contains already-classified surfaces too. Also, this list references “Otto-20 below” / “Otto-19” but those labels don’t appear anywhere else in the doc, so the cross-reference can’t be followed.

Copilot uses AI. Check for mistakes.
**Overall classification:** **both (coupled)** — mixed
per-subdirectory.

**Scope:** 13 top-level subdirectories under `tools/`:
Copy link

Copilot AI Apr 23, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This line says “13 top-level subdirectories under tools/”, but the list includes at least tools/audit-packages.sh and tools/profile.sh which are files, not subdirectories. Suggest rewording to “13 top-level entries” (or “11 subdirectories + 2 scripts”) to keep the audit accurate.

Suggested change
**Scope:** 13 top-level subdirectories under `tools/`:
**Scope:** 13 top-level entries under `tools/`:

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants