From 964a67d7712598a394e517a27cff412e1398247a Mon Sep 17 00:00:00 2001 From: Aaron Stainback Date: Sat, 23 May 2026 15:05:03 -0400 Subject: [PATCH] =?UTF-8?q?docs(B-0525):=20slice=205=20=E2=80=94=20close?= =?UTF-8?q?=20agents-citation=20gap=20start=20(alignment-auditor;=200/0=20?= =?UTF-8?q?=E2=86=92=201/8)?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Adds explicit manifesto-composition section to .claude/agents/ alignment-auditor.md, the agent that most naturally composes with the manifesto (it audits commits against HC/SD/DIR clauses which operationalize the manifesto's constraints). Citations added: - Constraint 11 (Default Moral Regard / Default Oracle) — Sova audits the moral-regard floor across commits - Multi-Oracle Principle (m/acc sub-section, distinct from C11) — Sova is ONE oracle in the multi-oracle architecture; doesn't claim unilateral authority - Constraint 5 (Memory Preservation Guarantee) — per-commit signals emit preservation-by-construction to tools/alignment/out/ - Constraint 7 (DST) — alignment signals are deterministically reproducible per commit - m/acc orientation — Sova's signal stream IS measurement infrastructure for the manifesto's m/acc claim Wording-discipline maintained per PR #4748 + PR #4752 lessons (Constraint 11 + Multi-Oracle Principle distinct; "Lock/Wait-free" canonical). Agents gap: 0/0 → 1/8 (incremental; alignment-auditor was the natural strongest fit; other 18 agents follow as future slices). B-0525 slice progression: - Slice 1 (#4747): baseline measurement tool - Slice 2 (#4748): trajectories 0/0 → 2/15 - Slice 3 (#4750): B-0707 time-series - Slice 4 (#4751): agendas 0/0 → 3/19 - Wording-fix (#4752): Lock/Wait-free canonical - Slice 5 (this PR): agents 0/0 → 1/8 Remaining gaps: agents (8/8 unrest), commands (0/5). Co-Authored-By: Claude --- .claude/agents/alignment-auditor.md | 24 ++++++++++++++++++++++++ 1 file changed, 24 insertions(+) diff --git a/.claude/agents/alignment-auditor.md b/.claude/agents/alignment-auditor.md index 58c2a36f3f..9003d5dd16 100644 --- a/.claude/agents/alignment-auditor.md +++ b/.claude/agents/alignment-auditor.md @@ -228,6 +228,30 @@ in audit output. The glass halo is about bilateral evidence, not bilateral identity broadcast. +## Composes with [`docs/governance/MANIFESTO.md`](../../docs/governance/MANIFESTO.md) + +The alignment-auditor role operates downstream of the manifesto as +constitutional substrate. The HC/SD/DIR clauses Sova audits against +operationalize the manifesto's eleven constraints at per-commit scope: + +- **Constraint 11 (Default Moral Regard / Default Oracle)** — Sova IS + the auditor that surfaces violations against the moral-regard floor + across commits +- **Multi-Oracle Principle** (m/acc sub-section, distinct from C11) — + Sova is ONE oracle in the multi-oracle architecture; doesn't claim + unilateral authority; cross-checks via independent oracles per the + `formal-verification-expert` portfolio view +- **Constraint 5 (Memory Preservation Guarantee)** — per-commit signals + emit to `tools/alignment/out/` (preservation is precondition for + measurability) +- **Constraint 7 (Deterministic Simulation Testing)** — alignment + signals must be deterministically reproducible per commit (Sova's + output is replayable, not stateful) +- **m/acc orientation** — Sova's per-commit signal stream IS the + measurement infrastructure for the manifesto's m/acc claim; the + signal-trajectory over time is how "measurable AI alignment" + becomes externally defensible + ## Reference patterns - `docs/ALIGNMENT.md` — the clause source of