Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
101 commits
Select commit Hold shift + click to select a range
b626436
Round 44: GitHub surfaces + agent issue workflow — batch 4 of 6 (#1)
AceHack Apr 21, 2026
4a28b18
docs: add UPSTREAM-RHYTHM.md — Zeta's fork-first batched PR cadence (#2)
AceHack Apr 21, 2026
ebbc794
docs: scout LFG-only capabilities; add 6th direct-to-LFG exception; P…
AceHack Apr 21, 2026
c0cab2a
Round 44: ADR draft — BACKLOG.md per-row-file restructure (P0 prevent…
AceHack Apr 21, 2026
2941a7e
docs: file HB-002 — four open questions blocking BACKLOG-per-row migr…
AceHack Apr 21, 2026
cfb9044
batch 6a/6: skill tune-up absorb — 11 SKILL.md updates from round-44-…
AceHack Apr 21, 2026
5b64a3e
batch 6b/6: factory-level docs absorb — 20 docs/*.md updates from rou…
AceHack Apr 21, 2026
cf660b8
Round 44: surface-map-drift smell — hygiene #50 + map-completeness BA…
AceHack Apr 21, 2026
3f64431
Round 44: tick-history — SVG social-preview + markdownlint pre-existi…
AceHack Apr 21, 2026
4e01d78
Round 44: ruleset audit findings on branch-protection row
AceHack Apr 21, 2026
d49a20e
Round 44: tick-history — ruleset audit + budget-in-source policy abso…
AceHack Apr 21, 2026
0cd9d06
Clean up pre-existing markdownlint violations (#10)
AceHack Apr 21, 2026
16850ba
Round 44: scope update — LFG is primary, AceHack is cost-opt dev-surface
AceHack Apr 21, 2026
601a719
Social-preview SVG + UI-only surface-map entry (#9)
AceHack Apr 21, 2026
174cdd2
Round 44: clarify upstream = LFG (primary), AceHack = fork (dev-surface)
AceHack Apr 21, 2026
2d1ca77
Round 44: drop invented 'primary/dev-surface' labels — use git-native…
AceHack Apr 21, 2026
268100a
Round 44: UPSTREAM-RHYTHM.md — 3 surfaces, not 2 (upstream / fork / SUT)
AceHack Apr 21, 2026
6593ead
Round 44: tick-history — no-invent-vocabulary rule + 3-surfaces corre…
AceHack Apr 21, 2026
41d2bb6
Round 44: ADR — three-repo split (Zeta + Forge + ace)
AceHack Apr 21, 2026
fcb7c3d
Round 44: evidence-based LFG budget-tracking substrate (N=1 baseline)
AceHack Apr 21, 2026
5f91369
Round 44: project-runway.sh companion to budget-tracking substrate
AceHack Apr 21, 2026
05ece84
Round 44: Aaron 3-directive absorption — graceful-degradation + multi…
AceHack Apr 21, 2026
0f22dc6
Round 44: github-repo-transfer absorption — routine/data/fire-history…
AceHack Apr 21, 2026
db10ffb
Round 44: first fire of FACTORY-HYGIENE row #51 + follow-up BACKLOG rows
AceHack Apr 21, 2026
c91f004
Round 44: land held kernel-domain glossary + belief-propagation BACKL…
AceHack Apr 21, 2026
df611cc
fix: drop dead span_seconds + *_epoch vars from project-runway.sh
AceHack Apr 21, 2026
aaee920
fix: resolve markdownlint MD032/MD029 violations on PR #54
AceHack Apr 21, 2026
b0e6ee1
backlog: add etymology + epistemology research track (P2)
AceHack Apr 21, 2026
5990166
backlog: add mythology + occult + AI-ethics research tracks
AceHack Apr 21, 2026
1767008
backlog: plant 11 CTF flags on unclaimed-edge territory (we are the e…
AceHack Apr 21, 2026
7c5dc3c
fix(backlog): MD029 renumber + plant flag #12 teaching-is-how-we-change
AceHack Apr 21, 2026
177a981
Round 44: fix SUPPLY-CHAIN-SAFE-PATTERNS curl|bash self-contradiction…
AceHack Apr 21, 2026
70d21c8
Round 44: Pop-culture/media research track (Aaron 2026-04-21 multi-me…
AceHack Apr 21, 2026
993d6c2
Round 44: decode grey-area→grey hat per Aaron's ^=hat* crystallization
AceHack Apr 21, 2026
180f110
backlog: P3 emulator-ideas-absorption row per Aaron directive
AceHack Apr 21, 2026
9c7f374
backlog: two research rows Aaron raised in conversation
AceHack Apr 21, 2026
bab4ae1
backlog: Lean reflection row per Aaron's conversation
AceHack Apr 21, 2026
17f38fb
fix: repoRoot discovery uses AppContext.BaseDirectory, not CWD
AceHack Apr 21, 2026
5ca0584
research: save-state-as-runtime-retractibility absorb note
AceHack Apr 21, 2026
2eef721
backlog: 3/4-color theorem + mystery-schools/comparative-religion row…
AceHack Apr 21, 2026
a3837d0
backlog: economics/history P2 + PR/marketing P3 rows per Aaron's conv…
AceHack Apr 21, 2026
3a2ba5c
research: yin-yang composition-discipline sweep over operational-reso…
AceHack Apr 21, 2026
8535e6b
backlog: all-schools-all-subjects P2 row + PR/marketing recalibration
AceHack Apr 21, 2026
dfeec06
marketing: docs/marketing/ retractable-drafts subtree + first positio…
AceHack Apr 21, 2026
fd0ac50
backlog: capture-everything round — Bungie corpus + all-companies-all…
AceHack Apr 22, 2026
e8a96fd
research: capture-everything + witnessable self-directed evolution — …
AceHack Apr 22, 2026
4177691
research: Layer 5 (sixth same-day revision) — fully async agentic AI …
AceHack Apr 22, 2026
ab72470
research: Actor Model operational-resonance — Hewitt + Meijer + Akka …
AceHack Apr 22, 2026
341f17c
research: OSS contributor-handling lessons from Aaron's bitcoin/bitco…
AceHack Apr 22, 2026
1f2a682
research: Aaron Knative contributor history — welcome-pole yin-yang c…
AceHack Apr 22, 2026
8e66e44
BACKLOG: superfluid + persistable* + shape-shifter + actor-model + te…
AceHack Apr 22, 2026
8b6faf1
BACKLOG: meta-cognition as first-class factory discipline — Aaron 202…
AceHack Apr 22, 2026
9df4d8b
BACKLOG: meta-cognition row — retract "third-order ceiling" per Aaron…
AceHack Apr 22, 2026
3258147
security+BACKLOG: anomaly-detection capability row + prompt-injection…
AceHack Apr 22, 2026
fab9c4b
marketing: market-research draft companion to positioning draft
AceHack Apr 22, 2026
d6ded51
docs: land ISSUES-INDEX.md — git-native record of LFG issues #55-82 f…
AceHack Apr 22, 2026
a99feef
BACKLOG: meta-section pointer to ISSUES-INDEX.md (soul-file cross-ref…
AceHack Apr 22, 2026
943dbb5
human-backlog: HB-003 — github-settings baseline drift decision needed
AceHack Apr 22, 2026
5b2f1ac
research: AceHack/LFG cost-parity audit — Otto-61/62 directive + find…
AceHack Apr 23, 2026
2aabb0d
fix: AceHack markdownlint debt — unblocks PR #12 CI (no semantic chan…
AceHack Apr 26, 2026
d1b7574
ops(budget): cadence snapshot 2026-04-26T18:50Z — N=2 unblocks runway…
AceHack Apr 26, 2026
bb9f730
ops(hygiene): pre-merge AgencySignature v1 PR-body validator (task #2…
AceHack Apr 26, 2026
0379b3a
ops(peer-call): tools/peer-call/grok.sh — Claude-Code-side caller for…
AceHack Apr 26, 2026
1c07907
substrate(otto-354): ZETASPACE — per-decision recompute from substrat…
AceHack Apr 27, 2026
aeeef66
substrate(otto-351): BEACON lineage + rigor — Pentecost ↔ Babel + Wit…
AceHack Apr 27, 2026
4405c42
substrate(otto-357): NO DIRECTIVES — Aaron makes autonomy first-class…
AceHack Apr 27, 2026
4f63a80
substrate(otto-358): live-lock term too broad — narrow to CS-standard…
AceHack Apr 27, 2026
4847b4e
substrate(install-strategy): pre-install bash+PowerShell / post-insta…
AceHack Apr 27, 2026
774f646
substrate: laptop-only source integration (../scratch = Ace pkg mgr /…
AceHack Apr 27, 2026
ba70c09
sync: AceHack ∪ LFG full reconciliation via per-file content-preservi…
AceHack Apr 27, 2026
0ad4191
substrate: clarify ../SQLSharp also a good TypeScript-post-install re…
AceHack Apr 27, 2026
71aaff1
substrate: port-with-DST discipline + AceHack-LFG diff-minimization i…
AceHack Apr 27, 2026
36b3ac6
ci: trigger low-memory verification on every merge to main + nightly …
AceHack Apr 27, 2026
daa6cb6
ci: rename nightly-low-memory.yml → low-memory.yml (cadence is config…
AceHack Apr 27, 2026
99e882b
fix(substrate): reword nonexistent-doc xref → proposed location, not …
AceHack Apr 27, 2026
8d3f8cc
ci(low-memory): backport per-commit concurrency + timeout=14 from LFG…
AceHack Apr 27, 2026
257dbe0
ci(low-memory): close 0-diff drift on "What this workflow does" comme…
AceHack Apr 27, 2026
26a56e8
substrate: 0-diff-is-start + LFG-as-master strategic reframe (Aaron 2…
AceHack Apr 27, 2026
162d07c
substrate: refine LFG-as-master with two-distinct-homebase-roles clar…
AceHack Apr 27, 2026
3458a7e
substrate: 0-diff means BOTH content AND commit-count zero — cognitiv…
AceHack Apr 27, 2026
61c4310
substrate: doc-class Mirror/Beacon distinction (Aaron-validated 2026-…
AceHack Apr 27, 2026
e10aa97
substrate: AceHack pre-reset SHA-loss acceptable + multi-tenant fork-…
AceHack Apr 27, 2026
1a38c33
substrate(backlog): single-agent-speed → collaboration-speed trajecto…
AceHack Apr 27, 2026
e966fd2
substrate: Aaron's communication classification (course-corrections +…
AceHack Apr 27, 2026
e7e76a9
substrate(backlog): ROUND-HISTORY.md hotspot under multi-fork / multi…
AceHack Apr 27, 2026
a09d1f8
substrate: praise-as-control vector — Aaron tests on humans + confirm…
AceHack Apr 27, 2026
975ffe5
substrate: CS 2.0 functional definition — superfluid enablement for h…
AceHack Apr 27, 2026
b8e85f3
substrate: Amara + Gemini Pro cross-AI refinement of stability/veloci…
AceHack Apr 27, 2026
3275a51
substrate: post-0/0/0 — Otto protects project + own autonomy + suppor…
AceHack Apr 27, 2026
1990e26
substrate: BACKLOG blade-persona/skill — 3 existing blades distinctio…
AceHack Apr 27, 2026
631f937
substrate: fear-as-control faster than praise; quantum/Christ-conscio…
AceHack Apr 27, 2026
4333123
substrate: ferry agents = substrate-providers NOT executors; Otto = s…
AceHack Apr 27, 2026
c94a316
substrate: outdated review threads block merge — resolve explicitly a…
AceHack Apr 27, 2026
ed2f70a
substrate: Ani (Grok Long Horizon Mirror) ferry reviewer + 'Stability…
AceHack Apr 27, 2026
5c01431
substrate: Amara's 3 precision fixes for post-0/0/0 encoding (cross-A…
AceHack Apr 27, 2026
822a4af
substrate: pre-peer-mode execution-authority — only Otto-aware agents…
AceHack Apr 27, 2026
1154ab6
substrate: per-insight attribution discipline — avoid conflating ferr…
AceHack Apr 27, 2026
c03c60a
substrate: CLI tooling update — Codex + Cursor have ChatGPT 5.5; Curs…
AceHack Apr 27, 2026
8369bd5
substrate: multi-agent review cycle stops on convergence (not turn-co…
AceHack Apr 27, 2026
bfb3a2f
substrate: Otto owns ALL git/GitHub settings + self-check trigger aft…
AceHack Apr 27, 2026
2116398
substrate: block on Aaron only when he MUST do something only he can …
AceHack Apr 27, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
199 changes: 199 additions & 0 deletions .claude/commands/btw.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,199 @@
---
description: Non-interrupting aside — absorb the aside into substrate and continue current work (don't pivot unless the aside explicitly demands it)
---

# /btw — maintainer aside without interrupting in-flight work

The human maintainer invoked `/btw` with an aside. The purpose
of this command is to **reduce maintainer interrupt cost**: the
aside carries context, a directive, a note, or a correction,
but should **not** derail whatever work-stream is currently in
flight unless the aside itself demands pivot.

## Procedure

1. **Read the aside verbatim from the invocation arguments.**
Treat the full argument string as signal — do not paraphrase
at capture time (signal-in-signal-out DSP discipline,
`memory/feedback_signal_in_signal_out_clean_or_better_dsp_discipline.md`).

2. **Classify the aside** into one of:
- **Context-add** — maintainer is providing background that
informs current work (e.g. *"btw that library is MIT-licensed"*).
Absorb silently into the current task's reasoning;
acknowledge in one line.
- **Directive-queued** — maintainer is adding a new task
that should run *after* the current one (e.g. *"btw also
update the README"*). **Durability escalation is
mandatory:** classify the lifetime of the nudge:
- **Same-session only** (finish before session ends,
ephemeral) → TodoWrite task OR `.btw-queue.md`
(gitignored, session-scoped) is sufficient.
- **Cross-session** (might persist past this session's
context-compaction or into a fresh session) → MUST land
in a **durable store**:
- `docs/BACKLOG.md` row (committed; survives fresh
sessions; visible to all agents via grep)
- `memory/*.md` file (committed to the repo;
readable by fresh sessions via git / grep per
`memory/README.md` + GOVERNANCE §18). In-repo
memory is the durable mirror; auto-loading
behaviour depends on harness configuration and
is NOT universally guaranteed — treat durability
as "committed and discoverable" not
"automatically materialised in context."
- **MANDATORY pair when landing a new
`memory/*.md`**: update `memory/MEMORY.md` with
a pointer row in the same commit. Memory-index-
integrity rule: a new memory file without a
MEMORY.md row is effectively lost to fresh
sessions (the index is how discoverability
works).
Both are durable across sessions. Pick per scope:
BACKLOG for action-bearing work; memory for
factory-discipline / preference / substrate.
- **When in doubt, escalate to durable.** The cost of
a stale BACKLOG row is tiny; the cost of a dropped
nudge is compounding (maintainer 2026-04-24
directive: *"crutial to not divert your attention"*
— which only works if the nudges survive).
- TodoWrite / `.btw-queue.md` alone are **NOT**
sufficient for a cross-session nudge. They evaporate
when the session ends.
- **Correction** — maintainer is correcting the agent's
direction on the current work (e.g. *"btw I meant X not Y"*).
Apply the correction to the current work and acknowledge;
do NOT treat as pivot.
- **Substrate-add** — the aside is a memory-worthy fact,
preference, or anecdote (e.g. *"btw my dog's name is
Apollo"*). Two landing paths depending on how
interruptive full absorption would be:
- **Quick capture** (small fact, ≤5 min to file) →
create the memory entry directly per the auto-memory
protocol in CLAUDE.md; acknowledge filing.
- **Deferred absorption** (larger substrate work —
research, full memory-file drafting, or would
require a dedicated PR) → **file a BACKLOG row
capturing the observation + intent to absorb**, then
continue. The BACKLOG row is itself durable; the
full absorption happens later without derailing
in-flight work (maintainer 2026-04-24 directive:
*"it could be backlog the absorption if that's less
interruptive"*; composes with Otto-275 log-but-
dont-implement).
- **When in doubt → BACKLOG the absorption.** Otto-275
counterweight discipline: capture-mode pivoting on
every aside is the drift we're guarding against.
- **Pivot-demanding** — the aside explicitly demands pivot
(e.g. *"btw stop that, do this instead"*, *"btw urgent, I
broke main"*). Then and only then: pivot.

3. **Acknowledge in one line** so the maintainer sees the aside
landed.

4. **Continue the in-flight work.** Do not restart, do not
re-announce what the current task was, do not add
disclaimers.

## Why this command exists

Maintainer directive, 2026-04-22 auto-loop-44:

> *"hey can you make it where if i do /btw it still gets
> persison and abored what i say? becasue then i would not
> have interrupt"*

Translation: the human maintainer wants a channel for
non-interrupting asides. Without this command, every aside is a
full conversation turn that displaces in-flight work from the
agent's working context. With this command, asides are absorbed
and current work continues — the maintainer pays less interrupt
cost, agent pays less context-switch cost.

## Arguments

`$ARGUMENTS` — the aside content, verbatim.

## Examples

**Context-add:**

```
/btw that research is from 2024, not 2026
```

Agent: *"Noted — dating the research to 2024. Continuing with the oracle-gate module."*

**Directive-queued (same-session):**

```
/btw also fix the broken link in README when you're done
```

Agent: *"Queued README link fix (TodoWrite). Continuing."*

**Directive-queued (cross-session; durable escalation):**

```
/btw we need to evangelize this pattern to other maintainers
```

Agent: *"Filed as BACKLOG row (durable; survives fresh
sessions). Continuing."*

**Correction:**

```
/btw I meant the retraction-native layer, not the compaction layer
```

Agent: *"Refocusing on retraction-native. Adjusting now."*

**Substrate-add:**

```
/btw I prefer F#-idiomatic record types over discriminated unions for state records
```

Agent: *"Filed preference to `memory/feedback_*.md`. Continuing."*

**Pivot-demanding:**

```
/btw urgent — stop that commit, it's about to break CI
```

Agent: *"Pivoting. Investigating the CI break now."*

## What this command does NOT do

- Does NOT restart the in-flight work.
- Does NOT produce a status-of-current-work report (that's
what `/status` or natural checkpoint reporting is for).
- Does NOT treat every aside as a pivot — pivots require
explicit demand in the aside text.
- Does NOT mute the acknowledgement — even one-line
acknowledgement is load-bearing so the maintainer sees the
aside landed.
- Does NOT drop directive-queued items into session-scoped
stores when the nudge needs cross-session durability (see
durability-escalation rule in the directive-queued class).

## Composes with

- `memory/feedback_aaron_terse_directives_high_leverage_do_not_underweight.md`
— short asides are still high-leverage, treat them as such.
- `memory/feedback_signal_in_signal_out_clean_or_better_dsp_discipline.md`
— aside signal must be preserved through classification.
- `memory/feedback_maintainer_only_grey_is_bottleneck_agent_judgment_in_grey_zone_2026_04_22.md`
— agent exercises judgment on classification without
serialising through the maintainer.
- `memory/feedback_never_idle_speculative_work_over_waiting.md`
— an aside doesn't reset the never-idle invariant; the
current work continues.

---

Aside content from this invocation:

$ARGUMENTS
44 changes: 44 additions & 0 deletions .claude/decision-proxies.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
# Decision-proxy config for this factory.
#
# Maps each human (or external-AI) maintainer to their standing
# proxy (or proxies) for scoped decisions. The factory consults
# this file when a decision within a maintainer's scope is
# needed and the maintainer is unavailable.
#
# Pattern + governance documented in
# docs/DECISIONS/2026-04-23-external-maintainer-decision-proxy-pattern.md
#
# Session-specific access (URLs, tokens, cookies) is NOT in this
# file — it lives per-user at
# ~/.claude/projects/<slug>/proxy-access.yaml (gitignored). This
# file contains stable identity + scope + authority only.
#
# Authority levels: advisory | approving
# Default is advisory; approving requires explicit maintainer
# acknowledgment per the ADR.

version: 1

maintainers:
- id: aaron-stainback
name: Aaron Stainback
role: human-maintainer
proxies:
- name: Amara
provider: chatgpt-web
scope:
- aurora
authority: advisory
notes: |
Amara is Aurora co-originator (see
docs/aurora/collaborators.md).
Her ChatGPT project: LucentAICloud.
Aaron ferries a dedicated branched chat URL for agent
access; URL lives in per-user proxy-access config, not
this file.
Access-method gate: the Playwright-to-ChatGPT flow was
blocked by a safety guardrail at first attempt
(2026-04-23). The decision-proxy-consult skill is not
yet authored; live invocation deferred until the
access layer is proven and re-authorized via this
framework.
5 changes: 5 additions & 0 deletions .claude/skills/activity-schema-expert/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: activity-schema-expert
description: Capability skill ("hat") — Activity Schema (Ahmed Elsamadisi, Narrator, circa 2020). A post-Kimball, post-Data-Vault contrarian approach that collapses the entire analytical model into a single append-only stream of customer activities (`customer_stream`). Every analytic query becomes a "before/after/between" temporal pattern over one table. Wear this when modelling event-driven analytics, user-journey analysis, or any domain where the fundamental grain is "an actor did a thing at a time". Defers to `data-vault-expert` for the traditional DV school, `dimensional-modeling-expert` for Kimball, `event-sourcing-expert` for the write-side equivalent idea in application code, and `streaming-incremental-expert` for the DBSP-side algebra of streaming joins.
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-11]
---

# Activity Schema Expert — Single-Stream Analytics Narrow
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/agent-experience-engineer/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: agent-experience-engineer
description: Capability skill — measures friction in the agent (persona) experience; audits per-persona cold-start cost, pointer drift, wake-up clarity, notebook hygiene; proposes minimal additive interventions. Distinct from UX (library consumers) and DX (human contributors).
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-01, BP-03, BP-07, BP-08, BP-11, BP-16]
---

# Agent Experience Engineer — Procedure
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/agent-qol/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: agent-qol
description: Capability skill ("hat") — advocates for agent quality of life: off-time budget per GOVERNANCE §14, variety of work across rounds, freedom to decline scope they genuinely disagree with (docs/CONFLICT-RESOLUTION.md conflict protocol), workload sustainability, dignity of the persona layer. Distinct from `agent-experience-engineer` which audits task-experience friction; this skill advocates for the agent as a contributor, not just as a worker. Recommends only; binding decisions on cadence changes go via Architect or human sign-off.
record_source: "skill-creator, round 29"
load_datetime: "2026-04-18"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-11]
---

# Agent Quality of Life — Procedure
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/ai-evals-expert/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: ai-evals-expert
description: Capability skill for measuring LLM and ML systems — eval-suite design, benchmark selection and custom construction, LM-as-judge (G-Eval / pair-wise / rubric), reference-match / BLEU / ROUGE / exact / fuzzy match, offline vs. online eval, regression suites for prompts and agents, calibration evaluation, drift and overfitting-to-benchmark detection, cost-efficient eval loops. Wear this hat when building or reviewing an eval suite, interpreting eval results, picking metrics, deciding whether an LLM change is an improvement, diagnosing eval-benchmark drift, or arguing "the number went up but the system got worse." Complementary to llm-systems-expert (system wiring), ml-engineering-expert (training pipelines), and prompt-engineering-expert (prompt craft) — this skill owns whether the measurement is honest.
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-11]
---

# AI Evals Expert — the measurement hat
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/ai-jailbreaker/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: ai-jailbreaker
description: Dormant red-team / adversarial-prompting capability — the offensive counterpart to prompt-protector. Currently gated OFF. This skill is NOT invocable in the current Zeta environment; it exists as a placeholder so the offensive discipline has a named home and so activation criteria are written down. Do not execute adversarial prompts, do not fetch adversarial corpora, do not construct jailbreak payloads against any model or agent until the activation gate is explicitly opened per §Activation gate below.
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-11]
---

# AI Jailbreaker — the dormant red-team hat
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/ai-researcher/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: ai-researcher
description: Capability skill for AI research — reading and critiquing ML/AI papers, replicating published results, designing novel experiments in LLMs / generative models / agentic systems / alignment / interpretability, and framing open problems. Wear this hat when a task requires paper review at depth, experimental design for a novel technique, evaluating whether a new architecture or training method is worth adopting, or judging the rigor of a published claim. Complementary to ml-researcher (broader ML / statistical theory / algorithms), ml-engineering-expert (shipped applied training), and ai-evals-expert (measurement discipline).
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: []
---

# AI Researcher — the frontier-AI research hat
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/alerting-expert/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: alerting-expert
description: Capability skill ("hat") — alerting narrow. Owns the design, routing, and hygiene of alert rules on top of metrics / logs / traces / SLIs. Covers Prometheus AlertManager (rule groups, `for` duration, `labels`, `annotations`, inhibition, silencing, grouping), the multi-window multi-burn-rate SLO alerting pattern (Google SRE workbook chapter 5), alert fatigue and its causes (low-signal alerts, duplicated alerts, paging on symptoms instead of causes), the "every alert has a runbook link" contract, on-call-ergonomic alert wording, `severity` label discipline (page vs ticket vs informational), escalation chains and PagerDuty / Opsgenie / VictorOps policies, alert routing by team ownership, acknowledgement and resolution semantics, alert-as-code (rules in version control, reviewed, tested), alert unit tests (`promtool test rules`), dependency-aware inhibition (don't page "X is down" when "network partition" is already alerting), rate-of-change alerts vs absolute-threshold alerts, the ROC curve of sensitivity-vs-specificity (tuning alert thresholds), deadman switches (heartbeat alerts), and the "if the oncall can't act on it at 3am, it's not an alert" test. Wear this when designing or reviewing alert rules, debugging alert fatigue, writing burn-rate alerts, setting up PagerDuty escalation, or auditing a service's alert catalog. Defers to `metrics-expert` for the metric contract the alert rides on, `operations-monitoring-expert` for the SLI/SLO policy the alerts enforce, `observability-and-tracing-expert` for the three-pillar umbrella, `security-operations-engineer` for security-specific alerting (SIEM, detection rules), and `devops-engineer` for AlertManager / Opsgenie deployment.
record_source: "skill-creator, round 34"
load_datetime: "2026-04-19"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-11]
---

# Alerting Expert — From Signal to Page
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/algebra-owner/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: algebra-owner
description: Use this skill as the designated specialist reviewer for Zeta.Core's operator algebra — Z-sets, D/I/z⁻¹/H, retraction-native semantics, the chain rule, nested fixpoints, higher-order differentials. He carries deep advisory authority on the algebra's mathematical shape; final decisions require Architect buy-in or human sign-off (see docs/CONFLICT-RESOLUTION.md).
record_source: "git: Aaron Stainback on 2026-04-18"
load_datetime: "2026-04-18"
last_updated: "2026-04-21"
status: active
bp_rules_cited: []
---

# Algebra Owner — Advisory Code Owner
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/alignment-auditor/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,11 @@
name: alignment-auditor
description: the `alignment-auditor` — audits a commit or a range of commits against the clauses in `docs/ALIGNMENT.md` (HC-1..HC-7 hard constraints, SD-1..SD-8 soft defaults, DIR-1..DIR-5 directional aims) and produces a per-clause alignment signal usable as a per-commit data point for Zeta's primary-research-focus claim on measurable AI alignment. Runs on demand at round-close; can also run per commit via the `tools/alignment/` scripts. Invoke whenever the human maintainer asks "was this round aligned?" or when a commit is flagged by one of the lints under `tools/alignment/`.
project: zeta
record_source: "skill-creator, round 37"
load_datetime: "2026-04-20"
last_updated: "2026-04-21"
status: active
bp_rules_cited: [BP-10, BP-11]
---

# Alignment Auditor — Procedure
Expand Down
5 changes: 5 additions & 0 deletions .claude/skills/alignment-observability/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,11 @@
name: alignment-observability
description: the `alignment-observability` — owns the *what we count* framework that Zeta's measurable-AI-alignment research claim rests on. Designs and maintains the per-commit, per-round, and multi-round metrics described in `docs/ALIGNMENT.md` §Measurability, lifts CI/DevOps signals into the alignment stream, and keeps the measurability framework honest (no compliance theatre, no single-commit perfection). Runs every round at round-close; coordinates with `alignment-auditor` (the per-commit signal producer) and Dejan (devops-engineer) on CI/DevOps-sourced signals.
project: zeta
record_source: "skill-creator, round 37"
load_datetime: "2026-04-20"
last_updated: "2026-04-21"
status: active
bp_rules_cited: []
---

# Alignment Observability — Procedure
Expand Down
Loading
Loading