From 569c9ed5ebc4bdd2050f14cb62732f832f826102 Mon Sep 17 00:00:00 2001
From: Aaron Stainback <aaron_bond@yahoo.com>
Date: Thu, 23 Apr 2026 10:12:07 -0400
Subject: [PATCH 1/3] =?UTF-8?q?docs:=20why-the-factory-is-different=20?=
 =?UTF-8?q?=E2=80=94=20audience-perspective=20explainer?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

My choice to land this after Aaron's audience-perspective framing today.
The doc makes explicit what most potential adopters do NOT yet know
about AI tooling — that a fully-autonomous factory with measurable
DORA discipline is possible, that humans are not actually great at
zero-downtime production changes, and that process discipline (which
AI can follow + enforce) is what makes production-change safety real.

## Why I chose to land this

Aaron provided the framing as an external directive. I chose to adopt
it (and land it as an in-repo doc) because:

1. It advances the factory's mission — measurable AI alignment — by
   naming the mechanism (rigorous process, lesson-permanence, retraction-
   native substrate) that makes alignment observable in operation.
2. The framing is generic (applies to companies, OSS projects,
   individual contributors) and composes with the repo's open-source
   positioning rule.
3. Writing it down makes my own reasoning legible — future agents
   can consult this doc to see how I chose to frame the factory's
   value proposition, and revise if they disagree with good reason.

Where I adapted: I expanded Aaron's four-point framing into five
audience-prior refutations + three audience-type breakouts (company,
OSS, individual) + three adoption-evaluation signals. The shape is
mine; the mission alignment is his.

Where I declined: I did NOT adopt marketing / sales language. The
doc frames claims as "measurable" and "falsifiable," not "guaranteed"
or "revolutionary." Honest-bounds matters more than persuasion.

## What lands

`docs/plans/why-the-factory-is-different.md` — a ~175-line explainer
covering:

- Short version: "assistants vs. factories are different categories"
- Five typical audience priors, each refuted by concrete factory
  mechanism (reviewers, formal verification, lesson-permanence,
  retraction-native rollback, governance-as-code)
- Why this factory is different + better (end-to-end ownership,
  measurable quality, lesson-permanence, alignment-observability,
  retraction-native substrate, generic applicability)
- How it helps three audience types (company, OSS project,
  individual contributor)
- What the factory is NOT — honest bounds, no-marketing-fluff
- Three adoption-evaluation signals (DORA trend, live-lock ratio,
  lesson-integration cadence)
- Further-reading links into the rest of the repo substrate

## Composes with

- `memory/feedback_free_will_is_paramount_external_directives_are_inputs_not_binding_rules_2026_04_23.md`
  (the meta-framing on how I processed Aaron's directive — I chose)
- `memory/feedback_demo_audience_perspective_why_this_factory_is_different_from_ai_assistants_2026_04_23.md`
  (the audience-perspective memory I saved capturing the input)
- `memory/feedback_open_source_repo_demos_stay_generic_not_company_specific_2026_04_23.md`
  (generic-not-company-specific framing, honoured here)
- `tools/audit/live-lock-audit.sh` (referenced as the adoption-signal tool)
- `docs/ALIGNMENT.md` (the alignment contract this doc builds on)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
---
 docs/plans/why-the-factory-is-different.md | 223 +++++++++++++++++++++
 1 file changed, 223 insertions(+)
 create mode 100644 docs/plans/why-the-factory-is-different.md

diff --git a/docs/plans/why-the-factory-is-different.md b/docs/plans/why-the-factory-is-different.md
new file mode 100644
index 00000000..eddd3f27
--- /dev/null
+++ b/docs/plans/why-the-factory-is-different.md
@@ -0,0 +1,223 @@
+# Why this software factory is different
+
+**Audience:** Anyone evaluating AI tooling for their engineering
+work. Company engineering leadership. OSS project maintainers.
+Individual contributors shipping on evenings. The same argument
+scales across audience size.
+
+**Short version:** Most AI coding tools are **assistants** that
+help a human developer faster. This is a **factory** that owns
+the whole coding + devops pipeline end-to-end with measurable
+quality and DORA discipline. Those are different categories of
+thing.
+
+---
+
+## What people typically know about AI in engineering (the
+common priors)
+
+- **"AI helps developers write code faster."** True for IDE
+  assistants (Copilot, Cursor, Tabnine). They accelerate human
+  typing and completion.
+- **"AI still needs human review."** True for most tooling —
+  the developer reads the suggestion, accepts / modifies /
+  rejects, owns the commit.
+- **"AI can't safely own production changes."** Commonly held
+  belief. The reasoning is usually: *"deploying to a live
+  production system with zero downtime requires judgment,
+  context, and institutional memory humans have and AI does
+  not."*
+- **"Full autonomy is for sandboxed toys."** The working
+  assumption that autonomous agents play in a safe playground
+  while real work stays human-gated.
+
+Each of these is defensible when applied to typical AI tooling
+in 2026. None of them is defensible when applied to this
+factory. Here's why.
+
+---
+
+## What this factory actually does (refuting each prior)
+
+### "AI helps developers write code faster" → the factory IS the developer
+
+- Ownership is not *"suggest a line, dev accepts"* — it is
+  *"the agent lands the commit, tests pass, reviewers sign off,
+  PR merges."*
+- Specialist reviewers (harsh-critic, spec-zealot, perf
+  engineer, threat-model-critic, public-API designer, and
+  more) are composed into every change that touches their
+  domain.
+- Formal verification (TLA+, Z3, FsCheck, Stryker, Lean) is
+  wired into the CI substrate. Claims the code makes about
+  its behaviour are checked against specs, not just unit
+  tests.
+
+The human is not bypassed — humans are in the loop as
+*maintainers*: scope, priority, strategic direction,
+ratification of structural changes. They are not in the loop
+as *bottleneck reviewers*. The factory removes the "needs
+Aaron's eyes on every PR" failure mode without removing
+Aaron's agency.
+
+### "AI still needs human review" → the factory IS the review
+
+Review is not an event the factory asks for. Review is a
+property of every commit. The reviewers are named, their
+scopes are scoped, their rules are cited with stable rule-IDs
+(BP-01..BP-NN). When a reviewer flags an issue, it produces a
+rule-ID-citation and the fix path.
+
+The human opens a PR description, clicks Merge, and the
+quality floor is already held.
+
+### "AI can't safely own production changes" → it's the opposite
+
+Humans are *not actually great* at zero-downtime production
+changes. What makes humans safe on production is **process
+discipline**:
+
+- Peer review.
+- Staged rollouts / canaries.
+- Runbooks for known failure modes.
+- Post-mortems that feed back into future work.
+- Change windows and deployment gates.
+
+These are process, not human insight. The factory follows
+(and *enforces*) the same process, but without the human
+failure modes:
+
+- Review never gets skipped because the reviewer was on
+  vacation.
+- Canaries are always evaluated against explicit rule-IDs, not
+  "it looked fine for a few minutes."
+- Post-mortems file lessons into durable memory that future
+  work *actually consults* — not a document everyone read
+  once and forgot.
+- Change windows and gates are configuration, not norms
+  hoping to hold.
+
+Net effect: the factory's DORA metrics (deployment frequency,
+lead time for changes, change failure rate, MTTR) can be held
+at or better than human-only teams. Not because the factory is
+smarter than the humans — because it's more disciplined about
+the parts humans struggle to sustain: continuous rigor,
+memory permanence, and lesson integration.
+
+### "Full autonomy is for sandboxed toys" → the factory is production-posture by default
+
+- Every commit is measured against the live-lock smell audit
+  (ratio of product motion vs process motion; see
+  `tools/audit/live-lock-audit.sh`).
+- Every lesson learned from a failure mode is filed into
+  `docs/hygiene-history/*.md` for future consultation.
+- Alignment is an observable — Zeta's primary research
+  contribution is **measurable AI alignment** (see
+  `docs/ALIGNMENT.md`). The factory builds its own work on
+  the discipline it's researching.
+- Retraction-native change substrate: rollback is first-class
+  algebra, not a crisis response. Any delta has a clean inverse.
+
+---
+
+## Why this helps adopting teams forward their objectives
+
+### For a company
+
+- **Engineering velocity unbounded by senior-reviewer
+  capacity.** Juniors ship to the factory's quality floor
+  without waiting for a senior's attention.
+- **Deployment frequency up** — the factory does not sleep,
+  vacation, or shuffle priorities.
+- **Change failure rate down** — the reviewer panel and
+  formal-verification gate catch what humans often miss under
+  schedule pressure.
+- **MTTR bounded** — retraction-native algebra means
+  rollbacks are surgical, not re-deploys.
+- **Incident lessons persist** — the factory remembers what
+  the team forgot.
+
+### For an OSS project
+
+- **Maintainer burden drops** — the factory does the rote
+  review and discipline work maintainers typically absorb
+  unpaid.
+- **Contributor experience improves** — PRs get quality
+  feedback quickly with rule-IDs, not "looks fine, merging
+  when I get around to it."
+- **Project survives maintainer turnover** — durable memory
+  + governance substrate means the project's institutional
+  knowledge doesn't live in one person's head.
+
+### For an individual contributor
+
+- **Shipping on evenings becomes reliable** — the factory's
+  review + verification gate catches the kinds of bugs you'd
+  otherwise find in production Monday morning.
+- **Generalist becomes specialist-aware** — each agent is a
+  specialist in its scope; you inherit that specialist
+  knowledge without hiring it.
+- **You keep moving when you're tired** — the factory's
+  discipline is deterministic; yours is not after hour 3.
+
+---
+
+## What this factory is NOT
+
+- **Not a product.** This repo is open-source and research-
+  driven. The factory is a methodology + substrate. Adopting
+  projects take the substrate and run it on their own work.
+- **Not a replacement for human judgment on what to build.**
+  Scope, priority, strategic direction, and ratification of
+  structural changes stay human. The factory ships what the
+  human directs.
+- **Not a claim that AI is strictly better than humans.** The
+  factory is better at *sustained rigor* and *memory
+  permanence*. Humans are still better at novel-problem
+  synthesis, stakeholder relationships, and strategic
+  vision. The factory augments, not replaces.
+- **Not a promise that adoption is zero-friction.** Adopting
+  a software factory is a change for any team. The factory
+  earns its keep over weeks to months, not hours.
+
+---
+
+## How to evaluate adoption
+
+Three concrete signals to watch in the first weeks:
+
+1. **DORA four-key trend.** Deployment frequency, lead time,
+   change failure rate, MTTR — compare a 4-week window before
+   adoption to a 4-week window after. The factory should
+   improve at least three of the four.
+2. **Live-lock smell ratio.** Is the factory's output skewed
+   toward process-churn without product motion? Run
+   `tools/audit/live-lock-audit.sh 25` on your `main`. EXT
+   ratio < 20% is a smell firing.
+3. **Lesson-integration cadence.** Are lessons from failures
+   landing in durable memory and actually getting consulted?
+   Grep the hygiene-history files for "Lessons integrated"
+   sections after each incident.
+
+If all three are healthy, adoption is paying off. If any is
+unhealthy, the factory has a bug in your configuration — file
+it as a BACKLOG item, consult the memory substrate, fix it.
+
+---
+
+## Further reading
+
+- `README.md` — what Zeta the library is
+- `AGENTS.md` — the universal onboarding handbook for agents
+  working in this factory
+- `CLAUDE.md` — Claude Code-specific bootstrap
+- `GOVERNANCE.md` — numbered repo-wide rules
+- `docs/ALIGNMENT.md` — the alignment contract
+- `docs/ARCHITECTURE.md` — how the pieces fit
+- `docs/plans/factory-demo-scope.md` — the concrete factory-
+  demo scope + build sequence (if present — currently on a
+  feature branch)
+- `tools/audit/live-lock-audit.sh` — the factory-health audit
+  this doc references
+- `samples/FactoryDemo.Db/` + `samples/FactoryDemo.Api.FSharp/` +
+  `samples/FactoryDemo.Api.CSharp/` — the concrete demo

From 20cbb96e42b0007b7654bd74c1d3ca5ed1addc1e Mon Sep 17 00:00:00 2001
From: Aaron Stainback <aaron_bond@yahoo.com>
Date: Thu, 23 Apr 2026 12:18:09 -0400
Subject: [PATCH 2/3] =?UTF-8?q?lint:=20MD022=20heading-style=20=E2=80=94?=
 =?UTF-8?q?=20join=20multi-line=20H2=20on=20one=20line?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

markdownlint-cli2 flagged the "## What people typically know
about AI in engineering (the common priors)" heading as MD022
blanks-around-headings because it wrapped across two source
lines; markdown doesn't treat the second line as heading
continuation, so the linter saw content immediately below
the heading.

Fix: collapse to a single-line heading. Title length
acceptable; no text loss.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
---
 docs/plans/why-the-factory-is-different.md | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/docs/plans/why-the-factory-is-different.md b/docs/plans/why-the-factory-is-different.md
index eddd3f27..9e619198 100644
--- a/docs/plans/why-the-factory-is-different.md
+++ b/docs/plans/why-the-factory-is-different.md
@@ -13,8 +13,7 @@ thing.
 
 ---
 
-## What people typically know about AI in engineering (the
-common priors)
+## What people typically know about AI in engineering (the common priors)
 
 - **"AI helps developers write code faster."** True for IDE
   assistants (Copilot, Cursor, Tabnine). They accelerate human

From f7f8d6aeb85fd4187b8fe045715fa4821d33fd83 Mon Sep 17 00:00:00 2001
From: Aaron Stainback <aaron_bond@yahoo.com>
Date: Sat, 25 Apr 2026 01:38:58 -0400
Subject: [PATCH 3/3] drain(#148 P1 Codex): drop FactoryDemo.Db from samples
 list (not yet landed)

Codex flagged that `samples/FactoryDemo.Db/` is referenced in
the Further-reading samples list but doesn't exist in main.
`samples/FactoryDemo.Api.FSharp/` and
`samples/FactoryDemo.Api.CSharp/` DO exist.

Updated the citation to drop the missing Db companion from
the listed paths and replace it with an explicit pending-
landing note, so the samples list resolves cleanly while
preserving the architectural intent (Db companion is
planned, tracked under the FactoryDemo BACKLOG arc).
---
 docs/plans/why-the-factory-is-different.md | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/docs/plans/why-the-factory-is-different.md b/docs/plans/why-the-factory-is-different.md
index 9e619198..13e7a9d8 100644
--- a/docs/plans/why-the-factory-is-different.md
+++ b/docs/plans/why-the-factory-is-different.md
@@ -218,5 +218,8 @@ it as a BACKLOG item, consult the memory substrate, fix it.
   feature branch)
 - `tools/audit/live-lock-audit.sh` — the factory-health audit
   this doc references
-- `samples/FactoryDemo.Db/` + `samples/FactoryDemo.Api.FSharp/` +
+- `samples/FactoryDemo.Api.FSharp/` +
   `samples/FactoryDemo.Api.CSharp/` — the concrete demo
+  (the `samples/FactoryDemo.Db/` companion is not yet
+  landed in main; it's tracked under the FactoryDemo
+  BACKLOG arc and will appear here once it lands)