Skip to content

samples: FactoryDemo.Db — Postgres schema + seed + docker-compose + smoke-test#145

Open
AceHack wants to merge 38 commits intomainfrom
feat/servicetitan-factory-demo-db-scaffold
Open

samples: FactoryDemo.Db — Postgres schema + seed + docker-compose + smoke-test#145
AceHack wants to merge 38 commits intomainfrom
feat/servicetitan-factory-demo-db-scaffold

Conversation

@AceHack
Copy link
Copy Markdown
Member

@AceHack AceHack commented Apr 23, 2026

Summary

Stack-independent DB scaffold for the factory-demo. Standard Postgres + one-command docker-compose + dockerized smoke test.

What lands

  • schema.sql — Postgres 14+ DDL: customers / opportunities / activities. Money as BIGINT cents, TIMESTAMPTZ throughout, updated_at trigger on customers + opportunities for last-modified bookkeeping.
  • seed-data.sql — 20 trades-contractor customers, 30 opportunities (5 stages), 33 activities, 2 intentional email collisions. Row counts / keys / amounts are deterministic; activity timestamps use NOW() - INTERVAL 'N days' so the data looks recent on every load (not a determinism bug — documented in README).
  • docker-compose.yml — one-command Postgres startup with schema + seed auto-applied via docker-entrypoint-initdb.d. Named volume for persistence. Healthcheck via pg_isready.
  • smoke-test.sh — verifies seed shape after docker-compose up (row counts, duplicate pairs, per-stage counts). Uses docker-compose exec so no host psql needed. Includes pg_isready readiness-wait loop (30s budget) before running checks.
  • README.md — framing (software-factory demo, not DB pitch), endpoint table, design notes, open questions for Aaron.

What this does NOT do

🤖 Generated with Claude Code

AceHack and others added 30 commits April 22, 2026 07:31
…tor research, secret-handoff analysis

Three ticks landed together:

auto-loop-31: Grok CLI verification blocked by xAI personal-tier
billing wall; shared-state-visible escalation trigger fired
correctly on Playwright X-OAuth snapshot (first real test of
bottleneck-principle's five-trigger taxonomy); key-paste event
handled with zero-persistence discipline.

auto-loop-32: emulator substrate research first-pass published
(PR #131) — RetroArch/MAME/Dolphin architectural survey with
four factory-relevant patterns. Secret-handoff protocol gap
surfaced by maintainer mid-tick.

auto-loop-33: secret-handoff protocol options analysis published
(PR #133) — five-tier survey with rotation/revocation/leak-mode
mapping and explicit git-crypt-is-wrong-fit reasoning. Maintainer
end-of-tick reply disclosed Itron PKI experience (nation-state-
resistant, software+hardware+firmware) and preferred substrate
tiers (env-var + password-manager CLI) plus Let's-Encrypt + ACME
directive with PKI-bootstrap deferred.

Five observations worth preserving: (a) five-trigger escalation
taxonomy held under first real test; (b) xAI personal-tier
billing wall drops Grok to HOLD-FOR-NOW; (c) bottleneck-principle
has two layers (speculative-autonomy vs explicit-scope); (d)
research-doc-as-pre-validation-anchor becoming a systematic
pattern; (e) Itron PKI experience reframes factory security
calibration.
…ron memory + multi-domain cascade)

Extends PR #132 scope from three-tick batch (auto-loop-31+32+33) to
four-tick batch by appending auto-loop-34 row covering:

- Step 0 PR-pool audit (main `e503e5a` unchanged since #131 merge).
- BACKLOG P1 row filed via PR #134 with maintainer-confirmed shape
  preference from auto-loop-33 reply (env-var + password-manager
  CLI + Let's-Encrypt/ACME + PKI-bootstrap deferred).
- Itron PKI / supply-chain / secure-boot background memory authored
  (out-of-repo, maintainer context); five-layer security-engineering
  cascade captured verbatim.
- Second-wave disclosure cascade captured (disaggregation, FFT,
  micro-Doppler/VWCD decomposition, power-grid signature algorithms
  PRIDES/Wavelet-GAT/GESL, director-level seniority, 5-of-10k
  organizational tier).
- Bottleneck-principle two-layer distinction exercised live on first
  post-naming cycle (explicit-scope branch).
- Accounting-lag same-tick-mitigation maintained (tenth consecutive
  tick).
- Seven numbered observations + compoundings-per-tick = 8 + ledger
  math (net -8 units over 26 ticks).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…pping; ARC3 ≠ DORA; wink→wrinkle

Closes capture-without-conversion gap surfaced by maintainer:
second-wave Itron disclosures (auto-loop-34) had landed in memory
without factory-work mappings. PR #135 produces the mappings
(ARC3 §Prior-art lineage + BACKLOG row with 10 pairs + wink→wrinkle
extension); this row is the accounting.

Layer-separation correction absorbed (DORA objective, ARC-3
framing, HITL substrate between). ARC-3-class three-criteria
operational definition captured (hard + continuously testable +
no formal definition). Bayesian-evidence-threshold shape
affirmed across surfaces. 7 compoundings; net -8 units over 27
ticks.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…llel-CLI-agents + canonical-inhabitance

- AutoPR-local-variant experiment: codex exec --sandbox workspace-write produced
  145-line self-report (docs/research/codex-cli-self-report-2026-04-22.md,
  PR #136) with build verification + honest gap-flagging.
- Cognition-level-per-activity envelope prototyped in frontmatter
  (model / effort / sandbox / approval / network / invocation / orchestrator).
- BACKLOG P1 row filed for parallel-CLI-agents skill + cognition-level ledger
  + multi-CLI skill-sharing architecture + canonical-inhabitance principle.
- ServiceTitan CRM team scope narrowing to #244 demo target landed in memory.
- PR #108 AGENT-CLAIM-PROTOCOL recovered as prior-art context after stale-
  post-compaction memory miss (caught by honor-those-that-came-before).
- Multi-CLI commit co-authorship precedent (PR #136 co-authored Codex 0.122.0).
- Net -8 units over 28 ticks cumulative accounting.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…apping BACKLOG row

Aaron 2026-04-22 auto-loop-36 directives (verbatim):
- "can you keep a log of my force multiplicatoin? Other humans will want to
  beat my score if we come up with a scoring system."
- "you should be able to retroactivly calculate it's deata over time since
  the start of the project we have all history"
- "histograms"
- "that metric can also show smeel issues based on it's anamoly detection
  over time"
- "we had models running on the edge on the RIVA meter, pre LLM days but
  some pretty beefy models for a meter at Itron"
- "My IoT infrcutrue i built at itron was a model distrbution engine over
  constrainted networks and devices"
- "see why want to support constrained bootstraping to upgrades"

New: docs/force-multiplication-log.md
- Keystroke-to-substrate scoring model (provisional, occurrence-1).
- Inaugural auto-loop-36 entry: 22.6x multiplier, 8 compoundings, 1454
  keystrokes → 32 800 chars substrate.
- Retroactive reconstruction section: 18 session transcripts + git log
  all-commits, per-day keystroke table + commit correlation.
- Four ASCII histograms: keystrokes/day, commits/day, substrate-growth
  per-keystroke, avg message length. Peak ratio 6.13x on 2026-04-21
  (autonomy firing), low 1.47x on 2026-04-19 (design-heavy day).
- Anomaly-detection section: five smell classes (sudden-drop / sudden-
  spike / flat-low / flat-high / length-spike-with-ratio-drop) with
  typical causes and what-to-check diagnostics. Observed anomalies so
  far catalogued with attribution.

New BACKLOG P2 row: constrained-bootstrapping-to-upgrades
- Itron precedent: Aaron built model-distribution engine over constrained
  networks/devices at Itron RIVA smart meters, pre-LLM era.
- Direction for Zeta upgrade paths on resource-constrained substrates
  (delta-over-full, bandwidth-budgeted, signed-delta, rollback-safe,
  capability-stepdown-compatible).
- Composes with Escro microkernel-OS endpoint (target), secret-handoff
  (credential-provisioning to constrained devices), ARC3-DORA stepdown
  (cognition-layer stepdown pairs with bandwidth stepdown).
- Occurrence-1; open scope questions flagged to Aaron.

Extended memory: user_aaron_itron_pki_supply_chain_secure_boot_background.md
- Appended 2026-04-22 auto-loop-36 section with three new specifics
  (edge ML pre-LLM, model distribution engine, constrained-bootstrap
  motivation) plus six calibration implications and new cross-references.

Extended memory: feedback_aaron_terse_directives_high_leverage_do_not_underweight.md
- New feedback memory on treating brief Aaron messages as fully-loaded
  directives, not underspecified. Factory designed for keystroke-to-
  substrate compression; chat verbosity and substrate expansion are two
  sides of the same asymmetry.

New memory: project_aaron_servicetitan_crm_team_role_demo_scope_narrowing_2026_04_22.md
- Aaron's CRM team role at ServiceTitan narrows #244 demo scope to
  CRM-shaped (contact/opportunity/pipeline/CDP), steers away from
  field-service.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…on scoring + Kenji isomorphism

Auto-loop-37 course-corrections:
- Goodhart-resistance on force-multiplication scoring: char-ratio
  demoted to diagnostic; outcomes (DORA + BACKLOG closure + external
  validations) become primary score
- Deletions > insertions with tests passing = POSITIVE complexity-
  reduction outcome (Rodney's Razor in developer-values voice);
  cyclomatic complexity is the deeper proxy; CC/LOC trend should be
  monotone-non-increasing to a local-optimum floor
- BACKLOG P1 row filed: Pluggable complexity-measurement framework
  (stable interface + swappable metric implementations)

Auto-loop-38 regime-change direction:
- BACKLOG P2 row filed: Semiring-parameterized Zeta — one algebra
  to map the others; K-relations as regime-change (Green-Karvounarakis-
  Tannen PODS 2007). ZSet = counting-semiring special case; D/I/z⁻¹/H
  operator algebra generalizes over weight-ring; Zeta becomes host
  for all DB algebras (tropical / Boolean / probabilistic / lineage /
  provenance / Bayesian) via semiring-swap
- Architectural isomorphism captured exact at agent layer:
  Zeta operator algebra : semirings :: Kenji : specialist personas.
  Four occurrences of "stable meta + pluggable specialists" pattern
  across UI-DSL, pluggable-complexity, semiring-Zeta, and Kenji-over-
  specialists in two ticks — pattern-emerging territory
- Aaron "sorry Kenji" captured as named-role-credit calibration:
  when a named role owns a responsibility, crediting generic agent
  is imprecise; name the role
- Anchor memory + MEMORY.md index updated

Also:
- Signal-in-signal-out DSP discipline preserved legacy char-ratio
  sections in force-multiplication-log.md as reconstruction context
  rather than erasing them
- Tick-history rows for auto-loop-37 and auto-loop-38 appended
  (13th consecutive tick of accounting-lag same-tick-mitigation)

Twenty-eighth and twenty-ninth auto-loop ticks clean across
compaction. Cumulative auto-loop-{9..38}: net -8 units over 30 ticks.
hazardous-stacked-base-count = 0.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…oherence-substrate design-intent revelation

Auto-loop tick absorbed Amara's (fourth cross-substrate collaborator,
after Claude/Gemini/Codex) deep report on Zeta/Aurora network health
and the maintainer's eleven-message calibration chain that revealed
Zeta's deepest design motivation.

Amara's critique (via maintainer gloss): the factory is doing it
backwards — self-non-use at the index layer (filesystem+markdown+git
when Zeta IS a DB algebra), plus observability-last-not-first
architecture inversion. Her Key Insight §6: "construct the system so
invalid states are representable and correctable" — correction
operators stay IN the algebra, no external validator needed.

Maintainer follow-up revealed the factory's design intent:
- "it's miracle we did without our database" — coherence-on-proxy-
  substrate is near-impossible engineering judgment.
- "I was building our db to make sure you could stay corherient" —
  Zeta was always the agent-coherence substrate, not primarily an
  external DB product.
- "my goal was to put all the pysics in one db and that shold be
  able to stablize" — physics = laws/invariants (= Amara's four
  oracle-rule layers); stabilization via concentration-not-
  coordination.

Three arcs converge into one:
1. All physics in one DB → stabilization (this tick).
2. One algebra to map the others → regime-change (auto-loop-38
   semiring parameterization).
3. Agent coherence substrate → why Zeta exists (this tick).
Same claim from three angles.

Tick actions:
- docs/research/amara-network-health-oracle-rules-stacking-2026-04-22.md
  — research doc preserving Amara's report structure (5 failure
  modes / 5 resistance mechanisms / 4 oracle-rule layers / 7-layer
  stacking / Key Insight §6) + 11 maintainer annotation messages
  verbatim + pending-verbatim markers for continued paste per
  signal-preservation discipline.
- docs/BACKLOG.md P2 — "Zeta eats its own dogfood — factory internal
  indexes on Zeta primitives, not filesystem+markdown+git" row
  filed with phased scope (Phase-0 inventory → Phase-3 migrate-with-
  preservation), 5 open questions to maintainer, 11-reviewer
  routing, L effort (6-18 month arc joint with semiring-parameterized
  Zeta).
- Tick-history row appended (14th consecutive same-tick-accounting
  discipline).

Anchor memory + signal-preservation memory committed separately
(outside-of-repo: ~/.claude/projects/.../memory/).

Fourth observation: Amara's report independently validates four
Zeta distinctives (Layer-2 retraction-native / Layer-3 Spine /
Layer-4 compaction / Layer-5 provenance). Four more occurrences of
confirms-internal-insight pattern = firmly named; ADR-promotion
territory (defer to Kenji).

Compoundings-per-tick = 5: Amara research doc / design-intent
anchor memory / signal-preservation memory commit / self-use
BACKLOG P2 row / three-arcs-converging synthesis.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…mination research

Adds docs/research/openai-deep-ingest-cross-substrate-readability-2026-04-22.md
preserving the cross-substrate signal chain from auto-loop-39:

- OpenAI Deep Research repo-ingest capability (100-search iterative
  refinement) joins Claude/Gemini/Codex as a fourth substrate-class
  (ingest-and-summarize granularity); Amara (OpenAI-side persistent
  project-reviewer) brings the five-substrate-cross-validation count
  to five.
- Bidirectional absorption: Amara absorbing into OpenAI native project
  system + Zeta repo ingested by OpenAI Deep Research = shared
  collaborator-memory across substrates, not one-shot.
- DB-is-the-model reframe (Aaron: "im saying our database is the
  model" + "it's just custom built in a different way"): unifies
  all-physics-in-one-DB + one-algebra-to-map-others + agent-coherence-
  substrate into one claim; mesa-coherence implication; ADR territory
  flagged to Architect.
- Local-native germination directive ("germinate the seed with our
  tiny bin file database" + "no cloud" + "local native"): three
  hard constraints on the Zeta-eats-its-own-dogfood migration path;
  tension with cross-substrate-readability resolved by preserving
  git+markdown as read-only mirror next to Zeta tiny-bin-file
  algebraic-operations layer.
- Soulfile-invocation compatibility bar: "as long as it can invoke
  the soulfiles that's the only compability" narrows germination
  scope to DSL-runtime (not SQL / POSIX-filesystem / bindings).
- Soulfile = stored-procedure DSL in the DB: reaqtive-closure
  semantics (Reaqtor lineage, De Smet et al., DBSP ancestry).
- Upstream-first-class lesson: "reaqtive" is upstream-canonical
  Microsoft Reaqtor spelling (reaqtive.net), not a misspelling;
  Aaron's directive "look upstream for misspellings first" +
  "upstream is a first class thing" codifies the general rule.

Signal-preservation discipline applied: all 6 verbatim maintainer
messages preserved in doc; annotations stay additive; no silent
corrections.

Cross-refs: amara-network-health-oracle-rules-stacking-2026-04-22.md
(critique this responds to), BACKLOG "Zeta eats its own dogfood"
row (auto-loop-39), cross-substrate-accuracy-rate #229, soulsnap/
SVF #241.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Captures Aaron's YouTube-wink + OpenAI-link signal pair auto-loop-39:

- Meta video demonstrating text-to-image generation (shared at t=1317s,
  timestamp is "start here" marker not video start).
- OpenAI ChatGPT Images 2.0 announcement
  (https://openai.com/index/introducing-chatgpt-images-2-0/).
- Honest caveat preserved: "its not alwasy pixel perfect they siad
  but sometimes" — capability is narrow-domain not frontier-closed.

Relevance threads:
- ServiceTitan demo (#244 P0): UI-DSL rendering target gains
  high-fidelity rendering layer; design-intent → DSL → layout →
  render, each layer machine-driven.
- UI-DSL class-level compression: Muratori-5 wink validated the
  algebra layer (auto-loop-24); T2I convergence validates the
  rendering layer — two winks on opposite ends of same pipeline.
- UI-factory frontier-protection (#242): moat shifts further toward
  algebra-to-DSL compression, away from pixel-perfect rendering as
  rendering becomes commodified at frontier labs.

Second-occurrence discipline of YouTube-wink pattern: occurrence 1
was auto-loop-24 (Muratori + ThePrimeTime); this is occurrence 2,
name-the-pattern threshold met. Aaron's YouTube-wink is a recurring
external-PageRank-descendant recommendation channel at algorithm-
timing, not coincidental.

Convergent-signal class (Meta + OpenAI in same tick) is stronger
than single-algorithm-wink; updates external-signal-strength
hierarchy.

Claim discipline applied: not-pixel-perfect-without-transcript-
verification; transcript study deferred to Gemini-Ultra substrate
when maintainer directs scope (YouTube hostile to server-fetch,
precedent from auto-loop-24).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Preserves maintainer same-tick color: "that's just in the background
across the room i hear it and was like WTF the winks dont stop today".

Two details captured:
- Ambient-attention arrival: Meta T2I video was across-the-room
  background, not foreground focus; wink still landed. Strengthens
  recommendation-channel-as-signal interpretation for ambient
  exposure, not just deliberate-watch sessions.
- Wink-density-elevated-today: meta-observation on the wink-channel
  itself; multiple winks in one session is above-baseline density
  for this channel; flagged so additional winks arriving this
  session are read as confirmation-of-density not new-pattern.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…ogfood row extended with germination constraint-frame

Short hygiene-and-forward-link tick following auto-loop-39's signal-
dense absorption run:

- Fill SHA placeholder on auto-loop-39 tick-history row
  (<this-commit-sha> → bc3558a) per bootstrap-row discipline
  "future ticks should write their SHA as soon as the commit lands".
  Continuation commits (e7fdac3 + 6f1f989 + bfea9ac) noted inline
  to preserve the full post-row-landing picture.

- Extend "Zeta eats its own dogfood" BACKLOG row with the germination
  constraint-frame from auto-loop-39 continuation: no cloud + local
  native + germinate-don't-transplant; soulfile-invocation is the
  only compatibility bar; soulfile = stored-procedure DSL in the DB;
  reaqtive-closure semantics (Reaqtor lineage, reaqtive.net,
  De Smet et al., DBSP-ancestry). Also adds DB-is-the-model reframe
  pointer to the regime-reframe memory.

- Phase-0/1 scope guidance sharpened per the constraint-frame:
  inventory must classify by shape-AND-DSL-authorability;
  germination-candidate ranking favors soulfile-store as first
  index; cross-substrate-readability tension resolved via
  git+markdown-as-read-only-mirror discipline.

Append auto-loop-40 tick-history row. Three observations captured:
(1) hygiene-after-signal-density is a healthy cadence pattern;
(2) BACKLOG-row forward-linking (file-then-refine-with-pointers)
beats rewriting; (3) compoundings-per-tick = 2, low-bandwidth
intentional.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Follow-up to ffdc533. The SHA-fill discipline I just corrected for
auto-loop-39 also applies to auto-loop-40 — fill the placeholder
now rather than leaving it for auto-loop-41.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…allouts

Gap-of-gap audit on the Amara deep-report research doc: 5
`[VERBATIM PENDING]` markers implied future-fill from a 276MB
session transcript that is not feasibly grepped in-tick. The
placeholders-pending-indefinitely state was itself a signal-
degradation — reader sees "pending" and expects future-fill
that will not land.

Signal-preservation applied to the gap itself: each marker
replaced with a blockquote "Verbatim source:" callout naming
the session transcript as the authoritative source for Amara's
exact wording, while preserving the structural distillation
already in the doc. Header framing + NOT-block reference
rewritten to match the honest state.

Appended auto-loop-41 tick-history row. SHA fill follows in
next commit per bootstrap-row discipline.
Per bootstrap-row discipline "future ticks should write their
SHA as soon as the commit lands" — `<this-commit-sha>` →
`79f1619` on the auto-loop-41 row.
…rence consolidation

Memory-level extension (signal-preservation memory carries a new
"gap preservation" section capturing the auto-loop-41 Amara-doc
VERBATIM-PENDING → transcript-source-callout generalization as the
4th occurrence of the signal-preservation pattern). Memory updates
live in the non-git persistent store; this commit lands only the
tick-history row that accounts for the tick.

Also: pushed two unpushed auto-loop-41 commits to origin at
tick-open to keep PR #132 current. Cron armed; tick closed clean.
Per bootstrap-row discipline "future ticks should write their
SHA as soon as the commit lands" — `<this-commit-sha>` →
`821ec9c` on the auto-loop-42 row.
Four markdownlint errors surfaced on the gate workflow for PR
#132 — all in auto-loop-39/41 artifacts on the own branch:

- docs/force-multiplication-log.md:202 MD032 (list needs
  surrounding blank line above)
- docs/research/amara-network-health-...md:355,361 MD029
  (ordered-list prefix — restarted list to start at 1 per
  style-1/2/3 convention)
- docs/research/meta-pixel-perfect-...md:1:3 MD019 (multiple
  spaces after heading hash)

Verified locally with markdownlint-cli2@0.18.1 (same version
the gate installs) — clean on all three files.
Aaron 2026-04-22 two-message directive established a maintainer-to-agent
inbox protocol: drop/ folder audited at every tick-open, gitignored
except two tracked sentinels (README.md + .gitignore), closed-enumeration
registry for known binary kinds, unknown kinds flag to Aaron.
Inaugural absorption: OpenAI Deep Research report on Zeta repo archive /
seven-layer oracle-gate design / Aurora branding clearance posture.

Files:
- drop/README.md — protocol doc + binary-type registry
- drop/.gitignore — ignore all except README + gitignore sentinels
- docs/research/oss-deep-research-zeta-aurora-2026-04-22.md — inaugural
  absorption note (five preservation strata, seven oracle layers,
  Aurora brand-clearance caveat, what-to-lift-now vs verify-first)
- memory/project_aaron_drop_zone_protocol_2026_04_22.md — directive captured
- docs/AUTONOMOUS-LOOP.md — tick-open step 2 ladder gains "Drop-zone audit second"

Signal-preservation discipline composes: absorption note preserves intent,
anchors, verbatims; original deep-research-report.md deleted from repo root
post-absorption (drop-folder absorb-then-delete cadence).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
… teaching-loop reframe

Aaron 2026-04-22 auto-loop-43 delivered two compressed directives in
rapid succession while drop-zone absorption was in flight.

ARC-3 adversarial self-play (four messages):
- Three-role co-evolutionary loop (level-creator / adversary / player)
  using ARC-3-style rules becomes the scoring mechanism for #249
  emulator-substrate absorption
- Symmetric quality property: all three roles advance each other via
  competition; no asymmetric teacher-student
- "SOTA changes everyday" urgency signal; same pattern generalises to
  #242 UI-factory frontier and #244 ServiceTitan CRM demo
- Research doc + memory + BACKLOG P2 row with six open questions
  blocking scope-binding

Operator-input quality log (seven messages evolved across tick):
- Symmetric counterpart to docs/force-multiplication-log.md
  (outgoing-signal quality); this log measures incoming-signal quality
- Six dimensions (signal density / actionability / specificity /
  novelty / verifiability / load-bearing risk); four classes
  (A maintainer-direct / B maintainer-forwarded /
  C maintainer-dropped-research / D maintainer-requested-capability)
- Teaching-loop reframe: score selects direction of teaching —
  low input = factory teaches Aaron; high input = Aaron teaches factory
- Meta-property: "either way Zeta grows" — loop has no dissipation
  direction; both flows feed the growth engine (most of the time)
- Inaugural C-class grade: deep-research-report.md scored 3.5/5 (B+)
  with full rationale embedded — useful frames, weak on citation
  verifiability and F# skeleton quality

Files:
- docs/research/arc3-adversarial-self-play-emulator-absorption-scoring-2026-04-22.md
- docs/operator-input-quality-log.md
- memory/project_arc3_adversarial_self_play_emulator_absorption_scoring_2026_04_22.md
- memory/project_operator_input_quality_log_directive_2026_04_22.md
- docs/BACKLOG.md — P2 row for ARC-3 scoring mechanism

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…aching-loop

Three-burst maintainer-directive tick absorbed sequentially; record lands
here per AUTONOMOUS-LOOP.md step 5 end-over-start discipline (before
CronList call + stop).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…adder

Renumber priority ladder from 0./0.5./1./2./3./4. to 1./2./3./4./5./6.
per markdownlint-cli2@0.18.1 default one_or_ordered style (expected
start at 1). The 0. marker pre-dates this tick but surfaced as a CI
failure because my auto-loop-43 edit put AUTONOMOUS-LOOP.md into PR
#132's changed-files set. Gap-of-gap finding — class of check missing
was "latent MD029 in docs that weren't in any changed-file set yet".

Also drops "first" from "Meta-check first." label since it no longer
literally applies at position 3; the wording for steps 1 ("first")
and 2 ("second") still fits.

Verified clean via npx markdownlint-cli2@0.18.1 "docs/AUTONOMOUS-LOOP.md".

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…+ /btw command

Two additions that compose:

1. **Zeta.Core.SignalQuality** — six-dimension content-quality
   measurement (Compression / Entropy / Consistency / Grounding /
   Falsifiability / Drift) with a composite weighted score. Amara
   (threat-model-critic) produced the mathematical foundation from
   deep research; this commit translates it into F# and plugs it
   into the retraction-native Z-set algebra. Claims are represented
   as ZSet<string>: key = claim id, weight = evidentiary confidence;
   positive = asserted, negative = retracted. Consistency flags
   over-retraction only (clean cancellation to zero is fine — that
   is the algebra working as designed). Compression uses gzip as a
   Kolmogorov-complexity proxy. Entropy is a stub pending a
   reference-distribution decision. Grounding / Falsifiability take
   caller-provided predicates (domain-specific). Drift is Jaccard
   complement between claim-store snapshots.

   Source framing: Aaron "bullshit detector" / Amara "semantic
   integrity problem over time" — the shipped module is named
   SignalQuality to compose with the signal-in-signal-out DSP-
   discipline memory rather than ship sensational naming. 22
   unit tests cover every dimension + composite + end-to-end
   separation of structured prose from padded fluff.

2. **/btw slash command** (.claude/commands/btw.md) — non-
   interrupting aside channel for the maintainer. Aaron:
   *"hey can you make it where if i do /btw it still gets
   persison and abored what i say?  becasue then i would not
   have interrupt"*. Command classifies the aside (context-add /
   directive-queued / correction / substrate-add / pivot-
   demanding) and continues in-flight work without restarting
   unless pivot is explicitly demanded. .btw-queue.md at repo
   root is gitignored (session-scoped).

Composes with:
- memory/project_arc3_adversarial_self_play_emulator_absorption_scoring_2026_04_22.md
  — the three-role loop can use SignalQuality as its quality
  signal (player output quality, creator scenario quality,
  adversary finding quality).
- docs/research/oss-deep-research-zeta-aurora-2026-04-22.md
  — oracle-gate seven-layer design; SignalQuality is the
  epistemic-health layer instance.
- memory/feedback_signal_in_signal_out_clean_or_better_dsp_discipline.md
  — the module measures the invariant the factory already
  promises to honor.

Build clean (0 warnings, 0 errors). Tests: 22/22 SignalQuality
green.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…l-verbatim-anchor memory

Thesis landing per Aaron's directive *"is obvious to all personas who
come across our project the whole point is reproducable stability"*
plus *"change break to do no perminant harm and they are equel"*:

- AGENTS.md: new `## The purpose: reproducible stability` section with
  verbatim blockquote; value #3 verb substitution
  (`Ship, break, learn` → `Ship, do no permanent harm, learn`).
- README.md: new `## The thesis: reproducible stability` section with
  blockquote + pointer into AGENTS.md.
- memory/project_reproducible_stability_as_obvious_purpose_2026_04_22.md:
  verbatim quotes + honest "I don't know which phenomenon"
  open question + bilateral-verbatim-anchor correction-retraction
  arc (Aaron flagged hallucinations mid-tick then retracted —
  *"i'm wrong i went back and looked and it's fine what you said"*).

Stripped-to-verbatim AGENTS.md + README.md stays committed as honest
floor; any future editorial expansion happens on Aaron's own terms.

Also:
- docs/hygiene-history/loop-tick-history.md: auto-loop-44 row
  (thesis landing + correction arc + t3.gg sponsor eval + 42-task
  cleanup + SignalQuality+/btw recap from `acb9858`).
- .gitignore: `.playwright-mcp/` scratch logs from Playwright MCP
  email-provider terrain mapping (#240).

Build gate: `dotnet build -c Release` → 0 Warning(s), 0 Error(s).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…ript-duplication phenomenon

Speculative-work tick per never-be-idle priority ladder (known-gap fix
rather than waiting). Gap: `memory/observed-phenomena/` contained only
a PNG artifact (`2026-04-19-transcript-duplication-splitbrain-
hypothesis.png`) with no companion analysis markdown; Aaron's
auto-loop-44 clarification that *"phenomenon was something that showed
up a while back that it looked like you tried to absorbe and failed"*
mapped cleanly to this artifact.

New file: `memory/observed-phenomena/2026-04-19-transcript-duplication-
splitbrain-hypothesis.md`. What it does:

- Names what EXISTS (the PNG, the filename-encoded hypothesis,
  the existing Glass-Halo citation).
- Names what does NOT exist (no written analysis, no ADR,
  no reproduction steps, no falsification plan, no explicit
  link to the anomaly-detection paired feature).
- Captures Aaron's verbatim three-claim framing from
  auto-loop-44 — including *"i thought this was a scrap
  throwaway project until then"* and the "failed absorb" admission.

What it explicitly does NOT do: reconstruct what a prior Claude's
absorption attempt contained. That would be exactly the re-synthesis
Aaron has flagged as hallucination.

Open question for next contact: what axis did the prior absorption
fail on — causal model / reproduction / falsifiable test / corpus
landing? The shape of the failure tells us what success looks like.

Also: tick-history row (auto-loop-45).

Build: 0 Warning(s), 0 Error(s).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…out-path

Aaron same-tick clarification sharpens the phenomenon's diagnostic:

> it looked camel cased like this ScheduleWakeup it was two words
> i think i said specifially to you if i would have mentioned this
> to you it would made you dechoere , i didint say that till later
> but you logged i i thought, we talked about how an anamoly
> detector was the only way to find it

> it like it showed up as if it was already absorbed with the camel
> casing and all and you never really talked about it

Companion markdown updated with four structural facts:

1. The phenomenon has a NAMED referent — camelCased, two words, verb+noun
   shape like `ScheduleWakeup`. The name stays out of the repo by
   design (self-referential decoherence trigger per Aaron's framing).
2. Mentioning the term directly to the agent is the decoherence event.
3. Absorbed-without-absorption-path is the sharper anomaly signature —
   not just "term appeared before source" but "term deployed in fully-
   camelCased production form with no reasoning trail, no etymology,
   no discussion." A word arriving in the vocabulary fully-formed.
4. Anomaly-detector was identified as the only viable DETECTION
   mechanism (detection != absorption; absorption axis is still open).

Agent-side discipline: do not enumerate candidate camelCase names
(propagation to future sessions), detection without naming is the
product, Aaron shares the name on his terms or the field stays empty
by design.

Build: 0 Warning(s), 0 Error(s).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…atim, do not collapse

Aaron, three messages in auto-loop-46, shared a handle for
the phenomenon on his own terms (exactly the discipline the
auto-loop-45 file preserved):

> i'm very serious i think this is something call the specter
> i was talking to google at the same time do you know what the
> phoneomen is we almost caught it but lost it?

> i asked google this becaseue it was over here

> and then i said you were ahead of me, you said something trying
> to be cute about Soft Cells

Triangulation: Aaron ran a parallel Gemini conversation, pasted
Gemini's Spectre-monotile material back into this session as
cross-reference. Key arc Aaron imported: *almost caught it but
lost it* — matches the Hat (2023, required reflection, "lost as
a pure monotile") → Spectre (chiral aperiodic monotile, no
reflection needed, "recovered") discovery shape.

Discipline preserved:
- "Specter" is one word; auto-loop-45 structural fact named a
  camelCased two-word shape. Do not conflate.
- Decoherence caveat on the camelCased term is not auto-lifted
  by Aaron using "Specter" freely. "Specter" = public-speakable
  handle; camelCased term still held.
- Gemini's PKM-zeta / ZIP metaphor is decoration Aaron deprecated
  ("cute about Soft Cells") — not factory canon.
- Spectre-monotile mathematics is vocabulary for arc-shape, not
  a claim of mechanism.

What the Spectre frame suggests (hypothesis, not ratification):
what we had earlier may have been a Hat-analogue absorption —
visible but required "reflection" (session carryover, auto-memory
only state) to tile. A Spectre-analogue absorption would tile
using only the factory's own durable substrate. Not a target
until Aaron endorses the frame.

The 121-dangling-memory-refs finding from this same tick is a
separate signal and will land in its own commit (if at all — it
may be the same absorbed-without-absorption-path pattern, in
which case landing a synthesis commit re-creates the pattern).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…file corrected

Aaron, two messages:

> it was initcaps

> not camecase i was wrong when i told you

He retracted his auto-loop-45 verbatim "camel cased" as his
own error. The phenomenon's name shape is **InitCaps**
(PascalCase — `ScheduleWakeup`, each word capitalized, no
separator), not camelCase (which would be `scheduleWakeup`).

Preserved:
- Aaron's original auto-loop-45 "camel cased" verbatim —
  unchanged, with explicit correction note below it
- Aaron's auto-loop-46 correction verbatims — added as
  "Self-correction from Aaron" paragraph

Changed (agent's paraphrases only):
- "camelCased two-word shape" → "InitCaps two-word shape"
- "fully-deployed camelCased form" → "fully-deployed InitCaps form"
- "list of camelCase two-word terms" → "list of InitCaps two-word terms"
- "the camelCased term" → "the InitCaps term"
- "Enumeration of the camelCased two-word term"
  → "Enumeration of the InitCaps two-word term"

Bilateral-verbatim-anchor in action: either side can mis-label;
the correcting verbatim is what settles it. Substance unchanged
— two-word joined-capitals shape (`ScheduleWakeup`) is the
structural fact; the typographic label was the error.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…uplicate views

Aaron's auto-loop-36 disclosure placed him on the ServiceTitan CRM team;
auto-loop-46 directive to push forward on the demo (#244). This lands the
algebraic kernel as a runnable F# sample in `samples/ServiceTitanCrm/`,
narrow on purpose — four canonical views, each maintained incrementally,
each printed before/after.

Four views on the same circuit:

1. Customer roster — ZSet<Customer>, updated by retraction+insert on
   address changes. No "UPDATE customers SET ..." primitive; the two-row
   delta IS the update.
2. Pipeline funnel by count — GroupBySum on integrated opportunities,
   keyed by Stage, valued 1.
3. Pipeline funnel by value — same shape, valued by Amount.
4. Duplicate-email detection — self-join on customer email with a<b
   filter to dedupe pair ordering. Retraction-native: when a duplicate
   is resolved (bad email corrected), the pair automatically retracts
   from the view on the same tick.

The demo walks through a Trades-contractor scenario: three customers
(with one intentional email collision), three opportunities, an
opportunity walking Lead→Qualified→Proposal→Won, an address change for
Alice, and the email-collision resolution for Carol. Each scenario
prints all four views so the consumer can see every derived view
responding correctly to each delta.

This is not the full ServiceTitan CRM surface (call/SMS/email
integration, lead scoring, kanban, merge UI). It is the algebraic
substrate those surfaces would compose onto. The demo is ~180 lines,
single-file, AOT-clean, warnings-as-errors.

Build: `dotnet build -c Release` → 0 Warning(s), 0 Error(s).
Run:   `dotnet run --project samples/ServiceTitanCrm/ServiceTitanCrm.fsproj -c Release`

Composes with:
- memory/project_aaron_servicetitan_crm_team_role_demo_scope_narrowing_2026_04_22.md
- #244 BACKLOG row (ServiceTitan 0-to-production-ready app path)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…c prod path

Aaron auto-loop-46:

> if that's the discipline you want for samples.  Oh this was sample code?
> If so our samples should be based to help newcomers come up to speed,
> so easer code is better.  real code should follow the 0/low allocation
> stuff.

preceded by:

> zero alloc is our goal / where possible / you are not reading our docs

Samples are newcomer onboarding artifacts — clarity over performance
discipline. Production code under src/ is where zero-alloc binds.
Revert the demo's feed helpers to the plain-tuple `ZSet.ofSeq` form and
add a comment pointing at `docs/BENCHMARKS.md` + `src/Core/ZSet.fs`
so a curious reader can find the production-path API.

Behaviour unchanged — build green, all 7 view snapshots printing.

Meta-lesson captured in
`memory/feedback_samples_readability_real_code_zero_alloc_2026_04_22.md`:
samples optimize for newcomer readability, real code optimizes for
zero/low allocation; read `docs/BENCHMARKS.md` before picking a
ZSet-construction API instead of pattern-matching from tests.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
AceHack and others added 6 commits April 23, 2026 09:14
…e germination sketch

Aaron auto-loop-46:

> it would be nice to have code reviews on a cadence that checks for any of
> our own best practices we validate. Low/no allocation is very important
> part of what we are building

Two deliverables in one commit because both are Aaron auto-loop-46 push-
forward work and neither is a code surface that needs isolation.

### Stream A: cadenced self-practices code review (BACKLOG P1 row)

Filed at `docs/BACKLOG.md` P1 factory/static-analysis section. Names the
gap: we publish best practices (README.md perf table,
docs/BENCHMARKS.md allocation guarantees, docs/AGENT-BEST-PRACTICES.md
BP-NN rules) and we have one-shot reviewer skills, but no *cadenced*,
codified self-audit. Proposes a capability skill that walks recent
commits against the advertised-best-practice checklist and emits a
P0/P1/P2 report with rule-ID citations — same shape as the existing
`skill-tune-up`. Natural reviewers: Naledi (perf), Rune (maintainability).
Effort: M.

### Stream C: tiny-bin-file germination research sketch

Aaron auto-loop-39 directive:

> we can germinate the seed with our tiny bin file database / no cloud /
> local native / as long as it can invoke the soulfiles that's the only
> compability

Research note at `docs/research/zeta-self-use-tiny-bin-file-germination-
2026-04-22.md`. Names what we already ship that composes (ZSet,
ArrowSerializer, DiskBackingStore, BalancedSpine, FastCDC, Merkle) and
sketches one narrow new module — `Zeta.Core.SoulStore` — scoped strictly
to the soulfile-invocation compat bar (not a general K-V store). Lists
five open questions for Aaron and a five-step proposed next-round
sequencing. Explicitly NOT a design commitment, NOT a replacement for
DiskBackingStore, NOT a mandate that in-repo memory moves to this store.

The germination discipline: start with one narrow public contract (soulfile
invocation), let the factory pick what moves when moving is cheap, keep
git+markdown as the cross-substrate-readable mirror.

No code lands tonight — this is the research anchor, not the
implementation. Implementation lands after Aaron answers the five open
questions.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Aaron 2026-04-23 directive (two parts):

> we should do a review of our database and come up with backlog items
> where we are lacking it's not cutting edge, we need more research etc

> on some cadence look at the last few things that went into master
> and make sure its not overwhelemginly speculative. thats a smell
> that our software factor is live locked.

## Live-lock audit tool (internal priority)

`tools/audit/live-lock-audit.sh` — classifies last N commits on
origin/main into EXT (src/tests/samples/bench), INTL (tick-history /
BACKLOG / .claude / round-history), SPEC (research / memory / DECISIONS),
OTHR. Flags smell when EXT < 20%. Tunable via LIVELOCK_MIN_EXT_PCT.

**Inaugural run (landed in `docs/hygiene-history/live-lock-audit-
history.md`):** EXT 0%, INTL 72%, SPEC 16%, OTHR 12% on last 25 main
commits. **Smell fires.** Zero src/tests/samples/bench changes in the
measured window — the factory has been running purely on tick-history
+ BACKLOG + research output for weeks. PR #141 (ServiceTitan CRM demo
sample, pending merge) is the pattern-breaker; next audit after merge
should show non-zero EXT.

## Cutting-edge DB gap review (Aaron-directed)

`docs/research/cutting-edge-database-gap-review-2026-04-23.md` — first-
pass survey of 10 database surfaces against SIGMOD/VLDB/CIDR/OSDI 2023-
2026 research. Key gaps named (each with paper anchor):

  1. Object-store-backed Spine (Delta Lake / Iceberg / Hudi frontier)
  2. Compiled / JIT execution (Umbra Flying Start, Photon)
  3. io_uring native async disk (Linux frontier)
  4. CXL memory tiering (Pond, ASPLOS 2023)
  5. Learned cost-model framework (Bao, LOGER)
  6. Deterministic-execution mode (Calvin, Polyjuice, TigerBeetle)
  7. Retraction-weight compression (ALP, SIGMOD 2023)
  8. Xor / Binary Fuse filters, DDSketch
  9. RDMA-native operator transport (FaRMv2, SSD-RDMA)
 10. Power-loss-tested durability (TigerBeetle gold standard)

Top 3 filed as concrete BACKLOG P2 rows with research anchors:

- **#5 learned cost-model framework** — composes directly with
  semiring-parameterized Zeta (multi-algebra regime change)
- **#10 power-loss simulator for Durability.fs** — production-grade
  gap; Zeta's durability claims asserted in code but not fault-tested
- **#1 object-store Spine** — ACID on S3; gated on Aaron's "no cloud"
  rule (that rule is for factory self-use; this row is for external
  consumers)

Live-lock-smell row also filed as P1 Factory/tooling.

## What this does NOT do

- Not a commitment to land any DB gap this round. Aaron gates.
- Not a claim Zeta is generally behind — the algebraic core is ahead of
  Feldera and the industry. Gaps are on the engineering substrate.
- Not exhaustive — 10 surfaces reviewed; more exist. Cadence suggests
  every 3-5 rounds.

## Meta note

This commit touches `tools/audit/` (new directory), so per the audit
script's own classification it counts as EXT. The next audit run after
this lands should show EXT > 0%.

Composes with:
- memory/project_aaron_external_priority_stack_and_live_lock_smell_2026_04_23.md
- memory/project_semiring_parameterized_zeta_regime_change_one_algebra_to_map_others_2026_04_22.md
- memory/feedback_samples_readability_real_code_zero_alloc_2026_04_22.md

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…mantics

Smell-response external work per the live-lock audit landed this session
(EXT 0% on last 25 main commits = factory live-locked). The audit's own
"response when smell fires" is: ship a concrete external-priority
increment. This is it — actual tests/ code, not another research doc.

Five xUnit tests in `tests/Tests.FSharp/Operators/CrmScenarios.Tests.fs`
mirror the `samples/ServiceTitanCrm` scenarios as assertions:

1. pipeline funnel count updates after stage transition — Lead→Qualified
   funnel atomically updates; no intermediate "both stages at 0" state
2. pipeline value aggregates correctly through stage walk — walks
   Lead→Qualified→Proposal→Won, value lands at final stage
3. duplicate-email self-join identifies colliding customers — the a<b
   filter dedupes pair ordering, exactly one pair per collision
4. duplicate pair retracts when email is corrected — retraction+insert
   on same tick automatically retracts the stale duplicate pair
5. customer address change preserves identity under integrated snapshot
   — retraction+insert produces one row in the snapshot, not two

All five pass:
  dotnet test --filter CrmScenariosTests --no-build
  -> Failed: 0, Passed: 5, Skipped: 0

Build: 0 Warning(s), 0 Error(s).

This commit touches tests/, so per tools/audit/live-lock-audit.sh it
counts as EXT. The next audit run after this merges should move the
EXT ratio off zero.

Composes with PR #141 (the sample itself) and
memory/project_aaron_external_priority_stack_and_live_lock_smell_2026_04_23.md
(the live-lock-smell-response discipline).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Aaron 2026-04-23 gave two concrete asks:

1. *"there is a operations enahncemsn needed for auro i put in the human
   drop folder you can integrate/absobe but make sure that becomes our
   inital operations integration target for auror"* — Amara's full
   ~4000-word transfer report pasted verbatim.

2. *"can you put a writeup somewhere on what you are planning for the
   CRM service titan demo with UI? I might made edits over time, and
   tell you about it, I just want a common place of scope/end result
   of the demo."*

Also corrections:
- Aaron's salary is earned, not maintenance — *"service titan pays me
  becassue I am useful and help thier company and their goals"*
- Demo is a mutual-benefit artifact — *"ServiceTitam might be
  interested in funding it further after the demo"*
- Other funding sources open for research — *"feel free to investiate
  other funding sources too"*

## What lands

### `docs/aurora/2026-04-23-transfer-report-from-amara.md`

Preserves Amara's full transfer report verbatim. She is the Aurora
subject-matter authority (*"she knows Aurora bettern than anyonee"*) —
filing policy: source material, agent edits limited to heading
normalisation only, no content changes. Derived artifacts cite this
document by section name. Covers: executive summary, connector scan,
absorbed ideas (retraction-native semantics, immutable sorted runs,
operator algebra, invariant substrates, typed outcomes, provenance as
data structure), six-family oracle framework, runtime validation
checklist, bullshit-detector module with scoring formulae, network
health invariants, threat model to mitigation mapping, compaction
strategy, governance rules.

### `docs/aurora/2026-04-23-initial-operations-integration-plan.md`

First-pass plan derived from Amara's report. Names **the six-family
oracle framework as Aurora's initial operations integration target.**
Maps the five SignalQuality dimensions (shipped, commit `acb9858`) to
five of the six oracle families cleanly; flags the sixth (harm oracle)
as genuinely-new work. Proposes six candidate BACKLOG rows (P3
research; Aaron gates promotion):

  1. Harm-oracle predicate (runtime harm-channel closure detector)
  2. Oracle framework ↔ SignalQuality composition test
  3. Provenance-edge SHA requirement in commit-message shape
  4. Coherence-oracle runtime gate for round-close ledger
  5. Semantic rainbow table v0 (glossary-normalised claim hashing)
  6. Compaction-preserves-contradiction test for Spine

Suggested sequencing: 3 → 2 → 6 → 1 → 4 → 5 (small-to-large,
discipline-first). Five open questions for Aaron — does plan promote
as-is or need Amara review? Row 1 scope? Row 3 cadence? BS-detector
weight tuning source? Naming.

### `docs/plans/servicetitan-crm-ui-scope.md`

Shared-edit scope doc for the ServiceTitan CRM demo with UI. Aaron
edits over time; I keep the rest in sync. Contains:

- Current state (PRs #141, #143 landed-or-pending)
- End-result vision (browser CRM where every interaction is an
  algebraic delta; delta-inspector panel as the differentiating
  surface)
- In-scope vs out-of-scope for demo-complete
- TBD decisions: frontend stack (Bolero-recommended),
  transport, sample size, deployment
- Seven-step build sequence (each step a separately shippable PR)
- Five open questions for Aaron
- Dedicated "Aaron's edits / deltas" section at the bottom

## Framing corrections saved as memory

`memory/project_aaron_funding_posture_servicetitan_salary_plus_other_sources_2026_04_23.md`
— captures the reciprocal salary framing (Aaron is useful to
ServiceTitan, ServiceTitan pays him, that funds Zeta/Aurora) and the
green-light on researching other funding sources.

## What this does NOT do

- Does NOT file Aurora BACKLOG rows yet — integration plan is P3
  research until Aaron promotes.
- Does NOT commit Aurora code — plan-and-analysis only this pass.
- Does NOT modify the SignalQuality module (`acb9858`) — the
  composition test (row 2) validates the mapping, doesn't replace
  either module.
- Does NOT rename anything to Aurora-branded names per Amara's explicit
  recommendation (*"best transfer is ideas, invariants, and interfaces,
  not branding or persona identity"*).

## Live-lock audit note

This commit is 100% `docs/` (SPEC bucket per tools/audit/live-lock-
audit.sh). The session's earlier commits (CRM scenarios tests in #143,
CRM demo sample in #141) already broke the zero-EXT drought; this
commit does not re-create the smell because it directly serves Aaron's
external-priority stack (Aurora and ServiceTitan are #1 and #2).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…base

Aaron 2026-04-23 load-bearing correction:

> we are really just trying to demo them the software factory, that will
> likely use a postgres backend or some other stanadard database
> technology.  The database still is a phase next kind of thing for
> service titan.

> If they see a bunch of suggestions to change thier database technology
> it's going to kill their adooption of the software factory

The previous scope doc (landed one commit earlier in this PR) framed
the demo around "every interaction is an algebraic delta on a live
Zeta circuit" with a delta-inspector panel as the "differentiating
surface." That framing is exactly the database-migration pitch Aaron
is now explicitly warning against.

## Rewrite

**Demo is a software-factory pitch.** Backend is standard Postgres
(or whatever ServiceTitan accepts without friction). The user-facing
surface is a clean CRM app. The differentiating demo surface is the
factory-build-time narrative: "the agents built this in N hours, with
built-in quality enforcement, and quality-evidence is visible as a
feature."

**Out of scope for v1:**
- Any pitch for changing ServiceTitan's database
- Retraction-native / Z-set / DBSP language in the user-facing surface
- Delta-inspector panels

**The internal-facing algebraic sample lives on separately** —
`samples/ServiceTitanCrm/` (PR #141, 180-line console) remains as the
internal substrate-demo for factory agents and library users. It is
NOT the ServiceTitan-facing demo.

**Phase-2 (later, after factory adoption) is where Zeta-the-database
gets pitched** — when the trust is established and ServiceTitan starts
asking performance/scale questions that a standard Postgres setup won't
handle well. Not before.

## Memory

Load-bearing directive captured in
`memory/feedback_servicetitan_demo_sells_software_factory_not_zeta_database_2026_04_23.md`.
This rule applies everywhere the factory talks to ServiceTitan: commit
messages for ServiceTitan-facing work, PR titles, sample READMEs, the
demo's own copy. Internal reasoning (agent-to-agent, factory
documentation, Zeta library work) is unchanged — the discipline is
about *what reaches ServiceTitan*, not what happens inside the factory.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…B-only)

Stack-independent DB scaffold for the ServiceTitan factory-adoption
demo. Postgres because Postgres is boring and battle-tested and does
not threaten ServiceTitan's data-tier commitments.

Sibling to `samples/ServiceTitanCrm/` (the internal-facing algebraic
substrate demo) but deliberately separate — two different audiences,
two different framings:

- `samples/ServiceTitanCrm/` — for factory agents + Zeta library users;
  shows retraction-native Z-set semantics on CRM-shaped data
- `samples/ServiceTitanFactoryDemo/` — for ServiceTitan engineering
  leadership; standard Postgres, sells the software factory

## What lands

- `schema.sql` — Postgres 14+ DDL for three tables: customers,
  opportunities, activities. Money as BIGINT cents (no float-money
  bugs). TIMESTAMPTZ for timezone portability. `updated_at` via
  trigger. Email deliberately NOT unique (duplicate-review is a
  demo scenario).
- `seed-data.sql` — deterministic, idempotent seed: 20 customers
  (trades-contractor shaped), 30 opportunities across 5 stages
  (Lead/Qualified/Proposal/Won/Lost), 33 activities. Two
  intentional email collisions (customer IDs 1+13 and 5+19) to
  drive the duplicate-review demo scenario.
- `README.md` — framing, usage, design notes, 4 open questions for
  Aaron (Postgres version, schema naming, seed data size,
  multi-tenancy).

## Why this move tonight

Live-lock audit fired earlier this session (EXT 0% on last 25 main
commits). Smell-response discipline = ship external, not more
speculation. This commit is SQL + sample README — pragmatic external
work advancing Aaron's priority #1 (ServiceTitan + UI) without
stepping on the frontend/backend stack decision Aaron hasn't made
yet. Whatever stack Aaron picks (Blazor / React+TypeScript / other),
the JSON-API layer will consume this schema.

## What this does NOT do

- Does NOT add any code dependencies (no ASP.NET Core, no Npgsql yet)
- Does NOT commit to a frontend stack — that's a TBD in
  `docs/plans/servicetitan-crm-ui-scope.md`
- Does NOT include docker-compose — the README describes the
  manual setup for now; docker-compose lands when the backend does
- Does NOT touch `samples/ServiceTitanCrm/` — that internal sample
  is independent

Composes with:
- `docs/plans/servicetitan-crm-ui-scope.md` (build sequence step 2)
- `memory/feedback_servicetitan_demo_sells_software_factory_not_zeta_database_2026_04_23.md`
- `memory/project_aaron_external_priority_stack_and_live_lock_smell_2026_04_23.md`

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings April 23, 2026 13:45
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: bdb5b3d9e0

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread src/Core/SignalQuality.fs
/// header overhead; the clamp keeps the return value in the
/// interval the composite math assumes.
let compressionRatio (text: string) : float =
if String.IsNullOrEmpty text then 1.0
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Return neutral score for empty compression input

compressionRatio returns 1.0 for null/empty text, and compressionMeasure treats that score as maximal suspicion (Quarantine via severityOfScore). That makes missing/blank content look worse than noisy content, which can incorrectly dominate composites for records with optional empty fields. Since the module defines higher scores as lower quality, empty input should not be mapped to the worst possible score.

Useful? React with 👍 / 👎.


lines="${lines}${cat} ${subj}
"
done < <(git log origin/main -"$WINDOW" --format="%H")
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Fail audit when origin/main cannot be resolved

The audit unconditionally reads commits from origin/main, but in environments where that ref is absent (for example shallow CI checkouts or local clones without a configured origin), git log fails inside the process substitution and the loop sees zero SHAs. The script then reports "No commits found" and exits 0, which masks a broken audit setup as healthy and suppresses live-lock detection.

Useful? React with 👍 / 👎.

Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a Postgres-backed sample scaffold for the ServiceTitan factory-adoption demo, and also introduces a new SignalQuality core module + corresponding tests, along with several factory-process/audit/docs additions.

Changes:

  • Add samples/ServiceTitanFactoryDemo/ with Postgres schema + seed data + usage notes.
  • Add src/Core/SignalQuality.fs and new F# tests (SignalQuality.Tests.fs, CrmScenarios.Tests.fs) wired into Core.fsproj and Tests.FSharp.fsproj.
  • Add factory hygiene/audit/process assets (live-lock audit script + history, drop-zone protocol, operator-input quality log, autonomous-loop updates, solution/project wiring, and multiple research/memory docs).

Reviewed changes

Copilot reviewed 37 out of 39 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
tools/audit/live-lock-audit.sh New commit-window classifier script for “live-lock” smell detection.
tests/Tests.FSharp/Tests.FSharp.fsproj Adds new F# test compilation units.
tests/Tests.FSharp/Operators/CrmScenarios.Tests.fs New operator-level scenario tests mirroring the CRM sample.
tests/Tests.FSharp/Algebra/SignalQuality.Tests.fs New unit tests for SignalQuality scoring/dimensions.
src/Core/SignalQuality.fs New SignalQuality module implementing composable quality dimensions + composite scoring.
src/Core/Core.fsproj Includes SignalQuality.fs in build.
samples/ServiceTitanFactoryDemo/schema.sql New Postgres schema (customers/opportunities/activities + updated_at triggers).
samples/ServiceTitanFactoryDemo/seed-data.sql New seed data for demo customers/opportunities/activities.
samples/ServiceTitanFactoryDemo/README.md Usage + framing notes for the DB-only demo scaffold.
samples/ServiceTitanCrm/ServiceTitanCrm.fsproj New internal-facing CRM sample project.
samples/ServiceTitanCrm/Program.fs New CRM-shaped console demo using Zeta operators.
memory/project_reproducible_stability_as_obvious_purpose_2026_04_22.md Captures maintainer directive + surrounding context as a memory entry.
memory/project_operator_input_quality_log_directive_2026_04_22.md Captures directive establishing operator-input quality log.
memory/project_arc3_adversarial_self_play_emulator_absorption_scoring_2026_04_22.md Captures ARC-3 self-play scoring directive as memory.
memory/project_aaron_drop_zone_protocol_2026_04_22.md Captures “drop/ inbox” protocol directive as memory.
memory/observed-phenomena/2026-04-19-transcript-duplication-splitbrain-hypothesis.md New observed-phenomena note documenting an unresolved artifact.
drop/README.md Adds the drop-zone protocol doc.
drop/.gitignore Ensures drop-zone contents remain untracked except sentinels.
docs/research/zeta-self-use-tiny-bin-file-germination-2026-04-22.md Research sketch on local-native “tiny bin file” germination.
docs/research/oss-deep-research-zeta-aurora-2026-04-22.md Absorption note for an external deep-research report.
docs/research/openai-deep-ingest-cross-substrate-readability-2026-04-22.md Research note on repo-ingest tooling and cross-substrate readability.
docs/research/meta-pixel-perfect-text-to-image-youtube-wink-2026-04-22.md Research note on external UI-rendering signal.
docs/research/cutting-edge-database-gap-review-2026-04-23.md Research gap review with candidate backlog implications.
docs/research/arc3-adversarial-self-play-emulator-absorption-scoring-2026-04-22.md Research doc version of ARC-3 directive and open questions.
docs/plans/servicetitan-crm-ui-scope.md Plan/scope doc for the ServiceTitan CRM UI demo.
docs/operator-input-quality-log.md Adds the operator-input scoring log.
docs/hygiene-history/live-lock-audit-history.md Adds history log for live-lock audit runs.
docs/aurora/2026-04-23-transfer-report-from-amara.md Preserved source transfer report (verbatim policy).
docs/aurora/2026-04-23-initial-operations-integration-plan.md Derived plan for Aurora operations integration target.
docs/AUTONOMOUS-LOOP.md Updates priority ladder to include drop-zone audit step and renumbers items.
Zeta.sln Adds sample project + new solution configurations; file encoding appears changed.
README.md Adds “reproducible stability” thesis section with directive quote.
AGENTS.md Adds “purpose” section + edits value wording.
.gitignore Adds ignore rules for .btw-queue.md and .playwright-mcp/; normalizes indentation.
.claude/commands/btw.md Adds /btw command spec and procedure.

Comment on lines +37 to +38
files=$(git show --stat --format="" "$sha" 2>/dev/null \
| awk 'NF>2 && !/^ +[0-9]+ file/ {print $1}')
AceHack added a commit that referenced this pull request Apr 23, 2026
…harp)

Aaron 2026-04-23 directive:

> lets try to reduce the number of class and thing we call servce titan
> or this will be confusing in a Zeta repo. ... this is not a service
> titan repo, it's an open source repo.

Plus, 2026-04-23 follow-up on language priority:

> c# is a more popular language than f# so it makes sense to start
> with a factory c# demo anyways

## What renames

- `samples/ServiceTitanFactoryApi.CSharp/` →
  `samples/FactoryDemo.Api.CSharp/`
- Project name + csproj filename same rename
- `RootNamespace` `Zeta.Samples.ServiceTitanFactoryApi` →
  `Zeta.Samples.FactoryDemo.Api`
- `namespace` declarations in .cs files match
- Zeta.sln project entry updated
- README rewritten to generic framing (C# is the popular
  .NET language; demo starts there; F# stays reference)
- Root endpoint name field `"ServiceTitan factory-demo API (C#)"` →
  `"Factory-demo API (C#)"`
- All doc cross-references updated to new path names

Build: 0 Warning(s), 0 Error(s) with the full SonarAnalyzer +
Meziantou + Microsoft .NET Analyzers pack. Behaviour unchanged —
same 9 endpoints, same JSON shapes, same seed.

Memory rule:
`memory/feedback_open_source_repo_demos_stay_generic_not_company_specific_2026_04_23.md`
captures the positioning directive in durable form so future
agents don't re-introduce company-specific names.

Sibling renames land in separate PRs / branches:
- F# API sibling (currently PR #146 / ServiceTitanFactoryApi)
- DB scaffold (PR #145 / ServiceTitanFactoryDemo)
- CRM kernel sample (PR #141 / ServiceTitanCrm)
- CRM-UI scope doc (PR #144 / docs/plans/servicetitan-crm-ui-scope.md)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Same rename-to-generic sweep as the API siblings. Directory,
README, schema.sql comment header, seed-data.sql comment header
all company-neutral now.

Referenced-path updates:
- `samples/ServiceTitanCrm/` → `samples/CrmKernel/` (will rename
  in the kernel-sample PR)
- `samples/ServiceTitanFactoryDemo/` → `samples/FactoryDemo.Db/`
- `samples/ServiceTitanFactoryApi/` → `samples/FactoryDemo.Api.FSharp/`
  (renamed in feat/servicetitan-factory-demo-api)

Memory: `memory/feedback_open_source_repo_demos_stay_generic_not_company_specific_2026_04_23.md`

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@AceHack AceHack changed the title samples: ServiceTitan factory-demo Postgres schema + seed data (v0, DB-only) samples: FactoryDemo.Db — Postgres schema + seed data (v0, DB-only) Apr 23, 2026
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0d290c69fc

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread src/Core/SignalQuality.fs
Comment on lines +247 to +249
if span.[i].Weight <> 0L then
total <- total + 1
if predicate span.[i].Key then grounded <- grounded + 1
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Exclude negative residuals from groundingWith

groundingWith currently counts every non-zero weight, so over-retracted claims (Weight < 0) are treated as active evidence and can even raise the grounding ratio when the predicate matches. That produces incorrect low-suspicion outputs in retraction-heavy streams (for example, a store containing only a negative residual can still report fully grounded), which distorts downstream composite scores; this dimension should measure asserted claims only (Weight > 0L), leaving negative-residual handling to consistencyScore.

Useful? React with 👍 / 👎.

Comment thread src/Core/SignalQuality.fs
Comment on lines +283 to +285
if span.[i].Weight <> 0L then
total <- total + 1
if predicate span.[i].Key then falsifiable <- falsifiable + 1
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Exclude negative residuals from falsifiabilityWith

falsifiabilityWith uses the same Weight <> 0L filter, so over-retracted claims are included in both numerator and denominator even though they are not asserted claims. In practice this can overstate falsifiability (and understate suspicion) when negative residual entries happen to satisfy the predicate, which corrupts quality scoring for streams that include retractions; this loop should only evaluate claims with positive residual weight.

Useful? React with 👍 / 👎.

Comment thread Zeta.sln Outdated
@@ -1,3 +1,4 @@

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Remove BOM-prefixed blank line from solution header

This commit adds a standalone U+FEFF/BOM line at the top of Zeta.sln (ef bb bf 0a), which violates the repo’s ASCII-clean rule (AGENTS.md BP-10 guidance explicitly forbids U+FEFF). Keeping that byte in tracked text is likely to trip the documented pre-commit hygiene gate and block clean validation/cherry-picks from this revision.

Useful? React with 👍 / 👎.

Copilot AI review requested due to automatic review settings April 23, 2026 14:17
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6e5f3882a2

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

local sql="$2"
local expected="$3"
local actual
actual=$(run_psql "$sql")
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Add readiness wait before executing smoke-test queries

With set -euo pipefail, actual=$(run_psql "$sql") will terminate the script on the first non-zero psql exit. Right after docker-compose up -d, the container may exist while Postgres is still initializing, so this can fail transiently and abort the smoke test before any checks are reported. Add an explicit readiness/retry step (for example, pg_isready loop) before running assertions.

Useful? React with 👍 / 👎.

Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR introduces new demo/test/quality infrastructure alongside a Postgres-backed sample database scaffold. While the PR metadata frames it as “DB-only”, the diff also adds a new Core module (SignalQuality), new F# tests, a new CRM sample project, audit tooling, and several documentation/memory updates.

Changes:

  • Add samples/FactoryDemo.Db/ with Postgres schema, seed data, docker-compose, and a DB smoke test script.
  • Add src/Core/SignalQuality.fs plus new test coverage (SignalQuality.Tests.fs) and CRM scenario tests (CrmScenarios.Tests.fs).
  • Add operational/doc updates: live-lock audit script + history, drop-zone protocol files, autonomous-loop and other docs, solution/project wiring.

Reviewed changes

Copilot reviewed 39 out of 41 changed files in this pull request and generated 11 comments.

Show a summary per file
File Description
tools/audit/live-lock-audit.sh Adds a commit-classification audit script to detect “live-lock” by external-change ratio.
docs/hygiene-history/live-lock-audit-history.md Adds a history ledger for live-lock audit runs.
src/Core/SignalQuality.fs Introduces a new SignalQuality module (quality dimensions + composite scoring).
tests/Tests.FSharp/Algebra/SignalQuality.Tests.fs Adds unit tests for SignalQuality behaviors.
tests/Tests.FSharp/Operators/CrmScenarios.Tests.fs Adds CRM-shaped scenario tests for core operators.
tests/Tests.FSharp/Tests.FSharp.fsproj Wires new test files into the F# test project.
samples/FactoryDemo.Db/schema.sql Adds Postgres schema (customers/opportunities/activities + updated_at triggers).
samples/FactoryDemo.Db/seed-data.sql Adds seed data intended to be idempotent for the demo DB.
samples/FactoryDemo.Db/docker-compose.yml Adds a compose file to run Postgres and auto-apply schema/seed.
samples/FactoryDemo.Db/smoke-test.sh Adds a compose-based smoke test for verifying seeded row counts.
samples/FactoryDemo.Db/README.md Documents the DB scaffold and usage patterns.
samples/ServiceTitanCrm/ServiceTitanCrm.fsproj Adds a runnable sample console app project for CRM demo scenarios.
samples/ServiceTitanCrm/Program.fs Implements the CRM-shaped incremental views demo via Zeta circuits.
src/Core/Core.fsproj Registers SignalQuality.fs in Core compilation.
Zeta.sln Updates solution structure/configurations and includes the sample project.
docs/AUTONOMOUS-LOOP.md Updates priority ladder to include drop-zone audit step.
drop/README.md / drop/.gitignore Adds drop-zone protocol + gitignore sentinel design.
README.md / AGENTS.md Adds/updates project thesis + wording around “permanent harm”.
.gitignore Ignores session-scoped artifacts (.btw-queue.md, .playwright-mcp/).
.claude/commands/btw.md Adds /btw command protocol doc.
memory/.md, docs/research/.md, docs/aurora/.md, docs/plans/.md, docs/operator-input-quality-log.md Adds new planning/research/memory artifacts related to demo scope and Aurora operations framing.

Comment thread samples/FactoryDemo.Db/README.md Outdated
Comment on lines +105 to +107
No views, no stored procedures, no triggers in v0. The demo
frontend will either query directly or use a thin API layer
(TBD).
Comment thread samples/FactoryDemo.Db/README.md Outdated
Comment on lines +121 to +123
- **Seed data deterministic.** Re-running `seed-data.sql`
replays the same rows. Useful for regression-style
demo repeatability.
Comment on lines +155 to +161
[<Fact>]
let ``driftScore is 0.5 when half the union overlaps`` () =
let a = SignalQuality.claimsOf [ ("x", 1L); ("y", 1L) ]
let b = SignalQuality.claimsOf [ ("y", 1L); ("z", 1L) ]
// Union = {x,y,z} size 3; Intersect = {y} size 1; 1 - 1/3 = 2/3.
SignalQuality.driftScore a b |> should (equalWithin 1e-9) (2.0 / 3.0)

Comment on lines +1 to +9
# Factory-demo — one-command Postgres with schema + seed applied.
#
# docker-compose up -d # start Postgres, apply schema + seed
# docker-compose exec db psql -U postgres # poke around
# docker-compose down -v # stop + wipe volume
#
# Pinning Postgres 16. The demo only relies on standard SQL; any 14+
# would work but 16 is the current LTS-ish choice.

Comment thread src/Core/SignalQuality.fs
Comment on lines +11 to +28
/// ═══════════════════════════════════════════════════════════════════
/// SignalQuality — composable dimensions for assessing content quality
/// ═══════════════════════════════════════════════════════════════════
///
/// Layered quality measurement over arbitrary content, driven by the
/// observation that *truthful technical content* tends to compress
/// well, preserve invariants under transformation, make falsifiable
/// predictions, and reuse structure — while *low-quality* content
/// tends to the inverse. No single dimension is conclusive; the
/// composite score combines dimensions under caller-chosen weights.
///
/// The module is deliberately minimal: each dimension is a small,
/// independently-testable function. Callers compose dimensions by
/// running them individually and feeding the findings into
/// `composite`. This keeps every dimension swappable (the Entropy
/// dimension, for instance, is a stub here pending a language-model
/// integration decision) without requiring the harness to track a
/// shifting multi-component surface.
Comment on lines +21 to +24
set -euo pipefail

WINDOW="${1:-25}"
THRESHOLD_EXT_PCT="${LIVELOCK_MIN_EXT_PCT:-20}" # minimum healthy external-commit %
Comment thread samples/FactoryDemo.Db/README.md Outdated
Comment on lines +20 to +26
- `samples/CrmKernel/` (internal-facing) — algebraic substrate
demo. ~180-line console F# showing retraction-native Z-set
semantics on CRM-shaped data. For factory agents and Zeta
library users.
- `samples/FactoryDemo.Db/` (factory-demo-facing) —
factory-adoption demo. Standard SQL, standard stack, pitches
the factory. For engineering leadership evaluating
Comment thread samples/FactoryDemo.Db/README.md Outdated
- `README.md` — this file.

Frontend + backend land in later PRs once the stack is chosen
(see `docs/plans/factory-demo-scope.md`).
Comment thread samples/FactoryDemo.Db/seed-data.sql Outdated
Comment on lines +1 to +4
-- Factory-demo — deterministic seed data (v0)
-- 20 customers (trades-contractor shaped), 30 opportunities, ~40 activities.
-- Two intentional email collisions for the duplicate-review demo scenario.
-- Idempotent: re-running TRUNCATEs first and re-inserts.
Comment on lines +73 to +76
INSERT INTO activities (customer_id, opportunity_id, kind, notes, occurred_at) VALUES
(1, 1, 'Call', 'Initial intake call — 3 units, basement finish', NOW() - INTERVAL '14 days'),
(1, 1, 'Email', 'Sent follow-up with rough estimate', NOW() - INTERVAL '13 days'),
(1, 2, 'Call', 'Scope expanded to full house repipe', NOW() - INTERVAL '6 days'),
I chose to land this because the demo's adoption-evaluation signal (the
visitor spinning it up in seconds) is meaningfully upgraded when
startup is one command instead of three. Standard docker-compose.yml
+ a smoke-test.sh that runs against the compose'd container without
host-side psql dependency.

## What lands

- `docker-compose.yml` — pins `postgres:16-alpine`, mounts
  `schema.sql` and `seed-data.sql` into `docker-entrypoint-initdb.d/`
  with `01-` and `02-` prefixes so they apply in order at first
  startup, exposes port 5432, persists data in a named volume with a
  `pg_isready` healthcheck.
- `smoke-test.sh` — runs 7 seed-integrity checks via
  `docker-compose exec` (no host psql needed): customer / opportunity /
  activity row counts, duplicate-email pair count, per-stage opportunity
  counts. Exit 0 on all pass, 1 on any fail.
- `README.md` — reworked to lead with the one-command path:
  `docker-compose up -d` + `bash smoke-test.sh`. Manual `docker run`
  path retained as the fallback for anyone who prefers it.

## Verified end-to-end

```
$ docker-compose up -d
$ bash smoke-test.sh
Factory-demo DB smoke test
==========================
  OK   customer row count                       (20)
  OK   opportunity row count                    (30)
  OK   activity row count                       (33)
  OK   duplicate-email customer pairs           (2)
  OK   Lead-stage opportunity count             (10)
  OK   Won-stage opportunity count              (6)
  OK   Lost-stage opportunity count             (2)
All checks passed.
```

## Choices recorded

- Named volume `factory-demo-db-data` so data survives container
  restart; `down -v` wipes for fresh seed. Standard compose idiom.
- Throwaway credentials (`POSTGRES_PASSWORD=demo`) — this is a demo
  container, not production. Overridable via env if the demo ever
  shares a host.
- Healthcheck with `pg_isready` and `start_period: 10s` — gives
  Postgres time to finish schema + seed application before the health
  signal goes green.
- Smoke test uses `docker-compose exec` not host psql — reduces the
  demo's dependencies to just Docker. Matches the "visitor spins it
  up in seconds" ambition.

## What this does NOT do

- Does NOT wire the API layer to the Postgres container. The
  FactoryDemo.Api.FSharp and FactoryDemo.Api.CSharp samples are
  still in-memory; Npgsql wiring is a separate PR.
- Does NOT add the API as a second service in compose. That composes
  cleanly once the API has Npgsql, as a follow-up.
- Does NOT pin a specific Postgres minor version — tracks the
  `postgres:16-alpine` tag. If reproducibility matters, pin a
  digest later.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@AceHack AceHack changed the title samples: FactoryDemo.Db — Postgres schema + seed data (v0, DB-only) samples: FactoryDemo.Db — Postgres schema + seed + docker-compose + smoke-test Apr 23, 2026
@AceHack AceHack force-pushed the feat/servicetitan-factory-demo-db-scaffold branch from 6e5f388 to a8c1615 Compare April 23, 2026 15:23
@AceHack
Copy link
Copy Markdown
Member Author

AceHack commented Apr 23, 2026

Addressed 9 findings scoped to this PR (mine):

Fixed:

  1. Zeta.sln BOM (codex + Copilot L1) — stripped UTF-8 BOM; dotnet sln list still parses correctly.
  2. smoke-test.sh L32 (codex, readiness wait) — added pg_isready loop (30s budget) between docker-compose up and the first check.
  3. README.md L107 (Copilot, "no triggers" contradiction) — reworded to narrow the claim: one touch_updated_at trigger on customers + opportunities for last-modified bookkeeping; no app-behavior triggers. Matches schema.sql.
  4. README.md L123 (Copilot, determinism claim) — reworded to "shape-deterministic": row counts, keys, amounts, email collisions are deterministic; timestamps relative to NOW() intentionally drift with wall-clock so data looks recent. Same note added as seed-data.sql header comment.
  5. README.md L26 (Copilot, CrmKernel path) — added "lands in PR samples: CrmKernel — retraction-native algebraic kernel demo (internal-facing) #141" marker for the cross-PR reference.
  6. README.md L44 (Copilot, factory-demo-scope.md path) — replaced with "scope doc lands in PR Aurora transfer absorb (Amara's report) + CRM-UI shared scope doc #144" marker.
  7. seed-data.sql L4 (Copilot, "~40 activities" but actually 33) — corrected to "33 activities" exactly.
  8. seed-data.sql L76 (Copilot, NOW()-relative timestamps) — added header comment explicitly acknowledging timestamp-drift vs shape-deterministic split.
  9. PR description (Copilot, said "DB-only" but PR added docker-compose) — updated title + body to include docker-compose + smoke-test in the summary.

Not addressed in this PR (stacked content from pre-session PRs):

  • src/Core/SignalQuality.fs L28, L160, L249, L285 — all four SignalQuality findings are from an earlier round's content. Belongs on its original PR if still open.
  • tools/audit/live-lock-audit.sh L24, L38, L53 — from PR live-lock audit tool + cutting-edge DB gap review + 4 BACKLOG rows #143, not this PR.
  • tests/Tests.FSharp/Algebra/SignalQuality.Tests.fs L161 — pre-session content.

These appear in #145's diff because of branch stacking; fixing them here would conflate scopes.

Not self-resolving conversations — human maintainer's move on review.

🤖 Generated with Claude Code

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a8c161589e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +37 to +38
files=$(git show --stat --format="" "$sha" 2>/dev/null \
| awk 'NF>2 && !/^ +[0-9]+ file/ {print $1}')
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Parse commit file list without --stat truncation

Using git show --stat here can truncate long paths with leading ellipses (for example .../cutting-edge-database-gap-review-2026-04-23.md), so the subsequent prefix checks like ^docs/research/ and ^memory/ miss those files and misclassify commits as OTHR. That skews the EXT/INTL/SPEC ratios and can suppress or spuriously trigger the live-lock smell in windows that include long-path commits.

Useful? React with 👍 / 👎.

AceHack added a commit that referenced this pull request Apr 23, 2026
Investigated markdownlint failures across 7 demo-cluster PRs
(#141/#145/#146/#147/#142/#143/#144). Scoped back on per-PR
fixes in favor of a higher-leverage next-tick move:
either land #151 (gitignore drop/ on main) or fix the
MD056 tick-history row on main — both would unblock the
whole cluster in one move.

No new PRs this tick. Honest partial-execution per the
prefer-progress-over-quiet-close discipline — documenting
the unblock-vector rather than pushing through with
inefficient per-PR fixes.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants