chore: merge main into dev (v0.13.3 telemetry refactor → dev) by Knapp-Kevin · Pull Request #94 · BicameralAI/bicameral-mcp

Knapp-Kevin · 2026-04-29T06:56:31Z

Summary

Brings 22 commits from `main` (up to v0.13.3) onto `dev`. Per maintainer policy, dev should always be ahead of main; this merge restores that invariant after parallel work landed on both branches.

What dev is missing from main (this PR brings in)

Version	Highlights
v0.11.0	CodeGenome Phase 1+2 release artifacts
v0.12.0	Skill-level telemetry (`record_skill_event`), extensible relay, reset wipe_mode
v0.12.1	`bicameral.feedback` tool, `error_class` enum, `rationale` field
v0.12.2	questionary CLI wizards (config, reset)
v0.13.0	Gate telemetry schema (g{N}_ prefix), AskUserQuestion ground truth, liberal ingest filter (speculative proposals)
v0.13.1	Pending decisions surfaced when sync no-ops
v0.13.2	Ratify prompt ordering fix
v0.13.3	Pydantic diagnostic enforcement + telemetry field fix

What dev already has (preserved through the merge)

Windows fixes: fix(#74): make events.writer cross-platform (POSIX fcntl + Windows msvcrt) #80 (events.writer), fix(#69): skip tests of removed preflight contracts (12f25eb) #82 (preflight test skip), fix(#68): normalize Windows backslashes in surrealkv:// URLs #83 (surrealkv URL), fix(#67): validate cwd before subprocess.run to fix Windows WinError 267 #84 (subprocess cwd validation)
Schema: fix(#72): make binds_to.provenance FLEXIBLE so nested keys persist #81 (binds_to.provenance FLEXIBLE)
Docs: docs(#75): add decision-level reference doc + expand schema comment #79 (decision_level reference)
CodeGenome: feat: CodeGenome Phase 3 (#60) — continuity evaluation in link_commit #73 (Phase 3), feat: CodeGenome Phase 4 (#61) — semantic drift evaluation in resolve_compliance (M3) #91 (Phase 4)

Conflict resolution

One conflict in `contracts.py` — both branches modified the pydantic import line. Resolved by combining: `from pydantic import BaseModel, ConfigDict, Field` (dev needed `Field` for line 278; main added `ConfigDict` for the new `IngestDiagnostic` / `PreflightDiagnostic` models). All new diagnostic model classes from main are preserved.

Test results

Full suite post-merge on Windows: 535 pass, 9 fail, 3 skip, 1 xfail.

The 9 failures are pre-existing issues unrelated to the merge:

2× UnicodeDecodeError (cp1252 → UTF-8 markdown read) — separate Windows encoding issue
4× AssertionError on assertion mismatches (test_bind, test_desync_scenarios, test_sync_middleware, test_v0420_history) — falls under #70's AssertionError cluster
3× attribute errors in test_v0420_history — same cluster

Improvement vs pre-merge dev: the Windows subprocess fix from #84 (already on dev) eliminated the WinError 267 cluster, bringing pre-merge dev from 26 failures to 9.

Why this is a merge (not a rebase)

Both branches have merge commits from prior PRs (#73, #80–#84, #91 on dev; many on main). Rebasing dev onto main would flatten those merge commits and rewrite the hashes that referenced PRs reference. Merge preserves history without breaking external references.

🤖 Generated with Claude Code