feat: event-sourced collaboration (Phase 1) by jinhongkuan · Pull Request #2 · BicameralAI/bicameral-mcp

jinhongkuan · 2026-04-10T18:07:07Z

Summary

Adds cross-team decision sharing via append-only JSON event files committed to git
Dual-write adapter: in team mode, every ingest/link_commit writes an event file to .bicameral/events/{git-email}/ before updating the local DB
Watermark-based materializer: on startup, replays peer events into the local ledger — incremental, only processes new events
Zero merge conflicts: per-user directories + append-only files + UUID naming
Zero handler changes: adapter factory swap is invisible to server.py and all handlers

New files

File	Purpose
`events/__init__.py`	Package marker
`events/models.py`	`EventEnvelope` Pydantic model
`events/writer.py`	`EventFileWriter` — atomic JSON writes to per-user dir
`events/materializer.py`	`EventMaterializer` — watermark + glob + replay
`events/team_adapter.py`	`TeamWriteAdapter` — composition wrapper
`tests/test_team_events.py`	20 tests covering writer, materializer, adapter, config

Modified files

File	Changes
`adapters/ledger.py`	`get_ledger()` reads `.bicameral/config.yaml`, returns `TeamWriteAdapter` when `mode: team`
`setup_wizard.py`	Solo/team mode selection, config.yaml creation, `.gitignore` handling

Test plan

20 new unit tests passing (writer, materializer, team adapter, config detection)
Existing test suite unchanged (106 passing, pre-existing failures unaffected)
Manual: bicameral-mcp setup → select team mode → verify config.yaml + .gitignore
Manual: ingest in team mode → verify event file in .bicameral/events/
Manual: copy events from another user dir → restart → verify materialized

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Added team collaboration mode for multi-user workflows alongside solo mode
- Introduced configuration system to set and manage collaboration mode during setup
- Event-sourced collaboration enabling team members to share and synchronize architectural decisions
Documentation
- Added collaboration modes guide with comparison table and configuration instructions
- Enhanced local development setup documentation

…terializer Adds cross-team decision sharing via append-only JSON event files committed to git. In team mode, every ingest/link_commit writes an event file to .bicameral/events/{git-email}/ before updating the local DB. On startup, a watermark-based materializer replays peer events into the local ledger. Zero merge conflicts by design. New: events/ package (models, writer, materializer, team_adapter) Modified: adapters/ledger.py (config-driven adapter factory) Modified: setup_wizard.py (solo/team mode selection) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-04-10T18:07:25Z

Warning

Rate limit exceeded

@jinhongkuan has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 3 minutes and 1 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 3 minutes and 1 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 7f2771b4-de99-4c79-a66a-e56156241a88

📥 Commits

Reviewing files that changed from the base of the PR and between b678516 and 5060b7b.

📒 Files selected for processing (4)

events/materializer.py
events/team_adapter.py
setup_wizard.py
tests/test_team_events.py

📝 Walkthrough

Walkthrough

Adds a configuration-driven "team" collaboration mode: new event-sourcing components (EventEnvelope, EventFileWriter, EventMaterializer, TeamWriteAdapter), ledger factory wiring to optionally wrap the SurrealDB adapter in team mode, and setup changes to persist and respect collaboration mode.

Changes

Cohort / File(s)	Summary
Event Sourcing Package `events/__init__.py`, `events/models.py`, `events/writer.py`, `events/materializer.py`, `events/team_adapter.py`	New event-sourcing modules: `EventEnvelope` model, atomic per-author JSON `EventFileWriter`, watermark-driven `EventMaterializer` for incremental replay, and `TeamWriteAdapter` implementing dual-write (emit event file then delegate) with read-pass-through methods.
Ledger Adapter Factory `adapters/ledger.py`	Added `_read_collaboration_mode(repo_path)` and updated `get_ledger()` to construct a base `SurrealDBLedgerAdapter` and, when mode == `team`, wrap it with `TeamWriteAdapter` wired with `EventFileWriter` and `EventMaterializer` using `.bicameral/events/` and `.bicameral/local/`.
Setup Wizard & Config `setup_wizard.py`	Introduced `_select_collaboration_mode()` and `_write_collaboration_config()`, made `_ensure_gitignore(...)` mode-aware (solo vs team) and rewrote gitignore block handling; `run_setup()` now persists mode and creates `.bicameral/events/` in team mode.
Tests `tests/test_team_events.py`	Added comprehensive tests for writer, materializer, `TeamWriteAdapter`, and config detection (in-memory SurrealDB): verifies file formats, watermarking/replay ordering, event dispatch, dual-write behavior, and git-email author detection.
Docs `README.md`	Added "Collaboration Modes" documentation and adjusted local development/install instructions and post-install command grouping.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Factory as get_ledger()
    participant Inner as SurrealDBLedgerAdapter
    participant Team as TeamWriteAdapter
    participant Writer as EventFileWriter
    participant Materializer as EventMaterializer
    participant FS as Event Filesystem

    Factory->>Inner: instantiate SurrealDBLedgerAdapter
    Factory->>Writer: create EventFileWriter(.bicameral/events/, author)
    Factory->>Materializer: create EventMaterializer(.bicameral/events/, .bicameral/local/)
    Factory->>Team: wrap Inner with TeamWriteAdapter(Inner, Writer, Materializer)

    Team->>Inner: connect()
    Team->>Materializer: replay_new_events(inner)
    Materializer->>FS: read new event files (*.json)
    Materializer-->>Inner: call ingest_payload / ingest_commit for events

    User->>Team: ingest_payload(payload)
    Team->>Writer: write(event_type=ingest.completed, payload)
    Writer->>FS: atomic write timestamp-uuid.json
    Team->>Inner: inner.ingest_payload(payload)
    Inner-->>Team: result

    User->>Team: get_all_decisions(...)
    Team->>Inner: get_all_decisions(...)
    Inner-->>Team: decisions

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~65 minutes

Possibly related PRs

refactor: port interfaces + source_span for drift pipeline #1 — Related changes to ledger adapter wiring and ingest_commit signature (drift_analyzer forwarding) that overlap with TeamWriteAdapter delegations.

Poem

🐰
I hop and drop an event to share,
timestamps stitched with tidy care,
teammates read what I have penned,
watermarks tell where replays end,
solo or team—our logs declare.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 24.56% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'feat: event-sourced collaboration (Phase 1)' directly summarizes the main change—adding event-sourced collaboration infrastructure for cross-team decision sharing via append-only JSON event files, which is the central theme of all file modifications.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/event-sourced-collaboration

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@events/materializer.py`:
- Around line 57-69: The single scalar watermark (watermark from
_read_watermark) is too coarse because EventFileWriter timestamps only to the
second; change the filtering logic that builds new_events (and the analogous
logic at lines 103-105) to perform per-event bookkeeping for the current
timestamp bucket: persist (or atomically store) a set of processed event
identifiers or filenames for the latest watermark-second and use
_extract_timestamp(f.name) to group by second, allowing events with timestamp ==
watermark-second to be accepted if their filename/ID is not in the persisted
processed set; update the watermark persistence API (or add a companion
persisted processed_ids set) so after processing you append processed file IDs
for that second and advance the watermark when the set is complete, and adjust
the filter to use ">" for seconds newer than watermark-second and "== and id not
in processed_ids" for the current second.

In `@events/team_adapter.py`:
- Around line 41-56: The ingest_payload and ingest_commit methods currently
write events via self._writer.write then delegate to self._inner but never
record that the local event has been materialized; after the await of
self._inner.ingest_payload(...) and self._inner.ingest_commit(...), update the
local replay cursor/state to mark that the written event has been applied (e.g.,
call your replay cursor/state API such as
self._replay_cursor.mark_applied(event_id) or update self._local_state to
include the event), ensuring this occurs only after the inner call succeeds so
replay won’t reapply the same event after restart.
- Around line 32-37: The current materialization only runs in connect(), so add
an internal async method async def _ensure_ready(self) that is idempotent and
called from every public method (e.g., read/write methods, any public API on
this adapter) to guarantee the inner adapter and materializer are initialized;
implement _ensure_ready to await a per-instance asyncio.Lock or an initialized
flag to avoid races, call await self._inner.connect() then await
self._materializer.replay_new_events(self._inner) and log as before, and set a
_ready flag so subsequent calls are no-ops; remove reliance on callers invoking
connect() and instead call await self._ensure_ready() at the start of each
public method.

In `@events/writer.py`:
- Around line 41-45: The constructor stores author_email directly into a
filesystem path which lets values with separators, '..', or absolute paths
escape the events dir; update Writer.__init__ to validate and sanitize
author_email before using it: reject or normalize inputs containing path
separators, leading slashes, or components like '..' (e.g., ensure author_email
== Path(author_email).name or apply a safe-encoding), raise ValueError for
invalid inputs, then set self._author to the sanitized value and build
self._user_dir from events_dir / sanitized_author and mkdir as before; ensure
any external callers still get a clear exception on invalid author strings.

In `@setup_wizard.py`:
- Around line 400-407: The team-mode config currently leaves SURREAL_URL and
CODE_LOCATOR_SQLITE_DB pointing to .bicameral/ledger.db and
.bicameral/code-graph.db which will be tracked; update the config generation
called after _select_collaboration_mode/_write_collaboration_config to accept
the collab_mode flag and, when collab_mode == "team", set SURREAL_URL and
CODE_LOCATOR_SQLITE_DB to files under .bicameral/local/ (e.g.,
.bicameral/local/ledger.db and .bicameral/local/code-graph.db) or add explicit
ignores for those exact paths; modify the function that writes MCP/env config
(where SURREAL_URL and CODE_LOCATOR_SQLITE_DB are emitted) to branch on
collab_mode and ensure the moved paths match what _ensure_gitignore produces.

In `@tests/test_team_events.py`:
- Around line 16-17: The test module currently uses
os.environ.setdefault("SURREAL_URL", "memory://") which can leave an existing
SURREAL_URL and cause tests to hit a persistent DB; replace this with an
unconditional override by assigning os.environ["SURREAL_URL"] = "memory://" at
module import time so the SURREAL_URL is always set to the in-memory SurrealDB
for tests (refer to the SURREAL_URL environment variable and the
tests/test_team_events.py top-level env setup).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 06293b92-42c6-4788-8add-40ed6e13ec23

📥 Commits

Reviewing files that changed from the base of the PR and between 32f9097 and 800623d.

📒 Files selected for processing (8)

adapters/ledger.py
events/__init__.py
events/materializer.py
events/models.py
events/team_adapter.py
events/writer.py
setup_wizard.py
tests/test_team_events.py

coderabbitai · 2026-04-10T18:17:41Z

+        watermark = self._read_watermark()
+
+        # Glob all event files across all user directories
+        event_files = sorted(
+            self._events_dir.glob("*/*.json"),
+            key=lambda f: f.name,  # lexicographic = chronological
+        )
+
+        # Filter to new events
+        new_events = [
+            f for f in event_files
+            if self._extract_timestamp(f.name) > watermark
+        ]


⚠️ Potential issue | 🟠 Major

A single timestamp watermark is too coarse for these filenames.

EventFileWriter only guarantees second-level timestamps, so once the watermark is 20260410T180000Z, any event file merged later from that same second is skipped forever by the > filter here. You need per-event bookkeeping for the current timestamp bucket (or persistent processed event IDs), not a single scalar timestamp.

Also applies to: 103-105

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@events/materializer.py` around lines 57 - 69, The single scalar watermark (watermark from _read_watermark) is too coarse because EventFileWriter timestamps only to the second; change the filtering logic that builds new_events (and the analogous logic at lines 103-105) to perform per-event bookkeeping for the current timestamp bucket: persist (or atomically store) a set of processed event identifiers or filenames for the latest watermark-second and use _extract_timestamp(f.name) to group by second, allowing events with timestamp == watermark-second to be accepted if their filename/ID is not in the persisted processed set; update the watermark persistence API (or add a companion persisted processed_ids set) so after processing you append processed file IDs for that second and advance the watermark when the set is complete, and adjust the filter to use ">" for seconds newer than watermark-second and "== and id not in processed_ids" for the current second.

coderabbitai · 2026-04-10T18:17:41Z

+    async def ingest_payload(self, payload: dict) -> dict:
+        """Write ingest event, then delegate to inner adapter."""
+        self._writer.write("ingest.completed", payload)
+        return await self._inner.ingest_payload(payload)
+
+    async def ingest_commit(
+        self, commit_hash: str, repo_path: str, drift_analyzer=None,
+    ) -> dict:
+        """Write link_commit event, then delegate to inner adapter."""
+        self._writer.write(
+            "link_commit.completed",
+            {"commit_hash": commit_hash, "repo_path": repo_path},
+        )
+        return await self._inner.ingest_commit(
+            commit_hash, repo_path, drift_analyzer=drift_analyzer,
+        )


⚠️ Potential issue | 🟠 Major

Mark successful local writes as already applied.

These methods append the event file and then apply the same mutation to the local DB, but nothing records that the local event has already been materialized. After a restart, replay can apply that same event to the same ledger again. Once the inner write succeeds, record the written event in the local replay cursor/state.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@events/team_adapter.py` around lines 41 - 56, The ingest_payload and ingest_commit methods currently write events via self._writer.write then delegate to self._inner but never record that the local event has been materialized; after the await of self._inner.ingest_payload(...) and self._inner.ingest_commit(...), update the local replay cursor/state to mark that the written event has been applied (e.g., call your replay cursor/state API such as self._replay_cursor.mark_applied(event_id) or update self._local_state to include the event), ensuring this occurs only after the inner call succeeds so replay won’t reapply the same event after restart.

coderabbitai · 2026-04-10T18:17:41Z

+    def __init__(self, events_dir: Path, author_email: str) -> None:
+        self._events_dir = events_dir
+        self._author = author_email
+        self._user_dir = events_dir / author_email
+        self._user_dir.mkdir(parents=True, exist_ok=True)


⚠️ Potential issue | 🟠 Major

Validate author_email before using it as a path component.

author_email is joined directly into the filesystem path here. A value containing separators, .., or an absolute path can escape .bicameral/events/ and also breaks the materializer's */*.json scan.

Proposed fix

class EventFileWriter: """Writes append-only JSON event files to a per-user directory.""" def __init__(self, events_dir: Path, author_email: str) -> None: + if ( + not author_email + or author_email in {".", ".."} + or any(sep in author_email for sep in ("/", "\\")) + ): + raise ValueError("author_email must be a single directory name") self._events_dir = events_dir self._author = author_email self._user_dir = events_dir / author_email self._user_dir.mkdir(parents=True, exist_ok=True)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@events/writer.py` around lines 41 - 45, The constructor stores author_email directly into a filesystem path which lets values with separators, '..', or absolute paths escape the events dir; update Writer.__init__ to validate and sanitize author_email before using it: reject or normalize inputs containing path separators, leading slashes, or components like '..' (e.g., ensure author_email == Path(author_email).name or apply a safe-encoding), raise ValueError for invalid inputs, then set self._author to the sanitized value and build self._user_dir from events_dir / sanitized_author and mkdir as before; ensure any external callers still get a clear exception on invalid author strings.

coderabbitai · 2026-04-10T18:17:41Z

+# Ensure in-memory SurrealDB for all tests in this module
+os.environ.setdefault("SURREAL_URL", "memory://")


⚠️ Potential issue | 🟡 Minor

Force the in-memory DB here instead of using setdefault().

setdefault() leaves any preexisting SURREAL_URL untouched, so this module can unexpectedly run against a persistent ledger and leak state across test runs. Use a test-scoped env override that always sets memory://.

As per coding guidelines, "Set SURREAL_URL environment variable to configure SurrealDB connection; use memory:// for tests (no persistence)".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tests/test_team_events.py` around lines 16 - 17, The test module currently uses os.environ.setdefault("SURREAL_URL", "memory://") which can leave an existing SURREAL_URL and cause tests to hit a persistent DB; replace this with an unconditional override by assigning os.environ["SURREAL_URL"] = "memory://" at module import time so the SURREAL_URL is always set to the in-memory SurrealDB for tests (refer to the SURREAL_URL environment variable and the tests/test_team_events.py top-level env setup).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…olation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@README.md`:
- Around line 488-495: The fenced code block that shows the directory tree
starting with ".bicameral/" is missing a language tag, triggering markdownlint
MD040; update the opening fence from ``` to ```text (or another appropriate
language identifier) for the block that contains ".bicameral/ ├── events/ ..."
so the code fence declares a language and the linter passes.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ec3ac789-5221-45c5-88a7-5e1ef72787bf

📥 Commits

Reviewing files that changed from the base of the PR and between b615cc0 and b678516.

📒 Files selected for processing (1)

README.md

coderabbitai · 2026-04-10T18:40:16Z

+```
+.bicameral/
+├── events/              ← committed to git (shared decisions)
+│   ├── pm@co.com/       ← PM's ingested PRDs and transcripts
+│   └── dev@co.com/      ← developer's commit syncs
+├── config.yaml          ← committed (mode: solo | team)
+└── local/               ← gitignored (materialized state)
+```


⚠️ Potential issue | 🟡 Minor

Add a language tag to the fenced block (markdownlint MD040).

At Line 488, the code fence should declare a language to satisfy lint rules.

🛠️ Proposed fix

-``` +```text .bicameral/ ├── events/ ← committed to git (shared decisions) │ ├── pm@co.com/ ← PM's ingested PRDs and transcripts │ └── dev@co.com/ ← developer's commit syncs ├── config.yaml ← committed (mode: solo | team) └── local/ ← gitignored (materialized state)

</details> <details> <summary>🧰 Tools</summary> <details> <summary>🪛 markdownlint-cli2 (0.22.0)</summary> [warning] 488-488: Fenced code blocks should have a language specified (MD040, fenced-code-language) </details> </details> <details> <summary>🤖 Prompt for AI Agents</summary>

Verify each finding against the current code and only fix it if needed.

In @README.md around lines 488 - 495, The fenced code block that shows the
directory tree starting with ".bicameral/" is missing a language tag, triggering
markdownlint MD040; update the opening fence from totext (or another
appropriate language identifier) for the block that contains ".bicameral/ ├──
events/ ..." so the code fence declares a language and the linter passes.

</details>  

…arseness 1. DB paths (setup_wizard): team mode now places ledger.db and code-graph.db under .bicameral/local/ so they stay gitignored 2. Lazy connect (team_adapter): _ensure_ready() guard on all public methods ensures materialization runs even without explicit connect() 3. Watermark (materializer): use >= instead of > to handle multiple events in the same second (safe due to upsert idempotency) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Wire the tech spec §4.5 vocab cache pattern into the ingest pipeline. Before running the full BM25 + fuzzy grounding pipeline, check if the ledger already has a similar intent with high-confidence maps_to edges and reuse its code_regions. Key design choices: - Uses existing intent table BM25 index, not the vestigial vocab_cache table - Repo-isolated via code_region[WHERE repo = $repo] in graph traversal - Ranked by maps_to confidence (not search::score which returns 0.0 in v2) - Cached regions validated against live symbol index before acceptance - Qualified name fallback (PaymentService.process → process) - File-path disambiguation when multiple symbols share a name Closes gap #2 from the code locator drift audit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…uzzy Two fixes from Codex review feedback: 1. P1 #1: Filter weak BM25 scores (< 0.1) when no fuzzy matches exist. Prevents false groundings for under-specified queries that pass the FC-1 guard but have only noise-level BM25 hits. 2. P1 #2: Stage 2 now runs as enrichment after fuzzy direct lookup, not just as fallback. When fuzzy finds symbols from a subset of files, file-level retrieval fills remaining budget with additional files — preserving multi-file feature discovery. 49/49 tests pass. Eval metrics stable: MRR@5=0.605, Recall=20.0%. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…, index hardening 1. Split Phase 1 into 1a (DONE, groundwork) and 1b (TODO, actual enrichment wiring). Branch currently ships zero recall improvement. 2. Added compare-and-set content_hash guard to resolve_compliance. Caller must send the hash of the code it read; server rejects verdict if hash no longer matches current content. 3. Added try/except + atomic write (temp+rename) for symbol index build. Failure falls back to file-level BM25 gracefully. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…d payload V1 plan D1 was originally designed around a server-side search_code(decision.description) to surface relocation candidates. That approach is obsolete after v0.6.4 nuked search_code and shifted all code retrieval to the caller LLM. v0.6.4 already implemented the spirit of D1: - ledger/adapter.py:412-420 emits a symbol_disappeared grounding check on the authoritative ref when resolve_symbol_lines returns None for a tracked region; - handlers/link_commit.py:21 verification_instruction tells the caller to use Grep/Read + validate_symbols / extract_symbols + bicameral.bind; - tests/test_link_commit_grounding.py covers the rename-→-grounding flow end-to-end. V1's actual contribution: add original_lines: [start_line, end_line] to the symbol_disappeared payload so the caller LLM can inspect the symbol's prior position via `git show <prev_ref>:<file_path>` to ground its own retrieval. Single-line addition in ledger/adapter.py; new assertion in the existing regression test. Strictly informational. No new "call bicameral_bind" CTA — Codex pass-10 finding #2 stands: V2 must ship bicameral_rebind with old-binding CAS + a fresh L3 verdict on the new target before any rebind workflow becomes safe. Tests: 63 passed in 4.52s — zero regressions across ast_diff, b2_cosmetic_hint, sync_middleware, bind, link_commit_grounding, phase3 integration. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…trix New tests/test_desync_scenarios.py — one test per scenario from the Notion "Auto-Grounding Problem" catalog (Notion ID 3332a51619c4813caccec86c36d9bf98), routed through the real handler layer per the Apr 8 PR #84 lesson (tests bypassing handlers miss post-ingest hooks). Self-contained fixture builds a tmp git repo on ``main`` plus a memory ledger per test, so scenarios are independent of each other and of the bicameral-mcp checkout. Scorecard: 12 PASS, 1 XFAIL — meets V1 plan acceptance gate (>=12/13). Pass: 1. New decision, matching code exists → ungrounded → caller binds 2. Code changed after grounded → pending + pending_compliance_check 3. Code deleted after grounded → symbol_disappeared 4. Symbol renamed in file → symbol_disappeared with original_lines (V1 D1) 5. Symbol moved to different file → symbol_disappeared 6. Code added later, decision becomes resolvable → caller binds explicitly 7. Cold start: no matching code in repo → stays ungrounded 9. Intent description supersession → re-ingest succeeds 10. Multiple intents share a symbol → both surface in drift response 11. No server-side BM25 grounding (post-v0.6.0) → stays ungrounded — caller-LLM-driven 12. Line-shift edit does not trigger drift → resolve_symbol_lines re-resolves 13. [Open Question] prefix → ingested as gap-class decision XFAIL: 8. Drifted intent → atomic re-ground → requires V2 bicameral_rebind with old-binding CAS + fresh L3 verdict on new target (design doc 8 D2; Codex pass-10 #2) Scenario 2 explicitly asserts ``pending`` (not ``drifted``) per post-v0.5.0 derive_status semantics: a hash diff WITHOUT a cached ``compliant`` verdict yields ``pending``. ``drifted`` requires a caller-LLM verdict via the verdict cache (V2 territory). Incidental bug surfaced by F1 and fixed in ledger/adapter.py: ``pending_grounding_checks`` for ungrounded decisions read ``d.get("id", "")`` from ``get_all_decisions(filter="ungrounded")``, but that query aliases the field to ``decision_id``. Result: every ungrounded grounding-check entry shipped with ``decision_id=""``, leaving callers no handle to bind against. The existing ``test_pending_grounding_checks_for_ungrounded_decisions`` regression didn't catch it because it only asserted ``len > 0`` on the list, not non-empty IDs. Fix: read ``decision_id`` first, fall back to ``id`` for forward compatibility. Real correctness bug, not a test artifact. Tests: 76 passed, 1 xfailed in 7.35s across test_desync_scenarios, test_b2_cosmetic_hint, test_ast_diff, test_phase3_integration, test_sync_middleware, test_bind, test_link_commit_grounding. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…don't get bind CTA Codex pass-12 finding #2: D1 added original_lines to the symbol_disappeared payload (richer relocation context) but left the v0.6.4 verification_instruction text directing callers to use bicameral.bind for ALL pending_grounding_checks. For reason='ungrounded' that CTA is correct (no prior binding to retire, no duplicate-binding risk). For reason='symbol_disappeared' it is unsafe: bind on the new location leaves the old edge live under the N:N binds_to relation, producing duplicate-binding state. Atomic rebind that retires the stale edge in the same write ships in V2 (design doc 8 D2; Codex pass-10 finding #2). Net effect of D1 was that we added more relocation context routed through an unsafe CTA. This commit removes the unsafe CTA for relocation cases without reducing the safe CTA for ungrounded cases. Changes in handlers/link_commit.py: - _VERIFICATION_INSTRUCTION (single string) split into _VERIFICATION_INSTRUCTION_BASE + _GROUNDING_INSTRUCTION_UNGROUNDED + _GROUNDING_INSTRUCTION_RELOCATION. - New _build_verification_instruction(pending_compliance, pending_grounding) composer assembles the response text per call based on which reason values actually fired. Symbol-disappeared cases get an explicit "INFORMATIONAL ONLY — do NOT call bicameral.bind" warning citing the duplicate-binding hazard. - Response wiring at the bottom of handle_link_commit swapped from the constant lookup to the composer call. Tests (tests/test_link_commit_grounding.py): - test_pending_grounding_checks_for_ungrounded_decisions now asserts "bicameral.bind" IN the verification_instruction AND "INFORMATIONAL ONLY" NOT in it. - test_pending_grounding_checks_symbol_not_found now asserts "INFORMATIONAL ONLY" IN the instruction AND an explicit bind-prohibition phrase. Net destructive surface change for V1: arguably negative — V1 actively *removes* a v0.6.4 unsafe CTA for relocation, while introducing zero new mutating capabilities. Tests: 75 passed, 1 xfailed in 7.11s (xfail is scenario 8, V2 atomic rebind — by design). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

All four findings verified against current code; only the actionable ones applied. 81 passed + 1 xfailed in 9.02s. #1 — skills/bicameral-preflight/SKILL.md sync_metrics note The .claude/skills copy got the sync_metrics observability note back when V1 A3 shipped, but the canonical skills/ copy never did. Mirror the wording verbatim near step 2 so the rendering guidance and response-field documentation stay in sync. #2 — handlers/detect_drift.py per-entry alignment The cosmetic-hint enrichment was slicing both head_full and wt_full using entry.lines (the baseline anchor). HEAD and the working tree can shift the symbol independently, so a single index range can't align both sides. The narrow consequence: a drifted entry with shifted lines could yield a misleading cosmetic_hint=true on bytes that aren't the bound region. Fix: re-resolve the symbol against each ref via resolve_symbol_lines(file_path, entry.symbol, repo, ref="HEAD") and ref="working_tree" separately, slice each ref using its own resolved range. Resolution failure on either side → safe default of cosmetic_hint=False (matches the V1 contract: "False is cheap, True must be earned"). Empty symbol → skip (new fail-safe path). Test refactor: test_invalid_lines_skipped renamed to test_unresolvable_symbol_skipped — the old test asserted that lines=(0,0) was the failsafe trigger, but entry.lines is no longer the alignment input. New test exercises the resolve_symbol_lines-returns-None path via a nonexistent symbol name, which is the real fail-safe gate now. #3 — V2 guide TOC anchor for §9 GitHub auto-generates fragment IDs from heading text by lowercasing, replacing spaces with hyphens, and dropping punctuation. "## 9. Acceptance criteria for V2" maps to #9-acceptance-criteria-for-v2, but the TOC pointed at #9-acceptance-criteria (truncated). Link broken. Updated to the correct fragment. #4 — V2 guide unlabeled fenced code blocks (markdownlint MD040) Six fenced opens used bare ``` instead of a labeled fence. Tagged each with ```text — the contents are commit listings, ASCII DAG diagrams, pseudocode protocols, and tuple notation, none of which fit a real language tag. The other fenced blocks in the guide (already tagged ```sql / ```python) are unchanged. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>