feat(dashboard): per-agent decompose-to-action ratio + prs_merged/rows_filed in metrics.json (Aaron 2026-05-26 per-agent health visibility)#5115
Merged
AceHack merged 1 commit intoMay 26, 2026
Conversation
… + rows_filed_24h (Aaron 2026-05-26 — "per agent so we can see helath like per trajectory")
Aaron 2026-05-26 substrate-engineering concern:
> 'we need to make sure that decopose is happening an on going
> backlog log or else infinate backlog is just infnate debt'
> 'the decompose to action is what i want background to show
> with stats over time on the github page we have for plant
> metrics that and also prs, i want that per agent so we can
> see helath like per trajectory'
Extends tools/dashboard/generate-metrics.ts to surface per-agent
PR-shipping rate + decompose-to-action ratio in demo/metrics.json
(consumed by the Zeta Factory Dashboard at
lucent-financial-group.github.io/Zeta/demo/index.html).
Three new per-agent fields:
prs_merged_24h — PRs this agent merged in 24h window
rows_filed_24h — PRs whose title matches `backlog(B-NNNN`
(row-filing-only PRs, NOT action-on-rows)
decompose_to_action_ratio — (prs_merged - rows_filed) / max(rows_filed, 1)
→ impl-PRs per row-filing-PR
→ >=1 = strong action-on-rows discipline
→ <1 = filing rows faster than shipping
them = debt-accumulation signal
Attribution via branch-prefix lookup (BRANCH_PREFIX_TO_AGENT) per
.claude/rules/agent-roster-reference-card.md lane discipline:
otto-cli/ + otto-desktop/ + otto-vscode/ + otto/ → Otto;
alexa-kiro/ + alexa/ → Alexa; riven-cursor/ + riven/ → Riven;
vera-codex/ + vera/ → Vera; lior-antigravity/ + lior-gemini/ +
lior/ → Lior. PRs from non-prefixed branches attribute to 'Unknown'
bucket (operator-auditable as missing-attribution surface).
EMPIRICAL validation 2026-05-26 (live run):
Otto: 57 PRs / 30 row-filing → ratio = 0.9 (nearly 1:1; debt signal!)
Lior: 6 PRs / 0 row-filing → ratio = 6 (all action)
Others: 0/0/0 (quiet 24h window)
Otto ratio 0.9 EMPIRICALLY VALIDATES Aaron's concern — this
session filed 6 substantive rows (B-0791..B-0794, B-0796, B-0797)
+ shipped 4 implementation PRs (#5103 iter-5.1+5.2, #5107 iter-5.2.1,
#5113 iter-5.2.2, #5110 draft) — ratio < 1. The metric now exposes
the pattern continuously.
Dashboard HTML render of these new fields is follow-on substrate
(small UI work). The data layer is the load-bearing first step;
operator + Mika can read demo/metrics.json directly until UI lands.
Substrate-honest note: the dashboard generation itself happens on
the autonomous-loop cron tick (per B-0414); per-agent stats will
update on every tick going forward. Time-series tracking (today's
metric vs 7d-ago, 30d-ago) is separate substrate (would need to
preserve historical metrics.json snapshots; deferred to follow-on
iteration).
Composes with .claude/rules/agent-roster-reference-card.md
(branch-prefix attribution), .claude/rules/holding-without-named-
dependency-is-standing-by-failure.md (decompose-to-action discipline),
B-0797 (autonomous-loop sometimes-task; same substrate-engineering
direction).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
|
You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Aaron 2026-05-26: per-agent decompose-to-action visibility for the plant-metrics dashboard. New per-agent fields in demo/metrics.json: prs_merged_24h, rows_filed_24h, decompose_to_action_ratio. EMPIRICAL: Otto ratio = 0.9 (filing ~as fast as shipping; confirms Aaron's debt concern). Composes with B-0797. Dashboard HTML render = follow-on.