fix: address bot review comments from sync PRs by stranske · Pull Request #536 · stranske/Workflows

stranske · 2026-01-05T08:31:00Z

Source: Issue #123

Automated Status Summary

Scope

After merging PR #103 (multi-agent routing infrastructure), we need to:

Validate the CLI agent pipeline works end-to-end with the new task-focused prompts
Add GITHUB_STEP_SUMMARY output so iteration results are visible in the Actions UI
Streamline the Automated Status Summary to reduce clutter when using CLI agents
Clean up comment patterns to avoid a mix of old UI-agent and new CLI-agent comments

Context for Agent

Design Decisions & Constraints

1. Clean up comment patterns to avoid a mix of old UI-agent and new CLI-agent comments
The keepalive loop now:
|  | github-actions[bot] | NEW: CLI agent iteration tracking | ✅ Keep for CLI agents |
|  | agents-workflows-bot[bot] | State tracking | ⚠️ Multiple copies accumulate |
|  | stranske | OLD: Instruction comment | ❌ CLI agents dont need this |
The goal: For CLI agents (agent:* label), we should have exactly one updating comment () instead of accumulating 10+ comments per PR.
Requires PR #103 to be merged first
This round you MUST:
Review the Scope/Tasks/Acceptance below, identify the next incomplete task that requires code, implement it, then post a reply comment with the completed items using their exact original text.

Related Issues/PRs

References

https://github.com/stranske/Workflows/compare/main...codex/issue-123?expand=1

Blockers & Dependencies

After merging PR #103 (multi-agent routing infrastructure), we need to:
1. Mark a task checkbox complete ONLY after verifying the implementation works.

Tasks

Pipeline Validation

After PR chore(codex): bootstrap PR for issue #101 #103 merges, create a test PR with agent:codex label
Verify task appendix appears in Codex prompt (check workflow logs)
Verify Codex works on actual tasks (not random infrastructure work)
Verify keepalive comment updates with iteration progress

GITHUB_STEP_SUMMARY

Add step summary output to agents-keepalive-loop.yml after agent run
Include: iteration number, tasks completed, files changed, outcome
Ensure summary is visible in workflow run UI

Conditional Status Summary

Modify buildStatusBlock() in agents_pr_meta_update_body.js to accept agentType parameter
When agentType is set (CLI agent): hide workflow table, hide head SHA/required checks
Keep Scope/Tasks/Acceptance checkboxes for all cases
Pass agent type from workflow to the update_body job

Comment Pattern Cleanup

Acceptance criteria

CLI agent receives explicit tasks in prompt and works on them
Iteration results visible in Actions workflow run summary
PR body shows checkboxes but not workflow clutter when using CLI agents
UI Codex path (no agent label) continues to show full status summary
CLI agent PRs have ≤3 bot comments total (summary, one per iteration update) instead of 10+
State tracking is consolidated in the summary comment, not scattered

Dependencies

- Requires PR chore(codex): bootstrap PR for issue #101 #103 to be merged first

Head SHA: a6a57f5
Latest Runs: ✅ success — Gate
Required: gate: ✅ success

Workflow / Job	Result	Logs
Agents PR meta manager	❔ in progress	View run
CI Autofix Loop	✅ success	View run
Gate	✅ success	View run
Health 40 Sweep	✅ success	View run
Health 44 Gate Branch Protection	✅ success	View run
Health 45 Agents Guard	✅ success	View run
Health 50 Security Scan	✅ success	View run
Keepalive E2E	❔ startup failure	View run
Maint 52 Validate Workflows	✅ success	View run
PR 11 - Minimal invariant CI	✅ success	View run
Selftest CI	✅ success	View run
Validate Sync Manifest	✅ success	View run

github-actions · 2026-01-05T08:32:38Z

Automated Status Summary

Head SHA: fb92ef1
Latest Runs: ⏳ pending — Gate
Required contexts: Gate / gate, Health 45 Agents Guard / Enforce agents workflow protections
Required: core tests (3.11): ⏳ pending, core tests (3.12): ⏳ pending, docker smoke: ⏳ pending, gate: ⏳ pending

Workflow / Job	Result	Logs
(no jobs reported)	⏳ pending	—

Coverage Overview

Coverage history entries: 1

Coverage Trend

Metric	Value
Current	92.21%
Baseline	85.00%
Delta	+7.21%
Minimum	70.00%
Status	✅ Pass

Top Coverage Hotspots (lowest coverage)

File	Coverage	Missing
`scripts/workflow_health_check.py`	62.6%	28
`scripts/classify_test_failures.py`	62.9%	37
`scripts/ledger_validate.py`	65.3%	63
`scripts/mypy_return_autofix.py`	82.6%	11
`scripts/ledger_migrate_base.py`	85.5%	13
`scripts/fix_cosmetic_aggregate.py`	92.3%	1
`scripts/coverage_history_append.py`	92.8%	2
`scripts/workflow_validator.py`	93.3%	4
`scripts/update_autofix_expectations.py`	93.9%	1
`scripts/pr_metrics_tracker.py`	95.7%	3
`scripts/generate_residual_trend.py`	96.6%	1
`scripts/build_autofix_pr_comment.py`	97.0%	2
`scripts/aggregate_agent_metrics.py`	97.2%	0
`scripts/fix_numpy_asserts.py`	98.1%	0
`scripts/sync_test_dependencies.py`	98.3%	1

Updated automatically; will refresh on subsequent CI/Docker completions.

Keepalive checklist

Scope

After merging PR #103 (multi-agent routing infrastructure), we need to:

Validate the CLI agent pipeline works end-to-end with the new task-focused prompts
Add GITHUB_STEP_SUMMARY output so iteration results are visible in the Actions UI
Streamline the Automated Status Summary to reduce clutter when using CLI agents
Clean up comment patterns to avoid a mix of old UI-agent and new CLI-agent comments

Context for Agent

Design Decisions & Constraints

1. Clean up comment patterns to avoid a mix of old UI-agent and new CLI-agent comments
The keepalive loop now:
|  | github-actions[bot] | NEW: CLI agent iteration tracking | ✅ Keep for CLI agents |
|  | agents-workflows-bot[bot] | State tracking | ⚠️ Multiple copies accumulate |
|  | stranske | OLD: Instruction comment | ❌ CLI agents dont need this |
The goal: For CLI agents (agent:* label), we should have exactly one updating comment () instead of accumulating 10+ comments per PR.
Requires PR #103 to be merged first
This round you MUST:
Review the Scope/Tasks/Acceptance below, identify the next incomplete task that requires code, implement it, then post a reply comment with the completed items using their exact original text.

Related Issues/PRs

References

https://github.com/stranske/Workflows/compare/main...codex/issue-123?expand=1

Blockers & Dependencies

After merging PR #103 (multi-agent routing infrastructure), we need to:
1. Mark a task checkbox complete ONLY after verifying the implementation works.

Tasks

Pipeline Validation

After PR chore(codex): bootstrap PR for issue #101 #103 merges, create a test PR with agent:codex label
Verify task appendix appears in Codex prompt (check workflow logs)
Verify Codex works on actual tasks (not random infrastructure work)
Verify keepalive comment updates with iteration progress

GITHUB_STEP_SUMMARY

Add step summary output to agents-keepalive-loop.yml after agent run
Include: iteration number, tasks completed, files changed, outcome
Ensure summary is visible in workflow run UI

Conditional Status Summary

Modify buildStatusBlock() in agents_pr_meta_update_body.js to accept agentType parameter
When agentType is set (CLI agent): hide workflow table, hide head SHA/required checks
Keep Scope/Tasks/Acceptance checkboxes for all cases
Pass agent type from workflow to the update_body job

Comment Pattern Cleanup

Acceptance criteria

CLI agent receives explicit tasks in prompt and works on them
Iteration results visible in Actions workflow run summary
PR body shows checkboxes but not workflow clutter when using CLI agents
UI Codex path (no agent label) continues to show full status summary
CLI agent PRs have ≤3 bot comments total (summary, one per iteration update) instead of 10+
State tracking is consolidated in the summary comment, not scattered

Dependencies

- Requires PR chore(codex): bootstrap PR for issue #101 #103 to be merged first

github-actions · 2026-01-05T08:33:00Z

🤖 Keepalive Loop Status

PR #536 | Agent: Codex | Iteration 0/5

Current State

Metric	Value
Iteration progress	[----------] 0/5
Action	wait (missing-agent-label)
Disposition	skipped (transient)
Gate	success
Tasks	0/28 complete
Keepalive	❌ disabled
Autofix	❌ disabled

🔍 Failure Classification

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: eedf05fc5c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

.github/scripts/error_classifier.js

- keepalive_prompt_routing.js: Add missing 'ci_failure', 'fix-ci-failure' to FIX_MODES to match FIX_SCENARIOS entries - keepalive_loop.js: Remove redundant .toLowerCase() call since normalise() already returns lowercase strings - agents-guard.js: Clarify security documentation for automated PR bypass logic explaining when label can bypass CODEOWNER approval Note: Kept error_classifier.js patterns unchanged - the matching uses .includes() not regex, so broader patterns are intentional and work with the test suite.

Copilot

Pull request overview

This PR addresses issues identified by bot reviews on sync PRs across consumer repositories. The changes focus on improving consistency, reducing false positives in error classification, eliminating redundant code, and enhancing security documentation.

Key Changes:

Extended FIX_MODES constant to include ci_failure and fix-ci-failure for consistency with FIX_SCENARIOS
Made transient error patterns in error classifier more specific to avoid false positive matches
Removed redundant .toLowerCase() call in keepalive loop verification status check

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
.github/scripts/keepalive_prompt_routing.js	Adds missing mode variants to `FIX_MODES` for consistency with scenario definitions
.github/scripts/error_classifier.js	Updates transient error patterns to be more specific and avoid false positives
.github/scripts/keepalive_loop.js	Removes redundant `.toLowerCase()` call on verification status
.github/scripts/agents-guard.js	Enhances security documentation for automated PR bypass logic

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/scripts/error_classifier.js

.github/scripts/keepalive_loop.js

Copilot AI review requested due to automatic review settings January 5, 2026 08:31

stranske temporarily deployed to agent-standard January 5, 2026 08:31 — with GitHub Actions Inactive

Copilot started reviewing on behalf of stranske January 5, 2026 08:31 View session

chatgpt-codex-connector bot reviewed Jan 5, 2026

View reviewed changes

.github/scripts/error_classifier.js Outdated Show resolved Hide resolved

Copilot AI reviewed Jan 5, 2026

View reviewed changes

.github/scripts/error_classifier.js Outdated Show resolved Hide resolved

.github/scripts/keepalive_loop.js Show resolved Hide resolved

stranske force-pushed the fix/bot-review-comments branch from eedf05f to a6a57f5 Compare January 5, 2026 08:34

stranske temporarily deployed to agent-standard January 5, 2026 08:34 — with GitHub Actions Inactive

stranske merged commit 5b8c029 into main Jan 5, 2026
149 checks passed

stranske deleted the fix/bot-review-comments branch January 5, 2026 08:38

github-actions bot mentioned this pull request Jan 5, 2026

[Follow-up] Unmet criteria from PR #536 #537

Closed

27 tasks

stranske mentioned this pull request Jan 5, 2026

fix: correct bot review feedback from PR #536 #538

Merged

45 tasks

github-actions bot mentioned this pull request Jan 5, 2026

[Follow-up] Unmet criteria from PR #538 #539

Closed

23 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: address bot review comments from sync PRs#536

fix: address bot review comments from sync PRs#536
stranske merged 1 commit intomainfrom
fix/bot-review-comments

stranske commented Jan 5, 2026 •

edited by agents-workflows-bot bot

Loading

Uh oh!

github-actions bot commented Jan 5, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 5, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stranske commented Jan 5, 2026 • edited by agents-workflows-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Scope

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Tasks

Pipeline Validation

GITHUB_STEP_SUMMARY

Conditional Status Summary

Comment Pattern Cleanup

Acceptance criteria

Dependencies

Uh oh!

github-actions bot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Coverage Overview

Coverage Trend

Top Coverage Hotspots (lowest coverage)

Keepalive checklist

Scope

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Tasks

Pipeline Validation

GITHUB_STEP_SUMMARY

Conditional Status Summary

Comment Pattern Cleanup

Acceptance criteria

Dependencies

Uh oh!

github-actions bot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🤖 Keepalive Loop Status

Current State

🔍 Failure Classification

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stranske commented Jan 5, 2026 •

edited by agents-workflows-bot bot

Loading

github-actions bot commented Jan 5, 2026 •

edited

Loading

github-actions bot commented Jan 5, 2026 •

edited

Loading