fix: improve Codex coverage verification to prevent timeouts by stranske · Pull Request #386 · stranske/Workflows

stranske · 2025-12-31T14:44:23Z

Automated Status Summary

Scope

Test coverage is at 71.60% with 17 scripts below 95% coverage. Low coverage makes scripts risky to modify and harder to maintain.

Tasks

Acceptance criteria

Before marking ANY task complete, you MUST:
1. Run pytest tests/ --cov=scripts --cov-report=term-missing
2. Find the script in the output table
3. Confirm the Cover column shows ≥95%
4. Only then mark that specific task complete
Overall coverage ≥95%
Each script in scripts/ shows ≥95% in coverage output
All existing tests pass
New tests in tests/scripts/ directory

Head SHA: 19f0595
Latest Runs: ❔ in progress — Gate
Required: gate: ❔ in progress

Workflow / Job	Result	Logs
Agents PR meta manager	❔ in progress	View run
CI Autofix Loop	❔ in progress	View run
Copilot code review	❔ in progress	View run
Gate	❔ in progress	View run
Health 40 Sweep	✅ success	View run
Health 44 Gate Branch Protection	❔ in progress	View run
Health 45 Agents Guard	✅ success	View run
Health 50 Security Scan	❔ in progress	View run
Maint 52 Validate Workflows	✅ success	View run
PR 11 - Minimal invariant CI	✅ success	View run
Selftest CI	❔ in progress	View run
Validate Sync Manifest	✅ success	View run

- Add @slow marker to 3 integration tests that run external tools - Update all keepalive prompts to use targeted coverage verification: - Use -m 'not slow' to skip slow tests - Use --cov=scripts/specific_module for faster feedback - Run specific test file when available - Prevents Codex timeout issues during coverage verification

chatgpt-codex-connector · 2025-12-31T14:44:28Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

agents-workflows-bot · 2025-12-31T14:44:56Z

⚠️ Action Required: Unable to determine source issue for PR #386. The PR title, branch name, or body must contain the issue number (e.g. #123, branch: issue-123, or the hidden marker ).

github-actions · 2025-12-31T14:45:15Z

Copilot

Pull request overview

This PR prevents CI timeout issues during Codex coverage verification by optimizing test execution strategies. The solution involves marking slow integration tests for selective exclusion and providing Codex with targeted coverage verification instructions to avoid running the full test suite unnecessarily.

Key Changes:

Added @slow markers to three integration tests that invoke external tools (ruff, isort, black, mypy)
Updated coverage verification instructions across all Codex prompt templates to use targeted coverage checks and skip slow tests
Introduced pytest options (-m "not slow", -x, targeted --cov) to reduce execution time

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/workflows/test_autofix_pipeline_diverse.py	Added `@slow` marker to integration test that handles diverse autofix scenarios
tests/workflows/test_autofix_pipeline.py	Added `@slow` marker to integration test for trivial ruff issues
tests/workflows/test_autofix_full_pipeline.py	Added `@slow` marker to integration test for full lint and typing pipeline
.github/templates/keepalive-instruction.md	Updated coverage verification instructions with targeted pytest commands and slow test exclusion
.github/codex/prompts/keepalive_next_task.md	Updated coverage verification instructions with targeted pytest commands and slow test exclusion
templates/consumer-repo/.github/templates/keepalive-instruction.md	Updated coverage verification instructions with targeted pytest commands and slow test exclusion (template)
templates/consumer-repo/.github/codex/prompts/keepalive_next_task.md	Updated coverage verification instructions with targeted pytest commands and slow test exclusion (template)
agents/codex-prompt.md	Updated coverage verification instructions with targeted pytest commands and slow test exclusion

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions · 2025-12-31T14:50:13Z

Automated Status Summary

Head SHA: 590246a
Latest Runs: ⏳ pending — Gate
Required contexts: Gate / gate, Health 45 Agents Guard / Enforce agents workflow protections
Required: core tests (3.11): ⏳ pending, core tests (3.12): ⏳ pending, docker smoke: ⏳ pending, gate: ⏳ pending

Workflow / Job	Result	Logs
(no jobs reported)	⏳ pending	—

Coverage Overview

Coverage history entries: 1

Coverage Trend

Metric	Value
Current	0.00%
Baseline	85.00%
Delta	-85.00%
Minimum	70.00%
Status	❌ Below minimum

Updated automatically; will refresh on subsequent CI/Docker completions.

Keepalive checklist

Scope

Test coverage is at 71.60% with 17 scripts below 95% coverage. Low coverage makes scripts risky to modify and harder to maintain.

Tasks

Acceptance criteria

Before marking ANY task complete, you MUST:
1. Run pytest tests/ --cov=scripts --cov-report=term-missing
2. Find the script in the output table
3. Confirm the Cover column shows ≥95%
4. Only then mark that specific task complete
Overall coverage ≥95%
Each script in scripts/ shows ≥95% in coverage output
All existing tests pass
New tests in tests/scripts/ directory

github-actions · 2025-12-31T14:50:40Z

🤖 Keepalive Loop Status

PR #386 | Agent: Codex | Iteration 0/5

Current State

Metric	Value
Iteration progress	[----------] 0/5
Action	wait (missing-agent-label)
Gate	success
Tasks	20/27 complete
Keepalive	❌ disabled
Autofix	❌ disabled

🔍 Failure Classification

Copilot AI review requested due to automatic review settings December 31, 2025 14:44

stranske temporarily deployed to agent-standard December 31, 2025 14:44 — with GitHub Actions Inactive

github-actions bot added the autofix Opt-in automated formatting & lint remediation label Dec 31, 2025

Merge branch 'main' into fix/codex-coverage-verification

0c83c43

Copilot started reviewing on behalf of stranske December 31, 2025 14:44 View session

Copilot AI reviewed Dec 31, 2025

View reviewed changes

stranske merged commit ffdf1d5 into main Dec 31, 2025
23 checks passed

stranske deleted the fix/codex-coverage-verification branch December 31, 2025 14:48

github-actions bot mentioned this pull request Dec 31, 2025

[Follow-up] Unmet criteria from PR #386 #388

Closed

9 tasks

stranske temporarily deployed to agent-standard December 31, 2025 14:50 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve Codex coverage verification to prevent timeouts#386

fix: improve Codex coverage verification to prevent timeouts#386
stranske merged 2 commits intomainfrom
fix/codex-coverage-verification

stranske commented Dec 31, 2025 •

edited by agents-workflows-bot bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 31, 2025

Uh oh!

agents-workflows-bot bot commented Dec 31, 2025

Uh oh!

github-actions bot commented Dec 31, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

github-actions bot commented Dec 31, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stranske commented Dec 31, 2025 • edited by agents-workflows-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Scope

Tasks

Acceptance criteria

Uh oh!

chatgpt-codex-connector bot commented Dec 31, 2025

Uh oh!

agents-workflows-bot bot commented Dec 31, 2025

Uh oh!

github-actions bot commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

github-actions bot commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Coverage Overview

Coverage Trend

Keepalive checklist

Scope

Tasks

Acceptance criteria

Uh oh!

github-actions bot commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🤖 Keepalive Loop Status

Current State

🔍 Failure Classification

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stranske commented Dec 31, 2025 •

edited by agents-workflows-bot bot

Loading

github-actions bot commented Dec 31, 2025 •

edited

Loading

github-actions bot commented Dec 31, 2025 •

edited

Loading

github-actions bot commented Dec 31, 2025 •

edited

Loading