fix: bypass rate-limit-only Gate cancellations - proceed with work by stranske · Pull Request #702 · stranske/Workflows

stranske · 2026-01-09T16:08:11Z

Source: Issue #696

Automated Status Summary

Scope

Part of Phase 3 workflow rollout validation per langchain-post-code-rollout.md

Context for Agent

Design Decisions & Constraints

Create a decomposable issue with labels/milestone (The agent cannot modify repository settings, which may include labels and milestones. | Provide a predefined issue template with labels/milestones that can be used.)
DCPT03 preserves metadata on children (DCPT03 | List specific metadata that should be preserved.)
Bullets used for non-tasks in the 'Acceptance Criteria' section should be formatted as checkboxes.
The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Related Issues/PRs

#691

References

https://github.com/stranske/Workflows/compare/main...codex/issue-691?expand=1

Blockers & Dependencies

The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Context for Agent

Design Decisions & Constraints

Create a decomposable issue with labels/milestone (The agent cannot modify repository settings, which may include labels and milestones. | Provide a predefined issue template with labels/milestones that can be used.)
DCPT03 preserves metadata on children (DCPT03 | List specific metadata that should be preserved.)
Bullets used for non-tasks in the 'Acceptance Criteria' section should be formatted as checkboxes.
The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.
| Keepalive | ✅ enabled |

Related Issues/PRs

References

Blockers & Dependencies

The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Tasks

Create a large issue with 8+ subtasks (e.g., 'Build user dashboard with auth, profile, settings, notifications, themes, export, import, admin') in the test repo.
Create an atomic issue (e.g., 'Fix null check in parser') in the test repo.
Create a decomposable issue with labels/milestone in the test repo.
Create a large issue with 8+ subtasks (e.g., 'Build user dashboard with auth, profile, settings, notifications, themes, export, import, admin') in the test repo.
Create an atomic issue (e.g., 'Fix null check in parser') in the test repo.
Create a decomposable issue with labels/milestone in the test repo.

Acceptance criteria

Head SHA: 9c87503
Latest Runs: ✅ success — Gate
Required: gate: ✅ success

Workflow / Job	Result	Logs
Agents PR meta manager	❔ in progress	View run
CI Autofix Loop	✅ success	View run
Gate	✅ success	View run
Health 40 Sweep	✅ success	View run
Health 44 Gate Branch Protection	✅ success	View run
Health 45 Agents Guard	✅ success	View run
Health 50 Security Scan	✅ success	View run
Keepalive E2E	❔ startup failure	View run
Maint 52 Validate Workflows	✅ success	View run
PR 11 - Minimal invariant CI	✅ success	View run
Selftest CI	✅ success	View run
Validate Sync Manifest	✅ success	View run

…ately Rate limits are infrastructure noise, not code quality issues. When Gate is cancelled only due to API rate limits (not actual test failures), the keepalive loop should proceed with work immediately rather than deferring or waiting. This change: - Detects when Gate cancellation was due to rate limits only - Immediately continues with 'run' action instead of 'defer' - Sets reason as 'bypass-rate-limit-gate' for tracking - Preserves the defer fallback only for non-rate-limit cancellations This prevents PRs from getting stuck in 'defer' state waiting for scheduled retry workflows when the underlying issue is just temporary rate limiting from GitHub APIs. Affected PRs (examples): - #696, #698, #699 were stuck with 'gate-cancelled-rate-limit-transient'

github-actions · 2026-01-09T16:09:40Z

Automated Status Summary

Head SHA: bc44e0e
Latest Runs: ⏳ pending — Gate
Required contexts: Gate / gate, Health 45 Agents Guard / Enforce agents workflow protections
Required: core tests (3.11): ⏳ pending, core tests (3.12): ⏳ pending, docker smoke: ⏳ pending, gate: ⏳ pending

Workflow / Job	Result	Logs
(no jobs reported)	⏳ pending	—

Coverage Overview

Coverage history entries: 1

Coverage Trend

Metric	Value
Current	92.21%
Baseline	85.00%
Delta	+7.21%
Minimum	70.00%
Status	✅ Pass

Top Coverage Hotspots (lowest coverage)

File	Coverage	Missing
`scripts/workflow_health_check.py`	62.6%	28
`scripts/classify_test_failures.py`	62.9%	37
`scripts/ledger_validate.py`	65.3%	63
`scripts/mypy_return_autofix.py`	82.6%	11
`scripts/ledger_migrate_base.py`	85.5%	13
`scripts/fix_cosmetic_aggregate.py`	92.3%	1
`scripts/coverage_history_append.py`	92.8%	2
`scripts/workflow_validator.py`	93.3%	4
`scripts/update_autofix_expectations.py`	93.9%	1
`scripts/pr_metrics_tracker.py`	95.7%	3
`scripts/generate_residual_trend.py`	96.6%	1
`scripts/build_autofix_pr_comment.py`	97.0%	2
`scripts/aggregate_agent_metrics.py`	97.2%	0
`scripts/fix_numpy_asserts.py`	98.1%	0
`scripts/sync_test_dependencies.py`	98.3%	1

Updated automatically; will refresh on subsequent CI/Docker completions.

Keepalive checklist

Scope

Part of Phase 3 workflow rollout validation per langchain-post-code-rollout.md

Context for Agent

Design Decisions & Constraints

Create a decomposable issue with labels/milestone (The agent cannot modify repository settings, which may include labels and milestones. | Provide a predefined issue template with labels/milestones that can be used.)
DCPT03 preserves metadata on children (DCPT03 | List specific metadata that should be preserved.)
Bullets used for non-tasks in the 'Acceptance Criteria' section should be formatted as checkboxes.
The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Related Issues/PRs

#691

References

https://github.com/stranske/Workflows/compare/main...codex/issue-691?expand=1

Blockers & Dependencies

The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Context for Agent

Design Decisions & Constraints

Create a decomposable issue with labels/milestone (The agent cannot modify repository settings, which may include labels and milestones. | Provide a predefined issue template with labels/milestones that can be used.)
DCPT03 preserves metadata on children (DCPT03 | List specific metadata that should be preserved.)
Bullets used for non-tasks in the 'Acceptance Criteria' section should be formatted as checkboxes.
The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.
| Keepalive | ✅ enabled |

Related Issues/PRs

References

Blockers & Dependencies

The issue is generally well-structured but requires more clarity in tasks and acceptance criteria. Additionally, some tasks may be blocked due to agent limitations.

Tasks

Create a large issue with 8+ subtasks (e.g., 'Build user dashboard with auth, profile, settings, notifications, themes, export, import, admin') in the test repo.
Create an atomic issue (e.g., 'Fix null check in parser') in the test repo.
Create a decomposable issue with labels/milestone in the test repo.
Create a large issue with 8+ subtasks (e.g., 'Build user dashboard with auth, profile, settings, notifications, themes, export, import, admin') in the test repo.
Create an atomic issue (e.g., 'Fix null check in parser') in the test repo.
Create a decomposable issue with labels/milestone in the test repo.

Acceptance criteria

github-actions · 2026-01-09T16:10:08Z

🤖 Keepalive Loop Status

PR #702 | Agent: Codex | Iteration 0/5

Current State

Metric	Value
Iteration progress	[----------] 0/5
Action	wait (missing-agent-label)
Disposition	skipped (transient)
Gate	success
Tasks	17/17 complete
Keepalive	❌ disabled
Autofix	❌ disabled

🔍 Failure Classification

Copilot

Pull request overview

This PR modifies the keepalive loop to automatically bypass Gate workflow cancellations that are caused solely by GitHub API rate limits, allowing work to continue immediately instead of deferring to a scheduled retry. The change treats rate limits as temporary infrastructure constraints rather than code quality issues.

Adds automatic bypass logic for rate-limit-only Gate cancellations
Introduces new action reason bypass-rate-limit-gate for tracking
Updates decision flow to prioritize rate limit bypass over the existing forceRetry mechanism

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-09T16:10:58Z

.github/scripts/keepalive_loop.js

+      if (gateRateLimit && tasksRemaining) {
+        action = 'run';
+        reason = 'bypass-rate-limit-gate';
+        if (core) core.info('Gate cancelled due to rate limits only - proceeding with work (rate limits are not code quality issues)');
+      } else if (forceRetry && tasksRemaining) {
        action = 'run';
        reason = 'force-retry-cancelled';
        if (core) core.info(`Force retry enabled: bypassing cancelled gate (rate_limit=${gateRateLimit})`);


The new rate limit bypass logic takes precedence over the forceRetry flag. When both gateRateLimit and forceRetry are true, this code will return action='run' with reason='bypass-rate-limit-gate' instead of reason='force-retry-cancelled'. This changes the behavior for existing test case at line 493-517 in keepalive-loop.test.js which expects reason='force-retry-cancelled' when forceRetry is enabled with a rate-limited cancellation.

Consider checking forceRetry first (swap the order of these two conditions) to preserve the existing forceRetry behavior and maintain backward compatibility with the existing test expectations.

Suggested change

if (gateRateLimit && tasksRemaining) {

action = 'run';

reason = 'bypass-rate-limit-gate';

if (core) core.info('Gate cancelled due to rate limits only - proceeding with work (rate limits are not code quality issues)');

} else if (forceRetry && tasksRemaining) {

action = 'run';

reason = 'force-retry-cancelled';

if (core) core.info(`Force retry enabled: bypassing cancelled gate (rate_limit=${gateRateLimit})`);

if (forceRetry && tasksRemaining) {

action = 'run';

reason = 'force-retry-cancelled';

if (core) core.info(`Force retry enabled: bypassing cancelled gate (rate_limit=${gateRateLimit})`);

} else if (gateRateLimit && tasksRemaining) {

action = 'run';

reason = 'bypass-rate-limit-gate';

if (core) core.info('Gate cancelled due to rate limits only - proceeding with work (rate limits are not code quality issues)');

Copilot · 2026-01-09T16:10:58Z

.github/scripts/keepalive_loop.js

+      if (gateRateLimit && tasksRemaining) {
+        action = 'run';
+        reason = 'bypass-rate-limit-gate';
+        if (core) core.info('Gate cancelled due to rate limits only - proceeding with work (rate limits are not code quality issues)');


This change will cause existing tests to fail. The tests at lines 423-469 in keepalive-loop.test.js expect action='defer' and reason='gate-cancelled-rate-limit' when rate limit cancellations are detected. With this new code, when tasksRemaining is true, the action will be 'run' and reason will be 'bypass-rate-limit-gate' instead. The tests need to be updated to reflect this new behavior, or new tests should be added to verify the bypass logic works as intended.

.github/scripts/keepalive_loop.js

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…704) The reusable CI workflow had a bug where it assumed dev tools (black, ruff, mypy, pytest, etc.) were included in consumer repos' lock files. This caused CI failures with 'black: command not found' errors. Root cause: When has_lock_file=true, the workflow only recorded tools as 'from lock' for reporting but didn't actually install them. Consumer repos' lock files only contain runtime dependencies, not dev tools. This fix: - Always installs dev tools (black, ruff, mypy, pytest, etc.) - Removes the has_lock_file conditional for tool installation - Lock files still work for runtime dependencies - Affects all 4 CI jobs: lint-format, lint-ruff, typecheck-mypy, tests Impact: Fixes CI failures in Travel-Plan-Permission, Template, trip-planner, Collab-Admin and all other consumer repos with lock files.

Tests now expect action='run' with reason='bypass-rate-limit-gate' instead of action='defer' with reason='gate-cancelled-rate-limit'. Rate limits are infrastructure noise, not code quality issues. Work should proceed automatically when Gate cancellation is due to rate limits. Rate limit bypass takes precedence over forceRetry since: 1. Rate limit bypass is automatic infrastructure handling 2. forceRetry is still honored for non-rate-limit cases (cancelled, failed)

Aligns with JS test updates - rate limits are infrastructure noise that should be bypassed immediately rather than causing deferrals.

github-actions · 2026-01-09T17:24:05Z

Copilot AI review requested due to automatic review settings January 9, 2026 16:08

stranske temporarily deployed to agent-standard January 9, 2026 16:08 — with GitHub Actions Inactive

Copilot started reviewing on behalf of stranske January 9, 2026 16:08 View session

Copilot AI reviewed Jan 9, 2026

View reviewed changes

Update .github/scripts/keepalive_loop.js

86b6e45

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

stranske temporarily deployed to agent-standard January 9, 2026 16:28 — with GitHub Actions Inactive

Merge branch 'main' into fix/bypass-rate-limit-gate

79bea4a

stranske temporarily deployed to agent-standard January 9, 2026 16:30 — with GitHub Actions Inactive

Merge branch 'main' into fix/bypass-rate-limit-gate

a0d9761

stranske temporarily deployed to agent-standard January 9, 2026 16:30 — with GitHub Actions Inactive

stranske added 2 commits January 9, 2026 17:19

stranske temporarily deployed to agent-high-privilege January 9, 2026 17:19 — with GitHub Actions Inactive

fix: Update Python rate limit test to expect bypass behavior

9c87503

Aligns with JS test updates - rate limits are infrastructure noise that should be bypassed immediately rather than causing deferrals.

stranske temporarily deployed to agent-high-privilege January 9, 2026 17:23 — with GitHub Actions Inactive

github-actions bot added the autofix Opt-in automated formatting & lint remediation label Jan 9, 2026

stranske merged commit 374ce2f into main Jan 9, 2026
128 checks passed

stranske deleted the fix/bypass-rate-limit-gate branch January 9, 2026 17:25

stranske mentioned this pull request Jan 9, 2026

fix: prevent dual-agent conflict for codex in consumer template #701

Closed

agents-workflows-bot bot mentioned this pull request Jan 9, 2026

fix: prevent dual-agent conflict for codex by skipping post_agent_comment #705

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bypass rate-limit-only Gate cancellations - proceed with work#702

fix: bypass rate-limit-only Gate cancellations - proceed with work#702
stranske merged 7 commits intomainfrom
fix/bypass-rate-limit-gate

stranske commented Jan 9, 2026 •

edited by agents-workflows-bot bot

Loading

Uh oh!

github-actions bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 9, 2026

Uh oh!

Copilot AI Jan 9, 2026

Uh oh!

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stranske commented Jan 9, 2026 • edited by agents-workflows-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Scope

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Tasks

Acceptance criteria

Uh oh!

github-actions bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Coverage Overview

Coverage Trend

Top Coverage Hotspots (lowest coverage)

Keepalive checklist

Scope

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Context for Agent

Design Decisions & Constraints

Related Issues/PRs

References

Blockers & Dependencies

Tasks

Acceptance criteria

Uh oh!

github-actions bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🤖 Keepalive Loop Status

Current State

🔍 Failure Classification

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stranske commented Jan 9, 2026 •

edited by agents-workflows-bot bot

Loading

github-actions bot commented Jan 9, 2026 •

edited

Loading

github-actions bot commented Jan 9, 2026 •

edited

Loading