ci: add permission control for public ci tests by yongwww · Pull Request #2397 · flashinfer-ai/flashinfer

yongwww · 2026-01-22T00:00:48Z

📌 Description

Add team-based permission control for the PR test workflow to manage CI costs.

Add gate job to check authorization before running tests
PRs from ci-users team members run automatically
External PRs require run-ci label (via @flashinfer-bot run command from member of ci-users)
Add ci-bot-commands.yml for handling bot commands:
- @flashinfer-bot run - trigger CI
- @flashinfer-bot rerun - rerun all jobs
- @flashinfer-bot rerun failed - rerun failed and cancelled jobs
Add pr-label-cleanup.yml to auto-remove run-ci label when external PRs receive new commits

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing (unittest, etc.).

Co-Authored-By: Kevin Vincent kvincent@nvidia.com

Reviewer Notes

Summary by CodeRabbit

New Features
- Add PR comment bot commands to trigger, rerun (all or failed), or stop CI, with reaction feedback.
Chores
- Introduce CI authorization gating so CI only runs when authorized.
- Expand PR workflow triggers and add skip messaging when CI is gated.
Bug Fixes / Maintenance
- Automatically remove CI labels from external contributors and post explanatory guidance.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

gemini-code-assist · 2026-01-22T00:00:57Z

Note

Gemini is unable to generate a summary for this pull request due to the file types involved not being currently supported.

coderabbitai · 2026-01-22T00:00:58Z

📝 Walkthrough

Walkthrough

Adds three GitHub Actions workflows: a bot-driven PR command processor triggered by issue comments that authorizes ci-users to control CI, a PR label cleanup workflow that removes run-ci for external authors, and a gated PR test workflow that checks authorization before running CI jobs.

Changes

Cohort / File(s)	Summary
CI Automation Bot Commands `.github/workflows/ci-bot-commands.yml`	New workflow handling `issue_comment` events mentioning the bot; verifies `ci-users` membership via token, parses commands (`run`, `rerun`, `rerun-failed`, `stop`), and conditionally applies `run-ci`, reruns workflows, cancels runs, or reacts when unauthorized; includes error handling and reactions.
PR Label Cleanup `.github/workflows/pr-label-cleanup.yml`	New workflow on `pull_request` synchronize that checks `author_association`, and removes the `run-ci` label and comments when the author is external; writes `is_external` output.
PR Test Workflow Enhancement `.github/workflows/pr-test.yml`	Added `gate` job exporting `authorized` (team membership / label checks), extended PR triggers, added `pull-requests: write` permission, wired gating into `setup`, `aot-build-import`, `gpu-tests-*` and `test-results-summary` via `needs` and `if`, and posts skip messaging when unauthorized.

Sequence Diagram(s)

sequenceDiagram
    participant User as PR Commenter
    participant GitHub as GitHub API
    participant Workflow as CI-Bot Workflow
    participant TeamAPI as GitHub Team API
    participant CI as CI System

    User->>GitHub: Post comment mentioning bot
    GitHub->>Workflow: Trigger issue_comment event
    Workflow->>Workflow: Verify PR context & bot mention
    Workflow->>TeamAPI: Check ci-users membership
    TeamAPI-->>Workflow: Member / Not member

    alt Authorized
        Workflow->>Workflow: Parse command (run / rerun / rerun-failed / stop)
        alt run
            Workflow->>GitHub: Add `run-ci` label
        else rerun
            Workflow->>CI: Rerun all workflows for PR commit
        else rerun-failed
            Workflow->>CI: Rerun failed workflows for PR commit
        else stop
            Workflow->>CI: Cancel in-progress/queued runs for PR commit
        end
        Workflow->>GitHub: Post thumbs-up reaction
    else Unauthorized
        Workflow->>GitHub: Post confused reaction
    end

sequenceDiagram
    participant GitHubEvents as GitHub Events
    participant Gate as Gate Job
    participant Setup as Setup Job
    participant CIJobs as CI Jobs
    participant Results as Test Results

    GitHubEvents->>Gate: PR opened/synchronize/reopened/labeled or workflow_dispatch
    Gate->>Gate: Check `run-ci` label and ci-users membership
    Gate-->>Setup: Export `authorized` flag

    alt Authorized
        Gate->>Setup: Allow setup to run
        Setup->>CIJobs: Trigger gated CI jobs (aot-build-import, gpu-tests-*)
        CIJobs->>Results: Produce test results
    else Unauthorized
        Gate->>GitHubEvents: Post skip message on PR
        Setup--xCIJobs: Skip (blocked by gate)
        Results->>GitHubEvents: Indicate CI skipped
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

feat: introduce GitHub Actions workflow for PR testing #2326: Modifies .github/workflows/pr-test.yml and relates to gate/authorization logic and job wiring.

Suggested reviewers

yzh119
bkryu
jimmyzho

Poem

🐰 I hop through commits, label in paw,
I listen for commands that open the maw,
"Run" and "Rerun" I check who may call,
Gate swings for trusted friends, otherwise stall,
Thumbs-up and carrots — CI hops to the ball.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and concisely describes the main change: adding permission control for public CI tests, which aligns with the PR objectives.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Description check	✅ Passed	The PR description addresses the template requirements with a clear description, issue reference, and completed pre-commit checklist; however, the Tests section is incomplete with only one item checked.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @.github/workflows/ci-bot-commands.yml:
- Around line 64-77: The Parse command step currently interpolates the untrusted
comment body directly into the shell (variable BODY="${{
github.event.comment.body }}"), enabling command injection; instead, stop inline
YAML interpolation and load the comment body from a safe environment or file
(e.g., set an env var or read/parsing from GITHUB_EVENT_PATH) and then perform
the grep checks against that safe variable; update the step named "Parse
command" (id: parse) to populate BODY from an environment variable or
JSON-parsed file before running the echo/grep comparisons so no untrusted
content is directly injected into the shell.

🧹 Nitpick comments (2)

.github/workflows/pr-test.yml (1)

117-120: Duplicate comment check may be fragile.

The grep -c "CI is paused" check could produce false positives if someone quotes this phrase in another comment. Consider using a more unique marker (e.g., an HTML comment like ).
Suggested improvement
-            --jq '.comments[].body' | grep -c "CI is paused" || true)
+            --jq '.comments[].body' | grep -c "" || true)
...
-              --body "**CI is paused for external contributors.**
+              --body "
+**CI is paused for external contributors.**

.github/workflows/ci-bot-commands.yml (1)

148-160: RERUN_COUNT won't reflect actual count due to subshell.

The while read loop runs in a subshell (due to the pipe), so RERUN_COUNT modifications are lost. This is cosmetic since the counter isn't used afterward, but it's worth noting for future debugging.

Fix using process substitution (if count is needed later)

          # Rerun failed workflow runs for this commit
          RERUN_COUNT=0
-         gh run list \
+         while read -r run_id; do
+           if [ -n "$run_id" ]; then
+             echo "Rerunning failed jobs in workflow $run_id..."
+             gh run rerun "$run_id" --repo ${{ github.repository }} --failed || true
+             RERUN_COUNT=$((RERUN_COUNT + 1))
+           fi
+         done < <(gh run list \
            --repo ${{ github.repository }} \
            --commit "$PR_SHA" \
            --status failure \
-           --json databaseId -q '.[].databaseId' | \
-         while read -r run_id; do
-           if [ -n "$run_id" ]; then
-             echo "Rerunning failed jobs in workflow $run_id..."
-             gh run rerun "$run_id" --repo ${{ github.repository }} --failed || true
-             RERUN_COUNT=$((RERUN_COUNT + 1))
-           fi
-         done
+           --json databaseId -q '.[].databaseId')

.github/workflows/ci-bot-commands.yml

yzh119 · 2026-01-22T07:03:13Z

@yongwww can we test the functionality of the CI before it's merged?

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @.github/workflows/ci-bot-commands.yml:
- Around line 113-123: The current gh run list pipeline filters out only
"in_progress" runs and can pass non-completed statuses (like "queued") to gh run
rerun; update the jq filter used with gh run list (the command around gh run
list --commit "$PR_SHA" --json databaseId,status -q '.[] | select(.status !=
"in_progress") | .databaseId') to explicitly select only completed runs (e.g.,
change to select(.status == "completed") or an equivalent check for
terminal/completed statuses) so gh run rerun is invoked only for completed runs;
keep the surrounding while read loop and gh run rerun invocation unchanged
except for this jq filter adjustment.

🧹 Nitpick comments (1)

.github/workflows/ci-bot-commands.yml (1)

27-62: Use the team membership endpoint to avoid listing all members.
Fetching all members can be slow and API-heavy for large teams. The membership endpoint is cheaper and avoids pagination.

♻️ Proposed refactor

-          # List team members and check if commenter is in the list
-          MEMBERS=$(gh api \
-            -H "Accept: application/vnd.github+json" \
-            -H "X-GitHub-Api-Version: 2022-11-28" \
-            --paginate \
-            "/orgs/${ORG}/teams/${TEAM}/members" \
-            --jq '.[].login' 2>&1) || {
-            echo "::error::Failed to get team members: $MEMBERS"
-            echo "authorized=false" >> "$GITHUB_OUTPUT"
-            exit 0
-          }
-
-          if echo "$MEMBERS" | grep -qx "$ACTOR"; then
+          # Check membership directly
+          if gh api \
+            -H "Accept: application/vnd.github+json" \
+            -H "X-GitHub-Api-Version: 2022-11-28" \
+            "/orgs/${ORG}/teams/${TEAM}/memberships/${ACTOR}" >/dev/null 2>&1; then
             echo "$ACTOR is a member of $TEAM"
             echo "authorized=true" >> "$GITHUB_OUTPUT"
           else
-            echo "$ACTOR is not a member of $TEAM"
+            echo "$ACTOR is not a member of $TEAM or membership check failed"
             echo "authorized=false" >> "$GITHUB_OUTPUT"
           fi

.github/workflows/ci-bot-commands.yml

yongwww · 2026-01-24T00:47:28Z

@yongwww can we test the functionality of the CI before it's merged?

The issue_comment triggered workflow reads from the default branch (main), not the PR branch, so I can't test the bot commands directly on this PR until it's merged. (Reference: This event will only trigger a workflow run if the workflow file exists on the default branch.)
However, I've tested the functionality in my fork (merged in main, and test in a pr): Test PR: yongwww-org#6.

yongwww added the run-ci label Jan 22, 2026

yongwww marked this pull request as ready for review January 22, 2026 04:17

yongwww mentioned this pull request Jan 22, 2026

Expand Public CI GPU Architecture Coverage via GitHub Actions #2355

Open

coderabbitai bot reviewed Jan 22, 2026

View reviewed changes

.github/workflows/ci-bot-commands.yml Show resolved Hide resolved

coderabbitai bot reviewed Jan 22, 2026

View reviewed changes

.github/workflows/ci-bot-commands.yml Show resolved Hide resolved

yongwww added 7 commits January 23, 2026 09:51

ci: add permission control for PR tests

4cafbff

upd

3bca4f8

upd

a62a813

use flashinfer-bot

2dc220b

resolve comments

d880f1f

improve bot commands with rerun cancel-first and add stop command

0a5c673

remove redundant comments

5c1f56b

yongwww marked this pull request as draft January 23, 2026 18:36

yongwww added 2 commits January 23, 2026 16:04

use rerun failed instead of rerun-failed

0a5dc48

rerun failed to work for cancelled jobs

a4e28f7

yongwww force-pushed the ci_permission branch from b9b6a24 to a4e28f7 Compare January 24, 2026 00:37

yongwww marked this pull request as ready for review January 24, 2026 00:38

yzh119 approved these changes Jan 25, 2026

View reviewed changes

yzh119 merged commit c8d76d3 into flashinfer-ai:main Jan 25, 2026
65 of 74 checks passed

yongwww deleted the ci_permission branch January 25, 2026 12:53

This was referenced Jan 26, 2026

[CI] Add on-demand rerun for spot-terminated jobs #2403

Merged

[CI]: Enable Blackwell & Hopper in public CI testing #2413

Draft

This was referenced Feb 6, 2026

ci: refactor PR tests to hide failed spot jobs from PR status #2500

Merged

Revert "ci: refactor PR tests to hide failed spot jobs from PR status… #2524

Merged

coderabbitai bot mentioned this pull request Mar 11, 2026

fix: block PR merge when CI is skipped due to pending authorization #2761

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: add permission control for public ci tests#2397

ci: add permission control for public ci tests#2397
yzh119 merged 9 commits intoflashinfer-ai:mainfrom
yongwww:ci_permission

yongwww commented Jan 22, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 22, 2026

Uh oh!

coderabbitai bot commented Jan 22, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

yzh119 commented Jan 22, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

yongwww commented Jan 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yongwww commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

Reviewer Notes

Summary by CodeRabbit

Uh oh!

gemini-code-assist bot commented Jan 22, 2026

Uh oh!

coderabbitai bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yzh119 commented Jan 22, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yongwww commented Jan 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yongwww commented Jan 22, 2026 •

edited

Loading

coderabbitai bot commented Jan 22, 2026 •

edited

Loading