Add/update multi node/multi GPU test scripts by dierksen · Pull Request #2410 · flashinfer-ai/flashinfer

dierksen · 2026-01-23T18:30:31Z

📌 Description

Renames the unit testing script to more accurately reflect current usage
adds multi-node and multi-GPU testing scripts
refactors common code for those 3 scripts into a separate file

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing (unittest, etc.).

Reviewer Notes

Summary by CodeRabbit

Chores
- Centralized and consolidated test orchestration and utilities for consistent unit, multi‑GPU, and multi‑node test runs.
- Improved environment handling, install/verify flows, and deterministic test sampling across runners.
New Features
- Added dry‑run mode, clearer test‑mode banners, and detailed execution and dry‑run summaries for easier test planning and reporting.

coderabbitai · 2026-01-23T18:30:53Z

📝 Walkthrough

Walkthrough

Centralizes test orchestration: removes a monolithic test runner, adds a reusable test utility library, and introduces task-specific runners for unit, multi-GPU, and multi-node kernel tests. New scripts discover, sample, dry-run, and execute tests while installing/verifying precompiled kernels.

Changes

Cohort / File(s)	Summary
Core Test Utilities `scripts/test_utils.sh`	Adds a comprehensive test orchestration library: argument parsing, test-mode banners, precompiled-kernel installation, test collection & deterministic sampling, dry-run and full execution flows, JUnit output handling, and summary reporting (many new public functions).
Unit Test Runner `scripts/task_run_unit_tests.sh`	New dedicated unit-test runner that sources `test_utils.sh`, parses args, discovers test files (parses `norecursedirs` from `pytest.ini`), optionally dry-runs, and executes tests; defines `find_test_files()` and `main()`.
Multi-GPU Runner `scripts/task_test_multi_gpu_comm_kernels.sh`	New multi-GPU comm test script that sources `test_utils.sh`, configures MPI prefix and TEST_FILES, supports dry-run and full runs, and delegates execution to shared utilities.
Multi-Node Runner `scripts/task_test_multi_node_comm_kernels.sh`	Refactored multi-node launcher that sources `test_utils.sh`, centralizes TEST_FILES, disables sanity testing for multi-node, and uses a `main()` entry point delegating to shared utilities.
Removed Legacy Runner `scripts/task_test_blackwell_kernels.sh`	Deleted large monolithic test runner; its installation, sampling, discovery, and pytest orchestration logic moved into `test_utils.sh` and new task runners.

Sequence Diagram(s)

sequenceDiagram
    rect rgba(46, 117, 182, 0.5)
    Participant Runner as Task Script
    end
    rect rgba(76, 175, 80, 0.5)
    Participant Utils as test_utils.sh
    end
    rect rgba(255, 193, 7, 0.5)
    Participant FS as Artifact / Filesystem
    end
    rect rgba(233, 30, 99, 0.5)
    Participant Pytest as pytest
    end

    Runner->>Utils: source, parse_args(), print banner
    Runner->>Utils: request install_and_verify() / install_precompiled_kernels()
    Utils->>FS: locate precompiled artifacts (cubin/jit-cache)
    FS-->>Utils: artifact present / missing
    Utils->>Runner: install results (ok / error)
    Runner->>Utils: collect_tests() / find_test_files()
    Utils->>Pytest: run tests (sampled or full) -> produce JUnit XML
    Pytest-->>Utils: test outcomes
    Utils->>Runner: print_execution_summary(), exit code

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

ci: Reduce test time by moving compilation off-line #2089: Overlaps on precompiled-kernel installation and test orchestration changes; previously added offline kernel installation logic that is now centralized in test_utils.sh.

Suggested reviewers

nvmbreughe
yongwww
kahyunnam
jimmyzho
yzh119

Poem

🐰 Hopping through scripts, I stitch and gleam,

Old runner folded into one neat scheme.
Dry runs hum, samples dance in line,
Kernels placed, tests run just fine,
A rabbit cheers — CI's feeling sublime! 🥕

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately summarizes the main changes: adding/updating multi-node and multi-GPU test scripts with refactoring of common code.
Description check	✅ Passed	The description follows the template structure with complete Description and Pre-commit Checklist sections marked as done, but Tests section is incomplete with unchecked items.
Docstring Coverage	✅ Passed	Docstring coverage is 93.94% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Important

Action Needed: IP Allowlist Update

If your organization protects your Git platform with IP whitelisting, please add the new CodeRabbit IP address to your allowlist:

✨ 136.113.208.247/32 (new)
34.170.211.100/32
35.222.179.152/32

Reviews will stop working after February 8, 2026 if the new IP is not added to your allowlist.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-01-23T18:30:54Z

Summary of Changes

Hello @dierksen, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly improves the organization and capabilities of the project's testing infrastructure. It centralizes common test execution logic into a reusable script, renames the main unit test runner for clarity, and introduces dedicated scripts for multi-GPU and multi-node communication kernel tests. These changes aim to make test management more efficient and extensible, while also providing flexible execution options like dry runs and sanity checks.

Highlights

Test Script Refactoring: Common test execution logic, including argument parsing, environment setup, test collection, and result reporting, has been extracted into a new shared shell script: scripts/common_test_functions.sh.
Unit Test Script Renaming and Enhancement: The primary unit testing script, previously scripts/task_test_blackwell_kernels.sh, has been renamed to scripts/task_run_unit_tests.sh to better reflect its general purpose. The old filename now acts as a redirect to the new script. The new unit test script leverages the common functions and includes logic for excluding directories based on pytest.ini.
New Multi-GPU Test Script: A new script, scripts/task_test_multi_gpu_comm_kernels.sh, has been added to specifically run multi-GPU communication kernel tests. This script uses mpirun -np 4 as a command prefix for pytest and disables sanity testing, ensuring a full test suite run.
Multi-Node Test Script Update: The existing scripts/task_test_multi_node_comm_kernels.sh script has been refactored to utilize the newly introduced common test functions, streamlining its implementation and removing redundant code. Sanity testing is also disabled for this suite.
Dry Run and Sanity Test Modes: The common test functions introduce --dry-run and --sanity-test modes. The sanity test mode allows running a sampled subset of tests (e.g., every 5th test with a random offset) for quicker feedback, while the dry run mode shows which tests would be executed without actually running them.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request refactors the testing scripts by introducing a common functions file, renaming the main unit test script, and adding new scripts for multi-GPU/multi-node testing. The changes significantly improve the structure and maintainability of the test suite. My review includes suggestions to improve the robustness of shell scripting practices, such as handling filenames with spaces and simplifying conditional checks. I also found a file that seems to be a leftover from a rename operation and should be removed.

I am having trouble creating individual review comments. Click here to see my feedback.

scripts/task_test_blackwell_kernels.sh (1)

This file appears to be an artifact of a file rename operation, as mentioned in the PR description. It contains only the name of another script (task_run_unit_tests.sh). This is likely a mistake and the file should be removed to avoid confusion and potential issues in the future.

scripts/common_test_functions.sh (187)

The condition to check if SAMPLED_NODE_IDS is empty is overly complex and the wc -l part is buggy. When $SAMPLED_NODE_IDS is empty, echo "$SAMPLED_NODE_IDS" still prints a newline, causing wc -l to return 1, not 0. The [ -z "$SAMPLED_NODE_IDS" ] part correctly handles the empty case, making the second part of the || redundant and incorrect. A simpler check is sufficient and more robust.

    if [ -z "$SAMPLED_NODE_IDS" ]; then

scripts/common_test_functions.sh (241)

Using [ ... ] with == for string comparison is a bashism and can have unexpected behavior with word splitting. It's safer and generally recommended to use the [[ ... ]] keyword for tests in bash scripts, as it provides more predictable behavior and additional features.

    if [[ "$SANITY_TEST" == "true" ]]; then

scripts/task_run_unit_tests.sh (31-35)

The command ALL_TEST_FILES=$(find ...) and loop for test_file in $ALL_TEST_FILES is not robust for filenames with spaces. Using mapfile to read find's output into an array and then iterating over that array is a safer and more robust approach in bash.

    mapfile -t ALL_TEST_FILES < <(find tests/ -name "test_*.py" -type f | sort)

    # Filter out excluded files based on directory exclusions
    TEST_FILES=""
    for test_file in "${ALL_TEST_FILES[@]}"; do

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In `@scripts/common_test_functions.sh`:
- Around line 88-96: The script uses CUDA_VERSION to derive CUDA_STREAM but
doesn't validate it; update the logic around the CUDA_VERSION/CUDA_STREAM
assignment (the block referencing CUDA_VERSION, CUDA_STREAM and
JIT_ARCH_EFFECTIVE) to first check if CUDA_VERSION is set (e.g. test -z
"${CUDA_VERSION}") and either assign a clear default or exit with an error
message; then perform the existing pattern-match for values starting with "cu*"
and explicit matches like "12.9.0" to set CUDA_STREAM, ensuring the empty/unset
case is handled explicitly instead of falling back implicitly to "cu130".
- Around line 130-142: The script currently prints error messages when wheels
are missing but continues execution; update the two failure branches that check
DIST_CUBIN_DIR and DIST_JIT_CACHE_DIR so they stop the script with a non-zero
exit (e.g., call exit 1) after printing the error to stderr, or return a
non-zero status if these checks are inside a function; modify the branches in
the blocks referencing DIST_CUBIN_DIR and DIST_JIT_CACHE_DIR (the pip install
checks) to fail fast by exiting with code 1 so downstream tests won't run
without the expected artifacts.

In `@scripts/task_run_unit_tests.sh`:
- Around line 14-27: In find_test_files(), the parsing of NORECURSEDIRS
incorrectly converts commas to spaces and only grabs a single line with grep;
update the extraction so it preserves pytest's space-separated values (remove
the tr ',' ' ' conversion) and capture multiline continuations for the
norecursedirs setting (e.g., read the line starting with "norecursedirs" plus
any subsequent indented continuation lines, collapse runs of whitespace into
single spaces) before assigning EXCLUDED_DIRS and logging; adjust the
NORECURSEDIRS assignment logic accordingly so EXCLUDED_DIRS reflects the actual
space-separated directories.

🧹 Nitpick comments (5)

scripts/common_test_functions.sh (3)
181-189: Awk variables should be passed explicitly to avoid shell interpolation issues.

The awk command uses shell variables directly in the pattern. While this works in most cases, it's more robust to pass them as awk variables to avoid potential issues with special characters or unexpected expansion.
Suggested improvement
-    SAMPLED_NODE_IDS=$(echo "$all_node_ids" | awk "NR % $SAMPLE_RATE == $SAMPLE_OFFSET")
+    SAMPLED_NODE_IDS=$(echo "$all_node_ids" | awk -v rate="$SAMPLE_RATE" -v offset="$SAMPLE_OFFSET" 'NR % rate == offset')
241-241: Use POSIX-compliant = instead of == for string comparison in [ ].

The script uses [ "$SANITY_TEST" == "true" ] which is a bashism. While this works in bash, the POSIX-compliant form uses single =. This appears in multiple places (lines 241, 268, 370, 413, 434).

Since the shebang is #!/bin/bash, this will work, but for consistency with best practices:
Suggested fix (multiple locations)
-    if [ "$SANITY_TEST" == "true" ]; then
+    if [ "$SANITY_TEST" = "true" ]; then
405-426: Word splitting on $test_files may break with paths containing spaces.

The for test_file in $test_files pattern relies on word splitting, which will break if any test file paths contain spaces. While this is unlikely for test files, using an array would be more robust.

For now this is acceptable given the controlled test file paths, but worth noting for future maintainability.
scripts/task_test_multi_gpu_comm_kernels.sh (1)
19-22: Consider adding a TODO comment or tracking issue for disabled tests.

The commented-out test files indicate tests that need fixing. Consider adding a more explicit TODO with a reference to an issue tracker, or documenting what specific fixes are needed.
Suggested improvement
 # Define the specific test files for multi-GPU comm tests (single-node)
-# TEST_FILES="tests/comm/test_allreduce_unified_api.py tests/comm/test_allreduce_negative.py tests/comm/test_trtllm_allreduce_fusion.py"
-# Add others back once they are fixed
+# TODO(dierksen): Re-enable once fixed (see issue `#XXXX`):
+#   - tests/comm/test_allreduce_negative.py
+#   - tests/comm/test_trtllm_allreduce_fusion.py
 TEST_FILES="tests/comm/test_allreduce_unified_api.py"
scripts/task_test_multi_node_comm_kernels.sh (1)
10-15: Remove dead commented-out code.

These commented-out lines for cache cleaning are now redundant since clean_python_cache() from the common library is called in main(). Leaving this code adds noise and could cause confusion about whether cache cleaning is happening.
Suggested fix
-# Clean Python bytecode cache to avoid stale imports (e.g., after module refactoring)
-# echo "Cleaning Python bytecode cache..."
-# find . -type d -name __pycache__ -exec rm -rf {} + 2>/dev/null || true
-# find . -type f -name '*.pyc' -delete 2>/dev/null || true
-# echo "Cache cleaned."
-# echo ""
-

scripts/test_utils.sh

scripts/task_run_unit_tests.sh

yongwww · 2026-01-29T17:34:33Z

Will review it today

yongwww · 2026-01-29T17:35:36Z

/bot run

flashinfer-bot · 2026-01-29T17:36:43Z

GitLab MR !274 has been created, and the CI pipeline #42837414 is currently running. I'll report back once the pipeline job completes.

flashinfer-bot · 2026-01-30T02:24:59Z

[FAILED] Pipeline #42837414: 11/20 passed

- Add common test functions - Add multi-gpu and multi-node comm test scripts - Update blackwell kernel test script to use common functions - Rename unit test script to accurately reflect its purpose

scripts/common_test_functions.sh

scripts/task_test_multi_gpu_comm_kernels.sh

scripts/task_test_blackwell_kernels.sh

scripts/test_utils.sh

- Remove pytest -s flag to reduce CI log verbosity and costs - Rename common_test_functions.sh to test_utils.sh for brevity - Remove Python bytecode cache cleanup (no longer needed) - Delete task_test_blackwell_kernels.sh symlink The bytecode cleanup was added for a one-time module refactoring (gemm.py -> gemm/ package) in Nov 2025 and is no longer relevant. Modern Python handles cache invalidation automatically. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@scripts/test_utils.sh`:
- Around line 5-14: SAMPLE_RATE can be zero or non-numeric which will break
arithmetic (e.g. RANDOM % SAMPLE_RATE, 100 / SAMPLE_RATE, awk modulo); add a
validation right after the SAMPLE_RATE default assignment that ensures
SAMPLE_RATE is a positive integer (reject empty/non-digit or <=0), and if
invalid reset SAMPLE_RATE to a safe default (e.g. 5) and emit a warning; then
use the validated SAMPLE_RATE when computing SAMPLE_OFFSET and elsewhere
(references: SAMPLE_RATE, SAMPLE_OFFSET, RANDOM, the sampling arithmetic used
later in the script).

scripts/test_utils.sh

aleozlx · 2026-02-03T22:15:38Z

scripts/test_utils.sh

+    SAMPLE_OFFSET=$((RANDOM % SAMPLE_RATE))
+fi
+
+# Pytest configuration flags


do we need the "Clean Python bytecode cache" part before this?

https://github.com/dierksen/flashinfer/blob/main/scripts/task_test_blackwell_kernels.sh#L18-L23

@yongwww had asked about that one too; after some digging, it sounds like it was only needed temporarily for #2019 to prevent any stale files while that was in progress.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@scripts/task_test_multi_gpu_comm_kernels.sh`:
- Around line 22-50: TEST_FILES is defined as a quoted space-delimited string
which will be passed as a single argument to execute_tests/execute_dry_run;
change TEST_FILES to a Bash array (e.g.,
TEST_FILES=(tests/comm/test_allreduce_unified_api.py ...) ), update the for loop
to iterate over "${TEST_FILES[@]}", and call execute_dry_run "${TEST_FILES[@]}"
and execute_tests "${TEST_FILES[@]}"; ensure parse_args, print_test_mode_banner
and other surrounding logic remain unchanged and that any places referencing
TEST_FILES as a string are updated to use the array expansion.

coderabbitai · 2026-02-03T22:18:44Z

scripts/task_test_multi_gpu_comm_kernels.sh

+# Define the specific test files for multi-GPU comm tests (single-node)
+# TEST_FILES="tests/comm/test_allreduce_unified_api.py tests/comm/test_allreduce_negative.py tests/comm/test_trtllm_allreduce_fusion.py"
+# Add others back once they are fixed
+TEST_FILES="tests/comm/test_allreduce_unified_api.py"
+
+# Main execution
+main() {
+    # Parse command line arguments
+    parse_args "$@"
+
+    # Print test mode banner
+    print_test_mode_banner
+
+    # Install and verify (unless dry run)
+    install_and_verify
+
+    # Print test files
+    echo "Multi-GPU comm kernel test files (running with: ${PYTEST_COMMAND_PREFIX}):"
+    for test_file in $TEST_FILES; do
+        echo "  $test_file"
+    done
+    echo ""
+
+    # Execute tests or dry run
+    if [ "$DRY_RUN" == "true" ]; then
+        execute_dry_run "$TEST_FILES"
+    else
+        execute_tests "$TEST_FILES"
+    fi


⚠️ Potential issue | 🟡 Minor

Avoid quoting a space-delimited list; use an array for TEST_FILES.

execute_dry_run "$TEST_FILES" / execute_tests "$TEST_FILES" will pass a single argument once multiple files are re-enabled, which will break execution. Use a Bash array and pass "${TEST_FILES[@]}" instead.

Proposed fix

-# TEST_FILES="tests/comm/test_allreduce_unified_api.py tests/comm/test_allreduce_negative.py tests/comm/test_trtllm_allreduce_fusion.py" +# TEST_FILES=( +# "tests/comm/test_allreduce_unified_api.py" +# "tests/comm/test_allreduce_negative.py" +# "tests/comm/test_trtllm_allreduce_fusion.py" +# ) # Add others back once they are fixed -TEST_FILES="tests/comm/test_allreduce_unified_api.py" +TEST_FILES=("tests/comm/test_allreduce_unified_api.py") @@ - for test_file in $TEST_FILES; do + for test_file in "${TEST_FILES[@]}"; do echo " $test_file" done @@ - if [ "$DRY_RUN" == "true" ]; then - execute_dry_run "$TEST_FILES" + if [ "$DRY_RUN" == "true" ]; then + execute_dry_run "${TEST_FILES[@]}" else - execute_tests "$TEST_FILES" + execute_tests "${TEST_FILES[@]}" fi

🤖 Prompt for AI Agents

In `@scripts/task_test_multi_gpu_comm_kernels.sh` around lines 22 - 50, TEST_FILES is defined as a quoted space-delimited string which will be passed as a single argument to execute_tests/execute_dry_run; change TEST_FILES to a Bash array (e.g., TEST_FILES=(tests/comm/test_allreduce_unified_api.py ...) ), update the for loop to iterate over "${TEST_FILES[@]}", and call execute_dry_run "${TEST_FILES[@]}" and execute_tests "${TEST_FILES[@]}"; ensure parse_args, print_test_mode_banner and other surrounding logic remain unchanged and that any places referencing TEST_FILES as a string are updated to use the array expansion.

aleozlx · 2026-02-03T22:20:13Z

scripts/test_utils.sh

+    echo ""
+
+    if [ -n "${JIT_ARCH}" ]; then
+        # 12.0a for CUDA 12.9.0, 12.0f for CUDA 13.0.0


[non-blocking] should we have a file of consolidated source of truth like our cudnn cmake presets to specify

(device) x (cuda toolkit) -> arch target list

That's probably a good idea; keeping track of which SMs support the f modifier and which ones don't can be kind of a headache tbh.

aleozlx

posted a couple of non-blocking questions

lgtm

aleozlx · 2026-02-03T22:23:43Z

can @kahyunnam try approving it per approver list on the "Merging is blocked" section. thx

kahyunnam

LGTM

## 📌 Description - Renames the unit testing script to more accurately reflect current usage - adds multi-node and multi-GPU testing scripts - refactors common code for those 3 scripts into a separate file ## 🔍 Related Issues  ## 🚀 Pull Request Checklist Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete. ### ✅ Pre-commit Checks - [x] I have installed `pre-commit` by running `pip install pre-commit` (or used your preferred method). - [x] I have installed the hooks with `pre-commit install`. - [x] I have run the hooks manually with `pre-commit run --all-files` and fixed any reported issues. > If you are unsure about how to set up `pre-commit`, see [the pre-commit documentation](https://pre-commit.com/). ## 🧪 Tests - [ ] Tests have been added or updated as needed. - [ ] All tests are passing (`unittest`, etc.). ## Reviewer Notes   ## Summary by CodeRabbit * **Chores** * Centralized and consolidated test orchestration and utilities for consistent unit, multi‑GPU, and multi‑node test runs. * Improved environment handling, install/verify flows, and deterministic test sampling across runners. * **New Features** * Added dry‑run mode, clearer test‑mode banners, and detailed execution and dry‑run summaries for easier test planning and reporting.  --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>

dierksen requested review from jimmyzho, kahyunnam, nvmbreughe and yzh119 as code owners January 23, 2026 18:30

gemini-code-assist bot reviewed Jan 23, 2026

View reviewed changes

coderabbitai bot reviewed Jan 23, 2026

View reviewed changes

scripts/test_utils.sh Show resolved Hide resolved

scripts/test_utils.sh Show resolved Hide resolved

scripts/task_run_unit_tests.sh Show resolved Hide resolved

dierksen added 4 commits January 30, 2026 12:16

Refactor unit testing scripts

3fd1932

- Add common test functions - Add multi-gpu and multi-node comm test scripts - Update blackwell kernel test script to use common functions - Rename unit test script to accurately reflect its purpose

add mpirun prefix for multi-gpu tests

3db7ace

shellcheck fixes

a9f9513

Remove a couple of allreduce tests for now

48b4003

yongwww reviewed Jan 30, 2026

View reviewed changes

scripts/common_test_functions.sh Outdated Show resolved Hide resolved

scripts/common_test_functions.sh Outdated Show resolved Hide resolved

yongwww reviewed Jan 30, 2026

View reviewed changes

scripts/task_test_multi_gpu_comm_kernels.sh Show resolved Hide resolved

scripts/task_test_blackwell_kernels.sh Outdated Show resolved Hide resolved

yongwww reviewed Jan 30, 2026

View reviewed changes

scripts/test_utils.sh Show resolved Hide resolved

dierksen force-pushed the multi-node-multi-gpu branch from fd03e26 to fedbea5 Compare January 30, 2026 23:09

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

scripts/test_utils.sh Show resolved Hide resolved

dierksen requested a review from yongwww February 3, 2026 22:02

Add test env setup missed in refactor

0904bdd

aleozlx reviewed Feb 3, 2026

View reviewed changes

coderabbitai bot reviewed Feb 3, 2026

View reviewed changes

aleozlx reviewed Feb 3, 2026

View reviewed changes

aleozlx approved these changes Feb 3, 2026

View reviewed changes

yongwww approved these changes Feb 3, 2026

View reviewed changes

kahyunnam approved these changes Feb 4, 2026

View reviewed changes

bkryu merged commit 9bf007d into flashinfer-ai:main Feb 4, 2026
28 checks passed

coderabbitai bot mentioned this pull request Feb 9, 2026

Add parallel testing to unit test script #2531

Merged

2 tasks

coderabbitai bot mentioned this pull request Feb 27, 2026

chore: fix the python dependency override #2651

Merged

5 tasks

coderabbitai bot mentioned this pull request Mar 13, 2026

Add parallel attention #2630

Open

5 tasks

coderabbitai bot mentioned this pull request Mar 23, 2026

[DRAFT][DO NOT MERGE] Test tvm-ffi 0.1.10rc #2599

Draft

5 tasks

This was referenced Apr 1, 2026

ci: remove 1gpu label from H100 runner selector #2946

Merged

Update NVSHMEM interface to use NVSHMEM4Py instead of custom bindings #2960

Open

Conversation

dierksen commented Jan 23, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

Reviewer Notes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

gemini-code-assist bot commented Jan 23, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

scripts/task_test_blackwell_kernels.sh (1)

scripts/common_test_functions.sh (187)

scripts/common_test_functions.sh (241)

scripts/task_run_unit_tests.sh (31-35)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yongwww commented Jan 29, 2026

Uh oh!

yongwww commented Jan 29, 2026

Uh oh!

flashinfer-bot commented Jan 29, 2026

Uh oh!

flashinfer-bot commented Jan 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aleozlx Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

dierksen Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

aleozlx Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

dierksen Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

aleozlx left a comment

Choose a reason for hiding this comment

Uh oh!

aleozlx commented Feb 3, 2026

Uh oh!

dierksen commented Jan 23, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 23, 2026 •

edited

Loading