[Misc][BE] Type coverage for vllm/compilation [3/3] by Lucaskabela · Pull Request #31748 · vllm-project/vllm

Lucaskabela · 2026-01-05T21:58:55Z

Purpose

We want to provide better type hint coverage in vllm/compilation to improve maintainability, readability, and reduce silent errors

This PR should be applied on top of #31744

Test Plan

mypy vllm/compilation

Test Result

Success: no issues found in 28 source files

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Note

Improves type coverage and API clarity across compilation passes with minimal logic adjustments.

Add explicit return/param types, ParamSpec, and helper get_inputs methods across fusion passes (activation, RMSNorm, attention, collective, sequence parallelism, ROCm/AIter)
Tighten function signatures (__init__, register, __call__, uuid, helpers like empty_*) and use precise tuple/list return types in pattern/replacement fns
Safer device capability handling and minor no-op cleanup hooks; unify tracing wrappers (wrap_trace_fn), reshape conversions, and first-return-only helpers with typing
Update decorators to import SourceInfo, mark dynamic/unbacked dims conditionally, and patch configs with typed contexts
Minor typing fixes in distributed parallel_state, rotary embedding __all__ and cache key types

mypy: Success on 28 files (no issues).

^{Written by Cursor Bugbot for commit 06e6343b18cf59113c80149a18f93b7ceeb988ac. This will update automatically on new commits. Configure here.}

Note

Improves type coverage and API clarity across vllm/compilation with minimal logic changes.

Add explicit return/param types (incl. ParamSpec) and tighten signatures for __init__, register, __call__, uuid, and helper fns; pattern/replacement fns now return precise tuples
Introduce typed get_inputs() helpers for patterns and unify tracing via wrap_trace_fn, view→reshape conversions, and no-op permute cleanup
Safer FlashInfer handling (device capability may be None), workspace setup, and one-shot size checks; minor no-op cleanup hook in sequence parallelism
Small typing fixes in distributed/parallel_state and rotary_embedding (__all__, cache key types)

^{Written by Cursor Bugbot for commit 78350f9. This will update automatically on new commits. Configure here.}

gemini-code-assist

Code Review

This pull request focuses on improving type hint coverage across the vllm/compilation module, which is a valuable contribution to code quality and maintainability. The changes are extensive and well-executed, adding type hints to many functions and methods. I've identified one critical issue in vllm/compilation/collective_fusion.py where a replacement function in a pattern matcher returns a list of tensors instead of a single tensor, which will likely lead to a runtime error. Please address this issue.

mergify · 2026-01-08T07:39:27Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @Lucaskabela.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

zou3519 · 2026-01-12T17:42:05Z

If someone asks: this PR does refactor get_inputs to be a methods on some of these classes but doesn't require it in the base class

ProExpertProg · 2026-01-12T18:46:21Z

vllm/compilation/collective_fusion.py

 if find_spec("flashinfer"):
    try:
        import flashinfer.comm as flashinfer_comm

-        flashinfer_comm = (
+        flashinfer_comm: ModuleType | None = (  # type: ignore[no-redef]
            flashinfer_comm
            if hasattr(flashinfer_comm, "trtllm_allreduce_fusion")
            else None
        )
    except ImportError:
-        flashinfer_comm = None
+        flashinfer_comm = None  # type: ignore[assignment]
 else:
-    flashinfer_comm = None
+    flashinfer_comm = None  # type: ignore[assignment]



Can we just set flashinfer_comm=None at the start and avoid the type: ignores?

If we set flashinfer_comm = None above this logic, we still need to add ignores for redef on L35 and L37 so not sure if it would save us much for code cleanliness

I thought that would just be reassignment not definition? Also I see the import is the same as the var, let's rename the import to _flashinfer_comm?

Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com>

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

hmellor · 2026-01-27T21:12:23Z

vllm/model_executor/layers/rotary_embedding/__init__.py

-_ROPE_DICT: dict[tuple, RotaryEmbedding] = {}
+_ROPE_DICT: dict[tuple[Any, ...], RotaryEmbedding] = {}
+
+__all__ = ["RotaryEmbedding"]


Why do we need this?

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

mergify bot added the nvidia label Jan 5, 2026

github-project-automation bot added this to NVIDIA Jan 5, 2026

Lucaskabela force-pushed the lucaskabela/compilation_type_coverage_3 branch from 08aa1b1 to c102a4b Compare January 5, 2026 21:59

gemini-code-assist bot reviewed Jan 5, 2026

View reviewed changes

Lucaskabela mentioned this pull request Jan 5, 2026

[Misc][BE] Turn on strict type coverage for vllm/compilation #31756

Merged

5 tasks

Lucaskabela changed the title ~~[BE] Type coverage for vllm/compilation [3/3]~~ [Misc][BE] Type coverage for vllm/compilation [3/3] Jan 5, 2026

mergify bot added the needs-rebase label Jan 8, 2026

Lucaskabela force-pushed the lucaskabela/compilation_type_coverage_3 branch from c102a4b to 06e6343 Compare January 10, 2026 00:57

mergify bot removed the needs-rebase label Jan 10, 2026

Lucaskabela marked this pull request as ready for review January 10, 2026 00:59

Lucaskabela requested review from ProExpertProg, tjtanaa, youkaichao and zou3519 as code owners January 10, 2026 00:59

Lucaskabela added 3 commits January 12, 2026 08:00

Type coverage for more files

871a8c0

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

Coverage for compilation and turn on strict type

3c941cd

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

Fix decorators import

78350f9

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

Lucaskabela force-pushed the lucaskabela/compilation_type_coverage_3 branch from 06e6343 to 78350f9 Compare January 12, 2026 16:01

zou3519 approved these changes Jan 12, 2026

View reviewed changes

github-project-automation bot moved this to Ready in NVIDIA Jan 12, 2026

zou3519 added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 12, 2026

zou3519 enabled auto-merge (squash) January 12, 2026 17:41

ProExpertProg approved these changes Jan 12, 2026

View reviewed changes

zou3519 merged commit ad8818b into vllm-project:main Jan 12, 2026
61 checks passed

github-project-automation bot moved this from Ready to Done in NVIDIA Jan 12, 2026

sammysun0711 pushed a commit to sammysun0711/vllm that referenced this pull request Jan 16, 2026

[Misc][BE] Type coverage for vllm/compilation [3/3] (vllm-project#31748)

ab513bb

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[Misc][BE] Type coverage for vllm/compilation [3/3] (vllm-project#31748)

755490f

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[Misc][BE] Type coverage for vllm/compilation [3/3] (vllm-project#31748)

6e8450e

Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

hmellor mentioned this pull request Jan 27, 2026

[CI] Fix mypy for vllm/attention and vllm/compilation #26482

Closed

hmellor reviewed Jan 27, 2026

View reviewed changes

Lucaskabela deleted the lucaskabela/compilation_type_coverage_3 branch February 19, 2026 16:40

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Misc][BE] Type coverage for vllm/compilation [3/3] (vllm-project#31748)

63ff908

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Misc][BE] Type coverage for vllm/compilation [3/3]#31748

[Misc][BE] Type coverage for vllm/compilation [3/3]#31748
zou3519 merged 3 commits intovllm-project:mainfrom
Lucaskabela:lucaskabela/compilation_type_coverage_3

Lucaskabela commented Jan 5, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mergify bot commented Jan 8, 2026

Uh oh!

zou3519 commented Jan 12, 2026

Uh oh!

ProExpertProg Jan 12, 2026

Uh oh!

Lucaskabela Jan 12, 2026

Uh oh!

ProExpertProg Jan 12, 2026

Uh oh!

Uh oh!

hmellor Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Lucaskabela commented Jan 5, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mergify bot commented Jan 8, 2026

Uh oh!

zou3519 commented Jan 12, 2026

Uh oh!

ProExpertProg Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Lucaskabela Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hmellor Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Lucaskabela commented Jan 5, 2026 •

edited by github-actions bot

Loading