[Bugfix] Fix Dynamo unexpected keyword argument by samutamm · Pull Request #34320 · vllm-project/vllm

samutamm · 2026-02-11T08:27:18Z

Purpose

Fix QuantFP8 with torch.compile on ROCm when CustomOP quant_fp8 is disabled with --compilation-config '{"custom_ops": ["-quant_fp8"]}'.

Current main branch raises error:

(EngineCore_DP0 pid=565)   File "/app/vllm/vllm/v1/executor/multiproc_executor.py", line 375, in collective_rpc
(EngineCore_DP0 pid=565)     return aggregate(get_response())
(EngineCore_DP0 pid=565)                      ^^^^^^^^^^^^^^
(EngineCore_DP0 pid=565)   File "/app/vllm/vllm/v1/executor/multiproc_executor.py", line 358, in get_response
(EngineCore_DP0 pid=565)     raise RuntimeError(
(EngineCore_DP0 pid=565) RuntimeError: Worker failed with error 'Observed exception
(EngineCore_DP0 pid=565)   Explanation: Dynamo found no exception handler at the top-level compiled function when encountering an exception. Exception will propagate outside the compiled region.
(EngineCore_DP0 pid=565)   Hint: Dynamo has detected that tracing the code will result in an error when running in eager. Please double check that your code doesn't contain a similar error when actually running eager/uncompiled.
(EngineCore_DP0 pid=565)   Hint: It may be possible to write Dynamo tracing rules for this code. Please report an issue to PyTorch if you encounter this graph break often and it is causing performance issues.
(EngineCore_DP0 pid=565) 
(EngineCore_DP0 pid=565)   Developer debug context: raised exception TypeError([ConstantVariable(str: "Unexpected keyword arguments: ['use_triton']")])
(EngineCore_DP0 pid=565) 
(EngineCore_DP0 pid=565)  For more details about this graph break, please visit: https://meta-pytorch.github.io/compile-graph-break-site/gb/gb0088.html
(EngineCore_DP0 pid=565) 
(EngineCore_DP0 pid=565) from user code:
(EngineCore_DP0 pid=565)    File "/app/vllm/vllm/model_executor/models/qwen3_vl_moe.py", line 133, in forward
(EngineCore_DP0 pid=565)     hidden_states, residual = layer(
(EngineCore_DP0 pid=565) 
(EngineCore_DP0 pid=565) Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo"
(EngineCore_DP0 pid=565) ', please check the stack trace above for the root cause

This was introduced in #33047 .

Test Plan

Server

export VLLM_ROCM_USE_AITER=1
export VLLM_ROCM_USE_AITER_MOE=1
vllm serve Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 \
    -tp 8 \
	--enable-expert-parallel \
	--max-num-batched-tokens 32768 \
	--compilation-config '{"custom_ops": ["-quant_fp8"]}' \
	--max-num-seqs 1024 \
        --distributed-executor-backend mp \
	--kv-cache-dtype fp8 \
	--no-enable-prefix-caching

Test Result

After moving use_triton from kwargs to positional argument, Dynamo error disappears.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Samu Tamminen <stammine@amd.com>

gemini-code-assist

Code Review

This pull request addresses a TypeError that occurs with torch.compile on ROCm when the quant_fp8 custom operation is disabled. The error was caused by an unexpected use_triton keyword argument being passed through **kwargs. The fix involves changing the signatures of forward_cuda, forward_hip, and forward_native methods in the QuantFP8 class to explicitly include use_triton as a keyword argument. This change makes the API consistent across different implementations and resolves the issue with Dynamo tracing. The fix is correct, well-targeted, and improves code clarity.

yewentao256

LGTM, thanks for the work!

yewentao256

Could you take a look at CI failure? Maybe related

samutamm · 2026-02-16T07:00:35Z

Could you take a look at CI failure? Maybe related

Looking into it. Many of the CI tests fail with: interrupted by a signal: signal: terminated

Then pytest -v -s tests/compile/correctness_e2e/test_sequence_parallel.py::test_tp_sp_generation[False-False-hmellor/tiny-random-LlamaForCausalLM-parallel_setup14-mp-auto-test_options14] at least passes on ROCm. I'll see if updating branch makes any difference.

DarkLight1337 · 2026-02-16T09:32:21Z

H100 is down, the rest are known failures on main

Signed-off-by: Samu Tamminen <stammine@amd.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: athrael-soju <athrael-soju@users.noreply.github.com>

Signed-off-by: Samu Tamminen <stammine@amd.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: wzhao18 <wzhao18.sz@gmail.com>

Signed-off-by: Samu Tamminen <stammine@amd.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: Eldar Kurtic <research@neuralmagic.com>

Signed-off-by: Samu Tamminen <stammine@amd.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

Signed-off-by: Samu Tamminen <stammine@amd.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>

move use_triton from kwargs to pos arg to fix dynamo error

a472967

Signed-off-by: Samu Tamminen <stammine@amd.com>

samutamm requested review from mgoin, pavanimajety, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners February 11, 2026 08:27

samutamm mentioned this pull request Feb 11, 2026

[bugfix] Fix Dynamo unexpected keyword argument #34318

Closed

5 tasks

mergify bot added the bug Something isn't working label Feb 11, 2026

gemini-code-assist bot reviewed Feb 11, 2026

View reviewed changes

yewentao256 approved these changes Feb 13, 2026

View reviewed changes

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 13, 2026

samutamm and others added 2 commits February 14, 2026 12:12

Merge branch 'main' into dynamo_error_use_triton

3c007d4

Merge branch 'main' into dynamo_error_use_triton

1ca1fdb

yewentao256 enabled auto-merge (squash) February 14, 2026 14:46

Merge branch 'main' into dynamo_error_use_triton

4232816

yewentao256 reviewed Feb 15, 2026

View reviewed changes

Merge branch 'main' into dynamo_error_use_triton

398f118

vllm-bot merged commit a5ccc85 into vllm-project:main Feb 16, 2026
58 of 65 checks passed

samutamm deleted the dynamo_error_use_triton branch February 27, 2026 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix Dynamo unexpected keyword argument #34320

[Bugfix] Fix Dynamo unexpected keyword argument #34320
vllm-bot merged 5 commits intovllm-project:mainfrom
samutamm:dynamo_error_use_triton

samutamm commented Feb 11, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

samutamm commented Feb 16, 2026

Uh oh!

DarkLight1337 commented Feb 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

samutamm commented Feb 11, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

samutamm commented Feb 16, 2026

Uh oh!

DarkLight1337 commented Feb 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

samutamm commented Feb 11, 2026 •

edited by github-actions bot

Loading