Skip to content

[CI][ROCm] (wip) Fix test_async_scheduing#29254

Draft
divakar-amd wants to merge 1 commit intovllm-project:mainfrom
divakar-amd:fix_test_async_sched
Draft

[CI][ROCm] (wip) Fix test_async_scheduing#29254
divakar-amd wants to merge 1 commit intovllm-project:mainfrom
divakar-amd:fix_test_async_sched

Conversation

@divakar-amd
Copy link
Copy Markdown
Contributor

@divakar-amd divakar-amd commented Nov 23, 2025

This PR aims to fix aync test for ROCm. pytest -s -v test_async_scheduling.py

  1. Fix for xgrammar's triton kernel - Fix warp_size in triton kernel for AMD GPUs mlc-ai/xgrammar#476
  2. Unset Flex_attn backend on ROCm since it is not supported for speculative decoding
Test Name Status Notes
test_with_spec_decoding PASSES
test_without_spec_decoding FAILS Accuracy mismatch of outputs

Signed-off-by: Divakar Verma <divakar.verma@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build rocm Related to AMD ROCm v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant