Skip to content

[FIX_FOR_VLLM_CUSTOM=0a54df28471be07b3d668ea21c5e411569d3baea] Fix DynamicNTKScalingRotaryEmbedding and HPUCompressedTensorsConfig#1479

Merged
iboiko-habana merged 1 commit into
vllm-project:mainfrom
pawel-olejniczak:fix/upstream-api-compat-0a54df28-v2
May 22, 2026
Merged

[FIX_FOR_VLLM_CUSTOM=0a54df28471be07b3d668ea21c5e411569d3baea] Fix DynamicNTKScalingRotaryEmbedding and HPUCompressedTensorsConfig#1479
iboiko-habana merged 1 commit into
vllm-project:mainfrom
pawel-olejniczak:fix/upstream-api-compat-0a54df28-v2

Conversation

@pawel-olejniczak
Copy link
Copy Markdown
Collaborator

@pawel-olejniczak pawel-olejniczak commented May 21, 2026

Root cause

Upstream vLLM at SHA 0a54df28 introduced two API changes that broke vllm-gaudi:

  1. PR Fix error in Dynamic NTK scaling vllm#41277 added a required max_trained_positions parameter to DynamicNTKScalingRotaryEmbedding.__init__(), causing the unit test to fail with TypeError.
  2. PR Remove additional dead code as a follow-up to #42889 vllm#43144 removed sparsity_scheme_map and sparsity_ignore_list from CompressedTensorsConfig.__init__(), causing HPUCompressedTensorsConfig instantiation to fail during e2e tests.

Upstream PR

vllm-project/vllm#41277
Added max_trained_positions to DynamicNTKScalingRotaryEmbedding

vllm-project/vllm#43144
Removed sparsity parameters from CompressedTensorsConfig

Fix

  1. Add max_trained_positions parameter to the rotary embedding unit test.
  2. Remove stale sparsity_scheme_map and sparsity_ignore_list from HPUCompressedTensorsConfig init signature and super() call, plus the unused SparsityCompressionConfig import.

…or upstream vllm@0a54df28

Root cause: Upstream vLLM PRs #41277 and #43144 changed APIs in DynamicNTKScalingRotaryEmbedding (added max_trained_positions) and CompressedTensorsConfig (removed sparsity params).
Upstream: vllm-project/vllm#41277, vllm-project/vllm#43144
Fix: Add max_trained_positions to rotary embedding test; remove stale sparsity_scheme_map and sparsity_ignore_list from HPUCompressedTensorsConfig init.

Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

This PR appears to remove sparsity-compression configuration plumbing from the Gaudi compressed-tensors path and updates a rotary-embedding unit test to include a max_trained_positions field.

Changes:

  • Removed SparsityCompressionConfig import and sparsity-related __init__ parameters/forwarding in hpu_compressed_tensors.py.
  • Updated rotary-embedding test config to include max_trained_positions.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File Description
vllm_gaudi/ops/hpu_compressed_tensors.py Drops sparsity config types and constructor parameters from the HPU compressed-tensors integration.
tests/unit_tests/ops/test_hpu_rotary_embedding.py Adds max_trained_positions to the test configuration for dynamic NTK scaling.

Comment thread vllm_gaudi/ops/hpu_compressed_tensors.py
Comment thread vllm_gaudi/ops/hpu_compressed_tensors.py
Comment thread tests/unit_tests/ops/test_hpu_rotary_embedding.py
@github-actions
Copy link
Copy Markdown

✅ CI Passed

All checks passed successfully against the following vllm commit:
0a54df28471be07b3d668ea21c5e411569d3baea

@iboiko-habana iboiko-habana merged commit 7b7bc8f into vllm-project:main May 22, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants