Implement bucket corrector for Mamba chunk size - v0.14.1 by jbyczkow · Pull Request #885 · vllm-project/vllm-gaudi

jbyczkow · 2026-01-27T02:10:02Z

Due to MambaMixer2 implementation requirements, all buckets used for mamba must be a multiple of mamba chunk size.

Due to MambaMixer2 implementation requirements, all buckets used for mamba must be a multiple of mamba chunk size. Signed-off-by: Jakub Byczkowski <jbyczkowski@habana.ai>

Copilot

Pull request overview

This PR adds correction logic to ensure all bucket sizes used for Mamba models are multiples of the Mamba chunk size, as required by the MambaMixer2 implementation.

Changes:

Added initialization of Mamba layer count and chunk size in the HPU model runner
Updated bucket generation to accept and apply Mamba chunk size corrections
Implemented a corrector function that rounds query sizes up to the nearest multiple of Mamba chunk size

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
vllm_gaudi/v1/worker/hpu_model_runner.py	Retrieves Mamba configuration and passes chunk size to bucket initialization
vllm_gaudi/extension/bucketing/common.py	Adds Mamba chunk size parameter throughout bucket generation and applies correction logic

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-27T02:10:38Z

    def correct_for_max_model_len(bs, query, ctx):
        return (bs, query, min(ctx, bs * math.ceil(max_model_len / block_size)))

+    def correct_for_mamba_chunk_size(bs, query, ctx):


Division by zero will occur if mamba_chunk_size is 0. While the corrector is only called when mamba_chunk_size > 0 (line 433), the function itself doesn't enforce this constraint. Add a guard condition at the start of the function to prevent potential misuse.

Suggested change

def correct_for_mamba_chunk_size(bs, query, ctx):

def correct_for_mamba_chunk_size(bs, query, ctx):

if mamba_chunk_size <= 0:

raise ValueError("mamba_chunk_size must be greater than 0 to avoid division by zero.")

adobrzyn

lgtm

github-actions · 2026-01-27T19:04:42Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
d7de043d55d1dd629554467e23874097e1c48993

wpyszka

LGTM

…ct#885) Due to MambaMixer2 implementation requirements, all buckets used for mamba must be a multiple of mamba chunk size. Signed-off-by: Jakub Byczkowski <jbyczkowski@habana.ai> Signed-off-by: slokesha <slokeshappa@habana.ai>

Implement bucket selector for Mamba chunk size

833ca92

Due to MambaMixer2 implementation requirements, all buckets used for mamba must be a multiple of mamba chunk size. Signed-off-by: Jakub Byczkowski <jbyczkowski@habana.ai>

jbyczkow requested a review from mgawarkiewicz-intel as a code owner January 27, 2026 02:10

Copilot AI review requested due to automatic review settings January 27, 2026 02:10

jbyczkow requested review from piotrbocian and wpyszka as code owners January 27, 2026 02:10

Copilot AI reviewed Jan 27, 2026

View reviewed changes

jbyczkow changed the title ~~Mamba chunk bucket corrector - v0.14.1~~ Implement bucket corrector for Mamba chunk size - v0.14.1 Jan 27, 2026

github-actions Bot mentioned this pull request Jan 27, 2026

🚦 Team Review Dashboard #701

Open

adobrzyn approved these changes Jan 27, 2026

View reviewed changes

wpyszka approved these changes Jan 28, 2026

View reviewed changes

wpyszka merged commit edb9b73 into vllm-project:releases/v0.14.1 Jan 28, 2026
64 of 65 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement bucket corrector for Mamba chunk size - v0.14.1#885

Implement bucket corrector for Mamba chunk size - v0.14.1#885
wpyszka merged 1 commit into
vllm-project:releases/v0.14.1from
jbyczkow:mamba_chunk_bucket_corrector_0_14_1

jbyczkow commented Jan 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 27, 2026

Uh oh!

adobrzyn left a comment

Uh oh!

github-actions Bot commented Jan 27, 2026

Uh oh!

wpyszka left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jbyczkow commented Jan 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

adobrzyn left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jan 27, 2026

✅ CI Passed

Uh oh!

wpyszka left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wpyszka left a comment •

edited

Loading