Skip to content

[NV] update qwen3.5 b200 configs#169

Merged
Fridge003 merged 4 commits intosgl-project:mainfrom
kedarpotdar-nv:main
Feb 24, 2026
Merged

[NV] update qwen3.5 b200 configs#169
Fridge003 merged 4 commits intosgl-project:mainfrom
kedarpotdar-nv:main

Conversation

@kedarpotdar-nv
Copy link
Copy Markdown
Contributor

No description provided.

@kedarpotdar-nv kedarpotdar-nv marked this pull request as draft February 19, 2026 18:20
functionstackx pushed a commit to SemiAnalysisAI/InferenceX that referenced this pull request Feb 20, 2026
sglang cookbook WIP PR here - sgl-project/sgl-cookbook#169

Updates the Qwen3.5-397B-A17B BF16 SGLang benchmark launch configuration on B200 with optimized server parameters and environment tuning for improved performance.
Signed-off-by: Faradawn Yang <73060648+faradawn@users.noreply.github.com>
Signed-off-by: Faradawn Yang <73060648+faradawn@users.noreply.github.com>
@kedarpotdar-nv kedarpotdar-nv marked this pull request as ready for review February 21, 2026 00:35
Signed-off-by: Faradawn Yang <73060648+faradawn@users.noreply.github.com>
cmd += ` \\\n --disable-radix-cache`;
cmd += ` \\\n --enable-flashinfer-allreduce-fusion`;
if (speculative === 'disabled') {
cmd += ` \\\n --tokenizer-worker-num 6`;
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just curious, how do you get the number of 6?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reference: https://github.com/SemiAnalysisAI/InferenceX/pull/758/changes. Line 56 of benchmarks/single_node/qwen3.5_bf16_b200.sh. Let me know if there is any issue or suggestions!

@Fridge003 Fridge003 merged commit 0afb3f0 into sgl-project:main Feb 24, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants