[Docs] Add Falcon H1, Hunyuan-Large, Qwen3-Omni support and update Diffusion usage by pokymono · Pull Request #17888 · sgl-project/sglang

pokymono · 2026-01-28T15:09:43Z

Motivation

The documentation for supported models was missing models such as Falcon H1, Hunyuan-Large, and Qwen3-Omni.
Adding a Diffusion to Basic Usage for better guidance.

Modifications

New Model Support:
Generative Models: Added Falcon H1 (0.5B–34B Hybrid Mamba-Transformer) and Hunyuan-Large (389B MoE).
Multimodal Models: Added Step3-VL (10B) and Qwen3-Omni (Omni-modal MoE).

Basic Usage:
Added/Updated Diffusion documentation in the Basic Usage section to provide clear guidance on diffusion language models as requested.

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

gemini-code-assist · 2026-01-28T15:10:03Z

Summary of Changes

Hello @pokymono, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the project's documentation by incorporating several newly supported large language models and multimodal models. It also introduces a structured guide for diffusion models, distinguishing between those used for image/video generation and those for text generation, thereby improving clarity and discoverability for users.

Highlights

New Generative Model Support: Added documentation for new generative models, including Falcon H1 (a hybrid Mamba-Transformer architecture) and Hunyuan-Large (a 389B Mixture of Experts model).
New Multimodal Model Support: Included new multimodal models, Step3-VL (a 10B VLM) and Qwen3-Omni (an omni-modal MoE model), in the supported models list.
Enhanced Diffusion Model Documentation: Introduced a dedicated documentation page for Diffusion models, categorizing them into image/video generation and diffusion language models, and updated existing diffusion model documentation to provide clearer cross-references.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request updates the documentation to include newly supported models like Falcon H1, Hunyuan-Large, Step3-VL, and Qwen3-Omni. It also adds a new page for diffusion models, improving the overall structure and clarity of the documentation. The changes are well-executed, but I found a minor formatting issue in one of the updated tables.

docs/supported_models/generative_models.md

pokymono · 2026-01-28T15:14:14Z

@Ratish1 Apologies for the previous PR, please check this one

docs/supported_models/generative_models.md

Ratish1 · 2026-01-28T16:21:08Z

docs/basic_usage/diffusion.md

Why do we need this file?, we already have this

sglang/docs/supported_models/diffusion_models.md

Line 1 in 452dce8

# Diffusion Models

basic_usage is model-specific folder only

Adarsh asked me to add Diffusion in basic usage.

Ratish1 · 2026-01-28T16:22:21Z

Also @pokymono , apply the gemini fix, there is a clear render bug for that file. Fix the lint error aswell

Ratish1 · 2026-01-28T16:31:01Z

There still seems to be a lint error, make sure you are running the pre-commit command

Ratish1

LGTM

cc: @adarshxs

docs/supported_models/multimodal_language_models.md

docs/supported_models/generative_models.md

docs/index.rst

adarshxs · 2026-01-29T18:25:08Z

LGTM. cc @mickqian

…rted models

Co-authored-by: Ratish P <114130421+Ratish1@users.noreply.github.com>

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

…_models with sub sections based on categories.

fix fix lint fix

pokymono · 2026-02-06T04:32:24Z

@zhaochenyang20 updated

adarshxs · 2026-02-06T04:37:40Z

LGTM cc @zhaochenyang20

…ffusion usage (sgl-project#17888) Co-authored-by: Rishitshivam <164783543+Rishitshivam@users.noreply.github.com> Co-authored-by: Ratish P <114130421+Ratish1@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Adarsh Shirawalmath <114558126+adarshxs@users.noreply.github.com> Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>

zhaochenyang20 · 2026-02-21T04:18:39Z

This doc is vibe-coded and introduces unexpected errors in our documentation. We shall spend more time evaluating this PR. @Ratish1

#19104

…ffusion usage (sgl-project#17888) Co-authored-by: Rishitshivam <164783543+Rishitshivam@users.noreply.github.com> Co-authored-by: Ratish P <114130421+Ratish1@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Adarsh Shirawalmath <114558126+adarshxs@users.noreply.github.com> Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>

github-actions bot added documentation Improvements or additions to documentation Multi-modal multi-modal language model labels Jan 28, 2026

pokymono changed the title ~~[Docs]~~ [Docs] Add Falcon H1, Hunyuan-Large, Qwen3-Omni support and update Diffusion usage Jan 28, 2026

gemini-code-assist bot reviewed Jan 28, 2026

View reviewed changes

docs/supported_models/generative_models.md Outdated Show resolved Hide resolved

Ratish1 reviewed Jan 28, 2026

View reviewed changes

docs/supported_models/generative_models.md Outdated Show resolved Hide resolved

Ratish1 reviewed Jan 28, 2026

View reviewed changes

Ratish1 approved these changes Jan 28, 2026

View reviewed changes

adarshxs reviewed Jan 28, 2026

View reviewed changes

docs/supported_models/multimodal_language_models.md Outdated Show resolved Hide resolved

adarshxs reviewed Jan 28, 2026

View reviewed changes

docs/supported_models/generative_models.md Outdated Show resolved Hide resolved

adarshxs reviewed Jan 28, 2026

View reviewed changes

docs/index.rst Outdated Show resolved Hide resolved

adarshxs approved these changes Jan 29, 2026

View reviewed changes

pokymono and others added 10 commits January 30, 2026 12:57

docs: add Falcon H1, Hunyuan-Large, Qwen3-Omni, and Step3-VL to suppo…

b07e0b4

…rted models

docs: add Diffusion to basic usage

5fc0f4a

Update docs/supported_models/generative_models.md

9a960a0

Co-authored-by: Ratish P <114130421+Ratish1@users.noreply.github.com>

Update docs/supported_models/generative_models.md

b0f529b

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix: lint error

9948b92

support doc fix

c4d78bc

docs: Introduce a comprehensive documentation structure for supported…

facdddc

…_models with sub sections based on categories.

fix links

7103f16

fix

c72ae1b

lint error fix

d49a9b0

pokymono force-pushed the main branch from 5457e46 to d49a9b0 Compare January 30, 2026 08:15

pokymono and others added 3 commits January 30, 2026 13:49

Merge branch 'sgl-project:main' into main

4e0fa09

line fix

3026c57

fix fix lint fix

lint fix

f8fe1b5

pokymono requested review from ByronHsu, CatherineSue, DarkSharpness, Qiaolin-Yu, hebiao064, iforgetmyname, ishandhanani, ping1jing2, slin1237 and yingluosanqian as code owners February 6, 2026 04:29

github-actions bot added quant LLM Quantization amd dependencies Pull requests that update a dependency file lora deepseek sgl-kernel labels Feb 6, 2026

Merge branch 'sgl-project:main' into main

4e4343b

github-actions bot added blackwell SM100/SM120 npu diffusion SGLang Diffusion mthreads labels Feb 6, 2026

zhaochenyang20 approved these changes Feb 6, 2026

View reviewed changes

zhaochenyang20 merged commit c850a8a into sgl-project:main Feb 6, 2026
46 checks passed

Conversation

pokymono commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

gemini-code-assist bot commented Jan 28, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

pokymono commented Jan 28, 2026

Uh oh!

Uh oh!

Ratish1 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

pokymono Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Ratish1 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ratish1 commented Jan 28, 2026

Uh oh!

Ratish1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adarshxs commented Jan 29, 2026

Uh oh!

pokymono commented Feb 6, 2026

Uh oh!

adarshxs commented Feb 6, 2026

Uh oh!

Uh oh!

zhaochenyang20 commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pokymono commented Jan 28, 2026 •

edited

Loading

Ratish1 commented Jan 28, 2026 •

edited

Loading