Skip to content

🚧 [llm][npu][quant] Add W4A8 MXFP quantization support for Qwen3 Dense on Ascend NPU#23650

Open
TallMessiWu wants to merge 28 commits into
sgl-project:mainfrom
TallMessiWu:junlin_qwen3_dense_w4a8
Open

🚧 [llm][npu][quant] Add W4A8 MXFP quantization support for Qwen3 Dense on Ascend NPU#23650
TallMessiWu wants to merge 28 commits into
sgl-project:mainfrom
TallMessiWu:junlin_qwen3_dense_w4a8

Commits

Commits on Mar 18, 2026

Commits on Mar 19, 2026

Commits on Mar 23, 2026

Commits on Mar 31, 2026

Commits on Apr 1, 2026

Commits on Apr 2, 2026

Commits on Apr 3, 2026

Commits on Apr 7, 2026

Commits on Apr 8, 2026

Commits on Apr 16, 2026