Skip to content

🚧 [llm][npu][quant] Add W4A4 MXFP4 quantization support for Qwen3 Dense on Ascend NPU#23795

Open
TallMessiWu wants to merge 35 commits into
sgl-project:mainfrom
TallMessiWu:junlin_qwen3_dense_w4a4
Open

🚧 [llm][npu][quant] Add W4A4 MXFP4 quantization support for Qwen3 Dense on Ascend NPU#23795
TallMessiWu wants to merge 35 commits into
sgl-project:mainfrom
TallMessiWu:junlin_qwen3_dense_w4a4

Commits

Commits on Mar 18, 2026

Commits on Mar 19, 2026

Commits on Mar 23, 2026

Commits on Mar 31, 2026

Commits on Apr 1, 2026

Commits on Apr 2, 2026

Commits on Apr 3, 2026

Commits on Apr 7, 2026

Commits on Apr 8, 2026

Commits on Apr 16, 2026

Commits on Apr 17, 2026

Commits on Apr 25, 2026

Commits on Apr 27, 2026