Skip to content

:bug: fix(modelslim): fix W4A8_MXFP weight dtype to float8_e4m3fn

456bd14
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

🚧 [llm][npu][quant] Add W4A8 MXFP quantization support for Qwen3 Dense on Ascend NPU #23650

:bug: fix(modelslim): fix W4A8_MXFP weight dtype to float8_e4m3fn
456bd14
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
label
succeeded Apr 24, 2026 in 7s