Skip to content

[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow#6629

Merged
yuxianq merged 2 commits intoNVIDIA:mainfrom
Yuening-wa:user/yueningl/moe_int8_weight_only_quant_support
Aug 15, 2025
Merged

[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow#6629
yuxianq merged 2 commits intoNVIDIA:mainfrom
Yuening-wa:user/yueningl/moe_int8_weight_only_quant_support

Commits

Commits on Aug 15, 2025