new modelslim quantization model support v0.7.3 (#743)

ganyi1996ppo · web-flow · commit d491957e0132 · 2025-04-30T22:55:30.000+08:00
### What this PR does / why we need it? In order to support quantization model generated by new modelslim version, we need to add quant_description in `AscendQuantConfig`. cherry-pick this PR from #719 ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? test locally Signed-off-by: ganyi <pleaplusone.gy@gmail.com>
diff --git a/vllm_ascend/quantization/quant_config.py b/vllm_ascend/quantization/quant_config.py
@@ -70,7 +70,7 @@ def get_min_capability(cls) -> int:
 
     @classmethod
     def get_config_filenames(cls) -> List[str]:
-        return []
+        return ["quant_model_description.json"]
 
     @classmethod
     def from_config(cls, config: Dict[str, Any]) -> "AscendQuantConfig":