v2.0-beta4: HQQ Fix and Minor Refinements
-
BUG FIX: HQQ quantization would error out if torch.dtype (dataType) was set to auto, it now force-sets to torch.bfloat16
-
BUG FIX: Add new LLM button re-displays when the HF-Waitress LLM list is closed and re-opened
-
Minor response-formatting adjustment
Full Changelog: v2.0-beta3...v2.0-beta4