v2.0-beta4: HQQ Fix and Minor Refinements

abgulati released this 06 Sep 23:08

· 22 commits to main since this release

8aa7889

BUG FIX: HQQ quantization would error out if torch.dtype (dataType) was set to auto, it now force-sets to torch.bfloat16
BUG FIX: Add new LLM button re-displays when the HF-Waitress LLM list is closed and re-opened
Minor response-formatting adjustment

Full Changelog: v2.0-beta3...v2.0-beta4

Assets 2