Skip to content

v2.0-beta4: HQQ Fix and Minor Refinements

Compare
Choose a tag to compare
@abgulati abgulati released this 06 Sep 23:08
· 22 commits to main since this release
  • BUG FIX: HQQ quantization would error out if torch.dtype (dataType) was set to auto, it now force-sets to torch.bfloat16

  • BUG FIX: Add new LLM button re-displays when the HF-Waitress LLM list is closed and re-opened

  • Minor response-formatting adjustment

Full Changelog: v2.0-beta3...v2.0-beta4