The Path to v1.0.0 #348

PanQiWei · 2023-09-25T09:50:06Z

Hi everyone, long time no see! Start from this week, I will use about 4 weeks to gradually push AutoGPTQ to v1.0.0, in the mean time, there will be 2~3 minor version released as optimization or feature preview so that you can experience those updates as soon as they are finished and I can hear more community voices and get more feedbacks.

My vision is at the time of v1.0.0 is released, AutoGPTQ can serve as an automatic, extendable and flexible quantization backend for all language models that are written by Pytorch.

I open this issue to list all the things will be done (optimizations, new features, bug fixes, etc) and record the development progress.(so contents below will be updated frequently)

Feel free to comment in the thread to give your opinions and suggestions!

Optimizations

refactor the code framework for the future extensions while maintain the important interfaces.
- separate quantization logic as a stand alone module and serve as mixin.
- design automatic structure recognize strategy to better support different models (hope can even support multi-modal and diffusion models).
speed up model packing after quantization.
support kernel fusion to more models to futher speed up inference.

New Features

model sharping: split model checkpoint into multiple files and load from multiple files. Save and Load sharded gptq checkpoint #364
tensor parallelism for all kinds of QuantLinear that are supported by AutoGPTQ.
CLI: run common commands such as quantization and benchmark directly.

Bug Fixes

Auth0rM0rgan · 2024-02-15T08:54:42Z

Hi @PanQiWei, Any updates regarding version 1.0.0?

Qubitium · 2024-04-27T12:13:31Z

@PanQiWei Can you rejoin @fxmarty and be more active in code reviews? Feels like the project needs at least 2 active maintainers to keep it up to speed and not overload any single person.

PanQiWei added the enhancement New feature or request label Sep 25, 2023

PanQiWei pinned this issue Sep 25, 2023

Qubitium unpinned this issue Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Path to v1.0.0 #348

The Path to v1.0.0 #348

PanQiWei commented Sep 25, 2023 •

edited

Loading

Auth0rM0rgan commented Feb 15, 2024

Qubitium commented Apr 27, 2024 •

edited

Loading

The Path to v1.0.0 #348

The Path to v1.0.0 #348

Comments

PanQiWei commented Sep 25, 2023 • edited Loading

Optimizations

New Features

Bug Fixes

Auth0rM0rgan commented Feb 15, 2024

Qubitium commented Apr 27, 2024 • edited Loading

PanQiWei commented Sep 25, 2023 •

edited

Loading

Qubitium commented Apr 27, 2024 •

edited

Loading