Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPTQ fixes #115

Closed
wants to merge 2 commits into from
Closed

GPTQ fixes #115

wants to merge 2 commits into from

Conversation

HDCharles
Copy link
Contributor

@HDCharles HDCharles commented Apr 3, 2024

Stack from ghstack (oldest at bottom):

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
HDCharles added a commit that referenced this pull request Apr 3, 2024
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: #115
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 3, 2024
@cpuhrsch cpuhrsch changed the base branch from gh/HDCharles/6/base to main April 3, 2024 00:51
@cpuhrsch
Copy link
Contributor

cpuhrsch commented Apr 3, 2024

@HDCharles - Changed the base to main in hope that CI will kick off

def test_gptq_quantizer_gpt_fast(self):
from torchao.quantization.GPTQ import Int8DynActInt4WeightGPTQQuantizer, InputRecorder
# should be similar to TorchCompileDynamicQuantizer
precision = torch.bfloat16
device = "cuda"
checkpoint_path = Path("../gpt-fast/checkpoints/meta-llama/Llama-2-7b-chat-hf/model.pth")
checkpoint_path = Path("/home/cdhernandez/local/gpt-fast/checkpoints/meta-llama/Llama-2-7b-chat-hf/model.pth")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We definitely can't store this on CI. Maybe open_llama_7b works better. gpt-fast supports that as well.

HDCharles added a commit that referenced this pull request Apr 3, 2024
Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: #115
@HDCharles HDCharles mentioned this pull request Apr 3, 2024
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
HDCharles added a commit that referenced this pull request Apr 3, 2024
Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 9d293f86255d16fb813c1d20a4c2e0dc5360e1cc
Pull Request resolved: #115
HDCharles added a commit that referenced this pull request Apr 3, 2024
* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: #115

* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

* remove debug from GPTQ

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
dbyoung18 pushed a commit to dbyoung18/ao that referenced this pull request Jul 31, 2024
* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: pytorch#115

* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

* remove debug from GPTQ

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
Update readme

Update README.md (pytorch#113)

update README.md

Update README.md (pytorch#114)

Update README.md (pytorch#115)

Update Readme.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants