Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subclass API #966

Closed
wants to merge 1 commit into from
Closed

Subclass API #966

wants to merge 1 commit into from

Conversation

metascroy
Copy link
Contributor

Summary: Adds new int8_dynamic_activation_intx_weight quantization with subclass API

Differential Revision: D62464487

Copy link

pytorch-bot bot commented Sep 28, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/966

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 1 New Failure, 2 Unrelated Failures

As of commit 8d53959 with merge base 60ffb86 (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 28, 2024
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D62464487

Summary:
Pull Request resolved: pytorch#966

Adds new int8_dynamic_activation_intx_weight quantization with subclass API

Differential Revision: D62464487
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D62464487

@metascroy metascroy closed this Oct 2, 2024
metascroy added a commit to metascroy/ao that referenced this pull request Oct 2, 2024
Summary:

Adds new int8_dynamic_activation_intx_weight quantization with subclass API

Differential Revision: D62464487
facebook-github-bot pushed a commit that referenced this pull request Oct 30, 2024
Differential Revision: D62464487

Pull Request resolved: #995
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
* add llama 3.1 8b support

* make Model and ModelArgs as model definition entrance

* make model definition support multiple transformer

* make model definition support multiple transformer

* make model definition support multiple transformer

* make input arg static in Model to support export

* fix bugs for gguf and et in new model definition architecture

* retrieve text transformer arg from modelargs

* add set_cache funtion to Model to work around PTEModel issue

* make torchchat rely on torchtune

* remove export_util

* extra torchtune dependency
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants