Subclass API #966

metascroy · 2024-09-28T00:23:34Z

Summary: Adds new int8_dynamic_activation_intx_weight quantization with subclass API

Differential Revision: D62464487

pytorch-bot · 2024-09-28T00:23:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/966

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

PyTorch Testing Nodes Undergoing ROCm 6.2.1 Upgrades

❌ 1 New Failure, 2 Unrelated Failures

As of commit 8d53959 with merge base 60ffb86 ():

NEW FAILURE - The following job has failed:

Code Analysis with Ruff / build (3.9) (gh)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-09-28T01:18:12Z

This pull request was exported from Phabricator. Differential Revision: D62464487

Summary: Pull Request resolved: pytorch#966 Adds new int8_dynamic_activation_intx_weight quantization with subclass API Differential Revision: D62464487

facebook-github-bot · 2024-09-30T18:27:18Z

This pull request was exported from Phabricator. Differential Revision: D62464487

Summary: Adds new int8_dynamic_activation_intx_weight quantization with subclass API Differential Revision: D62464487

Differential Revision: D62464487 Pull Request resolved: #995

* add llama 3.1 8b support * make Model and ModelArgs as model definition entrance * make model definition support multiple transformer * make model definition support multiple transformer * make model definition support multiple transformer * make input arg static in Model to support export * fix bugs for gguf and et in new model definition architecture * retrieve text transformer arg from modelargs * add set_cache funtion to Model to work around PTEModel issue * make torchchat rely on torchtune * remove export_util * extra torchtune dependency

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 28, 2024

facebook-github-bot added the fb-exported label Sep 28, 2024

Subclass API (pytorch#966)

8d53959

Summary: Pull Request resolved: pytorch#966 Adds new int8_dynamic_activation_intx_weight quantization with subclass API Differential Revision: D62464487

metascroy force-pushed the export-D62464487 branch from 8b7c8fb to 8d53959 Compare September 30, 2024 18:27

metascroy closed this Oct 2, 2024

metascroy added a commit to metascroy/ao that referenced this pull request Oct 2, 2024

Subclass API (pytorch#966)

41a40cb

Summary: Adds new int8_dynamic_activation_intx_weight quantization with subclass API Differential Revision: D62464487

facebook-github-bot pushed a commit that referenced this pull request Oct 30, 2024

Subclass API (#966)

581d8e0

Differential Revision: D62464487 Pull Request resolved: #995

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Subclass API #966

Subclass API #966

metascroy commented Sep 28, 2024

pytorch-bot bot commented Sep 28, 2024 •

edited

Loading

facebook-github-bot commented Sep 28, 2024

facebook-github-bot commented Sep 30, 2024

Subclass API #966

Subclass API #966

Conversation

metascroy commented Sep 28, 2024

pytorch-bot bot commented Sep 28, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/966

❗ 1 Active SEVs

❌ 1 New Failure, 2 Unrelated Failures

facebook-github-bot commented Sep 28, 2024

facebook-github-bot commented Sep 30, 2024

pytorch-bot bot commented Sep 28, 2024 •

edited

Loading