-
Notifications
You must be signed in to change notification settings - Fork 257
Issues: pytorch/ao
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[tracker] Low precision training for MoEs
tracker
triaged
#2147
opened Apr 29, 2025 by
danielvegamyhre
4 of 10 tasks
KleidiAI int4 kernels not loading properly on aarch64 Linux
quantize
triaged
#2143
opened Apr 28, 2025 by
vctrmn
AO/GemLite tensors produce incorrect outputs in vLLM
integration
Issues related to integrations with other libraries, like huggingface, vllm, sglang, gemlite etc.
quantize
triaged
#2141
opened Apr 28, 2025 by
mobicham
AssertionError During Quantization of pt2 export quantization
triaged
torch.empty_like()
, torch.ones_like
, and torch.randn_like
pt2e_quant
#2146
opened Apr 28, 2025 by
defaultd661
QAT model drops accuracy after converting with torch.ao.quantization.convert
qat
triaged
#2138
opened Apr 28, 2025 by
tranngocduvnvp
int8 quantization with FSDP for inference error
distributed
quantize
triaged
#2127
opened Apr 25, 2025 by
Andy0422
Question about dtype check in marlin_qqq validation for w4a8 functionality
quantize
triaged
#2115
opened Apr 23, 2025 by
xxw11
[PT2E] observers do not handle inputs with different shapes correctly
pt2e_quant
pt2 export quantization
triaged
#2112
opened Apr 23, 2025 by
Xia-Weiwen
Got unexpected low speed using quantization inference on qwen models.
performance
triaged
#2102
opened Apr 22, 2025 by
HaoKang-Timmy
[Tracker] TorchAO activation sparsity acceleration 🚀
sparsity
tracker
triaged
#2095
opened Apr 22, 2025 by
jcaip
2 of 9 tasks
[Quant][PT2E] AffineQuantized observers failed Resnet18
pt2e_quant
pt2 export quantization
triaged
#2094
opened Apr 22, 2025 by
Xia-Weiwen
How to automatically install the latest TorchAO nightly wheel
distribution
triaged
#2086
opened Apr 21, 2025 by
MingxuZh
Refactor torchao and tests to use model architectures from torchao.testing.model_architectures
good first issue
Good for newcomers
triaged
#2078
opened Apr 18, 2025 by
jainapurva
Dynamo error with large mesh + AdamWFp8 + bf16 stochastic rounding
bug
Something isn't working
distributed
optimizer
triaged
#2074
opened Apr 18, 2025 by
cassanof
Make lm_eval optional dependency
topic: for developers
Use this tag if this PR is mainly developer facing
triaged
#2073
opened Apr 18, 2025 by
jainapurva
Remove old subclass implementation to reduce maintainence cost
topic: deprecation
Use this tag if this PR deprecates a feature
triaged
#2056
opened Apr 14, 2025 by
jerryzh168
Making RCEIL the default for MXFP scale derivation
mx
triaged
#2035
opened Apr 10, 2025 by
frsun-nvda
Fix remaining issues when running on H100 machines
topic: for developers
Use this tag if this PR is mainly developer facing
triaged
#2028
opened Apr 8, 2025 by
jerryzh168
Torchao import time
topic: improvement
Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
triaged
#1944
opened Mar 24, 2025 by
felipemello1
[Bug] FSDP2 FP8 compatibility problem with nn.Linear layers (GPU count > out_features)
distributed
float8
triaged
#1938
opened Mar 24, 2025 by
HIT-cwh
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.