-
Notifications
You must be signed in to change notification settings - Fork 87
Issues: Lightning-AI/lightning-thunder
Label tracking meta-issue (edit me to get automatically CC'ed...
#72
opened Mar 25, 2024 by
carmocca
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Grad Transform generates inconsistent saved_for_backward between forward and backward trace.
#1732
opened Feb 1, 2025 by
jjsjann123
dividing a float16 tensor by a python float is inaccurate with nvfuser
nvfuser
#1724
opened Jan 30, 2025 by
beverlylytle
cudnn SDPA : cudnn sdpa is not used for bigcode/starcoder2-7b
cudnn
#1722
opened Jan 30, 2025 by
kshitij12345
Add HF models PEFT benchmarks
benchmarking
nemo
Issues needed to support NVIDIA NeMo models.
#1716
opened Jan 29, 2025 by
riccardofelluga
4 tasks done
Run LitGPT benchmarking with a custom Attention implementation priority.
benchmarking
#1714
opened Jan 29, 2025 by
wprazuch
Input upcast is missing in Thunder's implementation of torch.nn.functional.rms_norm
operators
#1713
opened Jan 29, 2025 by
IvanYashchuk
symbolic cache policy can't handle string inputs properly.
symbolic values
#1710
opened Jan 28, 2025 by
jjsjann123
[Reporting Tool] Modular report classes for saving non-Thunder-specific repro scripts
reporting
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1700
opened Jan 27, 2025 by
kiya00
Implement max_norm argument for torch.nn.functional.embedding
enhancement
New feature or request
in-place
operators
#1699
opened Jan 27, 2025 by
IvanYashchuk
check memory location of things tagged STATIC_MEMORY_LOCATION by default
cudagraphs
enhancement
New feature or request
#1686
opened Jan 24, 2025 by
t-vi
Transforming traces should always precede a domination check
enhancement
New feature or request
#1684
opened Jan 22, 2025 by
ali-alshaar7
Investigate bf16 rms norm numerics
numerical accuracy
operators
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1678
opened Jan 22, 2025 by
t-vi
Connect New feature or request
nvfuser
thunderfx
for things that could be applicable to the dynamo+thunder frontend
prims.copy_with_setitem
to nvFuser's Executor
enhancement
#1676
opened Jan 22, 2025 by
kevinstephano
thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.
performance
#1657
opened Jan 17, 2025 by
kiya00
nvFuser using more memory than inductor for HF CausalLMLoss
memory use
nvfuser
#1654
opened Jan 17, 2025 by
riccardofelluga
backward creates inconsistent proxies between args and unpacking them
autograd
tracing architecture
#1633
opened Jan 10, 2025 by
t-vi
avoid joint trace in rematerialize forward backward
rematerialization
#1618
opened Jan 8, 2025 by
t-vi
make traces own proxies and bsyms
enhancement
New feature or request
tracing architecture
#1606
opened Jan 6, 2025 by
t-vi
nvFuser has a faster RMSNorm fusion definition than thunder's RMSNorm decomposition
operators
performance
#1582
opened Dec 23, 2024 by
mruberry
Get dynamic shapes to work with Phi-3-mini-128k-instruct
enhancement
New feature or request
nemo
Issues needed to support NVIDIA NeMo models.
#1579
opened Dec 20, 2024 by
tfogal
Consider adding is_leaf attribute to TensorProxies
enhancement
New feature or request
#1577
opened Dec 20, 2024 by
beverlylytle
Previous Next
ProTip!
Follow long discussions with comments:>50.