Replies: 1 comment 4 replies
-
what about el honcho - the big cheese - loading a checkpoint? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
@bghira I was able to hack together a dirty script for FP8 LoRA training through
torchao
: https://gist.github.com/sayakpaul/2743810ed362cdc841b90956e2b155a1But the results are not there yet. So, opening this discussion up for debugging this together. But some desired things already work:
transformer
into an FP8-compatible module.accelerator.prepare()
on thetransformer
.Training command and other necessary instructions are in the gist itself.
Cc @AmericanPresidentJimmyCarter who might be interested too.
Beta Was this translation helpful? Give feedback.
All reactions