Skip to content

25% less mem and 10% faster training: Do not upcast lm_head and embedding to float32#1186

Merged
danielhanchen merged 2 commits intounslothai:nightlyfrom
Datta0:lm_head_no_upcast
Oct 25, 2024
Merged

25% less mem and 10% faster training: Do not upcast lm_head and embedding to float32#1186
danielhanchen merged 2 commits intounslothai:nightlyfrom
Datta0:lm_head_no_upcast

Commits