Skip to content

Conversation

@rasbt
Copy link
Contributor

@rasbt rasbt commented Apr 24, 2024

Allows selecting the precision for pretraining.

@rasbt rasbt mentioned this pull request Apr 24, 2024
Copy link
Contributor

@carmocca carmocca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. This was hardcoded because originally it was just meant to be a script to reproduce tinyllama. Probably worth letting Adrian take a final look

@awaelchli
Copy link
Contributor

In #882 I set it to mixed because of convergence stability. It would be great if we could leave the TinyLlama defaults untouched to be consistent with the original repro settings. For pretraining, I see mixed as the better default, even for the toy examples. Can we keep that please?

@rasbt
Copy link
Contributor Author

rasbt commented Apr 25, 2024

Arg yes, typing "bf16-true" must have been a weird muscle memory reflex. I didn't intentionally mean to change this. Thanks for the note.

@rasbt rasbt enabled auto-merge (squash) April 25, 2024 13:25
@rasbt rasbt merged commit b9ddd8b into main Apr 25, 2024
@rasbt rasbt deleted the fix-auto-precision branch April 25, 2024 13:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants