Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FastChat-T5 doc+fix data processing #1430

Merged
merged 5 commits into from
May 22, 2023
Merged

FastChat-T5 doc+fix data processing #1430

merged 5 commits into from
May 22, 2023

Conversation

DachengLi1
Copy link
Collaborator

@DachengLi1 DachengLi1 commented May 21, 2023

Why are these changes needed?

This PR adds instructions on how to train fastchat-t5 and how to handle potential saving problems. It also fixes distributed data processing bugs and prevent race condition on very small datasets, e.g. the playground dataset.

Related issue number (if applicable)

#1339, #643

Checks

  • I've run format.sh to lint the changes in this PR.
  • I've included any doc changes needed.
  • I've made sure the relevant tests are passing (if applicable).

@DachengLi1 DachengLi1 changed the title Dacheng fst5 FastChat-T5 doc+fix data processing May 21, 2023
@merrymercy merrymercy merged commit 621bc89 into main May 22, 2023
@merrymercy merrymercy deleted the dacheng-fst5 branch May 22, 2023 01:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants