Skip to content

Conversation

@jla524
Copy link
Contributor

@jla524 jla524 commented Mar 21, 2024

What does this PR do?

Fixes #29697 (issue)

  • match the torch_dtype argument in run_clm
  • max_seq_length is very similar to block_size and is left unchanged

Who can review?

@galtay and @amyeroberts

Copy link
Contributor

@amyeroberts amyeroberts left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding this!

@amyeroberts amyeroberts merged commit ef6e371 into huggingface:main Mar 21, 2024
@jla524 jla524 deleted the run_mlm_dtype branch April 22, 2024 22:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

run_mlm example is missing block_size and torch_dtype args (present in run_clm)

2 participants