Skip to content

Conversation

@younesbelkada
Copy link
Contributor

@younesbelkada younesbelkada commented Dec 16, 2022

What does this PR do?

This PR is a follow-up PR of #50 - it is an attempt to remove the nbdev dependency from trl
This PR also adds a patch to support the latest version of transformers
Draft for now as I have to figure out how to deal with the documentation

For now testing that everything works with python examples/04-ppo-sentiment.py

cc @lvwerra

- move notebooks to `examples/notebooks``
- removed `_nbdev`file
- refactored `gpt2.py` to make it work with more recent `transformers`
- update `requirements` to add recent `transformers`
@younesbelkada younesbelkada marked this pull request as ready for review December 19, 2022 21:25
@younesbelkada
Copy link
Contributor Author

Moved all images inside the org https://huggingface.co/trl-internal-testing and fixed all image links on README + notebooks with the correct ones
Also as discussed, I removed the 3 first notebooks ;) Let me know what is missing here!

Copy link
Member

@lvwerra lvwerra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @younesbelkada, looks great!

A few minor things to fix:

  • remove the numbering from notebooks and scripts - i think this is no longer necessary
  • remove the CI (should be in .github/workflows)
    I wonder if it's possible to remove generated from note at the top.

@lvwerra
Copy link
Member

lvwerra commented Dec 20, 2022

@younesbelkada
Copy link
Contributor Author

Thanks for the review!
I should have removed the CI, done the renaming of the files ;-)

@younesbelkada younesbelkada merged commit 6fa5248 into huggingface:master Dec 20, 2022
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants