-
Notifications
You must be signed in to change notification settings - Fork 247
Issues: OpenRLHF/OpenRLHF
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
assert state_dict_keys.issubset( [rank0]: AssertionError: mismatch keys
#501
opened Nov 5, 2024 by
anoxia-1
[RFC] Support SGLang generation in RLHF
enhancement
New feature or request
#487
opened Oct 28, 2024 by
hijkzzz
Evaluate the PPO Process: Compatibility issues between DeepSpeed checkpoints and Transformers models
#426
opened Aug 20, 2024 by
Ricardokevins
A worker died or was killed while executing a task by an unexpected system error.
#360
opened Jul 15, 2024 by
lusongshuo-mt
使用Deepseek-lite训练DPO,显示expected mat1 and mat2 to have the same type, but got: float != c10: : BFLoat16
#306
opened May 27, 2024 by
victorShawFan
Previous Next
ProTip!
Follow long discussions with comments:>50.