We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,请问PDF版的P150页6.2“奖励模型”这一章节的第一句话“基于人类反馈训练的奖励模型可以很好的人类的偏好”里“很好的”和“人类偏好”之间是否漏掉了诸如“拟合”,“对齐”这样的动词
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hi,请问PDF版的P150页6.2“奖励模型”这一章节的第一句话“基于人类反馈训练的奖励模型可以很好的人类的偏好”里“很好的”和“人类偏好”之间是否漏掉了诸如“拟合”,“对齐”这样的动词
The text was updated successfully, but these errors were encountered: