Skip to content

Conversation

@realtmxi
Copy link
Collaborator

  • update the verl/utils/dataset, sync with verl latest code
  • update requirements.txt, to successfully run train_ppo.sh dependencies
  • extract webshop testset from AgentEval dataset
  • run_sft script
  • base_eval_webshop.sh & distributed_eval_webshop.sh

@Kunlun-Zhu
Copy link
Contributor

@realtmxi need to have another requirement.txt files for the rollout system

@Kunlun-Zhu Kunlun-Zhu merged commit 25ff2d4 into main May 11, 2025
1 of 5 checks passed
realtmxi added a commit that referenced this pull request May 23, 2025
* [feat]: offline rollout evaluation

* [feat]: sync with verl/utils/dataset/ lastest code

* feat: update requirements.txt

* [feat]: extract the webshop testset from AgentEval dataset

* feat: run_sft script

* feat: webshop evaluation script

---------

Co-authored-by: Kunlun Zhu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants