Skip to content
Merged
13 changes: 13 additions & 0 deletions examples/bots/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# 🤖🤖🤖 BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning

<p align="center">
<a href="https://arxiv.org/abs/1234.56789">
<img alt="Paper" src="https://img.shields.io/badge/Paper-arXiv%3A1234.56789-b31b1b?style=flat&logo=arxiv">
</a>
</p>

### Repository Status

This repository hosts the upcoming Trinity version of our code, which is still under development and not yet released.

For complete reproduction of the results in our paper, please use the verl version available [here](https://dail-wlcb.oss-cn-wulanchabu.aliyuncs.com/public/BOTS_verl_version.zip).
Loading