Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization.
To install all the required dependencies:
- Install MuJoCo engine, which can be downloaded from here.
- Install Python packages listed in
requirements.txt
usingpip install -r requirements.txt
. You should specify the version ofmujoco-py
inrequirements.txt
depending on the version of MuJoCo engine you have installed. - Manually download and install
d4rl
package from here. - Manually download and install
neorl
package from here.
Just run train.py
with specifying the task name. Other hyperparameters are automatically loaded from config
.
python train.py --task [TASKNAME]
If you find this repository useful for your research, please cite:
@inproceedings{
mobile,
title={Model-Bellman Inconsistency Penalized Offline Policy Optimization},
author={Yihao Sun and Jiaji Zhang and Chengxing Jia and Haoxin Lin and Junyin Ye and Yang Yu},
booktitle={International Conference on Machine Learning},
year={2023}
}