Skip to content

Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization

License

Notifications You must be signed in to change notification settings

yihaosun1124/mobile

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Model-Bellman Inconsistency Penalized Offline Policy Optimization (MOBILE)

License: MIT

Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization.

Requirements

To install all the required dependencies:

  1. Install MuJoCo engine, which can be downloaded from here.
  2. Install Python packages listed in requirements.txt using pip install -r requirements.txt. You should specify the version of mujoco-py in requirements.txt depending on the version of MuJoCo engine you have installed.
  3. Manually download and install d4rl package from here.
  4. Manually download and install neorl package from here.

Usage

Just run train.py with specifying the task name. Other hyperparameters are automatically loaded from config.

python train.py --task [TASKNAME]

Citation

If you find this repository useful for your research, please cite:

@inproceedings{
    mobile,
    title={Model-Bellman Inconsistency Penalized Offline Policy Optimization},
    author={Yihao Sun and Jiaji Zhang and Chengxing Jia and Haoxin Lin and Junyin Ye and Yang Yu},
    booktitle={International Conference on Machine Learning},
    year={2023}
}

About

Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages