Bidirectional Model-based Policy Optimization

This is the TensorFlow implementation for the paper Bidrectional Model-based Policy Optimization.

Requirements

pip install -r requirements.txt

python main.py --config=config.hopperNT

To change hyper-parameters, please modify the corresponding config file in config/.

This code is mainly modified based on the mbpo codebase.