You Only Evaluate Once (YOEO)

Prerequisite

mujoco200
conda

Remember to add the mujoco directory to LD_LIBRARY_PATH environment variable.
All the other dependencies will be handled in the following installation script via conda.

Install

git clone <anonymized for dual blind>
cd <cloned-dir>
conda create --file env.yaml --name YOEO
conda activate YOEO
# in the case of error during creation, use conda update commands:
# conda env update --file env.yaml
pip install git+https://github.com/rail-berkeley/d4rl@master#egg=d4rl

Run

Train Y

```
python -m YOEO.scripts.value_training --log_dir ./log/directory/you/want --seed {seed you want} --config_file ./experiments/Y.gin ./experiments/envs/{hopper,walker2d,etc}.gin --config_params env_id=\'{d4rl env id, like hopper-medium-replay-v0}\'
```

Train YOEO (and ablations)

YOEO

python -m YOEO.scripts.offline_rl --log_dir ./log/directory/you/want --seed 1 --config_file ./experiments/ValidQbeta.gin ./experiments/envs/{hopper,walker2d,etc}.gin --config_params env_id=\'{d4rl env id, like hopper-medium-replay-v0}\' Y_chkpt=\'./path/to/trained/Y/value.tf\' lambda={0.1 or 1.0}

$Q^\beta$ without regularizations

python -m YOEO.scripts.offline_rl --log_dir ./log/directory/you/want --seed 1 --config_file ./experiments/Qbeta.gin ./experiments/envs/{hopper,walker2d,etc}.gin --config_params env_id=\'{d4rl env id, like hopper-medium-replay-v0}\'

For other ablation studies, you can just change the gin file, or you can provide the hyperparameters using --config_params arguments, then the provided arguments will overide the arguments in gin file.

Observe training statistics & results

Use Tensorboard
```
tensorboard --logdir <log directory>
```

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
YOEO		YOEO
.gitignore		.gitignore
README.md		README.md
env.yaml		env.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

You Only Evaluate Once (YOEO)

Prerequisite

Install

Run

Train Y

Train YOEO (and ablations)

Observe training statistics & results

About

Releases

Packages

Languages

hiwonjoon/YOEO-public

Folders and files

Latest commit

History

Repository files navigation

You Only Evaluate Once (YOEO)

Prerequisite

Install

Run

Train Y

Train YOEO (and ablations)

Observe training statistics & results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages