GitHub - gravesec/actor-critic-with-emphatic-weightings: Experiment code for our project on actor-critic algorithms with emphatic weightings.

Installation:

Install python3 (3.7.5) if necessary. On MacOS using Homebrew to install python3 is pretty good.
Create a new python virtual environment in the 'actor-critic-with-emphatic-weightings' directory (named 've' in this case):

$ python3 -m venv ve

Activate the virtual environment:

$ source ve/bin/activate

Install the required python package dependencies:

(ve)$ pip install -r requirements.txt

Running the experiment scripts on Compute Canada:

Activate the virtual environment (if necessary):

$ cd $SCRATCH/actor-critic-with-emphatic-weightings/
$ source ve/bin/activate

Read the help output for each script to determine which arguments you want to run your experiment with.

(ve)$ python generate_experience.py --help
(ve)$ python sweep.py --help
(ve)$ python run_ace.py --help
(ve)$ python evaluate_policies.py --help

Generate the data to use to train the agents:

(ve)$ python generate_experience.py

Generating the data ahead of time is more efficient than doing it for each agent, and is possible due to off-policy learning.

Run the sweep.py python script to generate bash scripts for SLURM to run:

(ve)$ python sweep.py

The script will give you a really rough estimate of how long the job might take and the number of nodes necessary to complete the job in the amount of time specified via the "--num_hours" argument. If requesting that number of nodes is ok with you, type "y", hit enter, and the script will generate the individual bash scripts for each node.

Schedule the generated script(s) to run via SLURM:

(ve)$ sbatch mountain-car/sweep0.sh

Evaluate the resulting policies:

(ve)$ python evaluate_policies.py

Use a jupyter notebook to explore the data and generate plots of performance.

Name		Name	Last commit message	Last commit date
Latest commit History 292 Commits
figures		figures
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ace.ipynb		ace.ipynb
evaluate_policies.py		evaluate_policies.py
generate_experience.py		generate_experience.py
mountain-car.ipynb		mountain-car.ipynb
puddle-world.ipynb		puddle-world.ipynb
requirements.txt		requirements.txt
run_ace.py		run_ace.py
run_ace_only_eval.py		run_ace_only_eval.py
run_ace_virtual_office.py		run_ace_virtual_office.py
run_ideal_ace.py		run_ideal_ace.py
sweep.py		sweep.py
virtual-office.ipynb		virtual-office.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation:

Running the experiment scripts on Compute Canada:

About

Releases 2

Packages

Contributors 4

Languages

License

gravesec/actor-critic-with-emphatic-weightings

Folders and files

Latest commit

History

Repository files navigation

Installation:

Running the experiment scripts on Compute Canada:

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages