WIS-LSTD Experiments

The main purpose of this project is to provide the python implementation of WIS-LSTD introduced by Mahmood, van Hasselt and Sutton (2014). Additionally, it also provides a random walk experiment to illustrate the usage of this algorithm.

It can be imported as an Eclipse Pydev project.

Read or execute runwislstdexperiments.sh for an example of running the experiment.

##References

Mahmood, A.R., van Hasselt, H., Sutton, R.S. (2014). Weighted importance sampling for off-policy learning with linear function approximation. Advances in Neural Information Processing Systems 27.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
pysrc		pysrc
pysrctest		pysrctest
results/wislstdexperiments		results/wislstdexperiments
.gitignore		.gitignore
.project		.project
.pydevproject		.pydevproject
README.md		README.md
runwislstdexperiments.sh		runwislstdexperiments.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WIS-LSTD Experiments

About

Releases

Packages

Languages

armahmood/wislstd-experiments

Folders and files

Latest commit

History

Repository files navigation

WIS-LSTD Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages