TPolicies

Policies used in various RL or IL applications. Based on tf 1.x APIs. In the style of tf.contrib.layers and tf-slim. Some code of implementing policy gradient loss is borrowed from deepmind/trfl and openai/baselines. Some code for transformer layer is borrowed from here.

Install

cd to the folder and run the command:

pip install -e .

Moreover, we require tensorflow==1.15.0, please install it manually!

Quick Example

See the testing file net_zoo/net_name/net_name_test.py which can serve as examples.

some notes

Use tf.contrib.framework.nest, e.g., nest.flatten, nest.map_structure. It is helpful when, for example, computing losses from all the action heads.

Disclaimer

This is not an officially supported Tencent product. The code and data in this repository are for research purpose only. No representation or warranty whatsoever, expressed or implied, is made as to its accuracy, reliability or completeness. We assume no liability and are not responsible for any misuse or damage caused by the code and data. Your use of the code and data are subject to applicable laws and your use of them is at your own risk.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
tpolicies		tpolicies
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TPolicies

Install

Quick Example

some notes

Disclaimer

About

Releases

Packages

Contributors 2

Languages

License

tencent-ailab/TPolicies

Folders and files

Latest commit

History

Repository files navigation

TPolicies

Install

Quick Example

some notes

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages