Skip to content

tencent-ailab/TPolicies

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TPolicies

Policies used in various RL or IL applications. Based on tf 1.x APIs. In the style of tf.contrib.layers and tf-slim. Some code of implementing policy gradient loss is borrowed from deepmind/trfl and openai/baselines. Some code for transformer layer is borrowed from here.

Install

cd to the folder and run the command:

pip install -e .

Moreover, we require tensorflow==1.15.0, please install it manually!

Quick Example

See the testing file net_zoo/net_name/net_name_test.py which can serve as examples.

some notes

Use tf.contrib.framework.nest, e.g., nest.flatten, nest.map_structure. It is helpful when, for example, computing losses from all the action heads.

Disclaimer

This is not an officially supported Tencent product. The code and data in this repository are for research purpose only. No representation or warranty whatsoever, expressed or implied, is made as to its accuracy, reliability or completeness. We assume no liability and are not responsible for any misuse or damage caused by the code and data. Your use of the code and data are subject to applicable laws and your use of them is at your own risk.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages