GitHub

sc_a3c: another attempt for SC2 AI

This is the A3C method applied by Deepmind in 2017 to train SC2 AI, and A3Chas been proved successful for minigames.

dqfd_new.py: old DQfD version algorithm, proved unsuccessful

a3c_tl: A3C on continious controlling based on tensorlayer. Copied from TensorLayer's Github.

a3c_discrete: A3C for discrete decision problems. Copied from Morvan Zhou.

a3c_sc: My implementation of A3c on pysc2.

Up till 1.1, most work has been finished. Check todo.txt to find out improvements.

En Taro Tassadar, and enjoy yourself.

References:

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
D:/sc_a3c/logs		D:/sc_a3c/logs
Global_Net		Global_Net
MNIST_data		MNIST_data
__pycache__		__pycache__
big		big
data		data
logs		logs
model		model
sd		sd
sx		sx
README.md		README.md
a3c_discrete.py		a3c_discrete.py
a3c_tl.py		a3c_tl.py
aa.bin		aa.bin
aa_small.bin		aa_small.bin
adam.bin		adam.bin
code_test.py		code_test.py
dqfd_new.py		dqfd_new.py
duiying.txt		duiying.txt
full.py		full.py
huitu.py		huitu.py
mnist_case.py		mnist_case.py
new.py		new.py
optimizer.py		optimizer.py
plot.py		plot.py
rnn.bin		rnn.bin
rnn_over.bin		rnn_over.bin
sc_a3c.py		sc_a3c.py
srun3k.cfg		srun3k.cfg
tbtest.py		tbtest.py
todo.txt		todo.txt
trainopti.py		trainopti.py

Provide feedback