Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train.py is an implementation of A3C or A2C ? #6

Open
wanziyu opened this issue Jun 5, 2022 · 1 comment
Open

train.py is an implementation of A3C or A2C ? #6

wanziyu opened this issue Jun 5, 2022 · 1 comment

Comments

@wanziyu
Copy link

wanziyu commented Jun 5, 2022

In train.py, I see a central agent,SL agent and RL agents. They are running in different CPU cores with multiprocessing package. And RL agents get the weights of policy and value network from central agent with a Queue. I see train_a3c.py is very similar to train.py. I wonder if these two files are both implementations of A3C algorithm?

@pengyanghua
Copy link
Owner

@wanziyu I think both are A3C algorithm. You can run "diff train.py train_a3c.py" to see the differences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants