Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

plot rewards as a function of number of timesteps #30

Open
rachit-dubey opened this issue Oct 4, 2018 · 1 comment
Open

plot rewards as a function of number of timesteps #30

rachit-dubey opened this issue Oct 4, 2018 · 1 comment

Comments

@rachit-dubey
Copy link

Hi, thanks so much for the excellent codebase. Just wondering, is there any way to plot the training curve as a function of timesteps (as opposed to plotting the training curve as a function of time passed)?

Thanks!

@dgriff777
Copy link
Owner

As the training cases are in asynchronous manner. If you need such data and want accurate number and not hinder training performance then probably best to keep a tally in each parallel process. Then set up to combine total by sending that data to a newly created process that would do the actual summation of.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants