Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Batch Size Selection for a Finite MDP #2024

Open
4 tasks done
DavidLudl opened this issue Oct 22, 2024 · 4 comments
Open
4 tasks done

[Question] Batch Size Selection for a Finite MDP #2024

DavidLudl opened this issue Oct 22, 2024 · 4 comments
Labels
question Further information is requested

Comments

@DavidLudl
Copy link

DavidLudl commented Oct 22, 2024

❓ Question

Hello.

I would like to ask if I have a finite MDP, where each episode has a same fixed timestep $T$. Then during the training, do I have to choose batch size with $n\times T$? Or any other batch size is also OK?

Thank you for your time,

Best regards,

Checklist

@DavidLudl DavidLudl added the question Further information is requested label Oct 22, 2024
@araffin
Copy link
Member

araffin commented Oct 29, 2024

Then during the training, do I have to choose batch size with n × T ? Or any other batch size is also OK?

What algorithm are you using?
I guess you are talking about on policy algorithm (A2C/PPO) and the n steps parameter?

In that case, you can use any number of steps (this parameter can impact performance though).

@DavidLudl
Copy link
Author

I am using PPO. They parameter I want to ask is the batch_size. Should I set the batch_size (default 64) in PPO algorithm to n*T ?

@araffin
Copy link
Member

araffin commented Oct 29, 2024

The mini batch size can be kept as is, it is only about the gradient step.

@DavidLudl
Copy link
Author

Thank you, now I understand.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants