Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tuple action space with stable baselines PPO2 [question] #107

Closed
sahilgupta2105 opened this issue Dec 1, 2018 · 3 comments
Closed

Tuple action space with stable baselines PPO2 [question] #107

sahilgupta2105 opened this issue Dec 1, 2018 · 3 comments
Labels
enhancement New feature or request help wanted Help from contributors is needed

Comments

@sahilgupta2105
Copy link

Hi,

I am trying to train a controller using PPO2 algorithm. The action space for my problem consists of 2 continuous and one discrete action. I tried using a tuple action space (similar to examples on gym website), but PPO2 (I also tried TRPO) throws a not implemented error. I tried a workaround: I defined the action space as Box with 3 actions and before stepping the environment, I check if the value is below a threshold value, I change the action value to 0, else 1. But this simplification is making it hard for the controller to learn the task. Is there a way to use tuple action spaces, or do you have ideas from similar problems?

@sahilgupta2105 sahilgupta2105 changed the title Tuple action space with stable baselines PPO2 [question] Tuple action space with stable baselines PPO2 Dec 1, 2018
@sahilgupta2105 sahilgupta2105 changed the title Tuple action space with stable baselines PPO2 Tuple action space with stable baselines PPO2 [question] Dec 1, 2018
@araffin araffin added enhancement New feature or request help wanted Help from contributors is needed labels Dec 1, 2018
@araffin
Copy link
Collaborator

araffin commented Dec 1, 2018

Hello,

Tuple action space is currently not supported, but I recommend you to read @hill-a comment on that issue #100 (comment)

Support is not currently planned but we are open to PR ;)

@sahilgupta2105
Copy link
Author

Hi,

Thanks for the prompt reply. I saw the comment. So, I am guessing just implementing a probability distribution for a tuple space will suffice. I will update you if I am able to successfully implement it.

@araffin
Copy link
Collaborator

araffin commented Feb 12, 2019

Closing in favor of #133 to avoid duplicated issues ;)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Help from contributors is needed
Projects
None yet
Development

No branches or pull requests

2 participants