Refactor `parse_args()` #118

vwxyzjn · 2022-02-21T20:43:55Z

This PR refactors a few arguments in parse_args():

This PR moves the gym_id, learning_rate, total_timesteps move down to algorithm-specific arguments. Currently when doing a file diff between c51_atari.py and ppo_continuous_action.py, we have the following screenshot:

However, it's more desirable to all of the differences in one place, as shown below:
This PR also renames the gym-id to env-id. The reason is that as we adopt different environments, some of them don't necessarily have a gym-id but they have an environment name. For example, procgen has the env-id but not gym-id.

Curious about your thoughts @yooceii @dosssman on whether this is necessary.

gitpod-io · 2022-02-21T20:43:58Z

dosssman

I concur on the gym-id to env-id change, as the former is more general indeed.

For the position of the arguments in the argparser list, since some algorithms are kind of made for speicfic tasks (SAC -> Bulet / Mujoco), it does make sense to put it in the algorithms specific arguments. learning-rate too, because in SAC for example, there are actually 2 learning, so it would make the parameterization cleaner.

While total-timesteps is also general, I guess different algorithms training for different amount of steps could be possible (for example Atari's required 10M + vs Bullet's 1M) can be used as justification for this change.

In any case, it is all good on my side.

vercel · 2022-02-22T15:38:49Z

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/AAxRK8ytYUqJoht2izi8wtmoQzwA
✅ Preview: https://cleanrl-git-refactor-arg-vwxyzjn.vercel.app

vwxyzjn · 2022-02-22T15:41:36Z

The matplotlib utilities might break but that's ok since we are re-building Open RL Benchmark anyway #115.

Refactor parse_args()

92aad94

vwxyzjn requested a review from dosssman February 21, 2022 20:44

dosssman approved these changes Feb 22, 2022

View reviewed changes

Refactor gym-id to env-id

8b26685

vercel bot deployed to Preview February 22, 2022 15:38 View deployment

vwxyzjn mentioned this pull request Feb 22, 2022

Refactor the argparse parameters to have learning_rate total_timesteps move down to algorithm-specific arguments. #116

Closed

vwxyzjn merged commit 3437186 into master Feb 22, 2022

vwxyzjn deleted the refactor-arg branch February 22, 2022 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `parse_args()` #118

Refactor `parse_args()` #118

vwxyzjn commented Feb 21, 2022

gitpod-io bot commented Feb 21, 2022

dosssman left a comment

vercel bot commented Feb 22, 2022 •

edited

Loading

vwxyzjn commented Feb 22, 2022

Refactor parse_args() #118

Refactor parse_args() #118

Conversation

vwxyzjn commented Feb 21, 2022

gitpod-io bot commented Feb 21, 2022

dosssman left a comment

Choose a reason for hiding this comment

vercel bot commented Feb 22, 2022 • edited Loading

vwxyzjn commented Feb 22, 2022

Refactor `parse_args()` #118

Refactor `parse_args()` #118

vercel bot commented Feb 22, 2022 •

edited

Loading