Added RollBall env #366

guru-narayana · 2024-06-05T00:45:54Z

A simple task where the objective is to push and roll a ball to a goal region at the other end of the table.
When testing with baseline PPO please use max_steps of 60, the ball takes time to roll

mani_skill/envs/tasks/tabletop/roll_ball.py

StoneT2000

Can you provide the exact command line script for ppo that I can just copy and paste?

Thank you.

guru-narayana · 2024-06-05T02:23:35Z

Updated everything in accordance with your recommendations.

Use the following command to run PPO

python examples/baselines/ppo/ppo.py --env_id="RollBall-v1"  --num_envs=1024 --update_epochs=8 --num_minibatches=32 --seed=100 --total_timesteps=100_00_000 --eval_freq=8 --num-steps=60 --num_eval_steps=60 --gamma 0.95

StoneT2000 · 2024-06-05T04:24:21Z

docs/source/tasks/table_top_gripper/index.md

+:::
+
+<video preload="auto" controls="True" width="100%">
+<source src="https://github.com/haosulab/ManiSkill/raw/main/figures/environment_demos/RollBall-v1.mp4" type="video/mp4">


I just checked the video. how did you generate it? It looks like the first frame is from another episode (asking about how you generated since this could be a bug with a tool of ours if you used our tools)

I think I found where bug could be, when I am saving the trajectory from PPO with "--evaluate" its reset params is empty so that could be root cause. Is there a way to prevent this ?

Ah so you are straight running the PPO code in the repo. I will check this

docs/source/tasks/table_top_gripper/index.md

StoneT2000 · 2024-06-10T19:10:56Z

I am still training the environment @guru-narayana to check it solves in a reasonalbe time (the PPO script uses 100M steps which is a lot. If it solves in about an hour on a 3080 I will merge this in anyway).

I recommend trying to tune the reward function a bit more (maybe a staged reward function can work better). Another point is that the reward gets the agent to be 0.05m behind the ball, but this can be suboptimal as you need to hit the ball at an angle.

StoneT2000 · 2024-06-10T19:13:18Z

Actually for the example ppo script did you mean to write 100_000_000, it says 100_00_000

…o main

guru-narayana · 2024-06-11T00:26:42Z

I am still training the environment @guru-narayana to check it solves in a reasonalbe time (the PPO script uses 100M steps which is a lot. If it solves in about an hour on a 3080 I will merge this in anyway).

I recommend trying to tune the reward function a bit more (maybe a staged reward function can work better). Another point is that the reward gets the agent to be 0.05m behind the ball, but this can be suboptimal as you need to hit the ball at an angle.

I made suggested modification and now the agent needs to be at 0.05m behind the ball and along the direction of the goal to get reward. I also made the reward function staged.

please use this modified command to test the environment.
python ManiSkill/examples/baselines/ppo/ppo.py --env_id="RollBall-v1" --num_eval_envs=8 --num_envs=1024 --update_epochs=8 --num_minibatches=32 --total_timesteps=20_000_000 --eval_freq=10 --num-steps=80 --num_eval_steps=80 --gamma=0.95

guru-narayana · 2024-06-11T00:29:50Z

Actually for the example ppo script did you mean to write 100_000_000, it says 100_00_000

100_00_000 was correct previously, but now please use the command in my recent comment for execution.

mani_skill/envs/tasks/tabletop/roll_ball.py

StoneT2000 · 2024-06-11T03:55:42Z

Furthermore can you merge in the main branch? It seems you may have used a version on the main branch that had a small bug with the ManiSkillVectorEnv (apoligies for that). Things should run faster now / correctly. Otherwise i can verify this task works correctly, only that one small concern about the need to have a delayed boolean

guru-narayana · 2024-06-11T05:58:47Z

Furthermore can you merge in the main branch? It seems you may have used a version on the main branch that had a small bug with the ManiSkillVectorEnv (apoligies for that). Things should run faster now / correctly. Otherwise i can verify this task works correctly, only that one small concern about the need to have a delayed boolean

Done

Added RollBall env

4d0b06c

StoneT2000 reviewed Jun 5, 2024

View reviewed changes

mani_skill/envs/tasks/tabletop/roll_ball.py Outdated Show resolved Hide resolved

StoneT2000 reviewed Jun 5, 2024

View reviewed changes

mani_skill/envs/tasks/tabletop/roll_ball.py Show resolved Hide resolved

StoneT2000 requested changes Jun 5, 2024

View reviewed changes

Added default sensor configs

ff066c7

StoneT2000 reviewed Jun 5, 2024

View reviewed changes

docs/source/tasks/table_top_gripper/index.md Outdated Show resolved Hide resolved

guru-narayana and others added 2 commits June 4, 2024 21:35

Fixed Supported robots in docs index

4f7891e

Merge branch 'haosulab:main' into main

7fd7d4e

guru-narayana added 2 commits June 10, 2024 17:22

staged reward and reach pose modification

903d115

Merge branch 'main' of https://github.com/guru-narayana/ManiSkill int…

e30ad60

…o main

StoneT2000 requested changes Jun 11, 2024

View reviewed changes

mani_skill/envs/tasks/tabletop/roll_ball.py Outdated Show resolved Hide resolved

guru-narayana and others added 2 commits June 10, 2024 22:45

handle merge conflicts with main for auto merge

355ba44

Merge branch 'main' into main

18284d9

deylaed update fix

72c99ee

StoneT2000 approved these changes Jun 13, 2024

View reviewed changes

StoneT2000 merged commit 144b5b6 into haosulab:main Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added RollBall env #366

Added RollBall env #366

guru-narayana commented Jun 5, 2024

StoneT2000 left a comment

guru-narayana commented Jun 5, 2024

StoneT2000 Jun 5, 2024 •

edited

Loading

guru-narayana Jun 5, 2024

StoneT2000 Jun 5, 2024

StoneT2000 commented Jun 10, 2024

StoneT2000 commented Jun 10, 2024

guru-narayana commented Jun 11, 2024 •

edited

Loading

guru-narayana commented Jun 11, 2024 •

edited

Loading

StoneT2000 commented Jun 11, 2024

guru-narayana commented Jun 11, 2024

Added RollBall env #366

Added RollBall env #366

Conversation

guru-narayana commented Jun 5, 2024

StoneT2000 left a comment

Choose a reason for hiding this comment

guru-narayana commented Jun 5, 2024

StoneT2000 Jun 5, 2024 • edited Loading

Choose a reason for hiding this comment

guru-narayana Jun 5, 2024

Choose a reason for hiding this comment

StoneT2000 Jun 5, 2024

Choose a reason for hiding this comment

StoneT2000 commented Jun 10, 2024

StoneT2000 commented Jun 10, 2024

guru-narayana commented Jun 11, 2024 • edited Loading

guru-narayana commented Jun 11, 2024 • edited Loading

StoneT2000 commented Jun 11, 2024

guru-narayana commented Jun 11, 2024

StoneT2000 Jun 5, 2024 •

edited

Loading

guru-narayana commented Jun 11, 2024 •

edited

Loading

guru-narayana commented Jun 11, 2024 •

edited

Loading