Dense Rews PullCube & LiftPegUpright #403

Xander-Hinrichsen · 2024-07-03T01:00:34Z

Dense Rewards added for PullCube & LiftPegUpright Envs

PullCube dense reward is mirror of PushCube, difference is reaching reward on opposite side of cube

LiftPegUpright dense reward is sum of rotation reward (implemented as cosine similarity of unit vector from peg center of mass toward end of peg and it's goal orientation -see implementation comments for details) + center of mass distance reward + reaching/gripping reward

bash command to re-create experiments with PPO below:
PullCube:
for i in {1..3}; do python ppo.py --env_id="PullCube-v1" --exp-name="pullcubeseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=4_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.8; done

LiftPegUpright:
for i in {1..3}; do python ppo.py --env_id="LiftPegUpright-v1" --exp-name="liftpegseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=9_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.9; done

Comparison between pullcube and pushcube:

pushcube ran as default, with more steps to overshoot convergence

pushcube:
for i in {1..3}; do python ppo.py --env_id="PushCube-v1" --exp-name="pushcubeseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=4_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.8; done

21.mp4

9.mp4

mani_skill/envs/tasks/tabletop/lift_peg_upright.py

pullcube&LiftPegrew

e99ffeb

Xander-Hinrichsen requested a review from StoneT2000 July 3, 2024 01:00

StoneT2000 requested changes Jul 3, 2024

View reviewed changes

mani_skill/envs/tasks/tabletop/lift_peg_upright.py Outdated Show resolved Hide resolved

black and isort

6e4a48f

Xander-Hinrichsen requested a review from StoneT2000 July 3, 2024 01:55

StoneT2000 approved these changes Jul 4, 2024

View reviewed changes

StoneT2000 merged commit b86e4a6 into haosulab:main Jul 4, 2024

Xander-Hinrichsen deleted the pullcube_liftpeg_rew branch July 4, 2024 21:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dense Rews PullCube & LiftPegUpright #403

Dense Rews PullCube & LiftPegUpright #403

Xander-Hinrichsen commented Jul 3, 2024 •

edited

Loading

Dense Rews PullCube & LiftPegUpright #403

Dense Rews PullCube & LiftPegUpright #403

Conversation

Xander-Hinrichsen commented Jul 3, 2024 • edited Loading

Xander-Hinrichsen commented Jul 3, 2024 •

edited

Loading