Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dense Rews PullCube & LiftPegUpright #403

Merged
merged 2 commits into from
Jul 4, 2024

Conversation

Xander-Hinrichsen
Copy link
Collaborator

@Xander-Hinrichsen Xander-Hinrichsen commented Jul 3, 2024

Dense Rewards added for PullCube & LiftPegUpright Envs

PullCube dense reward is mirror of PushCube, difference is reaching reward on opposite side of cube

LiftPegUpright dense reward is sum of rotation reward (implemented as cosine similarity of unit vector from peg center of mass toward end of peg and it's goal orientation -see implementation comments for details) + center of mass distance reward + reaching/gripping reward

bash command to re-create experiments with PPO below:
PullCube:
for i in {1..3}; do python ppo.py --env_id="PullCube-v1" --exp-name="pullcubeseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=4_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.8; done

LiftPegUpright:
for i in {1..3}; do python ppo.py --env_id="LiftPegUpright-v1" --exp-name="liftpegseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=9_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.9; done

image
image

Comparison between pullcube and pushcube:

pushcube ran as default, with more steps to overshoot convergence

pushcube:
for i in {1..3}; do python ppo.py --env_id="PushCube-v1" --exp-name="pushcubeseed${i}" --num_envs=2048 \
--update_epochs=8 --num_minibatches=32 --total_timesteps=4_000_000 --eval_freq=10 --num-steps=20 \
--seed=${i} --gamma=0.8; done

image

21.mp4
9.mp4

@StoneT2000 StoneT2000 merged commit b86e4a6 into haosulab:main Jul 4, 2024
@Xander-Hinrichsen Xander-Hinrichsen deleted the pullcube_liftpeg_rew branch July 4, 2024 21:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants