-
Notifications
You must be signed in to change notification settings - Fork 78
3. Getting Started
A quick guide for simulating robots helping people with Assistive Gym.
We will first want to install Assistive Gym and dependencies.
If you prefer to install using a python virtual environment, see the Install Guide.
python3 -m pip install git+https://github.com/Zackory/bullet3.git
git clone https://github.com/Healthcare-Robotics/assistive-gym.git
cd assistive-gym
python3 -m pip install .
We can quickly visualize some of the existing assistive robotics environments using env_viewer.py
.
Here, a Sawyer robot will help clean off a person's arm who is lying in a resting pose on a bed.
python3 env_viewer.py --env "BedBathingSawyer-v0"
We can also visualize a collaborative assistance environment where both the robot and human take random actions.
python3 env_viewer.py --env "ScratchItchJacoHuman-v0"
When creating new environments, this viewer can also be helpful for visualizing and debugging changes.
We provide trained robot/human control policies for all of the available environments.
These control policies were trained using Proximal Policy Optimization (PPO) through a PyTorch library.
If you do not have wget
installed on your machine, you can download the models directly from the GitHub release page.
python3 -m pip install git+https://github.com/Zackory/pytorch-a2c-ppo-acktr --no-cache-dir
python3 -m pip install git+https://github.com/openai/baselines.git
# Download pretrained policies
wget -O trained_models/ppo/pretrained_policies.zip https://github.com/Healthcare-Robotics/assistive-gym/releases/download/0.100/pretrained_policies.zip
unzip trained_models/ppo/pretrained_policies.zip -d trained_models/ppo
Here we will run one of these policies for a PR2 providing drinking assistance to a person who has learned to actively collaborate with the robot through co-optimization.
python3 -m ppo.enjoy_coop --env-name "DrinkingPR2Human-v0"
New control policies for Assistive Gym can be trained using any library that works with the OpenAI Gym interface.
Here, we can train a new policy for a Jaco robot helping to scratch an itch using the PyTorch library from above.
python3 -m ppo.train --env-name "ScratchItchJaco-v0" --num-env-steps 20000 --save-dir ./trained_models_new/
As an example, we will only train for 20,000 time steps (100 simulation rollouts), yet increasing num-env-steps
can greatly improve the learned policy. The pretrained policies we provide were each trained using 10,000,000 time steps (50,000 simulation rollouts).
We can now visualize the newly learned control policy:
python3 -m ppo.enjoy --env-name "ScratchItchJaco-v0" --load-dir trained_models_new/ppo