ADLR SS22 Project: Social Navigation with Reinforcement Learning

Abstract

We envision a future with safe, interactive robots which can co-exist with people. For this reason we chose the topic "Social Navigation". Social navigation is the type of navigation, during which the agent aims to avoid conflicts with pedestrians in the environment while navigating towards its goal. SARL, the state of the art method proposed by Chen et. al. [1], explores this problem in a simple environment without any obstacles. In our work, we investigate this problem further under more challenging conditions, explore the challenges, and share our insights in overcoming them.

Reports

In the following reports we explain our experiments, challenges we faced and the results we obtained from this project.

Setup

Install Python-RVO2 library
Install crowd_sim and crowd_nav into pip

pip install -e .

Getting Started

This repository is organized in two parts: crowd_sim/ folder contains the simulation environment and crowd_nav/ folder contains codes for training and testing the policies. Details of the simulation framework can be found here. Below are the instructions for training and testing policies, and they should be executed inside the crowd_nav/ folder.

Train a policy.

python train.py --policy gat4sn

Test policies with 500 test cases.

python test.py --policy gat4sn --model_dir data/output --phase test

Run a policy for one episode and visualize the result.

python test.py --policy gat4sn --model_dir data/output --phase test --visualize --test_case 0

Plot training curves.

python utils/plot.py data/output/output.log

Graph Attention Network for Social Navigation (GAT4SN)

Simulation Videos

GAT4SN	SARL

More Complicated Environment?

Only Humans	With Static Obstacles

Curriculum Learning

By making the training scenarios gradually harder, curriculum learning allows the robot to navigate in harder environments. We increase the difficulty by allowing larger obstacles in the environment.

In order to train the agent with curriculum learning, use cl_train.py instead of train.py.

In order to test or visualize the results of curriculum learning training, use cl_test.py instead of test.py.

The curriculum learning hyperparameters can be configured in [curriculum] block of train.config.

Limited Field of View

To increase the complexity of the task, the field of view of the robot can be limited. To activate limited field of view, set robot's sensor to RGB in env.config. The limited FOV implementation is based on the implementation in DSRNN. We added some heuristics for the unseen agents. In order to toggle them, change unseen_mode in the [humans] block of env.config.

`unseen_mode`	Description	Visualization
`ground_truth`	Same as LiDAR, provide gt pos. and vel. of pedestrians.
`stationary`	Always provide the last seen pos. and vel. of pedestrians.
`continuing`	Assume that the pedestrians keep moving in the last seen direction
`slowing_down`	Same as above, but the assumed velocity decreases every time step.
`expanding_stationary_bubble`	Increase the radius of unseen pedestrians at every time step.
`expanding_moving_bubble`	Same as above, additionaly assume movement in the last seen direction.

Additionally, the rate at which the unseen agents slow down or the increase in their radii is based on the parameter uncertainty. Its growth rate can be changed by changing uncertainty_growth under [sim] block of env.config. The options are:

linear
exponential
logarithmic

License

This repository is based on our fork from CrowdNav. Please check their repository for more detail on license conditions.

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
crowd_nav		crowd_nav
crowd_sim		crowd_sim
pictures		pictures
tests		tests
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
Midterm_Report.pdf		Midterm_Report.pdf
README.md		README.md
final_report_05.pdf		final_report_05.pdf
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADLR SS22 Project: Social Navigation with Reinforcement Learning

Abstract

Reports

Setup

Getting Started

Graph Attention Network for Social Navigation (GAT4SN)

Simulation Videos

More Complicated Environment?

Curriculum Learning

Limited Field of View

License

About

Releases

Packages

Languages

License

kkaytekin/tum-adlr-ss22-05

Folders and files

Latest commit

History

Repository files navigation

ADLR SS22 Project: Social Navigation with Reinforcement Learning

Abstract

Reports

Setup

Getting Started

Graph Attention Network for Social Navigation (GAT4SN)

Simulation Videos

More Complicated Environment?

Curriculum Learning

Limited Field of View

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages