Object detection in an Urban Environment

Project Writeup

Data

We use data from the Waymo Open dataset. The files can be downloaded directly from the website as tar files or from the Google Cloud Bucket as individual tf records.

Structure

The core files in this repository will be organized as follows:

- exploratory_data_analysis.py: to confirm correct bounding box in images and analyze the dataset 
- explore_augmentation.py: to test augmentations in the dataset
- create_splits.py: to split data into training, validation and testing
- edit_config.py: to create a new configuration for training
- model_main_tf2.py: to launch training
- exporter_main_v2.py: to create an inference model

Jupyter Notebook:

Exploratory Data Analysis.ipynb
Explore augmentations.ipynb

Utility:

process_waymo_data.py: transform downloaded .tfrecord of Waymo open dataset into objection detection api format
inference_video.py: to create a video of object detection results
modify_tf_record.py: import and modify the tfrecord file, and then export it again

The experiments folder are organized as follow:

📂experiments/
    - 📂experiment0/...: (initial pipeline_config)
    - 📂experiment1/...: (modified pipeline_config)
    - 📂experiment2/...: (modified pipeline_config)

The data folder contains:

📂data/
    - 📂processed_data/...: contain the processed data from Waymo Open dataset (empty to start)
    - 📂test/...: contain the test data (empty to start)
    - 📂train/...: contain the train data (empty to start)
    - 📂val/...: contain the val data (empty to start)

The training folder contains:

📂training/
    - 📂pre-trained model/...: contain the checkpoints of the pretrained models (empty to start)
    - 📂reference/...: (empty to start)

Prerequisites

Local Setup (Nvidia GPU)

Use the provided Dockerfile and requirements in the build directory.

Follow the README to create a docker container and install all prerequisites.

An alternative way is to download each library individually as well as python library dependencies. The following setup in the local machine can run the program successfully:

Ubuntu 20.04
Python 3.8.10
CUDA 11.2
cuDNN 8.1
TensorFlow 2.6

Note that in model_lib_v2.py (TensorFlow Object Detection API), put from object_detection import eval_util as the last import to avoid the potential segmentation fault.

Instructions

Clone this repo: git clone https://github.com/PoChang007/Object_Detection_in_Urban_Env.git
cd Object_Detection_in_Urban_Env

Download and process the data

Process the downloaded Waymo Open dataset by using process_waymo_data.py.

python3 process_waymo_data.py --src_dir {temp_dir_for_raw_files} --dst_dir data/processed_data

Or use download_process.py to download and process Waymo Open dataset,

python3 download_process.py --data_dir data/processed_data --temp_dir {temp_dir_for_raw_files}

Create the splits

Execute the script create_splits.py to split the dataset into training, validation and testing.

python3 create_splits.py --data_dir data/processed_data

Edit the config file

The config that we will use for this project is pipeline.config, which is the config for a SSD Resnet 50 640x640 model.

First, download the pretrained model and move it to training/pretrained-models/.

Now we need to edit the config files to change the location of the training and validation files, as well as the location of the label_map file, pretrained weights. We also need to adjust the batch size. To do so, run the following:

python3 edit_config.py --train_dir data/train/ --eval_dir data/val/ --batch_size 4 --checkpoint ./training/pretrained-models/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8/checkpoint/ckpt-0 --label_map label_map.pbtxt

A new config file has been created, pipeline_new.config.

Training

Launch an experiment with the Tensorflow object detection API. Create a folder training/reference. Move the pipeline_new.config to this folder. We will now have to launch two processes:

a training process:

python3 model_main_tf2.py --model_dir=training/reference/ --pipeline_config_path=training/reference/pipeline_new.config

an evaluation process:

python3 model_main_tf2.py --model_dir=training/reference/ --pipeline_config_path=training/reference/pipeline_new.config --checkpoint_dir=training/reference/

NOTE: both processes will display some Tensorflow warnings.

To monitor the training, launch a tensorboard instance by running tensorboard --logdir=training.

Improve the performances

The initial experiment may not yield optimal results. However, we can make multiple changes to the config file to improve this model, for example, do the data augmentation. The preprocessor.proto file contains the different data augmentation method available in the Tf Object Detection API.

Export the trained model

python3 ./exporter_main_v2.py --input_type image_tensor --pipeline_config_path training/reference/pipeline_new.config --trained_checkpoint_dir training/reference --output_directory training/experiment0/exported_model/

Creating an animation

python3 inference_video.py --labelmap_path label_map.pbtxt --model_path training/experiment0/exported_model/saved_model --tf_record_path data/test/segment-tripID.tfrecord --config_path training/experiment0/exported_model/pipeline.config --output_path animation.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object detection in an Urban Environment

Project Writeup

Data

Structure

Prerequisites

Local Setup (Nvidia GPU)

Instructions

Download and process the data

Create the splits

Edit the config file

Training

Improve the performances

Export the trained model

Creating an animation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
build		build
data		data
docs		docs
experiments		experiments
training/reference		training/reference
.gitignore		.gitignore
Exploratory Data Analysis.ipynb		Exploratory Data Analysis.ipynb
Explore augmentations.ipynb		Explore augmentations.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
create_splits.py		create_splits.py
download_process.py		download_process.py
edit_config.py		edit_config.py
exploratory_data_analysis.py		exploratory_data_analysis.py
explore_augmentation.py		explore_augmentation.py
exporter_main_v2.py		exporter_main_v2.py
filenames.txt		filenames.txt
inference_video.py		inference_video.py
label_map.pbtxt		label_map.pbtxt
model_main_tf2.py		model_main_tf2.py
modify_tf_record.py		modify_tf_record.py
pipeline.config		pipeline.config
process_waymo_data.py		process_waymo_data.py
project_writeup.md		project_writeup.md
utils.py		utils.py

License

PoChang007/Object_Detection_in_Urban_Env

Folders and files

Latest commit

History

Repository files navigation

Object detection in an Urban Environment

Project Writeup

Data

Structure

Prerequisites

Local Setup (Nvidia GPU)

Instructions

Download and process the data

Create the splits

Edit the config file

Training

Improve the performances

Export the trained model

Creating an animation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages