Pedestrian and Balloon Instance Segmentation projects using Pytorch and Mask R-CNN.
Instance segmentation is a challenging computer vision task that requires the prediction of object instances and their per-pixel segmentation mask. This makes it a hybrid of semantic segmentation and object detection.
Deep Learning methodologies are able to do end-to-end object detection using Convolutional Neural Networks. The algorithms designed to do object detection are based on two approaches - one-stage object detection and two-stage object detection. One-stage detectors have high inference speeds and two-stage detectors have high localization and recognition accuracy. The two stages of a two-stage detector can be divided by a RoI (Region of Interest) Pooling layer. One of the prominent two-stage object detectors is Faster R-CNN. It has the first stage called RPN, a Region Proposal Network to predict candidate bounding boxes. In the second stage, features are by RoI pooling operation from each candidate box for the following classification and boundingbox regression tasks. Mask R-CNN is an extention of Faster R-CNN for instance segmentation. [Bibliometric Analysis of One-stage and Two-stage Object Detection]
Image | Predictions | Ground Truth |
---|
Image | Predictions | Ground Truth |
---|