Advancing instance segmentation and WBC classification in peripheral blood smear through domain adaptation: A study on PBC and the novel RV-PBS datasets
Automating blood cell counting and detection from smear slides holds significant potential for aiding doctors in disease diagnosis through blood tests. However, existing literature has not adequately addressed using whole slide data in this context. This study introduces the novel RV-PBS dataset, comprising ten distinct peripheral blood smear classes, each featuring multiple multi-class White Blood Cells per slide, specifically designed, for instance segmentation benchmarks. While conventional instance segmentation models like Mask R-CNN exhibit promising results in segmenting medical artifact instances, they face challenges in scenarios with limited samples and class imbalances within the dataset. This challenge prompted us to explore innovative techniques such as domain adaptation using a similar dataset to enhance the classification accuracy of Mask R-CNN, a novel approach in the domain of medical image analysis. Our study has successfully established a comprehensive pipeline capable of segmenting, detecting, and classifying blood samples from slides, striking an optimal balance between computational complexity and accurate classification of medical artifacts. This advancement enables precise cell counting and classification, facilitating doctors in refining their diagnostic analyses.
This repository hosts the code created for the paper titled Advancing instance segmentation and WBC classification in peripheral blood smear through domain adaptation: A study on PBC and the novel RV-PBS datasets.
This is the part of my Master's thesis where we segmented white blood cell via Mask RCNN (Aniket's part) and used Domain Adaptation to detect the cells.
pip install --upgrade --no-cache-dir gdown
The dataset is annotated using CVAT. We are planning to release an extended version of this dataset soon. If you are a haematologist, then you could help us by annotating and adding more data. Please make sure that the data is ethically cleared before uploading new data in public servers, such as Github.
Please study the paper for getting more insights. Here are some snapshots from the paper:
@article{PAL2024123660,
title = {Advancing instance segmentation and WBC classification in peripheral blood smear through domain adaptation: A study on PBC and the novel RV-PBS datasets},
journal = {Expert Systems with Applications},
volume = {249},
pages = {123660},
year = {2024},
issn = {0957-4174},
doi = {https://doi.org/10.1016/j.eswa.2024.123660},
url = {https://www.sciencedirect.com/science/article/pii/S0957417424005268},
author = {Jimut Bahan Pal and Aniket Bhattacharyea and Debasis Banerjee and Br. Tamal Maharaj},
keywords = {Automated blood test, Detection, Domain adaptation, Instance segmentation, Peripheral blood smear}
}