teamSTIR

Automated segmentation of whole body MRI using deep learning segmentation algorithms

Michael Eickenberg, PhD; Amanda Paulson, PhD; Katarina Slama, PhD Candidate, Joe Carlson, PhD

Slides from our talk at the CTF's NF Conference 2019: https://docs.google.com/presentation/d/1jzv8J13AJjroT9qJaDryHM1RdNtApft5ZMUGcPi9Oi4/edit?usp=sharing

Abstract

We have evaluated techniques for an automated assay of tumor volume in neurofibromatosis patients through a machine learning algorithm trained on a set of clinical, whole body MRI images. A fast and accurate automated method would facilitate volumetric analysis, time series tracking and reduce burden on radiologists. Using a synapse dataset of 48 manually segmented NF patient MRIs, we utilize a PyTorch implementation of Unet for pixel-wise segmentation of whole body MRI images. We query whether a T1 sequence can improve accuracy over segmentation using the STIR sequence alone.

Introduction

Neurofibromatosis is a rare, debilitating genetic predisposition to benign tumors of the nervous system throughout the body which may progress and metasticize into malignant lesions. While no therapy presently exists, the decision for surgical resection of the tumors requires accuratly measuring tumor involvement and monitoring of disease progress. In addition, a clinician evaluating potential therapies requires accurate assessment to determine efficacy of treatment. Magnetic Resonance Imaging (MRI) with its sensitivity to soft-tissue tumors is the current gold-standard for non-invasive evaluation. The preferred protocol for tumors of this nature is a short tau inversion recovery (STIR) acquisition sequence with the high contrast given to soft tissue tumors.

After tumor involvement is determined, a technician must manually determine the tumor area on a set of images and compile the results into a volumetric estimate. However, this process is painstaking, time consuming, and expensive. There are often discrepancies in ground truth segmentations among different raters, institutions and MRI sequences. An automated segmentation method would reduce costs and time required for analysis as well as facilitate an analysis pipeline.

Methods

Our goal was to do binary classification of each pixel as 'tumor' or 'not tumor'. We used a training dataset of 36 patients' MRIs from a single institution with corresponding segmentation masks as ground truth. Preprocessing of DICOMS included masking out label annotations on images and normalizing data from (-1 to 1) (ignoring outliers).

We augmented the data by creating 64 x 64 pixel patches using a 1 pixel stride. There was a drastic class imbalance between tumor and not tumor - tumor pixels comprised only ~0.5% of all the pixels. To address this imbalances, we defined a patch with at least 1 tumor pixel as a 'tumor' class patch and then randomly selected an equal number of non-tumor patches. Because of overlapping augmentation, we had 250 million patches of each class. We randomly subsetted this data for ease of use for 50,000 patches of each type.

Using PyTorch, we trained a Unet architecture from @milesial with a batch size of 32 images for 100,000 batches per epoch for 10 epochs. We used an ADAM optimizer to set the learning rate. We used a logistic loss function which was averaged over each pixel.

[from researchgate]

Results : What did we observe? Figures are great!

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.a. Unthresholded predictions.

Figure 6.b. Thresholded predictions.

Figure 6.c. Ground truth.

Figure 6.d. Ground truth contours on anatomical patch.

Conclusion/Discussion:

Please make sure you address ALL of the following:

1. What additional data would you like to have

A lot of our time was used to scrape the existing data together from synapse. The data are great. More would of course be even better, but the fact that annotations exist that the scans were very similar to each other was very useful.

2. What are the next rational steps?

Many shortcuts were taken in the analysis. Next steps include

do dimensioning work on the Unet architecture.
- are 64x64 patches good? Is 128x128 or 256x256 better? Or just a few images cropped around tumor?
- is the number of layers and the size of layers appropriate?
- we randomly selected an epoch to be 100000 patches, but there were 25000000 useful ones. probably training on more of them would be better
the different types of tumor could be detected
a conversation with medical professionals would be useful to understand the utility of such approaches in practice:
- is full-body detection really useful? This requires very stringent thresholds
- maybe local detection and segmentation, informed by the practicioner, is actually more useful?

3. What additional tools or pipelines will be needed for those steps?

We did not exhaust the available resources on google cloud. For a full search of architecture, we might benefit from being able to run tests on many different GPU machines.

4. What skills would additional collaborators ideally have?

somebody with an exceptional sixth sense for neural network tweaking for complement us very well

Reproduction: How to reproduce the findings!

Docker

*The Docker image contains <R/jupyter> notebooks of all analyses and the dependencies to run them. Be sure to note if you need any special credentials to access data for these analyses, don't package restricted data in your containers!

Instructions for running the following notebooks: be sure to adjust these instructions as necessary! check out https://github.com/Sage-Bionetworks/nf-hackathon-2019 for example containers and instructions

docker pull <your dockerhub repo>/<this container> command to pull the image from the DockerHub
docker run <your dockerhub repo>/<this container> Run the docker image from the master shell script

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.DS_Store		.DS_Store
0-download-and-prepare-synapse-data.ipynb		0-download-and-prepare-synapse-data.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
download data from synapse.ipynb		download data from synapse.ipynb
ground_truth1.png		ground_truth1.png
groundtruth_contour_plus_anatomical.png		groundtruth_contour_plus_anatomical.png
imex1.png		imex1.png
imex2.png		imex2.png
imex3.png		imex3.png
imex4.png		imex4.png
imex5.png		imex5.png
nf_dataset.py		nf_dataset.py
patch_dataset.py		patch_dataset.py
thresholded_pred1.png		thresholded_pred1.png
train_unet.py		train_unet.py
unthresholded_pred1.png		unthresholded_pred1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

teamSTIR

Automated segmentation of whole body MRI using deep learning segmentation algorithms

Abstract

Introduction

Methods

Results : What did we observe? Figures are great!

Conclusion/Discussion:

Please make sure you address ALL of the following:

1. What additional data would you like to have

2. What are the next rational steps?

3. What additional tools or pipelines will be needed for those steps?

4. What skills would additional collaborators ideally have?

Reproduction: How to reproduce the findings!

Docker

Important Resources : primary data, github repository, Synapse project, dockerfile link etc.

About

Releases

Packages

Contributors 5

Languages

License

SVAI/teamSTIR

Folders and files

Latest commit

History

Repository files navigation

teamSTIR

Automated segmentation of whole body MRI using deep learning segmentation algorithms

Abstract

Introduction

Methods

Results : What did we observe? Figures are great!

Conclusion/Discussion:

Please make sure you address ALL of the following:

1. What additional data would you like to have

2. What are the next rational steps?

3. What additional tools or pipelines will be needed for those steps?

4. What skills would additional collaborators ideally have?

Reproduction: How to reproduce the findings!

Docker

Important Resources : primary data, github repository, Synapse project, dockerfile link etc.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages