Datasets and methods

This repository contains code used for the paper "A Benchmark of Medical Out of Distribution Detection."ArXiv.

Code is based on the repository from "Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of Outlier Detectors." ArXiv.

This code is provided "as-is" and is not guaranteed to work out-of-the-box.

Datasets and methods

Our additions include:

Datasets:
- ANHIR: Automatic Non-rigid Histological Image Registration Challenge link.
- DRD: High-resolution retina images with presence of diabetic retinopathy in each image labeled on a scale of 0 to 4. We convert this into a classification task where 0 corresponds to healthy and 1-4 corresponds to unhealthy. link
- DRIMDB: Fundus images of various qualities labeled as good/bad/outlier. We use the images labeled as bad/outlier in evaluation 3, use-case 2.
- Malaria Image of cells in blood smear microscopy collected from healthy persons and patients with malaria. Used in evaluation 4 use-case 1.link
- MURA: MUsculoskeletal RAdiographs is a large dataset of skeletal X-rays. We use its validation split in evaluation 1 and 2's use-case 1. Images are grayscale and the square cropped. link
- NIH Chest: This NIH Chest X-ray Dataset is comprised of 112,120 X-ray images with 14 condition labels. The x-rays images are in posterior-anterior view (X-tray traverses back to front). link
- PAD Chest: This is a large scale chest X-ray dataset. It is labeled with 117 radiological findings - we use the subset with correspondence to the 14 condition labels in the NIH Chest dataset. Images are in 5 different views: posterior-anterior (PA), anterior-posterior (AP), lateral, AP horizontal, and pediatric. link
- PCAM: Patch Camelyon dataset is composed of histopathologic scans of lymph node sections. Images are labeled for presence of cancerous tissue. link
- RIGA: Fundus imaging dataset for glaucoma analysis. Images are marked by physicians for regions of disease. We use this dataset for evaluation 3, use-case 3.
OoD Detection Methods:
- ALI + Reconstruction Threshold: uses Adversarially Learning Inference link to train auto-encoder.
- Mahalanobis: Uses Gaussian discriminant analysis in classifier feature space to distinguish In/Out of distribution link.

Code Structure

We largely kept the same code structure as OD-test with the following additions:

In preproc are code for preprocessing some medical datasets. High resolution images are converted to 224x244 resolution, and images with useful labels are selected.
In setup are code for training NNs on source datasets (DRD, NIH Chest, PAD Chest, PCAM). Default hyperparameters are used.
[IN_dataset_name]_eval_rand_seeds.py are main scripts for evaluating OD methods on datasets. Some OD methods may be commented out and should be uncommented in the __main__ block. methods_64 are methods that uses 64x64 resolution, while methods use 224x224 resolution.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
datasets		datasets
docs		docs
methods		methods
models		models
preproc		preproc
setup		setup
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
Chest_eval_binary.py		Chest_eval_binary.py
Chest_eval_rand_seeds.py		Chest_eval_rand_seeds.py
DRD_eval_rand_seeds.py		DRD_eval_rand_seeds.py
LICENSE		LICENSE
PADChest_eval_rand_seeds.py		PADChest_eval_rand_seeds.py
PCAM_eval_rand_seeds.py		PCAM_eval_rand_seeds.py
README.md		README.md
Time_methods.py		Time_methods.py
Untitled.py		Untitled.py
densenet121-a639ec97.pth		densenet121-a639ec97.pth
eval3d.py		eval3d.py
eval3d_cifar_on_uc2.py		eval3d_cifar_on_uc2.py
eval3d_nih.py		eval3d_nih.py
eval3d_nih_3264.py		eval3d_nih_3264.py
eval_drd.sh		eval_drd.sh
eval_nihcc.sh		eval_nihcc.sh
eval_nihcc_binary.sh		eval_nihcc_binary.sh
eval_nihcc_vector.sh		eval_nihcc_vector.sh
eval_padchest.sh		eval_padchest.sh
eval_pcam.sh		eval_pcam.sh
eval_time.sh		eval_time.sh
global_vars.py		global_vars.py
jupyter.sh		jupyter.sh
launch_visdom.sh		launch_visdom.sh
make_aggregate_figures.py		make_aggregate_figures.py
makefullfigure.py		makefullfigure.py
model.pth.tar		model.pth.tar
mono_model.pth.tar		mono_model.pth.tar
process_results.py		process_results.py
process_results_multiple.py		process_results_multiple.py
results.csv		results.csv
run_eval3d_cifar_uc2.sh		run_eval3d_cifar_uc2.sh
run_eval3d_nih.sh		run_eval3d_nih.sh
run_eval3d_nih_3264.sh		run_eval3d_nih_3264.sh
run_tsne.sh		run_tsne.sh
run_tsne_vae.sh		run_tsne_vae.sh
setup_datasets.py		setup_datasets.py
train_DRD.sh		train_DRD.sh
train_DRD2.sh		train_DRD2.sh
train_DRDAEs.sh		train_DRDAEs.sh
train_NIHAE.sh		train_NIHAE.sh
train_NIHAEs.sh		train_NIHAEs.sh
train_NIHAEs_vector.sh		train_NIHAEs_vector.sh
train_NIHALI_MSE.sh		train_NIHALI_MSE.sh
train_NIHBinary.sh		train_NIHBinary.sh
train_NIHRESAE.sh		train_NIHRESAE.sh
train_NIHVAE.sh		train_NIHVAE.sh
train_PADChest.sh		train_PADChest.sh
train_PADChestAEs.sh		train_PADChestAEs.sh
train_PADChest_vector.sh		train_PADChest_vector.sh
train_PCAM.sh		train_PCAM.sh
train_PCAMAEs.sh		train_PCAMAEs.sh
train_PCAM_ALI.sh		train_PCAM_ALI.sh
train_PCAM_ALI_single.sh		train_PCAM_ALI_single.sh
tsne_encoder.py		tsne_encoder.py
workspace		workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Datasets and methods

Code Structure

About

Releases

Packages

Contributors 2

Languages

License

caotians1/OD-test-master

Folders and files

Latest commit

History

Repository files navigation

Datasets and methods

Code Structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages