Skip to content

Latest commit

 

History

History
110 lines (95 loc) · 8.32 KB

README.md

File metadata and controls

110 lines (95 loc) · 8.32 KB

SimCLR

A PyTorch Implementation Of SimCLR

Introduction:

SimCLR, short for "Simple Contrastive Learning of Visual Representations," is a powerful self-supervised learning framework for learning high-quality image representations without requiring manual labels.It leverages contrastive learning, where the model is trained to pull together similar images and push apart dissimilar ones in a learned feature space.

Image Source

Folder Structure:

/
│
├── Sim_CLR/
│   ├── Adam/
│   │   ├── Contrastive Training Loss per Epoch.png
│   │   ├── SIMclr_confusion_matrix.png
|   |   ├── training_validation_metrics_finetuning_simclr.png
|   |   ├── tsne_finetuning_dataset.png
|   |   ├── tsne_test_dataset.png
|   |   ├── Adam.ipynb
│   ├── SGD/
│   │   ├── Contrastive Training Loss per Epoch.png
│   │   ├── SIMclr_confusion_matrix.png
|   |   ├── training_validation_metrics_finetuning_simclr.png
|   |   ├── tsne_finetuning_dataset.png
|   |   ├── tsne_test_dataset.png
|   |   ├── SGD.ipynb
│   ├── Augmentation.png
├── Supervised_Resnet18_as_backbone/
│   ├── supervised_ Resnet18 as Feature Extractor_confusion_matrix.png
│   ├── training_validation_metrics_supervised.png
│   ├── tsne_finetuning_dataset_pretrained_resnet18.png
│   ├── tsne_test_dataset_pretrained_resnet18.png
│   ├── Supervised.ipynb
├── .gitignore
├── LICENSE
├── README.md
├── requirements.txt

Implementation Overview:

In this implementation CIFAR-10 dataset has been used. For contrastive pretraining Resnet-18(pretrained=False) has been used as backbone. Before the pretraining, we performed two types of augmentation on the dataset, namely random cropping and color jittering.

Dataset Augmentation

Then it is pretrained for 100 epochs using Adam(Adam.ipynb) & Nesterov accelerated SGD(SGD.ipynb) and NT-Xent loss (Normalized temperature-scaled cross-entropy loss).

Image Source: Medium

After pretraining I threw away the projection head and I made a finetuning head and finetuned it using 6500 labeled datapoints for 20 epochs using Adam optimizer.

I also implemented this project using supervised approach(Supervised.ipynb).At first downloaded imagenet pretrained Resnet-18 and used it as a feature extractor for 6500 labeled datapoints and then made a finetuning head and finetuned it using those extracted features.

How To Run:

  • At first, make sure your project environment contains all the necessary dependencies by running
pip install -r requirements.txt
  • Then to run the corresponding notebooks, download them and run the cells sequentially.
  • To use the trained models, download them from the links given below and change the paths in the notebooks accordingly.

Trained Models:

  • SimCLR_Adam:

    • simclr_backbone.pth->This is Resnet-18(pretrained=False) backbone, which was pretrained by contrastive learning using Adam optimizer.
    • simclr_projection_head.pth->This is the projection head used in contrastive pretraining to project the images in 128 dimension and calculate NT-Xent loss, after pretraining I threw it away.
    • finetuned_model.pth->This is the finetuning head which was finetuned by 6500 labeled datapoints(after feature extraction of those datapoints by contrastive pretrained Resnet-18).
  • SimCLR_SGD:

    • simclr_backbone.pth->This is Resnet-18(pretrained=False) backbone, which was pretrained by contrastive learning using Nesterov accelerated SGD optimizer.
    • simclr_projection_head.pth->This is the projection head used in contrastive pretraining to project the images in 128 dimension and calculate NT-Xent loss, after pretraining I threw it away.
    • finetuned_model.pth->This is the finetuning head which was finetuned by 6500 labeled datapoints(after feature extraction of those datapoints by contrastive pretrained Resnet-18).
  • Supervised:

    • supervised_model.pth->This is the finetuning head of supervised approach, which was used after feature extraction by imagenet pretrained Resnet-18.

Results:

All the experiments were performed on CIFAR-10 dataset.

Method Batch Size Backbone Projection Dimension Contrastive Pretraining Epochs Finetuning Epochs Optimizer Test Accuracy Test F1 Score
SimCLR 256 ResNet18(pretrained=False) 128 100 20 Adam(pretraining),Adam(finetuning) 0.6137 0.6122
SimCLR 256 ResNet18(pretrained=False) 128 100 20 NAG(pretraining),Adam(finetuning) 0.6884 0.6854
Supervised 256 ResNet18(pretrained=True) 1000 - 20 Adam(finetuning) 0.7558 0.7543

Visualizations:

Contrastive Training Loss per Epoch (Adam) Contrastive Training Loss per Epoch (NAG)
Contrastive Training Loss per Epoch (Adam) Contrastive Training Loss per Epoch (NAG)
l_A_sgd l_a_s
Train and Validation Metrics in Finetuning of SimCLR Method Train and Validation Metrics in Finetuning of Supervised Method
CF(NAG) CF (s)
NAG Optimizer Based SimCLR Supervised Approach

Reference: