Supervised Contrastive Learning (SCL)

This repository provides a reproduction and implementation of the Supervised Contrastive Learning framework, as introduced in the original paper. The project focuses on training neural networks with supervised contrastive loss and evaluating their performance on benchmark datasets.

Features

Implementation of Supervised Contrastive Loss (supin and supout variations).
Support for multiple ResNet backbones: ResNet-34, ResNet-50, ResNet-101, and ResNet-200.
Pretraining with supervised contrastive loss for improved feature representation.
Fine-tuning and training classifiers from scratch for comparative evaluations.
Data augmentation strategies including CutMix, MixUp, and AutoAugment.
Configurable training settings to adapt to different tasks and datasets.

Installation

Clone the Repository

git clone https://github.com/DragonBebe/MLA_SCL.git
cd MLA_SCL

Set Up the Environment

Create the conda environment using the provided environment.yml file:
```
conda env create --file environment.yml
```
Activate the created environment:
```
conda activate SCL
```

Code Architecture

The project structure is organized as follows:

Supervised-Contrastive-Learning/
├── Contrastive_Learning/
│   ├── __init__.py                 # Marks the directory as a Python package
│   ├── config_con.py               # Configuration file for supervised contrastive learning
│   ├── train_con.py                # Main training script for contrastive learning
├── data_augmentation/
│   ├── __init__.py                 # Marks the directory as a Python package
│   ├── CutMix.py                   # Implementation of CutMix augmentation
|   ├── CutOut.py                   # Implementation of CutOut augmentation
│   ├── MixUp.py                    # Implementation of MixUp augmentation
│   ├── data_augmentation_con.py    # Augmentation pipeline for contrastive learning
├── losses/
│   ├── __init__.py                 # Marks the directory as a Python package
│   ├── SupIn.py                    # Implementation of SupIn loss
│   ├── SupOut.py                   # Implementation of SupOut loss
│   ├── CrossEntropy.py             # Implementation of CrossEntropy loss
├── models/
│   ├── __init__.py                 # Marks the directory as a Python package
│   ├── ResNet34.py                 # Implementation of ResNet-34 backbone
│   ├── ResNet50.py                 # Implementation of ResNet-50 backbone
│   ├── ResNet101.py                # Implementation of ResNet-101 backbone
│   ├── ResNet200.py                # Implementation of ResNet-200 backbone
│   ├── Projectionhead.py           # Implementation of the projection head
├── saved_models/                   # Directory for saving pretrained models and weights
│   ├── classification/             # Contains weights for classification tasks
│   │   ├── pretrain/               # Pretrained classification models
│   │   └── scratch/                # Models trained from scratch
│   ├── pretraining/                # Pretrained weights for contrastive learning
├── my_logs/                        # Stores training logs
├── main_con.py                     # Entry point for contrastive learning pretraining
├── train_pretrained_classifier.py  # Fine-tuning pretrained models
├── train_scratch_classifier.py     # Training classifiers from scratch
├── test_pretrained_classifier.py   # Evaluating pretrained models
├── test_scratch_classifier.py      # Evaluating classifiers trained from scratch
└── environment.yml                 # Python dependencies for setting up the environment

Training and Evaluation

Pretraining with Supervised Contrastive Loss

To pretrain the model using supervised contrastive loss, use the following command, parameters can be modified as needed:

python main_con.py --batch_size 32 --learning_rate 0.5 --epochs 700 --temp 0.1 --log_dir ./my_logs --model_save_dir ./saved_models/pretraining --gpu 0 --dataset ./data --dataset_name cifar10 --model_type ResNet34 --loss_type supout --input_resolution 32 --feature_dim 128 --num_workers 2

Fine-tuning Pretrained Models

To fine-tune the pretrained model for classification, run the following command, parameters can be modified as needed:

python train_pretrained_classifier.py --model_type ResNet34 --pretrained_model ./saved_models/pretraining/ResNet34/ResNet34_cifar10_feat128_supout_epoch241_batch32.pth --save_dir ./saved_models/classification/pretrained --batch_size 32 --epochs 3 --learning_rate 0.001 --dataset_name cifar10 --dataset ./data --gpu 0

Training Classifiers from Scratch

To train a classifier from scratch without pretraining, use the following command, parameters can be modified as needed:

python train_scratch_classifier.py --model_type ResNet34 --batch_size 32 --epochs 3 --learning_rate 0.1 --dataset_name cifar10 --dataset ./data --save_dir ./saved_models/classification/scratch --gpu 0

Training Workflow

In this project, Supervised Contrastive Learning is implemented as a pretraining strategy that effectively clusters data representations before classification. The training process is divided into three distinct phases:

1. Pretraining with Supervised Contrastive Loss

The first step is to pretrain the model using supervised contrastive loss. This step clusters the feature representations, preparing them for downstream classification tasks. Use the main_con.py script to perform this pretraining step. The pretrained weights will be saved automatically.

2. Linear Classification Training

After pretraining, the next step is to fine-tune the pretrained weights for linear classification. Use the train_pretrained_classifier.py script to load the pretrained weights and perform the classification task.

Important Notes:

Both training steps must use the same backbone network (e.g., ResNet-34) and dataset (e.g., CIFAR-10) for consistency.
Ensure that the correct pretrained weights are loaded during the fine-tuning step.

3. Training Classifiers from Scratch

For comparison, the train_scratch_classifier.py script trains a classifier from scratch on the dataset without any pretraining. This serves as a baseline to evaluate the performance improvement introduced by the supervised contrastive learning strategy.

Model Saving

During training, the scripts automatically save the model weights with the best performance (e.g., highest accuracy). These saved weights can be used for further evaluations or deployment.

By structuring the training process this way, the project ensures:

Efficient feature extraction through pretraining.
Robust evaluation of the performance benefits of supervised contrastive learning.
Direct comparison between pretrained and non-pretrained approaches.

Results

1.

We evaluated the performance of Supervised Contrastive Learning (SupCon) and Cross-Entropy (CE) loss functions on classification tasks using CIFAR-10 and CIFAR-100 datasets. Results include Top-1 and Top-5 accuracies for two ResNet variants:

ResNet-34-org: Original ResNet-34 architecture
ResNet-34-new: Optimized ResNet-34 architecture with improvements(SE Module,Gelu...)

Dataset	Loss	Architecture	Test Top-1	Test Top-5
CIFAR-10	Cross-Entropy	ResNet-34-org	85.34	96.98
CIFAR-10	SupCon	ResNet-34-org	90.30	99.52
CIFAR-10	Cross-Entropy	ResNet-34-new	89.94	99.61
CIFAR-10	SupCon	ResNet-34-new	91.70	99.73
CIFAR-100	Cross-Entropy	ResNet-50-org	81.68	97.86
CIFAR-100	SupCon	ResNet-50-org	91.22	98.60
CIFAR-100	Cross-Entropy	ResNet-34-new	63.71	87.58
CIFAR-100	SupCon	ResNet-34-new	65.88	89.01

Observations

SupCon consistently outperforms Cross-Entropy, achieving higher Top-1 and Top-5 accuracy across all architectures and datasets.
Optimized ResNet-34 (ResNet-34-new) shows improvements over the original ResNet-34 in both loss functions.

2.

We evaluated the impact of different data augmentation methods on the accuracy of Supervised Contrastive Learning (SupCon) and Cross-Entropy (CE) loss functions using the CIFAR-10 dataset. The study utilized the ResNet-34-new model with three data augmentation methods:

MixUp: Linearly combines two images and their labels.
CutMix: Replaces a portion of one image with a patch from another image, mixing labels accordingly.
AutoAugment: Applies a sequence of predefined augmentation operations to improve generalization.

Loss	Augmentation	Test Top-1	Test Top-5
Cross-Entropy	MixUp	83.34	98.23
Cross-Entropy	CutMix	90.30	99.49
Cross-Entropy	AutoAugment	89.94	99.61
SupCon	MixUp	85.68	98.73
SupCon	CutMix	91.22	99.42
SupCon	AutoAugment	91.70	99.73

Observations

Impact of Data Augmentation:
- AutoAugment provides the best results for both SupCon and Cross-Entropy, achieving Test Top-1 accuracies of 91.70 (SupCon) and 89.94 (Cross-Entropy).
- CutMix performs closely, with Test Top-1 accuracies of 91.22 (SupCon) and 90.30 (Cross-Entropy).
- MixUp shows the weakest performance, with Test Top-1 accuracies of 85.68 (SupCon) and 83.34 (Cross-Entropy).
Advantages of SupCon:
- SupCon consistently outperforms Cross-Entropy across all augmentation methods. For instance, with AutoAugment, SupCon achieves a Top-1 accuracy of 91.70, surpassing Cross-Entropy's 89.94.

Contact

For any inquiries, feel free to reach out:

Zhuoxuan Cao
Email: [email protected]

References

Khosla, Prannay, et al. "Supervised Contrastive Learning." arXiv preprint arXiv:2004.11362, Version 5, revised March 10, 2021. Link [DOI: 10.48550/arXiv.2004.11362]
Chen, Ting, et al. "A Simple Framework for Contrastive Learning of Visual Representations." arXiv preprint arXiv:2002.05709, Version 3, revised July 1, 2020. Link [DOI: 10.48550/arXiv.2002.05709]
He, Kaiming, et al. "Deep Residual Learning for Image Recognition." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778. Link [DOI: 10.1109/CVPR.2016.90]
Hu, Jie, et al. "Squeeze-and-Excitation Networks." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 7132–7141. Link [DOI: 10.1109/CVPR.2018.00745]
Hendrycks, Dan, and Kevin Gimpel. "Gaussian Error Linear Units (GELUs)." arXiv preprint arXiv:1606.08415 (2016). Link
Zhang, Hongyi, et al. "mixup: Beyond Empirical Risk Minimization." arXiv preprint arXiv:1710.09412, Version 2, revised April 27, 2018. Link [DOI: 10.48550/arXiv.1710.09412]
Yun, Sangdoo, et al. "CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features." arXiv preprint arXiv:1905.04899, Version 2, revised August 7, 2019. Link [DOI: 10.48550/arXiv.1905.04899]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supervised Contrastive Learning (SCL)

Features

Installation

Clone the Repository

Set Up the Environment

Code Architecture

Training and Evaluation

Pretraining with Supervised Contrastive Loss

Fine-tuning Pretrained Models

Training Classifiers from Scratch

Training Workflow

1. Pretraining with Supervised Contrastive Loss

2. Linear Classification Training

3. Training Classifiers from Scratch

Model Saving

Results

1.

Observations

2.

Observations

Contact

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Contrastive_Learning		Contrastive_Learning
data_augmentation		data_augmentation
image		image
losses		losses
models		models
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main_con.py		main_con.py
test_pretrained_classifier.py		test_pretrained_classifier.py
test_scratch_classifier.py		test_scratch_classifier.py
train_pretrained_classifier.py		train_pretrained_classifier.py
train_scratch_classifier.py		train_scratch_classifier.py

License

NEKONEKO76/MLA_SCL_Projet

Folders and files

Latest commit

History

Repository files navigation

Supervised Contrastive Learning (SCL)

Features

Installation

Clone the Repository

Set Up the Environment

Code Architecture

Training and Evaluation

Pretraining with Supervised Contrastive Loss

Fine-tuning Pretrained Models

Training Classifiers from Scratch

Training Workflow

1. Pretraining with Supervised Contrastive Loss

2. Linear Classification Training

3. Training Classifiers from Scratch

Model Saving

Results

1.

Observations

2.

Observations

Contact

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages