prakashchhipa/Functional_Knowledge_Transfer_SSL
Functional Knowledge Transfer with Self-supervised Representation Learning (ICIP 2023)
Title
Functional Knowledge Transfer with Self-supervised Representation Learning
Venue
Accepted at IEEE International Conference on Image Processing (ICIP 2023)
Chhipa, Prakash Chandra, Muskaan Chopra, Gopal Mengi, Varun Gupta, Richa Upadhyay, Meenakshi Subhash Chippa, Kanjar De, Rajkumar Saini, Seiichi Uchida, and Marcus Liwicki. "Functional Knowledge Transfer with Self-supervised Representation Learning." In 2023 IEEE International Conference on Image Processing (ICIP), pp. 3339-3343. IEEE, 2023.
Article
Poster & Presentation Video
Click here for enlarged view
Video presentation (5+ minutes) describing the work
Abstract
This work investigates the unexplored usability of self-supervised representation learning in the direction of functional knowledge transfer. In this work, functional knowledge transfer is achieved by joint optimization of self-supervised learning pseudo task and supervised learning task, improving supervised learning task performance. Recent progress in self-supervised learning uses a large volume of data, which becomes a constraint for its applications on small-scale datasets. This work shares a simple yet effective joint training framework that reinforces human-supervised task learning by learning self-supervised representations just-in-time and vice versa. Experiments on three public datasets from different visual domains, Intel Image, CIFAR, and APTOS, reveal a consistent track of performance improvements on classification tasks during joint optimization. Qualitative analysis also supports the robustness of learnt representations.
Method
SimCLR contrastive learning method employed for self-supervised representation learning part.
Datasets
Three publically available datasets from diverse visual domains are chosen for exprimentations.
- CIFAR10 - The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes (airplane, automobile, bird, cat, deer, dog, frog, horse, ship, and truck), with 6000 images per class. There are 50000 training images and 10000 test images.
- Intel Images - This is image data of Natural Scenes around the world. This Data contains around 25000 images (of which 17034 used) of size 150x150 distributed under six categories (buildings, forest, glaciar, mountain, sea, and street).
- APTOS 2019 - A set of 3662 retina images of fundus photography under a variety of imaging conditions. A clinician has rated each image for the severity of diabetic retinopathy on a scale of 0 to 4 (0: No DR, 1: Mild, 2: Moderate, 3: Severe, and 4: Proliferative DR).
Results
All the expriments have batch size of 256 and ResNet50 encoder.
Qualitative
Model Weights
-
Functional Knowledge Transfer Jointly Trained Models
a. ResNet50 on CIFAR10
b. ResNet50 on Aptos 2019
c. ResNet50 on Intel Images -
SSL Pretrained Models
a. ResNet50 on CIFAR10
b. ResNet50 on Aptos 2019
c. ResNet50 on Intel Images
Commands
- Pretrain (for representational transfer)
python -m pretrain <resnet_version> <device> <dataset>
- Finetune - downstream task
python -m finetune train <resnet_version> <device> <dataset> <pretrained_model_weights_path>
- Joint training (for Functional represetation transfer)
python -m joint_train <resnet_version> <device> <dataset>


