Knowledge Distillation

Knowledge distillation is a technique where we use a large pretained model, called the teacher, to train a smaller model, called the student. We aim to make a student model which has faster inference times than the teacher without losing out on too much accuracy. This technique can be used to quickly train smaller models for specialised tasks, even when we do not have a lot of training data.

The CIFAR code uses a ResNet50 model (with the classification head changed to predict 10 classes) as the teacher and a ResNet18 model as the student.

To learn more about Knowledge distillation, refer to Report.pdf

Most of the code used in the CIFAR experiments was adapted from this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
CIFAR-10		CIFAR-10
MNIST		MNIST
.gitignore		.gitignore
Presentation.pdf		Presentation.pdf
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge Distillation

About

Releases

Packages

Contributors 4

Languages

adityapgupta/Knowledge_Distillation

Folders and files

Latest commit

History

Repository files navigation

Knowledge Distillation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages