Skip to content

kushalgarg101/I-JEPA-Model-Implementation-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

I-JEPA Paper Implementation

My code implementation for I-JEPA Model from paper titled as : Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture.

Details

  • Implemented Vision Transformer from scratch as used in all three blocks of model : Target Encoder, Context Encoder and Predictor.

  • The paper mentions they have not used [CLS] token in any of the blocks. For Predictor block they have also mentioned of keeping number of self-attention heads equal to that of the backbone context-encoder but changing depth of predictor.

  • NOTE: Working on preparing image input for context encoder and combining these two blocks with Predictor model.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published