Skip to content

Evaluating state-of-the-art embedding models with domain-specific instructions to improve semantic representation and enhance explainability in semantic search.

Notifications You must be signed in to change notification settings

ZikunFu/Embedding-Model-with-Instructions

Repository files navigation

Embedding Model with Instructions

Current Progress

  • Datasets
  • Embedding Models
    • Implemented:
      • Bert
      • Bert-Large
      • Instructor
      • T5
      • GPT2 (Medium)
    • TODO:
  • Instructions
    • Experiment Log
    • TODO:
      • Effects of instructions: study how sensitive models are to instructions.
  • Evaluation
    • SVM (Linear)
    • MLP

Research Questions:

  • Does the inclusion of instructions improve the embeddings?
  • How can prompt engineering be used to further enhance embedding models?

Setup Instructions

Clone the Repository

git clone https://github.com/ZikunFu/Embedding-Model-with-Instructions.git
cd Embedding-Model-with-Instructions

For Conda

conda create --name embed --file environment.yml
conda activate embed

For Pip

pip install -r requirements.txt

Usage

  1. Clone the repository.
  2. Create and activate the environment.
  3. Run the Jupyter Notebook.

Acknowledgments

  • Hugging Face for providing the pre-trained models.

About

Evaluating state-of-the-art embedding models with domain-specific instructions to improve semantic representation and enhance explainability in semantic search.

Topics

Resources

Stars

Watchers

Forks