Skip to content

[EMNLP 2023] Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations

Notifications You must be signed in to change notification settings

jyhuang36/InterSent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

InterSent

Code for our paper Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations at EMNLP 2023

Requirements

  • transformers == 4.18.0
  • pytorch-lightning == 1.6.1

Data

└── data 
    └── paranmt
        └── para-nmt-5m-processed.txt
    └── discofuse
        ├── discofuse-train-balanced.txt
        └── discofuse-valid-balanced.txt
        └── discofuse-test-balanced.txt
    └── wikisplit
        ├── wikisplit-train.txt
        └── wikisplit-valid.txt
        └── wikisplit-test.txt
    └── google
        ├── sent-comp-train.txt
        └── sent-comp-test.txt

Training

To train InterSent from scratch, run the following:

bash train.sh

Evaluation

To evaluate InterSent on interpretability, run the following with your checkpoint path:

bash test.sh

To evaluate InterSent on STS, run the following with your checkpoint path:

bash stseval.sh

About

[EMNLP 2023] Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published