thesis_CSAI

This repository contains the code Jupyter Notebook of the thesis: Evaluating Sentence Embeddings on Syntactic Information Through Representational Similarity Analysis.

The notebook expects the Penn Treebank data set, however, this data set is not publicly available.

The notebook is dependent on the following packages:

nltk
ursa
pickle
os
matplotlib
numpy
tensorflow
skip-thoughts
scipy
pandas
jsonlines
BERT

Further, the analysis in the notebook is dependent on the pre-trained BERT model and the pre-trained uni-skip and bi-skip models.

The BERT model can be downloaded via the following link: https://storage.googleapis.com/bert_models/2018_10_18/uncased_L-12_H-768_A-12.zip Store the contents in a folder named: "uncased_L-12_H-768_A-12"

The Skip-Thought models can be download via the following instructions: https://github.com/tensorflow/models/tree/master/research/skip_thoughts#download-pretrained-models-optional Store the contents in a folder named: "skipthought"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

thesis_CSAI

Files

README.md

Latest commit

History

README.md

File metadata and controls

thesis_CSAI