Skip to content

honghanhh/terminology2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LlamATE: Automated Term Extraction Using Large-scale Generative Language Models

This repo implements the code for LlamATE, a framework to verify the impact of domain specificity on ATE when using in-context learning prompts in open-sourced LLM-based chat models.

Requirements

Please install all the necessary libraries noted in requirements.txt using this command:

pip install -r requirements.txt

Data

The experiments were conducted on ACTER datasets:

ACTER dataset
Languages English, French, and Dutch
Domains Corruption, Wind energy, Equitation, Heart failure

Download the ACTER dataset at here and save into ACTER folder.

License

@inproceedings{tran2024llamate,
  title={LlamATE: Automated Term Extraction Using Large-scale Generative Language Models},
  author={Tran, Hanh Thi Hong and González-Gallardo, Carlos-Emiliano and Doucet, Antoine and Pollak, Senja},
  booktitle={Computational Terminology Special Issue - Terminology},
  year={2024},
  note={Accepted}
}

About

Do Large-scale Language Models Eliminate the Need for Domain-Specificity in Automatic Term Extraction?

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published