Skip to content

The dataset used to evaluate JobBERT on the task of job title normalization.

Notifications You must be signed in to change notification settings

jensjorisdecorte/JobBERT-evaluation-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

JobBERT evaluation dataset 💾

JobBERT publication

This is the official repository containing the evaluation data that was used for the JobBERT paper. This dataset is a list of vacancy titles, each tagged with an ESCO occupation. The full dataset is split into two files in a stratified way by class distribution. The titles.csv file was used for validation during training, while the titles.test.csv file was used to determine the final performance of the trained models.

ℹ️ This data was automatically collected from a large governmental job board.

BibTeX Citation

If you use this dataset in a scientific publication, we would appreciate using the following citation:

@inproceedings{8720079,
  author       = {{Decorte, Jens-Joris and Van Hautte, Jeroen and Demeester, Thomas and Develder, Chris}},
  booktitle    = {{FEAST, ECML-PKDD 2021 Workshop, Proceedings}},
  language     = {{eng}},
  location     = {{Online}},
  pages        = {{9}},
  title        = {{JobBERT : understanding job titles through skills}},
  url          = {{https://feast-ecmlpkdd.github.io/papers/FEAST2021_paper_6.pdf}},
  year         = {{2021}},
}

About

The dataset used to evaluate JobBERT on the task of job title normalization.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published