Skip to content

ssciwr/mailcom

Repository files navigation

mailcom

Tool to parse email body from email text (eml file), and retains only the text, with names removed, for French of Spanish emails.

Installation

Install using
python -m pip install mailcom

For an overview over the available languages and models, check the spaCy website.

Usage

The package uses spaCy for sentencizing, based on the default language models, and transformers for NER recognition. Currently, you have to set the language and eml file directory manually at the top of parse.py; the default directory is data/in. Then run python parse.py. After the run, the output can be found in data/out.

Releases

No releases published

Packages

No packages published

Contributors 6

Languages