Skip to content

Extraction script for Paracrawl tmx files for use in WMT19

License

Notifications You must be signed in to change notification settings

jgwinnup/wmt19-tmx-extract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

wmt19-tmx-extract

Extraction script for Paracrawl tmx files for use in WMT19

Adapted from Apertium TMX tools http://wiki.apertium.org/wiki/Tools_for_TMX

use:

gzcat tmx.gz | ./tmx-extract-parallel.py -b base_filename -s src_lang -t tgt_lang -c (optional - removes crlf in )

About

Extraction script for Paracrawl tmx files for use in WMT19

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages