Problem:
Unclear or outdated documentation of build_langpack.py
My situation:
I really like this app, but sadly it doesn't have Hungarian language support.
I was trying to use build_langpack.py to create a Hungarian language pack, but the example command python build_langpack.py --lang fr --output langpack-fr.zip doesnt seem to work, since the script requires an --input option too:
Usage: python3 build_langpack.py --lang fr --name "French" --input french_words.txt --output langpack-fr.zip
So I scraped a Hungarian Wikipedia page for testing, and ran the script with the resulting txt as an input:
magyar.txt
The resulting unigrams.txt seems to be correct, but the dictionary only contains 73 words

Probably the .txt file is in a wrong format, but I didn't find any instructions on how the .txt input should be formatted.
My request
The readme need to be updated with clear instructions on how to create and add custom language packs.
Problem:
Unclear or outdated documentation of build_langpack.py
My situation:
I really like this app, but sadly it doesn't have Hungarian language support.
I was trying to use build_langpack.py to create a Hungarian language pack, but the example command
python build_langpack.py --lang fr --output langpack-fr.zipdoesnt seem to work, since the script requires an--inputoption too:Usage: python3 build_langpack.py --lang fr --name "French" --input french_words.txt --output langpack-fr.zipSo I scraped a Hungarian Wikipedia page for testing, and ran the script with the resulting txt as an input:
magyar.txt
The resulting unigrams.txt seems to be correct, but the dictionary only contains 73 words

Probably the .txt file is in a wrong format, but I didn't find any instructions on how the .txt input should be formatted.
My request
The readme need to be updated with clear instructions on how to create and add custom language packs.