fix dic paths

sgarda · sgarda · commit a3939d7d272c · 2018-02-15T12:36:38.000+01:00
diff --git a/data/dataset/README.md b/data/dataset/README.md
@@ -14,6 +14,9 @@ The intermediate step is done in order to get rid of missing tweets ( and tweets
 ## SPELL CHECKED
 
 Once you created the the file with the dependency it is possible to apply spell checking. For complete reproducibility the files using for spell check are provided.
+
+    $ python3 generate_data_file.py -p tweet_file_parsed -o output_path
+
 This step is performed now for the following reason:
 - need for tokenization
 - need for pos tags (provided by parsing) for avoiding parsing urls and emoticons
diff --git a/data/dataset/generate_data_file.py b/data/dataset/generate_data_file.py
@@ -12,7 +12,7 @@
 
 
 DISCARD = ['\n', 'Twitter / Account gesperr\n']
-DICS = ['/usr/share/hunspell/en_US.dic', '/usr/share/hunspell/en_US.aff']
+DICS = ['./en_US.dic', './en_US.aff']
 SPELLER = HunSpell(*DICS)
 NO_SPELL = ['^','Z','L','M','!','Y','#','@','~','U','E',',','G','S']