Use these NLP, Text Mining and Machine Learning code samples and tools to solve real world text data problems.
Links in the first column take you to the subfolder/repository with the source code.
Task | Related Article | Source Type |
---|---|---|
Large Scale Phrase Extraction | phrase2vec article | python script |
Word Cloud for Jupyter Notebook and Python Web Apps | word_cloud article | python script + notebook |
Gensim Word2Vec (with dataset) | word2vec article | notebook |
Reading files and word count with Spark | spark article | python script |
Extracting Keywords with TF-IDF and SKLearn (with dataset) | tfidf article | notebook |
Text Preprocessing | text preprocessing article | notebook |
- For more articles, please see this list
This repository is maintained by Kavita Ganesan. Please contact me directly if you have questions.