Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 557 Bytes

README.md

File metadata and controls

15 lines (12 loc) · 557 Bytes

Machine Learning with Spark Streaming

Using Spark Streaming and Spark MLlib (via PySpark) to perform Sentiment Analysis on a dataset of Tweets. Done as part of the Big Data (UE19CS322) course at PES University.

Models Implemented

  • Logistic Regression
  • Naive Bayes
  • Linear SVM
  • K-Means Clustering

The above 4 models have also been implemented with incremental learning via scikit-learn.

Team Members