Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 413 Bytes

README.md

File metadata and controls

8 lines (5 loc) · 413 Bytes

Twitter_sentiment_analysis

This project is about the sentiment analysis of twitter data using PySpark.

The project uses the dataset downloaded from kaggle https://www.kaggle.com/kazanova/sentiment140/code

The tweet data is classified into three labels 'positive', 'neutral', 'negative'.

Two different classification models 'Logistic Regression' and 'Naive Bayes' classifier models are trained and evaluated.