This project to build a sentiment analysis for English Movies dataset and Arabic Posts.
- Python versions 3.*.
- Python Libraries:
- sklearn.
- Pandas.
- string.
- nltk
- re.
English
IMDB dataset having 50K movie reviews. This is a dataset for binary sentiment classification.
Arabic
BBN Blog Posts Sentiment Corpus contains A random subset of 1200 sentences chosen from the BBN Arabic-Dialect–English Parallel Text.
In this project, we used Gaussian Naive Bayes from scikit-learn. The data have been split into training and testing with a ratio of 80:20.
English
Arabic
Here some prediction to model.