Synopsis

Our Analysis of Yelp Data Set to predict user sentiments based on their review.

Data Cleaning

Lowercase
Remove numbers
Remove stop words using nltk
Porter Stemming
Create sparse matrix representation using scikit.

Exploratory Data Analysis

Frequency vs Rank for a sample of yelp review dataset
To find out the stop words we are using inverse term document frequency.
To create a baseline for evaluating the algorithm, we are plotted the distribution of star category ratings.
To get a better intuition of the text data we plotted the most common and recurring words in each of the reviews.

Analysis

Bag of Words Generation - Bag of words representation of the user reviews.
Word Embeddings- Word embeddings representation of the user reviews.
Create models to predict sentiments based on user review and rating
1. Support Vector Machine
2. Long Short Term Memory Neural Network

Installation

Clone the repository

git clone https://github.com/hrushikesh-dhumal/Yelp-Data-Challlenge.git

Dependencies

Install the requirements using pip install -r requirements.txt

It is suggested that you have Anaconda which covers majority of the dependencies.

Example

The entire work is in form of python notebook. Execute the playbooks in order of their serial number.

Author Information

Hrushikesh Dhumal([email protected])

Parth Patel([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data/csv		data/csv
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
json_to_csv_converter.py		json_to_csv_converter.py
requirement.txt		requirement.txt
yelp_01dataCleaning.ipynb		yelp_01dataCleaning.ipynb
yelp_02EDA.ipynb		yelp_02EDA.ipynb
yelp_03bagOfWords.ipynb		yelp_03bagOfWords.ipynb
yelp_04word2vec.ipynb		yelp_04word2vec.ipynb
yelp_06SVM.ipynb		yelp_06SVM.ipynb
yelp_08LSTM.ipynb		yelp_08LSTM.ipynb
yelp_utils.py		yelp_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Synopsis

Data Cleaning

Exploratory Data Analysis

Analysis

Installation

Example

Author Information

About

Releases

Packages

Contributors 2

Languages

License

patelparth30j/yelp-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Synopsis

Data Cleaning

Exploratory Data Analysis

Analysis

Installation

Example

Author Information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages