Amazon Electronics Review Sentiment Analysis

A comprehensive sentiment analysis project analyzing Amazon Electronics product reviews using VADER and TextBlob sentiment analysis techniques, featuring an interactive visualization dashboard.

Project Overview

This project conducts sentiment analysis on Amazon product reviews in the Electronics category. Using Natural Language Processing (NLP) techniques, VADER and TextBlob sentiment analyzers, we analyze customer sentiment patterns and derive insights from user feedback through an interactive dashboard.

Summary of Key Insights

Sentiment Distribution:
- Positive sentiment: 82.9% of reviews
- Neutral sentiment: 11% of reviews
- Negative sentiment: 6.1% of reviews
Rating-Sentiment Correlation:
- Higher star ratings align strongly with positive sentiment.
- Mixed sentiments are more common in 3-star reviews.
Category and Brand Insights:
- Certain product categories and brands exhibit consistently higher positive sentiment.
- Technical products have more detailed sentiment patterns.
Product-Level Analysis:
- High-review-count products show balanced sentiment distribution.
- Price and technical specifications are key drivers of sentiment.

Key Findings

Sentiment Distribution

Majority of reviews show positive sentiment (82.9%).
Neutral reviews account for 11% of total.
Negative reviews represent 6.1% of the dataset.

Rating-Sentiment Correlation

Strong correlation between star ratings and sentiment analysis results.
Higher star ratings consistently show more positive sentiment.
Mixed sentiments appear more frequently in 3-star reviews.

Category Insights

Certain product categories show consistently higher positive sentiment.
Technical products tend to have more detailed and nuanced sentiment patterns.
Price sensitivity varies significantly across categories.

Brand Performance

Top brands maintain consistently higher positive sentiment ratios.
Brand sentiment varies significantly by product category.
Customer service and product reliability are key factors in brand sentiment.

Project Structure

Five-Star/
├── data/                    # Data files and analysis results
├── docs/                    # Project documentation and rubrics
├── notebooks/              # Jupyter notebooks for analysis
├── src/                    # Python source code files
├── templates/              # HTML templates
├── .gitignore             # Git ignore file
├── environment.yml        # Conda environment configuration
├── Final_Report.pdf       # Final report
├── Final_SA_Amazon_Presentation.pptx  # Final presentation
└── README.md              # Project documentation

Data Overview

Raw data source: https://jmcauley.ucsd.edu/data/amazon/index_2014.html Processed reviews: final_sentiment_analysis_data.csv

Core Review Data

Column Name	Description
reviewer_id	Unique identifier for each reviewer
asin	Amazon product identifier
review_text	Full text of the review
overall	Star rating (1-5 scale)
summary	Short review title/summary

Metadata

Column Name	Description
helpful	List of [helpful_votes, total_votes]
helpful_ratio	Ratio of helpful to total votes
unix_review_time	Review timestamp (Unix format)
review_time	Review date (MM DD, YYYY)
review_date	Review date (YYYY-MM-DD)

Text Analysis

Column Name	Description
cleaned_text	Preprocessed review text
processed_text	Tokenized/stemmed text
review_length	Character count
word_count	Number of words
sentiment	Calculated sentiment (positive/neutral/negative)

Prerequisites and Installation

Prerequisites

Python 3.8 or higher
Conda (Anaconda/Miniconda)
Git (for cloning the repository)

Required Packages

name: sentiment_analysis_env
dependencies:
  - python=3.9
  - pandas
  - numpy
  - matplotlib
  - seaborn
  - tqdm
  - wordcloud
  - flask
  - scikit-learn
  - spacy
  - pip
  - pip:
      - vaderSentiment
      - notebook
      - plotly
      - nbformat
      - textblob

Installation

Clone the repository:

git clone https://github.com/yourusername/Five-Star.git
cd Five-Star

Create the Conda environment:

conda env create -f environment.yml

Activate the environment:

conda activate sentiment_analysis_env

Install the spaCy language model:

python -m spacy download en_core_web_sm

Verify the setup:

python -c "import pandas, numpy, matplotlib, seaborn, tqdm, wordcloud, flask, vaderSentiment, notebook, plotly, nbformat, textblob, spacy; print('Setup successful')"

Analysis and Dashboard

Usage Instructions

Start by exploring the Jupyter notebooks in the notebooks/ directory:

jupyter notebook

Load and preprocess data using 02_preprocessing_reviews_data_part3.ipynb.
Run sentiment analysis on desired product reviews using 03_sentiment_analysis_indepth_part2.ipynb.

Dashboard Features

The project includes an interactive Flask-based dashboard to visualize results.

Overall Sentiment Distribution: Interactive pie charts showing sentiment breakdowns.
Rating Analysis:
- Sentiment distribution across star ratings.
- Grouped bar charts showing sentiment patterns.
Category and Brand Analysis:
- Top categories/brands by review count.
- Sentiment distribution within each category/brand.
Product Insights:
- Top 5 positive and negative products.
- Sentiment ratios and review counts.

Running the Dashboard

Activate the environment:

conda activate sentiment_analysis_env

Navigate to the src directory and run:

python3 04_data_visualization_advanced_part4.py

Key Features of Analysis

Sentiment classification using VADER and TextBlob.
Word frequency analysis.
Brand and product category sentiment trends.
Temporal sentiment analysis.
Review helpfulness correlation.

Team Members

License

This project is part of the DS5110 course at Northeastern University.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Electronics Review Sentiment Analysis

Project Overview

Summary of Key Insights

Key Findings

Sentiment Distribution

Rating-Sentiment Correlation

Category Insights

Brand Performance

Project Structure

Data Overview

Core Review Data

Metadata

Text Analysis

Prerequisites and Installation

Prerequisites

Required Packages

Installation

Analysis and Dashboard

Usage Instructions

Dashboard Features

Running the Dashboard

Key Features of Analysis

Team Members

License

References

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
data		data
docs		docs
notebooks		notebooks
src		src
templates		templates
.gitignore		.gitignore
Final_Report.pdf		Final_Report.pdf
Final_SA_Amazon_Presentation.pptx		Final_SA_Amazon_Presentation.pptx
README.md		README.md
environment.yml		environment.yml

yizucodes/Five-Star

Folders and files

Latest commit

History

Repository files navigation

Amazon Electronics Review Sentiment Analysis

Project Overview

Summary of Key Insights

Key Findings

Sentiment Distribution

Rating-Sentiment Correlation

Category Insights

Brand Performance

Project Structure

Data Overview

Core Review Data

Metadata

Text Analysis

Prerequisites and Installation

Prerequisites

Required Packages

Installation

Analysis and Dashboard

Usage Instructions

Dashboard Features

Running the Dashboard

Key Features of Analysis

Team Members

License

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages