Skip to content

Latest commit

 

History

History
32 lines (24 loc) · 1.6 KB

README.md

File metadata and controls

32 lines (24 loc) · 1.6 KB

Introduction

The dataset I used is a compiled dataset from Kaggle (https://www.kaggle.com/petersunga/google-amazon-facebook-employee-reviews) which contains 67k employee reviews for the most prestigious tech companies in the United States, such as Google, Netflix, Amazon, Facebook, Apple, and Microsoft. The requirements of the project have requested that we provide a prescriptive analysis for our context and so I will be making a reccomendation at the end of my analysis that will recommend which tech company is the best to work for based on the results of my analysis.

Image of Overall-Ratings

Approach-Methodology

I will be uploading the revised dataset to a local database as per the requirements of the project and I will be conducting the following topics which were covered throughout the lecture portion of CIS545-11 as well as some Exploratory Data Analysis (EDA). Below is a list of the items I will cover in this project:

  • EDA
  • Discriptive Analysis (Scatterplots, Histograms, Correlation, etc.)
  • Linear Regression
  • Logistic Regression
  • Database Connections
  • Calculating Averages
  • Prescriptive Analysis

Data Cleaning

I have cleaned the dataset in Excel and dropped columns from the dataset which are not pertinent for the different types of Analysis that I will be performing on the dataset.

The revised content of the dataset contains the following columns:

  • company
  • pros
  • cons
  • overall-ratings
  • work-balance-stars
  • culture-values-stars
  • career-opportunities-stars
  • comp-benefit-stars
  • senior-management-stars