Skip to content

Segment customers of Arvato Financial Solutions into distinct categories using PCA and K-Means Clustering

Notifications You must be signed in to change notification settings

peter-ohara/customer-segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

customer-segmentation

Segment customers of Arvato Financial Solutions into distinct categories using PCA and K-Means Clustering

This project uses Principal Component Analysis and K-Means clustering to see if any similarities exist between customers of Arvato Financial Solutions,and used those similarities to segment customers into distinct categories. This segmentation is used to help the business make more informed marketing and product decisions.

Data

The data files for this project is private to Arvato Financial Solutions so thus could not be included in this public repository. However it consists of:

  • Udacity_AZDIAS_Subset.csv: Demographic data for the general population of Germany; 891211 persons (rows) x 85 features (columns).
  • Udacity_CUSTOMERS_Subset.csv: Demographic data for customers of a mail-order company; 191652 persons (rows) x 85 features (columns).
  • Data_Dictionary.md: Information file about the features in the provided datasets.
  • AZDIAS_Feature_Summary.csv: Summary of feature attributes for demographic data.

About

Segment customers of Arvato Financial Solutions into distinct categories using PCA and K-Means Clustering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published