The dataset contain data about 150 store of The FOODmart supermarket chain, which is one of the leading grocery stores in Australia.
It describes the nature of each store in terms of sales, Gross profits, location, holidays, home delivery, parking space, number of staff, managers' information and Cost of the basket of food items.
- Analyse the natural of the stores in this super market chain
- Help the company to make informed desicion about making changes in its store
- Identify the customer preferences by analysing the features that improve the store sales.
Data transformations Feature selection Handle outliers
Data Visualization variables statistics sammary
- Regression
Implement Multiple Linear Regression
Implement Simple Linear Regression
Plot the fitted line and errors of each model. - Clustering (My Implementation)
Applying elbow method and visualize it using yellowbrick library to display the elbow point for k value in k-means
Use PCA method to reduce dimensionality. - Neural Network
Multi-Layer Perceptron Neural Network
Make predictions on new data and get the predicted sales
Looking at the importance scores to detect the effect each feature has on sales - Association Rule
detect relation between feature such as: Sales $m and Number of Staff, Sales $m and Car Spaces, Female managers and Advertising
1- From Regression:
- The Advertise expenses has the most positive impact compared to " Wages $m, Competitors, Basket:2014" on the sales.
- The number of effective staff has a high positive impact on the sales with about increasing the sales by $886800 for a unit.
- The available car spaces also have high positive impact on the sales as it will increase by about $646300 for unit.
2- From clustering:
- cluster 0 has the higher gross profit this can help to detect features to improve other clustering as offering home delivery services.
- stores that assigned to cluster 1 have high features and the company can use its strategies to improve the other stores.
- cluster 2 stores have the lower features and need more attention from the company managers. they can improve some of its feature and change its managers to more experience ones.
3- From Neural Network:
- neural network has performed reasonably well, with a relatively low MSE, RMSE,MAE and MAPE and with high Explained variance
- This model can be used to predict sales of store with specific characteristics, and help the supermarket chain to make informed decision about opening a new store
4- From Association Rule:
- The Sales and the number of effective staff are strongly related and have Mutual effect on each other
- Sales and the number of available parking space are strongly related
- The stores with female managers are more likely to have low advertising expenses.
- Expected stores with high number of competing supermarkets have low profit
- large parking spaces
- high number of full time staff
- low prices of main basket items
- Home delivery
- Opens on Sundays