INSURANCE-CROSS-SELLING-PREDICTION

Data 200

Developed with the software and tools below.

Quick Links

Overview

Repository Structure

Getting Started

Installation

Running Insurance-Cross-Selling-Prediction

Contributing

Acknowledgments

Overview

The Health Insurance Cross Sell Prediction project aimed to develop a classification model to predict health insurance policyholders' interest in subscribing to vehicle insurance. The project was led by Team Data 200 from Rakamin Academy. The dataset used in the project contained demographic and behavioral information of health insurance customers related to their interest in vehicle insurance. The team conducted an exploratory data analysis (EDA) to gain insights into the relationships between various features and the customers' interest in vehicle insurance. The EDA revealed that middle-aged adults, both male and female, showed the highest interest in vehicle insurance, with approximately 17.3% and 18.7% interest rates, respectively. Furthermore, the analysis showed that customers who had not previously subscribed to vehicle insurance exhibited a higher interest rate of 22.5%, while those who had previously subscribed showed minimal interest. These insights were crucial in understanding customer behavior and forming the basis for the subsequent modeling phase.

In terms of preprocessing, the team performed data cleansing to handle missing values and duplicates in the dataset. Feature transformation was carried out to handle class imbalance, feature encoding, and outlier handling. Feature extraction involved creating new features based on age categories, the interaction between vehicle damage and age, and the ratio of annual premium to age. Feature selection was conducted using Pearson correlation and mutual information to identify the most influential features for modeling. The selected features included previously insured, region code, vehicle age, vehicle damage, policy sales channel, age category, and vehicle damage age interaction. These features were then used for the modeling phase.

The modeling phase involved comparing the performance of various machine learning models, including XGBoost, Random Forest, Adaboost, LightGBM, CatBoost, and KNN. The models were evaluated based on metrics such as accuracy, precision, recall, F1 score, and mean ROC-AUC. The results indicated that the models exhibited similar performance across the metrics. Furthermore, the feature importance analysis revealed that the "previously insured" feature had the most significant influence on customer interest in vehicle insurance. Based on these findings, the team recommended prioritizing customers who had not previously subscribed to vehicle insurance, focusing on region code 28, and targeting customers with a medium premium to age ratio for marketing efforts. The team's simulation showed that using the predictive model for marketing efforts increased the conversion rate by 47.1%, leading to a significant improvement in revenue and a reduction in customer acquisition costs.

Documentation Details

Repository Structure

└── Insurance-Cross-Selling-Prediction/
    └── Code
        ├── EDA - Data200.ipynb
        ├── Modeling
        │   ├── AdaBoostClassifier_Ujang.ipynb
        │   ├── Catboost_Arifin.ipynb
        │   ├── decision trees_Ramlan.ipynb
        │   ├── k_nearest_neighbors.ipynb
        │   ├── Lightgbm_Arifin.ipynb
        │   ├── LogisticRegression.ipynb
        │   ├── Random_Forest_iqbal.ipynb
        │   └── XGBoost_Arifin.ipynb
        ├── modeling - Data 200.ipynb
        └── preprocessing - Data200.ipynb

Getting Started

Requirements

Ensure you have the following dependencies installed on your system:

JupyterNotebook: python 3.9

Installation

Clone the Insurance-Cross-Selling-Prediction repository:

git clone https://github.com/Theofilusarifin/Insurance-Cross-Selling-Prediction

Change to the project directory:

cd Insurance-Cross-Selling-Prediction

Install the dependencies:

pip install -r requirements.txt

Running Insurance-Cross-Selling-Prediction

Use the following command to run Insurance-Cross-Selling-Prediction:

jupyter nbconvert --execute notebook.ipynb

Tests

To execute tests, run:

pytest notebook_test.py

Contributing

Contributions are welcome! Here are several ways you can contribute:

Submit Pull Requests: Review open PRs, and submit your own PRs.
Join the Discussions: Share your insights, provide feedback, or ask questions.
Report Issues: Submit bugs found or log feature requests for Insurance-cross-selling-prediction.

Contributing Guidelines

Fork the Repository: Start by forking the project repository to your GitHub account.
Clone Locally: Clone the forked repository to your local machine using a Git client.
```
git clone https://github.com/Theofilusarifin/Insurance-Cross-Selling-Prediction
```
Create a New Branch: Always work on a new branch, giving it a descriptive name.
```
git checkout -b new-feature-x
```
Make Your Changes: Develop and test your changes locally.
Commit Your Changes: Commit with a clear message describing your updates.
```
git commit -m 'Implemented new feature x.'
```
Push to GitHub: Push the changes to your forked repository.
```
git push origin new-feature-x
```
Submit a Pull Request: Create a PR against the original project repository. Clearly describe the changes and their motivations.

Once your PR is reviewed and approved, it will be merged into the main branch.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Code		Code
Documentation.pdf		Documentation.pdf
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

INSURANCE-CROSS-SELLING-PREDICTION

Quick Links

Overview

Repository Structure

Getting Started

Installation

Running Insurance-Cross-Selling-Prediction

Tests

Contributing

Acknowledgments

About

Releases

Packages

Languages

Theofilusarifin/Insurance-Cross-Selling-Prediction

Folders and files

Latest commit

History

Repository files navigation

INSURANCE-CROSS-SELLING-PREDICTION

Quick Links

Overview

Repository Structure

Getting Started

Installation

Running Insurance-Cross-Selling-Prediction

Tests

Contributing

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages