Skip to content

Latest commit

 

History

History
21 lines (11 loc) · 903 Bytes

README.md

File metadata and controls

21 lines (11 loc) · 903 Bytes

Safe-T:

Analysed malware classifier datasets to advise electric vehicle (EV) companies for reducing malfunction rates within their EVs.

Overview:

  1. Data cleaning

  2. SMOTE analysis to identify the minority classes in order to randomly oversample

  3. Association rule mining was then used to determine relationships between co-occurring variables.

  4. Data visualisations were built with Plotly.

  5. We used Logistic Regression, Random Forest, and Neural Networks to predict malfunction detection capabilities

    • Logistic Regression, Random Forest and Neural Networks were built twice with and without feature selection with Boruta.

    • K-fold cross validation was used to improve Logistic Regression performances while Grid Search Algorithm was used to improve Random Forest and Neural Network performances.

  6. All findings are consolidated on an interactive Tableau dashboard.