This repository stores the codes for cleaning the data.
One of the challenges for data scientists is to transform the dirty data to clean data. In order to build a machine learning model, we need to form our data to what we want.
we have three fields which are:
- Skills
- Educations
- Experiences
For skills, data binning technique is used to categorize skills into categories below.
For educations, we divide people into four categories. The categories are below.
For experiences which are also our output and input, we have categories below.