Skip to content

Classification of a patient as healthy or sick based on patients' medical data, including morphological test results, tested for thyroid disease. The data comes from the "sick" dataset from the UCI Machine Learning Repositor website.

Notifications You must be signed in to change notification settings

p-przybylek/sick_dataset_analysis

Repository files navigation

Sick dataset analysis

The project is being carried out as part of the subject Fundamentals of Data Processing.

Note: The project made in Polish language.

Project Goal

The purpose of the project is to classify a patient as healthy or sick based on medical data of patients, including morphological test results. However, the subject itself can be divided into the analysis of this data and the prediction of the processed data according to the analysis.

A similar project was carried out previously in SAS software, hence the additional idea is to reproduce these analyses in Python and compare certain results with each other. The repository also includes a report on the execution of the project in SAS.

Dataset

The data of patients screened for thyroid disease came from the UCI Machine Learning Repository website and was provided in 1987 by the Garavan Institute and J. Ross Quinlan of the New South Wales Institute in Sydney, Australia. The project used a sick dataset, with a total number of records (patients) of 3772.

About

Classification of a patient as healthy or sick based on patients' medical data, including morphological test results, tested for thyroid disease. The data comes from the "sick" dataset from the UCI Machine Learning Repositor website.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published