Skip to content

Commit

Permalink
data
Browse files Browse the repository at this point in the history
  • Loading branch information
HyunjunA committed Nov 21, 2022
1 parent dd00f57 commit acfb7c8
Show file tree
Hide file tree
Showing 104 changed files with 86,249 additions and 0 deletions.
9 changes: 9 additions & 0 deletions data/datasets/pmlb_small/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Benchmark data sets

This directory contains over 150 data sets for benchmarking supervised machine learning algorithms.

Each subdirectory corresponds to a separate data set, and will have a README file providing some basic information about the data set.

# High-level summary of data sets

[in progress]
80 changes: 80 additions & 0 deletions data/datasets/pmlb_small/allbp/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
# allbp

## Summary Stats

#instances: 3772

#features: 29

#binary_features: 21

#integer_features: 8

#float_features: 0

Endpoint type: integer

#Classes: 3

Imbalance metric: 0.8755228428707819

## Feature Types

age:discrete

sex:discrete

on thyroxine:binary

query on thyroxine:binary

on antithyroid medication:binary

sick:binary

pregnant:binary

thyroid surgery:binary

I131 treatment:binary

query hypothyroid:binary

query hyperthyroid:binary

lithium:binary

goitre:binary

tumor:binary

hypopituitary:binary

psych:binary

TSH measured:binary

TSH:discrete

T3 measured:binary

T3:discrete

TT4 measured:binary

TT4:discrete

T4U measured:binary

T4U:discrete

FTI measured:binary

FTI:discrete

TBG measured:binary

TBG:binary

referral source:discrete

3,773 changes: 3,773 additions & 0 deletions data/datasets/pmlb_small/allbp/allbp.csv

Large diffs are not rendered by default.

80 changes: 80 additions & 0 deletions data/datasets/pmlb_small/allhyper/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
# allhyper

## Summary Stats

#instances: 3771

#features: 29

#binary_features: 21

#integer_features: 8

#float_features: 0

Endpoint type: integer

#Classes: 4

Imbalance metric: 0.9301422508626843

## Feature Types

age:discrete

sex:discrete

on thyroxine:binary

query on thyroxine:binary

on antithyroid medication:binary

sick:binary

pregnant:binary

thyroid surgery:binary

I131 treatment:binary

query hypothyroid:binary

query hyperthyroid:binary

lithium:binary

goitre:binary

tumor:binary

hypopituitary:binary

psych:binary

TSH measured:binary

TSH:discrete

T3 measured:binary

T3:discrete

TT4 measured:binary

TT4:discrete

T4U measured:binary

T4U:discrete

FTI measured:binary

FTI:discrete

TBG measured:binary

TBG:binary

referral source:discrete

Loading

0 comments on commit acfb7c8

Please sign in to comment.