Russell Peavey
An example workflow demonstrating use of pandas for data processing and sklearn for modeling and evaluation.
Row ordered anonymized clinical patient data of 4000 patients with ~1500 different diagnosis codes across provider visits from MIMIC-III
- SVMLight format of diagnosis code counts, min-maxed normalized, in patient observation window
- Decision Tree
- Logistic Regression
- SVM
K-fold and randomized K-fold methods.
- F1 score
- Accuracy
- Precision
- Recall