Machine Learning's wheelhouse is out-of-sample prediction, but these powerful methods can be deployed in service of causal inference. This two-session workshop will introduce the basics of machine learning prediction methods, including lasso and random forests and how they feature in causal inference methods like double machine learning (DML) and post-double selection lasso (PDS lasso). The course covers the conceptual and theoretical basis for the methods and also gets into the nuts and bolts of implementation in python and Stata using real-world data.
-
What’s your question? (prediction vs. causality)
-
Standard tools of causal inference
- gold standard: RCT
- Multiple Regression
-
ML prediction tools
- prediction objective
- bias-variance tradeoff
- lasso
- random forest
-
Where does ML prediction fit within causal inference?
- flexibly adjust for covariates
- estimate heterogeneous treatment effects
-
Post-Double Selection Lasso
- Theory
- Implementation
-
Double Machine Learning
- Theory
- Implementation
The following is a set of introductory readings for machine learning and causal inference and is in a good potential reading order
Kleinberg, Ludwig, Mullainathan, and Obermeyer (2015)
Mullainathan and Spiess (2017)