Please rotate your device to landscape mode to view the charts.

Key Findings

Double Machine Learning Framework

Introduces a novel approach combining Neyman-orthogonal scores and cross-fitting to obtain valid inferential statements about treatment effects when using machine learning methods

Robust Estimation Method

Provides asymptotically unbiased and normally distributed estimates of Average Treatment Effects (ATE) and Average Treatment Effects on the Treated (ATTE)

Sample Splitting Innovation

Develops a K-fold cross-fitting procedure that reduces overfitting and enables valid inference when using complex machine learning methods

Double Machine Learning Process

  • Data is split into K equal folds for cross-fitting
  • ML methods estimate nuisance parameters on auxiliary samples
  • Treatment effects estimated using Neyman-orthogonal scores

Uncertainty Quantification

  • Standard errors account for both sampling uncertainty and partition variation
  • Median-based approach provides robustness to outliers
  • Multiple partitions enable more accurate uncertainty estimates

Estimation Framework Components

  • Integration of machine learning with classic econometric theory
  • Combines efficiency of ML with valid statistical inference
  • Maintains asymptotic efficiency while allowing flexible ML methods

Contribution and Implications

  • Bridges gap between machine learning and causal inference in economics
  • Enables reliable inference with complex ML methods in treatment effect estimation
  • Provides practical framework for empirical researchers using modern ML techniques

Data Sources

  • Flow chart based on methodology description in Section I and II of the paper
  • Uncertainty quantification visualization derived from Section III discussion
  • Components chart reflects the theoretical framework presented in Sections I-II