Please rotate your device to landscape mode to view the charts.

Key Findings

Machine Learning Improves Default Prediction

Random Forest model outperforms traditional Logit models in predicting mortgage defaults, with a 5.1% improvement in precision score and 14.3% improvement in R-squared value

Unequal Distribution of Benefits

Benefits of improved prediction accuracy disproportionately favor White non-Hispanic and Asian borrowers, while Black and Hispanic borrowers see less favorable outcomes

Higher Rate Dispersion for Minorities

Machine learning technology leads to greater dispersion in interest rates, particularly affecting minority borrowers with increases of 8-10 basis points vs 5-6 for non-minority borrowers

Model Performance Comparison

  • Random Forest model shows superior performance across all metrics
  • Greatest improvement seen in R-squared value (14.3% increase)
  • ROC-AUC shows 0.8% improvement with Random Forest

Acceptance Rates by Race/Ethnicity

  • Clear disparities in acceptance rates across racial groups
  • Asian and White non-Hispanic borrowers have highest acceptance rates
  • Black borrowers face lowest acceptance rates under both models

Interest Rate Dispersion Effects

  • Interest rate dispersion increases progressively across racial groups
  • Black borrowers face 43% higher rate dispersion than Asian borrowers
  • Pattern suggests amplified pricing uncertainty for minority borrowers

Contribution and Implications

  • First comprehensive analysis of distributional effects of machine learning adoption in mortgage lending
  • Demonstrates that technological advancement can amplify existing inequalities despite improving overall prediction accuracy
  • Highlights need for careful consideration of fairness in deployment of new lending technologies

Data Sources

  • Performance comparison chart based on Table III - comparing model performance metrics
  • Acceptance rates visualization derived from Table VII - showing equilibrium outcomes across racial groups
  • Interest rate dispersion chart constructed from Table VII columns 5-6 - standard deviation of SATO data