Please rotate your device to landscape mode to view the charts.

Background and Context

The Research Question

This study investigates whether machine learning techniques can predict individual stock option returns and identify which characteristics drive these predictions.

Data Coverage

The analysis examines over 12 million observations of U.S. equity options from 1996 to 2020, using 273 different option and stock characteristics as predictors.

Methodology

The researchers compare linear and nonlinear machine learning models to predict delta-hedged option returns, with particular focus on allowing for complex interactions between different predictive factors.

Superior Performance of Nonlinear vs Linear Models

  • Nonlinear models achieve more than twice the predictive power of linear models
  • The nonlinear ensemble achieves 2.04% out-of-sample R² compared to 0.9% for linear models
  • This demonstrates the importance of capturing complex relationships between predictive factors

High-Minus-Low Portfolio Returns Across Model Types

  • Trading strategy based on nonlinear model predictions generates significantly higher returns
  • Nonlinear ensemble achieves 2.04% monthly returns compared to 1.30% for linear ensemble
  • Returns remain significant even after accounting for transaction costs

Predictability Across Information Friction Quintiles

  • Predictability increases monotonically with information frictions
  • Highest information friction quintile shows 5.32% R² compared to near-zero for lowest quintile
  • Suggests predictability is driven by informational inefficiencies

Option Mispricing and Return Predictability

  • Higher levels of mispricing lead to greater predictability and returns
  • High mispricing portfolio generates 2.74% monthly returns vs 0.76% for low mispricing
  • Confirms that option return predictability is linked to market inefficiencies

Contribution and Implications

  • First comprehensive study applying machine learning to predict individual option returns using both option and stock characteristics
  • Demonstrates that nonlinear interactions between characteristics are crucial for predicting option returns
  • Shows that predictability is economically significant and remains robust after accounting for transaction costs
  • Provides evidence that option return predictability is driven by informational frictions and market inefficiencies

Data Sources

  • Model comparison chart based on Figure 1 and Table 2
  • Portfolio returns visualization based on Table 3
  • Feature importance chart based on Figure 7
  • Information friction analysis based on Figure 12
  • Mispricing effect visualization based on Table 9