Key Findings
Nonlinear Models Outperform
Nonlinear machine learning models significantly outperform linear models in predicting option returns, with gradient-boosted regression trees achieving out-of-sample R² of 2.26%
Profitable Trading Strategy
Machine learning predictions generate monthly returns of 2.04% even after accounting for transaction costs and margin requirements
Key Predictive Factors
Option contract characteristics and market frictions are the most important predictors, with implied volatility being the single most influential factor
Model Performance Comparison
- Nonlinear models (GBR, RF, Dart, N-En) consistently achieve positive R² values
- Linear models all produce negative R² values
- Nonlinear ensemble (N-En) achieves the highest predictive power at 2.45%
Trading Strategy Returns
- N-En strategy remains profitable even with high transaction costs
- Returns decrease as trading costs increase but remain positive
- N-En consistently outperforms L-En across all cost scenarios
Feature Group Importance
- Contract-based features are most important, accounting for 35% of predictive power
- Illiquidity and risk measures are second and third most important
- Traditional stock characteristics have relatively lower importance
Contribution and Implications
- First comprehensive study applying machine learning to predict individual option returns using both option and stock characteristics
- Demonstrates importance of nonlinear interactions in option markets
- Provides practical framework for implementing profitable option trading strategies
- Shows that option return predictability is driven by informational frictions and market inefficiencies
Data Sources
- Model Performance Comparison chart based on Figure 1 and Table 2 from the paper
- Trading Strategy Returns visualization constructed using data from Table 6
- Feature Group Importance chart created using data from Figure 7 and related discussion