FinTechAI.org.uk

(Re-)imag(in)ing Price Trends
Journal of Finance, 2023

Jiang, Jingwen; Kelly, Bryan; Xiu, Dacheng
We reconsider trend-based predictability by employing flexible learning methods to identify price patterns that are highly predictive of returns, as opposed to testing predefined patterns like momentum or reversal. Our predictor data are stock-level price charts, allowing us to extract the most predictive price patterns using machine learning image analysis techniques. These patterns differ significantly from commonly analyzed trend signals, yield more accurate return predictions, enable more profitable investment strategies, and demonstrate robustness across specifications. Remarkably, they exhibit context independence, as short-term patterns perform well on longer time scales, and patterns learned from U.S. stocks prove effective in international markets.

Artificial Intelligence, Education, and Entrepreneurship
Journal of Finance, 2024

Gofman, Michael; Jin, Zhao
We document an unprecedented brain drain of Artificial Intelligence (AI) professors from universities from 2004 to 2018. We find that students from the affected universities establish fewer AI startups and raise less funding. The brain-drain effect is significant for tenured professors, professors from top universities, and deep-learning professors. Additional evidence suggests that unobserved city- and university-level shocks are unlikely to drive our results. We consider several economic channels for the findings. The most consistent explanation is that professors' departures reduce startup founders' AI knowledge, which we find is an important factor for successful startup formation and fundraising.

Regulatory Arbitrage or Random Errors? Implications of Race Prediction Algorithms in Fair Lending Analysis
Journal of Financial Economics, 2024

Greenwald, Daniel L.; Howell, Sabrina T.; Li, Cangyuan; Yimfor, Emmanuel
When race is not directly observed, regulators and analysts commonly predict it using algorithms based on last name and address. In small business lending--where regulators assess fair lending law compliance using the Bayesian Improved Surname Geocoding (BISG) algorithm--we document large prediction errors among Black Americans. The errors bias measured racial disparities in loan approval rates downward by 43%, with greater bias for traditional vs. fintech lenders. Regulation using self-identified race would increase lending to Black borrowers, but also shift lending toward affluent areas because errors correlate with socioeconomics. Overall, using race proxies in policymaking and research presents challenges.

Charting by Machines
Journal of Financial Economics, 2024

Murray, Scott; Xia, Yusen; Xiao, Houping
We test the efficient market hypothesis by using machine learning to forecast stock returns from historical performance. These forecasts strongly predict the cross-section of future stock returns. The predictive power holds in most subperiods and is strong among the largest 500 stocks. The forecasting function has important nonlinearities and interactions, is remarkably stable through time, and captures effects distinct from momentum, reversal, and extant technical signals. These findings question the efficient market hypothesis and indicate that technical analysis and charting have merit. We also demonstrate that machine learning models that perform well in optimization continue to perform well out-of-sample.

Missing Values Handling for Machine Learning Portfolios
Journal of Financial Economics, 2024

Chen, Andrew Y.; McCoy, Jack
We characterize the structure and origins of missingness for 159 cross-sectional return predictors and study missing value handling for portfolios constructed using machine learning. Simply imputing with cross-sectional means performs well compared to rigorous expectation-maximization methods. This stems from three facts about predictor data: (1) missingness occurs in large blocks organized by time, (2) cross-sectional correlations are small, and (3) missingness tends to occur in blocks organized by the underlying data source. As a result, observed data provide little information about missing data. Sophisticated imputations introduce estimation noise that can lead to underperformance if machine learning is not carefully applied.

The Virtue of Complexity in Return Prediction
Journal of Finance, 2024

Kelly, Bryan; Malamud, Semyon; Zhou, Kangying
Much of the extant literature predicts market returns with "simple" models that use only a few parameters. Contrary to conventional wisdom, we theoretically prove that simple models severely understate return predictability compared to "complex" models in which the number of parameters exceeds the number of observations. We empirically document the virtue of complexity in U.S. equity market return prediction. Our findings establish the rationale for modeling expected returns through machine learning.

Artificial Intelligence, Firm Growth, and Product Innovation
Journal of Financial Economics, 2024

Babina, Tania; Fedyk, Anastassia; He, Alex; Hodson, James
We study the use and economic impact of AI technologies. We propose a new measure of firm-level AI investments using employee resumes. Our measure reveals a stark increase in AI investments across sectors. AI-investing firms experience higher growth in sales, employment, and market valuations. This growth comes primarily through increased product innovation. Our results are robust to instrumenting AI investments using firms' exposure to universities' supply of AI graduates. AI-powered growth concentrates among larger firms and is associated with higher industry concentration. Our results highlight that new technologies like AI can contribute to growth and superstar firms through product innovation.

Informed Trading Intensity
Journal of Finance, 2024

Bogousslavsky, Vincent; Fos, Vyacheslav; Muravyev, Dmitriy
We train a machine learning method on a class of informed trades to develop a new measure of informed trading, informed trading intensity (ITI). ITI increases before earnings, mergers and acquisitions, and news announcements, and has implications for return reversal and asset pricing. ITI is effective because it captures nonlinearities and interactions between informed trading, volume, and volatility. This data-driven approach can shed light on the economics of informed trading, including impatient informed trading, commonality in informed trading, and models of informed trading. Overall, learning from informed trading data can generate an effective informed trading measure.

Goal Setting and Saving in the FinTech Era
Journal of Finance, 2024

Gargano, Antonio; Rossi, Alberto G.
We study the effectiveness of saving goals in increasing individuals' savings using data from a Fintech app. Using a difference-in-differences identification strategy that randomly assigns users into a group of beta testers who can set goals and a group of users who cannot, we find that setting goals increases individuals' savings rate. The increased savings within the app do not reduce savings outside the app. Moreover, goal setting helps those individuals previously identified as having the lowest propensity to save. Matching App user survey responses to their behavior highlights the relative merits of monitoring and concreteness channels in explaining our findings.

Lender Automation and Racial Disparities in Credit Access
Journal of Finance, 2024

Howell, Sabrina T.; Kuchler, Theresa; Snitkof, David; Stroebel, Johannes; Wong, Jun
Process automation reduces racial disparities in credit access by enabling smaller loans, broadening banks' geographic reach, and removing human biases from decision making. We document these findings in the context of the Paycheck Protection Program (PPP), where private lenders faced no credit risk but decided which firms to serve. Black-owned firms obtained PPP loans primarily from automated fintech lenders, especially in areas with high racial animus. After traditional banks automated their loan processing procedures, their PPP lending to Black-owned firms increased. Our findings cannot be fully explained by racial differences in loan application behaviors, preexisting banking relationships, firm performance, or fraud rates.