Sophisticated machine learning (ML) models to inform trading in the financial sector create problems of interpretability and risk management. Seemingly robust forecasting models may behave erroneously in out of distribution settings. In 2020, some of the world's most sophisticated quant hedge funds suffered losses as their ML models were first underhedged, and then overcompensated. We implement a gradient-based approach for precisely stress-testing how a trading model's forecasts can be manipulated, and their effects on downstream tasks at the trading execution level. We construct inputs -- whether in changes to sentiment or market variables -- that efficiently affect changes in the return distribution. In an industry-standard trading pipeline, we perturb model inputs for eight S&P 500 stocks. We find our approach discovers seemingly in-sample input settings that result in large negative shifts in return distributions. We provide the financial community with mechanisms to interpret ML forecasts in trading systems. For the security community, we provide a compelling application where studying ML robustness necessitates that one capture an end-to-end system's performance rather than study a ML model in isolation. Indeed, we show in our evaluation that errors in the forecasting model's predictions alone are not sufficient for trading decisions made based on these forecasts to yield a negative return.
翻译:向金融部门贸易提供信息的精密机器学习模式(ML)在金融行业贸易中造成了解释性和风险管理问题。看起来强有力的预测模式可能在分销环境之外出现错误行为。2020年,世界一些最尖端的对冲基金由于ML模式最初被冲淡,然后过度补偿而蒙受了损失。我们采用了基于梯度的方法,精确地测试如何操纵贸易模式的预测,及其对贸易执行阶段下游任务的影响。我们建构了有效影响回报分配变化的投入 -- -- 无论是情绪变化还是市场变量 -- -- 。在行业标准贸易管道中,我们渗透了8个S & P 500股票的模型投入。我们发现我们的方法发现,似乎在模拟投入环境中发现,导致回报分配的大幅负转移。我们为金融界提供了解释贸易体系中ML预测的机制。对于安全界,我们提供了令人信服的应用方法,即研究ML稳健性要求人们能够捕捉到最终到系统的业绩,而不是单独研究ML模型。我们的方法发现,在孤立的工业标准交易模型中,我们发现这些预测中的反向收益预测是充分的。