Successful quantitative investment usually relies on precise predictions of the future movement of the stock price. Recently, machine learning based solutions have shown their capacity to give more accurate stock prediction and become indispensable components in modern quantitative investment systems. However, the i.i.d. assumption behind existing methods is inconsistent with the existence of diverse trading patterns in the stock market, which inevitably limits their ability to achieve better stock prediction performance. In this paper, we propose a novel architecture, Temporal Routing Adaptor (TRA), to empower existing stock prediction models with the ability to model multiple stock trading patterns. Essentially, TRA is a lightweight module that consists of a set of independent predictors for learning multiple patterns as well as a router to dispatch samples to different predictors. Nevertheless, the lack of explicit pattern identifiers makes it quite challenging to train an effective TRA-based model. To tackle this challenge, we further design a learning algorithm based on Optimal Transport (OT) to obtain the optimal sample to predictor assignment and effectively optimize the router with such assignment through an auxiliary loss term. Experiments on the real-world stock ranking task show that compared to the state-of-the-art baselines, e.g., Attention LSTM and Transformer, the proposed method can improve information coefficient (IC) from 0.053 to 0.059 and 0.051 to 0.056 respectively. Our dataset and code used in this work are publicly available: https://github.com/microsoft/qlib/tree/main/examples/benchmarks/TRA.
翻译:成功的量化投资通常依赖于对未来股票价格变化的准确预测。最近,基于机器学习的解决方案显示它们有能力提供更准确的股票预测,并成为现代量化投资系统中不可或缺的组成部分。然而,现有方法背后的假设与股票市场中存在不同的交易模式不一致,这不可避免地限制了它们实现更好的股票预测绩效的能力。在本文件中,我们提议建立一个新颖的结构,即“时间路由调适(TRA)”,以赋予现有股票预测模型以模拟多种股票交易模式的能力。基本上,TRA是一个轻量级模块,由一套独立的预测器组成,用于学习多种模式,并成为向不同预测者发送样本的路由器。然而,由于缺乏明确的模式识别器,因此很难培训一个有效的基于交易模式的模式。为了应对这一挑战,我们进一步设计了一个基于最佳运输(Ot)的学习算法,以获得预测或分配的最佳样本,并通过一个辅助性损失术语有效地优化路由这种分配的路由器。在现实世界股票排名上进行实验,以学习多种模式以及将样本发送给不同预测的路径。 与目前使用的Slimbleg-al3/real-liglement数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库相比, 和目前使用的系统数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库数据库可以改进。