Price movement forecasting aims at predicting the future trends of financial assets based on the current market conditions and other relevant information. Recently, machine learning(ML) methods have become increasingly popular and achieved promising results for price movement forecasting in both academia and industry. Most existing ML solutions formulate the forecasting problem as a classification(to predict the direction) or a regression(to predict the return) problem in the entire set of training data. However, due to the extremely low signal-to-noise ratio and stochastic nature of financial data, good trading opportunities are extremely scarce. As a result, without careful selection of potentially profitable samples, such ML methods are prone to capture the patterns of noises instead of real signals. To address the above issues, we propose a novel framework-LARA(Locality-Aware Attention and Adaptive Refined Labeling), which contains the following three components: 1)Locality-aware attention automatically extracts the potentially profitable samples by attending to their label information in order to construct a more accurate classifier on these selected samples. 2)Adaptive refined labeling further iteratively refines the labels, alleviating the noise of samples. 3)Equipped with metric learning techniques, Locality-aware attention enjoys task-specific distance metrics and distributes attention on potentially profitable samples in a more effective way. To validate our method, we conduct comprehensive experiments on three real-world financial markets: ETFs, the China's A-share stock market, and the cryptocurrency market. LARA achieves superior performance compared with the time-series analysis methods and a set of machine learning based competitors on the Qlib platform. Extensive ablation studies and experiments demonstrate that LARA indeed captures more reliable trading opportunities.
翻译:价格流动预测旨在根据当前市场条件和其他相关信息预测金融资产的未来趋势。最近,在学术界和行业中,机器学习(ML)方法越来越受欢迎,在价格流动预测方面,在学术界和行业中,机器学习(ML)方法已经取得了令人乐观的结果。大多数现有的ML解决方案将预测问题表述为整个培训数据组的分类(预测方向)或回归(预测回报)问题。然而,由于信号对噪音比率极低,金融数据具有随机性,良好的交易机会极为稀少。因此,在不认真选择潜在盈利的样本的情况下,这类ML方法很容易获取噪音的格局,而不是真实信号。为了解决上述问题,我们提出了一个新的框架-LARA(当地-意识关注和适应性更新标签),它包含以下三个组成部分:(1) 地方认识(认识)通过关注其标签信息自动提取潜在盈利的样本,从而在这些选定的样本上构建一个更准确的分类。(2) 精确地改进标签,进一步反复改进标签,降低成本的规律关系,而不是真正的信号。我们要解决成本的噪音,在市场中学习一个更高的路路标。 平台平台 3 学习一个潜在的路标。