More and more stock trading strategies are constructed using deep reinforcement learning (DRL) algorithms, but DRL methods originally widely used in the gaming community are not directly adaptable to financial data with low signal-to-noise ratios and unevenness, and thus suffer from performance shortcomings. In this paper, to capture the hidden information, we propose a DRL based stock trading system using cascaded LSTM, which first uses LSTM to extract the time-series features from stock daily data, and then the features extracted are fed to the agent for training, while the strategy functions in reinforcement learning also use another LSTM for training. Experiments in DJI in the US market and SSE50 in the Chinese stock market show that our model outperforms previous baseline models in terms of cumulative returns and Sharp ratio, and this advantage is more significant in the Chinese stock market, a merging market. It indicates that our proposed method is a promising way to build a automated stock trading system.
翻译:越来越多的股票交易战略是利用深度强化学习算法(DRL)构建的,但最初在赌博界广泛使用的DRL方法并不直接适应信号到噪音比率低和不均衡的金融数据,因此也存在性能缺陷。 在本文中,为了捕捉隐藏的信息,我们建议使用级联LSTM建立基于DRL的股票交易系统,该系统首先使用LSTM从股票每日数据中提取时间序列特征,然后将提取的特征提供给培训代理,而加强学习的战略功能也使用另一种LSTM进行培训。 美国市场的DJI实验和中国股票市场的SSE50实验表明,从累积收益和夏普率来看,我们的模型比以往基线模型要好,而这一优势在中国股票市场更为显著,即合并市场。 它表明,我们提出的方法对于建立自动化股票交易系统来说是很有希望的方法。