This scientific research paper presents an innovative approach based on deep reinforcement learning (DRL) to solve the algorithmic trading problem of determining the optimal trading position at any point in time during a trading activity in stock markets. It proposes a novel DRL trading strategy so as to maximise the resulting Sharpe ratio performance indicator on a broad range of stock markets. Denominated the Trading Deep Q-Network algorithm (TDQN), this new trading strategy is inspired from the popular DQN algorithm and significantly adapted to the specific algorithmic trading problem at hand. The training of the resulting reinforcement learning (RL) agent is entirely based on the generation of artificial trajectories from a limited set of stock market historical data. In order to objectively assess the performance of trading strategies, the research paper also proposes a novel, more rigorous performance assessment methodology. Following this new performance assessment approach, promising results are reported for the TDQN strategy.
翻译:这份科学研究文件提出了基于深层强化学习(DRL)的创新性方法,以解决在股票市场交易活动期间任何时候确定最佳贸易地位的算法交易问题,提出了新的DRL贸易战略,以便在广泛的股票市场上最大限度地实现由此产生的夏普比率业绩指标。这项新的贸易战略源于流行的DQN算法,并在很大程度上适应了手头的具体算法交易问题。因此产生的强化学习(RL)剂的培训完全基于从有限的一批股票市场历史数据中生成的人工轨迹。为了客观评估贸易战略的绩效,研究文件还提出了新的、更严格的绩效评估方法。根据这种新的业绩评估方法,为TDQN战略报告了有希望的成果。