With the recent advancement in Deep Reinforcement Learning in the gaming industry, we are curious if the same technology would work as well for common quantitative financial problems. In this paper, we will investigate if an off-the-shelf library developed by OpenAI can be easily adapted to mean reversion strategy. Moreover, we will design and test to see if we can get better performance by narrowing the function space that the agent needs to search for. We achieve this through augmenting the reward function by a carefully picked penalty term.
翻译:随着最近赌博业深强化学习的进步,我们很想知道同样的技术是否能同样解决共同的量化金融问题。 在本文中,我们将调查开放会计师协会开发的现成图书馆能否很容易地被调整为反转战略。 此外,我们将设计和测试,看看我们能否通过缩小该代理商需要搜索的功能空间而取得更好的业绩。我们通过谨慎选择的处罚期来增加奖励功能来实现这一目标。