Reinforcement learning (RL) has demonstrated great potential, but is currently full of overhyping and pipe dreams. We point to some difficulties with current research which we feel are endemic to the direction taken by the community. To us, the current direction is not likely to lead to "deployable" RL: RL that works in practice and can work in practical situations yet still is economically viable. We also propose a potential fix to some of the difficulties of the field.
翻译:强化学习(RL)已显示出巨大的潜力,但目前充满了过度和空洞的梦想。我们指出目前研究的一些困难,我们认为,目前研究是社区所走的方向所特有的。对我们来说,目前的方向不可能导致“可部署的”RL:在实际情况下工作,在实际情况下工作,但在经济上仍然可行。我们还提出了解决该领域某些困难的可能办法。