The desire to make applications and machines more intelligent and the aspiration to enable their operation without human interaction have been driving innovations in neural networks, deep learning, and other machine learning techniques. Although reinforcement learning has been primarily used in video games, recent advancements and the development of diverse and powerful reinforcement algorithms have enabled the reinforcement learning community to move from playing video games to solving complex real-life problems in autonomous systems such as self-driving cars, delivery drones, and automated robotics. Understanding the environment of an application and the algorithms' limitations plays a vital role in selecting the appropriate reinforcement learning algorithm that successfully solves the problem on hand in an efficient manner. Consequently, in this study, we identify three main environment types and classify reinforcement learning algorithms according to those environment types. Moreover, within each category, we identify relationships between algorithms. The overview of each algorithm provides insight into the algorithms' foundations and reviews similarities and differences among algorithms. This study provides a perspective on the field and helps practitioners and researchers to select the appropriate algorithm for their use case.
翻译:使应用程序和机器更智能的愿望,以及使这些应用程序和机器在没有人际互动的情况下运作的愿望,一直在推动神经网络、深层学习和其他机器学习技术方面的创新。虽然强化学习主要用于电子游戏,但最近的进展和各种强大强化算法的发展使强化学习界能够从玩电子游戏转向解决自主系统中复杂的现实问题,如自行驾驶汽车、投送无人机和自动机器人。了解应用程序的环境和算法的局限性在选择能够有效解决手头问题的适当强化学习算法方面发挥着至关重要的作用。因此,在本研究中,我们确定了三种主要环境类型,并根据这些环境类型对强化学习算法进行分类。此外,我们在每一类别中都确定了各种算法之间的关系。每种算法的概览都提供了对算法基础的洞察力,并审查了各种算法之间的相似和差异。本研究从实地的角度出发,帮助从业人员和研究人员选择使用这些算法的适当算法。