Deep learning has provided new ways of manipulating, processing and analyzing data. It sometimes may achieve results comparable to, or surpassing human expert performance, and has become a source of inspiration in the era of artificial intelligence. Another subfield of machine learning named reinforcement learning, tries to find an optimal behavior strategy through interactions with the environment. Combining deep learning and reinforcement learning permits resolving critical issues relative to the dimensionality and scalability of data in tasks with sparse reward signals, such as robotic manipulation and control tasks, that neither method permits resolving when applied on its own. In this paper, we present recent significant progress of deep reinforcement learning algorithms, which try to tackle the problems for the application in the domain of robotic manipulation control, such as sample efficiency and generalization. Despite these continuous improvements, currently, the challenges of learning robust and versatile manipulation skills for robots with deep reinforcement learning are still far from being resolved for real world applications.
翻译:深层次的学习为数据操纵、处理和分析提供了新的方法,有时可以取得与人类专家业绩相似或超额的成果,并成为人工智能时代的灵感来源。另一个机器学习的子领域称为强化学习,试图通过与环境的相互作用找到最佳行为战略。深层次的学习和强化学习相结合,可以解决与数据维度和可缩放性有关的关键问题,而用微薄的奖励信号,如机器人操纵和控制任务,无法在应用时自行解决。在本文件中,我们介绍了深层强化学习算法最近取得的重大进展,这些算法试图解决机器人操纵控制领域的应用问题,例如抽样效率和概括化。尽管目前不断有改进,但为具有深层强化学习的机器人学习强大和多功能的操控技能的挑战仍然远远没有得到解决,用于真正的世界应用。