面对真实的2020模拟机器人竞赛的开放式学习架构 (An open-ended learning architecture to face the REAL 2020 simulated robot competition)

Open-ended learning is a core research field of machine learning and robotics aiming to build learning machines and robots able to autonomously acquire knowledge and skills and to reuse them to solve novel tasks. The multiple challenges posed by open-ended learning have been operationalized in the robotic competition REAL 2020. This requires a simulated camera-arm-gripper robot to (a) autonomously learn to interact with objects during an intrinsic phase where it can learn how to move objects and then (b) during an extrinsic phase, to re-use the acquired knowledge to accomplish externally given goals requiring the robot to move objects to specific locations unknown during the intrinsic phase. Here we present a 'baseline architecture' for solving the challenge, provided as baseline model for REAL 2020. Few models have all the functionalities needed to solve the REAL 2020 benchmark and none has been tested with it yet. The architecture we propose is formed by three components: (1) Abstractor: abstracting sensory input to learn relevant control variables from images; (2) Explorer: generating experience to learn goals and actions; (3) Planner: formulating and executing action plans to accomplish the externally provided goals. The architecture represents the first model to solve the simpler REAL 2020 'Round 1' allowing the use of a simple parameterised push action. On Round 2, the architecture was used with a more general action (sequence of joints positions) achieving again higher than chance level performance. The baseline software is well documented and available for download and use at https://github.com/AIcrowd/REAL2020_starter_kit.

翻译：开放式学习是机器学习和机器人的核心研究领域,目的是建立能够自主获取知识和技能并再利用这些知识和技能以完成新任务。开放式学习带来的多重挑战已在机器人竞赛“现实2020”中实施。这需要模拟相机-报警式高压机器人,以便(a) 自主地学习如何移动物体,在内在阶段与物体互动,然后(b) 在外部阶段,重新使用获得的知识,以实现外部给定的目标,要求机器人将物体转移到内在阶段未知的具体地点。我们在这里展示了一个“基础启动架构”,用于解决挑战,作为“现实2020”的基线模型。很少有模型具备解决“现实2020”基准所需的全部功能,尚未测试。我们提议的架构由三个组成部分组成:(1) 抽象:抽象的感官投入,以便从图像中学习相关的控制变量;(2) 探索者:创造学习目标和行动的经验;(3) 规划者:制定和执行行动计划,以完成外部提供的目标。这里我们展示了“基础启动”架构,作为“现实2020”的基线模型。我们提议的架构以更简单的2020年基准为基准,再次使用“实现”的模型。