Telerobotic systems must adapt to new environmental conditions and deal with high uncertainty caused by long-time delays. As one of the best alternatives to human-level intelligence, Reinforcement Learning (RL) may offer a solution to cope with these issues. This paper proposes to integrate RL with the Model Mediated Teleoperation (MMT) concept. The teleoperator interacts with a simulated virtual environment, which provides instant feedback. Whereas feedback from the real environment is delayed, feedback from the model is instantaneous, leading to high transparency. The MMT is realized in combination with an intelligent system with two layers. The first layer utilizes Dynamic Movement Primitives (DMP) which accounts for certain changes in the avatar environment. And, the second layer addresses the problems caused by uncertainty in the model using RL methods. Augmented reality was also provided to fuse the avatar device and virtual environment models for the teleoperator. Implemented on DLR's Exodex Adam hand-arm haptic exoskeleton, the results show RL methods are able to find different solutions when changes are applied to the object position after the demonstration. The results also show DMPs to be effective at adapting to new conditions where there is no uncertainty involved.
翻译:远程操作系统必须适应新的环境条件,并应对长期延误造成的高度不确定性。作为人类一级智能的最佳替代方法之一,加强学习(RL)可以提供处理这些问题的解决方案。本文件建议将RL与模范中媒体操作(MMMT)概念相结合。远程操作器与模拟虚拟环境互动,提供即时反馈。真实环境的反馈被延迟,来自模型的反馈是瞬时的,导致高透明度。MMMT与两个层次的智能系统相结合实现。第一层使用动态微调(DMP),其中说明阿凡尔环境的某些变化。第二层则使用RL方法处理模型中不确定性造成的问题。还提供强化现实,将电动装置和虚拟环境模型连接起来,供远程操作器使用。在DLRL的Exodex Adam 手动武器外骨骼上,结果显示RL方法能够在对演示后的对象位置应用变化时找到不同的解决方案。在演示结果中,没有应用新的不确定性。