学习6-D线性符合要求动议的模拟学习基于学习的框架 (Imitation learning-based framework for learning 6-D linear compliant motions)

We present a novel method for learning from demonstration 6-D tasks that can be modeled as a sequence of linear motions and compliances. The focus of this paper is the learning of a single linear primitive, many of which can be sequenced to perform more complex tasks. The presented method learns from demonstrations only, without any prior information, how to take advantage of mechanical gradients in in-contact tasks, such as assembly, both for translations and rotations. The method assumes there exists a desired linear direction in 6-D which, if followed by the manipulator, leads the robot's end-effector to the goal area shown in the demonstration, either in free space or by leveraging contact through compliance. First, demonstrations are gathered where the teacher explicitly shows the robot how the mechanical gradients can be used as guidance towards the goal. From the demonstrations, a set of directions is computed which would result in the observed motion at each timestep during a demonstration of a single primitive. By observing which direction is included in all these sets, we find a single desired direction which can reproduce the demonstrated motion. Finding the number of compliant axes and their directions in both rotation and translation is based on the assumption that in the presence of a desired direction of motion, all other observed motion is caused by the contact force of the environment, signalling the need for compliance. We evaluate the method on a KUKA LWR4+ robot with test setups imitating typical tasks where a human would use compliance to cope with positional uncertainty. Results show that the method can successfully learn and reproduce compliant motions by taking advantage of the geometry of the task, therefore reducing the need for localization accuracy.

翻译：我们提出了一个从演示 6-D 任务中学习的新方法,该方法可以作为线性运动和合规的一系列线性运动和合规的样板。本文件的重点是学习单一线性原始原始,其中许多可以顺序排列,以完成更复杂的任务。演示方法仅从演示中学习,而没有事先的任何信息,如何利用在接触任务中机械性梯度,例如组装、翻译和轮换。该方法假定在6-D 中存在一个理想的线性方向,如果在6-D 中出现操纵者,则将机器人的终端效应引向演示中显示的目标领域,要么在自由空间中,要么通过在合规过程中利用动作。首先,在教师明确展示机械性梯度如何用作实现目标的指南时,将进行演示。从演示中计算出一套方向,在单个原始任务演示期间的每个时间段,将产生观察到的动作。通过观察所有这些组合中包含的方向,我们找到一个可以复制演示运动的预期方向。在自由空间或通过合规性接触,或者通过在合规性运动中利用接触。首先显示机械性梯度位置,我们根据观察到的典型的动作的动作,在学习过程的进度,从而显示常规接触的进度,我们根据学习的进度,可以显示整个的进度,从而显示整个的进度,从而显示整个。