Learning from humans allows non-experts to program robots with ease, lowering the resources required to build complex robotic solutions. Nevertheless, such data-driven approaches often lack the ability of providing guarantees regarding their learned behaviors, which is critical for avoiding failures and/or accidents. In this work, we focus on reaching/point-to-point motions, where robots must always reach their goal, independently of their initial state. This can be achieved by modeling motions as dynamical systems and ensuring that they are globally asymptotically stable. Hence, we introduce a novel Contrastive Learning loss for training Deep Neural Networks (DNN) that, when used together with an Imitation Learning loss, enforces the aforementioned stability in the learned motions. Differently from previous work, our method does not restrict the structure of its function approximator, enabling its use with arbitrary DNNs and allowing it to learn complex motions with high accuracy. We validate it using datasets and a real robot. In the former case, motions are 2 and 4 dimensional, modeled as first and second order dynamical systems. In the latter, motions are 3, 4, and 6 dimensional, of first and second order, and are used to control a 7DoF robot manipulator in its end effector space and joint space. More details regarding the real-world experiments are presented in: https://youtu.be/OM-2edHBRfc.
翻译:人类的学习使非专家能够轻松地编程机器人,降低建立复杂的机器人解决方案所需的资源。然而,这类数据驱动的方法往往缺乏提供其学习行为保障的能力,这对于避免失败和(或)事故至关重要。在这项工作中,我们侧重于达到/点对点动议,机器人必须始终达到其目标,而不论其初始状态如何。这可以通过作为动态系统的模拟动作和确保它们在全球是无干扰的稳定性来实现。因此,我们为培训深神经网络(DNNN)引入了新的竞争学习损失,当与模拟学习失败一起使用时,往往缺乏对其学习行为提供保障的能力,而这对于避免失败和(或)事故至关重要。在这项工作中,我们的方法并不侧重于达到/点对点动议的排序结构,即机器人必须始终独立于其初始状态,从而能够使用任意的 DNNNP,并允许其以高度精确地学习复杂的动作。我们用数据集和真正的机器人来验证它。在前一例中,动作是2和4维,模型作为第一和第二级动态动态网络系统。在后期,运动是真实的MOR 3 和六维 。在实际操作中, 4 。在实际操作中,MLA 4 4 4 和6 。在实际操作中, 4 4 。在实际操作中, 4 4 4 4 4 4 4 和6-RBRBRBI 4 。在实际操作中, 。在实际操作中, 4 。