Recent years have seen a growth in the number of industrial robots working closely with end-users such as factory workers. This growing use of collaborative robots has been enabled in part due to the availability of end-user robot programming methods that allow users who are not robot programmers to teach robots task actions. Programming by Demonstration (PbD) is one such end-user programming method that enables users to bypass the complexities of specifying robot motions using programming languages by instead demonstrating the desired robot behavior. Demonstrations are often provided by physically guiding the robot through the motions required for a task action in a process known as kinesthetic teaching. Kinesthetic teaching enables users to directly demonstrate task behaviors in the robot's configuration space, making it a popular end-user robot programming method for collaborative robots known for its low cognitive burden. However, because kinesthetic teaching restricts the programmer's teaching to motion demonstrations, it fails to leverage information from other modalities that humans naturally use when providing physical task demonstrations to one other, such as gaze and speech. Incorporating multimodal information into the traditional kinesthetic programming workflow has the potential to enhance robot learning by highlighting critical aspects of a program, reducing ambiguity, and improving situational awareness for the robot learner and can provide insight into the human programmer's intent and difficulties. In this extended abstract, we describe a preliminary study on multimodal kinesthetic demonstrations and future directions for using multimodal demonstrations to enhance robot learning and user programming experiences.
翻译:近年来,与工厂工人等最终用户密切合作的工业机器人数量有所增加; 合作机器人的使用越来越多,部分是由于最终用户机器人编程方法的可用性,允许非机器人程序员的用户教授机器人任务行动; 示范编程(PbD)是一种终端用户编程方法,使用户能够绕过使用编程语言具体指定机器人动议的复杂性,而不用演示所希望的机器人行为; 演示通常通过在被称为 " 传感学教学 " 的进程中采取任务行动所需的动作,由实际指导机器人提供; 植保教学使用户能够直接展示机器人配置空间中的任务行为,使用户能够以较低的认知负担为名,为协作机器人提供一种受欢迎的终端用户机器人程序编程方法。 然而,由于传感美学教学限制了程序员的教学以运动演示为目的,因此它无法利用人类在提供实际任务演示时自然使用的其他模式的信息,例如视觉和演讲。 将多式联运信息纳入传统的传感编程工作流程,有可能通过强调程序的关键方面,来提高机器人学习机器人在机器人配置空间中的行为行为行为,减少模糊性,并改进对机器人的机性研究的深度学习,从而了解我们学习的机型图理学学习。