Robots are expected to replace menial tasks such as housework. Some of these tasks include nonprehensile manipulation performed without grasping objects. Nonprehensile manipulation is very difficult because it requires considering the dynamics of environments and objects. Therefore imitating complex behaviors requires a large number of human demonstrations. In this study, a self-supervised learning that considers dynamics to achieve variable speed for nonprehensile manipulation is proposed. The proposed method collects and fine-tunes only successful action data obtained during autonomous operations. By fine-tuning the successful data, the robot learns the dynamics among itself, its environment, and objects. We experimented with the task of scooping and transporting pancakes using the neural network model trained on 24 human-collected training data. The proposed method significantly improved the success rate from 40.2% to 85.7%, and succeeded the task more than 75% for other objects.
翻译:机器人预计将取代诸如家务等隐性任务。 其中一些任务包括: 不掌握物件而操作非致命操作。 非致命操作非常困难, 因为它需要考虑环境和物体的动态。 因此, 模仿复杂的行为需要大量的人类演示。 在此研究中, 提议进行自监督学习, 以考虑动态, 实现非致命操作的可变速度。 拟议方法只收集和微调在自主操作期间获得的成功动作数据 。 通过微调成功的数据, 机器人可以学习自身、 环境和对象之间的动态 。 我们实验了使用根据24个人类收集的培训数据培训的神经网络模型来提取和运输煎饼的任务。 拟议的方法将成功率从40.2% 提高到85.7%, 并成功完成其他对象超过75%的任务 。