Food packing industry workers typically pick a target amount of food by hand from a food tray and place them in containers. Since menus are diverse and change frequently, robots must adapt and learn to handle new foods in a short time-span. Learning to grasp a specific amount of granular food requires a large training dataset, which is challenging to collect reasonably quickly. In this study, we propose ways to reduce the necessary amount of training data by augmenting a deep neural network with models that estimate its uncertainty through self-supervised learning. To further reduce human effort, we devise a data collection system that automatically generates labels. We build on the idea that we can grasp sufficiently well if there is at least one low-uncertainty (high-confidence) grasp point among the various grasp point candidates. We evaluate the methods we propose in this work on a variety of granular foods -- coffee beans, rice, oatmeal and peanuts -- each of which has a different size, shape and material properties such as volumetric mass density or friction. For these foods, we show significantly improved grasp accuracy of user-specified target masses using smaller datasets by incorporating uncertainty.
翻译:食品包装业工人通常手工从食物托盘中提取一定数量的食品,并将其安置在容器中。由于菜单种类繁多且变化频繁,机器人必须适应并学习如何在很短的时间里处理新的食物。学会掌握特定数量的颗粒食品需要大量的培训数据集,而这些数据收集得相当快。在这项研究中,我们建议了减少必要数量的培训数据的方法,方法是增加一个深层神经网络,模型通过自我监督的学习来估计其不确定性。为了进一步减少人类的努力,我们设计了一个自动生成标签的数据收集系统。我们以这样的想法为基础,如果在不同的控制点候选者中至少有一个低不确定性(高度自信)的掌握点,我们就能很好地抓住这个想法。我们评估了我们在这项工作中建议的各种颗粒食品 -- -- 咖啡豆、大米、燕麦和花生 -- -- 每种食品都有不同的大小、形状和物质特性,例如数量密度或摩擦。对于这些食品,我们通过纳入不确定性来大大改进用户对目标质量的准确性。