In this paper, we present a semi supervised deep quick learning framework for instance detection and pixel-wise semantic segmentation of images in a dense clutter of items. The framework can quickly and incrementally learn novel items in an online manner by real-time data acquisition and generating corresponding ground truths on its own. To learn various combinations of items, it can synthesize cluttered scenes, in real time. The overall approach is based on the tutor-child analogy in which a deep network (tutor) is pretrained for class-agnostic object detection which generates labeled data for another deep network (child). The child utilizes a customized convolutional neural network head for the purpose of quick learning. There are broadly four key components of the proposed framework semi supervised labeling, occlusion aware clutter synthesis, a customized convolutional neural network head, and instance detection. The initial version of this framework was implemented during our participation in Amazon Robotics Challenge (ARC), 2017. Our system was ranked 3rd, 4th and 5th worldwide in pick, stow-pick and stow task respectively. The proposed framework is an improved version over ARC17 where novel features such as instance detection and online learning has been added.
翻译:在本文中,我们展示了一个半受监督的深层快速学习框架,例如检测和像素和像素的语义分解,图像在密闭的物品中产生标签数据。这个框架可以通过实时数据采集和自己生成相应的地面真相,快速和逐步地以在线方式学习新项目。要了解各种项目组合,它可以实时合成杂乱的场景。这个总体方法基于一个教师-孩子类比,在这个类比中,一个深层次的网络(图托尔)经过预先训练,为另一个深层网络(儿童)生成标签数据。这个儿童使用一个定制的进化神经网络头,以便快速学习。这个拟议框架的半监管性标签、封闭性认识结晶合成、一个定制进化的进化神经网络头和体格检测有四个主要组成部分。这个框架的最初版本是在2017年我们参加亚马逊机器人挑战(ARC)期间实施的。我们的系统在全世界排名第3、第4和第5位,分别用于采集、收缩和存储任务。拟议的框架有经过改进的新版本。