Recognizing objects in dense clutter accurately plays an important role to a wide variety of robotic manipulation tasks including grasping, packing, rearranging and many others. However, conventional visual recognition models usually miss objects because of the significant occlusion among instances and causes incorrect prediction due to the visual ambiguity with the high object crowdedness. In this paper, we propose an interactive exploration framework called Smart Explorer for recognizing all objects in dense clutters. Our Smart Explorer physically interacts with the clutter to maximize the recognition performance while minimize the number of motions, where the false positives and negatives can be alleviated effectively with the optimal accuracy-efficiency trade-offs. Specifically, we first collect the multi-view RGB-D images of the clutter and reconstruct the corresponding point cloud. By aggregating the instance segmentation of RGB images across views, we acquire the instance-wise point cloud partition of the clutter through which the existed classes and the number of objects for each class are predicted. The pushing actions for effective physical interaction are generated to sizably reduce the recognition uncertainty that consists of the instance segmentation entropy and multi-view object disagreement. Therefore, the optimal accuracy-efficiency trade-off of object recognition in dense clutter is achieved via iterative instance prediction and physical interaction. Extensive experiments demonstrate that our Smart Explorer acquires promising recognition accuracy with only a few actions, which also outperforms the random pushing by a large margin.
翻译:精确地确认密片中的物体,准确地确认密片中的物体,对于各种各样的机器人操纵任务,包括抓取、包装、重新排列和其他许多任务,都起着重要作用。然而,常规视觉识别模型通常会错失物体,因为各种情况之间有显著的隔离,并由于高物体拥挤的视觉模糊性而造成不正确的预测。在本文中,我们提议了一个互动探索框架,称为智能探索器,以识别密片中的所有物体。我们聪明的探索者与杂片进行物理互动,以最大限度地提高识别性能,同时尽量减少动作的数量,通过最佳的准确性交易交换,可以有效地减少假正反的正反两面。具体地说,我们首先收集结晶的多视图 RGB-D 图像,并重建相应的点云层云。通过对 RGB 图像的分解,我们获得了以实例为根据的点的云层隔断断层。因此,我们所存在的类别和每类对象的数量都得到预测。为有效物理互动的推动行动只能令人分辨地减少由实例分割和多视角目标构成的不确定性。我们通过高精度实验获得的深度精确度的精确度的精确度反应,从而展示了我们所实现的精确度的精确度反应。