Retail scenes usually contain densely packed high number of objects in each image. Standard object detection techniques use fully supervised training methodology. This is highly costly as annotating a large dense retail object detection dataset involves an order of magnitude more effort compared to standard datasets. Hence, we propose semi-supervised learning to effectively use the large amount of unlabeled data available in the retail domain. We adapt a popular self supervised method called noisy student initially proposed for object classification to the task of dense object detection. We show that using unlabeled data with the noisy student training methodology, we can improve the state of the art on precise detection of objects in densely packed retail scenes. We also show that performance of the model increases as you increase the amount of unlabeled data.
翻译:零售场通常包含每张图像中大量密集包装的物体。标准物体探测技术使用完全监督的培训方法。这是非常昂贵的,因为与标准数据集相比,大量密集的零售物体探测数据集要付出更大的努力。因此,我们提议采用半监督的学习方法,以便有效利用零售域中现有的大量无标签数据。我们调整了一种大众自监督的方法,即最初提议对物体进行分类的吵闹学生,以适应密集物体探测任务。我们显示,使用吵闹学生培训方法的无标签数据,我们可以改进在密集包装零售场准确探测物体的先进水平。我们还表明,随着你增加无标签数据的数量,模型的性能会提高。