We investigate the problem of classifying - from a single image - the level of content in a cup or a drinking glass. This problem is made challenging by several ambiguities caused by transparencies, shape variations and partial occlusions, and by the availability of only small training datasets. In this paper, we tackle this problem with an appropriate strategy for transfer learning. Specifically, we use adversarial training in a generic source dataset and then refine the training with a task-specific dataset. We also discuss and experimentally evaluate several training strategies and their combination on a range of container types of the CORSMAL Containers Manipulation dataset. We show that transfer learning with adversarial training in the source domain consistently improves the classification accuracy on the test set and limits the overfitting of the classifier to specific features of the training data.
翻译:我们从单一的图像中调查对杯子或饮料杯中含量的分类问题,这个问题由于移动、形状变异和部分隔离造成的若干模糊不清以及只有少量培训数据集的存在而具有挑战性。我们在本文件中通过适当的转移学习战略来解决这个问题。具体地说,我们使用通用源数据集中的对抗性培训,然后用具体任务数据集来改进培训。我们还讨论并实验性地评价若干培训战略及其结合,将CORMSAL集装箱操纵数据集的各类集装箱组合起来。我们表明,在源域通过对抗性培训进行的学习不断提高测试数据集的分类准确性,并将分类器的尺寸限制在培训数据的具体特性上。