A part-based object understanding facilitates efficient compositional learning and knowledge transfer, robustness to occlusion, and has the potential to increase the performance on general recognition and localization tasks. However, research on part-based models is hindered due to the lack of datasets with part annotations, which is caused by the extreme difficulty and high cost of annotating object parts in images. In this paper, we propose PartImageNet, a large, high-quality dataset with part segmentation annotations. It consists of 158 classes from ImageNet with approximately 24000 images. PartImageNet is unique because it offers part-level annotations on a general set of classes with non-rigid, articulated objects, while having an order of magnitude larger size compared to existing datasets. It can be utilized in multiple vision tasks including but not limited to: Part Discovery, Semantic Segmentation, Few-shot Learning. Comprehensive experiments are conducted to set up a set of baselines on PartImageNet and we find that existing works on part discovery can not always produce satisfactory results during complex variations. The exploit of parts on downstream tasks also remains insufficient. We believe that our PartImageNet will greatly facilitate the research on part-based models and their applications. The dataset and scripts will soon be released at https://github.com/TACJu/PartImageNet.
翻译:部分对象理解有助于高效的构成学习和知识转让,对封闭性的理解强健,并有可能提高一般识别和本地化任务的业绩。然而,部分基础模型的研究由于缺少带有部分说明的数据集而受阻,原因是图像中说明对象部分极其困难且成本高昂。在本文中,我们提议PartImageNet,是一个包含部分说明的大型高质量数据集。它由图像网的158个类组成,大约有24000个图像。PartImageNet是独一无二的,因为它为一组非硬化、清晰的普通类提供部分说明,但与现有数据集相比,其规模更大。它可用于多种愿景任务,包括但不限于:部分隐蔽、语义剖析、微小的学习。我们进行了全面试验,以在PartimageNet上建立一套基线。我们发现部分发现的现有部分发现工作在复杂的变异中不能总是产生令人满意的结果。在下游任务上对部分任务进行探索,同时比现有数据集大得多。我们相信,在下游任务上对部分的探索也将大大促进。