A significant bottleneck in training deep networks for part segmentation is the cost of obtaining detailed annotations. We propose a framework to exploit coarse labels such as figure-ground masks and keypoint locations that are readily available for some categories to improve part segmentation models. A key challenge is that these annotations were collected for different tasks and with different labeling styles and cannot be readily mapped to the part labels. To this end, we propose to jointly learn the dependencies between labeling styles and the part segmentation model, allowing us to utilize supervision from diverse labels. To evaluate our approach we develop a benchmark on the Caltech-UCSD birds and OID Aircraft dataset. Our approach outperforms baselines based on multi-task learning, semi-supervised learning, and competitive methods relying on loss functions manually designed to exploit sparse-supervision.
翻译:在培训深度分离网络方面,一个重大的瓶颈是获取详细说明的成本。我们提出了一个框架,以利用一些类别可以随时获得的粗糙标签,如地表面罩和关键点位置,改进分离模型。一个关键挑战是这些说明是为不同任务和不同标签风格收集的,无法轻易绘制到部分标签上。为此,我们提议共同了解标签样式和部分分割模型之间的依赖性,使我们能够利用不同标签的监管。为了评估我们的方法,我们制定了Caltech-UCSD鸟类和OID航空器数据集的基准。我们的方法超越了基于多任务学习、半监督学习和竞争性方法的基线,这些基准依靠人工设计来利用稀有监督的损耗功能。