Multi-instance multi-label (MIML) learning is widely applicated in numerous domains, such as the image classification where one image contains multiple instances correlated with multiple logic labels simultaneously. The related labels in existing MIML are all assumed as logical labels with equal significance. However, in practical applications in MIML, significance of each label for multiple instances per bag (such as an image) is significant different. Ignoring labeling significance will greatly lose the semantic information of the object, so that MIML is not applicable in complex scenes with a poor learning performance. To this end, this paper proposed a novel MIML framework based on graph label enhancement, namely GLEMIML, to improve the classification performance of MIML by leveraging label significance. GLEMIML first recognizes the correlations among instances by establishing the graph and then migrates the implicit information mined from the feature space to the label space via nonlinear mapping, thus recovering the label significance. Finally, GLEMIML is trained on the enhanced data through matching and interaction mechanisms. GLEMIML (AvgRank: 1.44) can effectively improve the performance of MIML by mining the label distribution mechanism and show better results than the SOTA method (AvgRank: 2.92) on multiple benchmark datasets.
翻译:多实例多标签(MIML)学习广泛应用于众多领域,例如图像分类,其中一张图像同时包含多个实例与多个逻辑标签相关。现有的 MIML 中所有相关标签都被假定为具有相等意义的逻辑标签。然而,在 MIML 的实际应用中,每个标签对于每个包(例如图像)中的多个实例的重要性不同。忽略标签重要性将极大地损失对象的语义信息,因此 MIML 在具有较差学习性能的复杂场景中不可应用。为此,本文提出了一种基于图形标签增强的新型 MIML 框架,即 GLEMIML,通过利用标签重要性来提高 MIML 的分类性能。GLEMIML 首先通过建立图形识别实例之间的关系,然后通过非线性映射将从特征空间挖掘的隐式信息迁移至标签空间,从而恢复标签的重要性。最后,通过匹配和交互机制在增强的数据上训练 GLEMIML。GLEMIML(AvgRank: 1.44)可以通过挖掘标签分布机制有效地提高 MIML 的性能,并且在多个基准数据集上表现出比 SOTA 方法(AvgRank: 2.92)更好的结果。