This paper proposes an adaptive graph-based approach for multi-label image classification. Graph-based methods have been largely exploited in the field of multi-label classification, given their ability to model label correlations. Specifically, their effectiveness has been proven not only when considering a single domain but also when taking into account multiple domains. However, the topology of the used graph is not optimal as it is pre-defined heuristically. In addition, consecutive Graph Convolutional Network (GCN) aggregations tend to destroy the feature similarity. To overcome these issues, an architecture for learning the graph connectivity in an end-to-end fashion is introduced. This is done by integrating an attention-based mechanism and a similarity-preserving strategy. The proposed framework is then extended to multiple domains using an adversarial training scheme. Numerous experiments are reported on well-known single-domain and multi-domain benchmarks. The results demonstrate that our approach outperforms the state-of-the-art in terms of mean Average Precision (mAP) and model size.
翻译:本文提出了多标签图像分类的适应性图表法方法。基于图表的方法在多标签分类领域得到了广泛利用,因为它们有能力建模标签关联性。具体地说,这些方法不仅在考虑单一领域时被证明是有效的,而且在考虑多个领域时也被证明是有效的。然而,使用过的图表的地形学并不是最佳的,因为它是预先定义的超自然学的。此外,连续的图表革命网络(GCN)集合往往摧毁其特征的相似性。为了克服这些问题,采用了一种结构,以端到端的方式学习图形连接性。这是通过集成关注机制和类似保留战略来完成的。然后,拟议的框架利用对抗性培训计划扩展到多个领域。许多实验都报告在著名的单域和多域基准上。结果表明,我们的方法在平均精度和模型大小方面超越了最新水平。