Compared with single-label image classification, multi-label image classification is more practical and challenging. Some recent studies attempted to leverage the semantic information of categories for improving multi-label image classification performance. However, these semantic-based methods only take semantic information as type of complements for visual representation without further exploitation. In this paper, we present an innovative path towards the solution of the multi-label image classification which considers it as a dictionary learning task. A novel end-to-end model named Deep Semantic Dictionary Learning (DSDL) is designed. In DSDL, an auto-encoder is applied to generate the semantic dictionary from class-level semantics and then such dictionary is utilized for representing the visual features extracted by Convolutional Neural Network (CNN) with label embeddings. The DSDL provides a simple but elegant way to exploit and reconcile the label, semantic and visual spaces simultaneously via conducting the dictionary learning among them. Moreover, inspired by iterative optimization of traditional dictionary learning, we further devise a novel training strategy named Alternately Parameters Update Strategy (APUS) for optimizing DSDL, which alternately optimizes the representation coefficients and the semantic dictionary in forward and backward propagation. Extensive experimental results on three popular benchmarks demonstrate that our method achieves promising performances in comparison with the state-of-the-arts. Our codes and models have been released at {https://github.com/ZFT-CQU/DSDL}.
翻译:与单标签图像分类相比,多标签图像分类更实际,更具挑战性。最近的一些研究试图利用分类的语义信息来提高多标签图像分类性能。然而,这些语义法只将语义信息作为视觉表达方式的补充,而无需进一步加以利用。在本文中,我们提出了一个创新的路径,以解决多标签图像分类,认为它是一种字典学习任务。设计了一个名为深语义词典学习(DSDL)的新颖端到端模式。在DSDL中,应用了自动编码来从类级语义分类中生成语义词典词典。然而,这些语义学方法仅将语义信息作为视觉表达方式作为视觉表达方式,而无需进一步加以利用。DSDL提供了一种简单但优美的方法,通过在它们之间进行词典学习,同时开发一个名为深语义词典学习(DSDDL) 的端系词典更新战略(APUS),然后用来代表进化DSDC/SDRMR 和在前的SDRMR 中以最有希望的SDL 的SDL 和SDRDL 格式模型展示我们最优化的SUDML 和SL 的SDML 和SDADR 的SDL 和SUDML 的SDR 的SA 的SA 和SDR 的SA 格式展示的SA 和SDR 和SUDFSOFA 的SDL 的SDL 。