Cross-modal hashing is an important approach for multimodal data management and application. Existing unsupervised cross-modal hashing algorithms mainly rely on data features in pre-trained models to mine their similarity relationships. However, their optimization objectives are based on the static metric between the original uni-modal features, without further exploring data correlations during the training. In addition, most of them mainly focus on association mining and alignment among pairwise instances in continuous space but ignore the latent structural correlations contained in the semantic hashing space. In this paper, we propose an unsupervised hash learning framework, namely Adaptive Structural Similarity Preservation Hashing (ASSPH), to solve the above problems. Firstly, we propose an adaptive learning scheme, with limited data and training batches, to enrich semantic correlations of unlabeled instances during the training process and meanwhile to ensure a smooth convergence of the training process. Secondly, we present an asymmetric structural semantic representation learning scheme. We introduce structural semantic metrics based on graph adjacency relations during the semantic reconstruction and correlation mining stage and meanwhile align the structure semantics in the hash space with an asymmetric binary optimization process. Finally, we conduct extensive experiments to validate the enhancements of our work in comparison with existing works.
翻译:现有未经监督的跨模式散列算法主要依赖事先培训的模型中的数据特征,以探寻其相似关系。然而,优化目标的基础是原始单式特征之间的静态度量,而没有在培训过程中进一步探索数据相关性。此外,大多数这类算法主要侧重于连带采矿和连续空间中对等实例之间的匹配,但忽视了语义散列空间中含有的潜在结构相关性。在本文中,我们提出了一个未经监督的散列学习框架,即适应性结构相似性保护散列(ASSPH),以解决上述问题。首先,我们提出一个适应性学习计划,其中的数据和培训批量有限,以丰富培训过程中未加标实例的语义相关性,同时确保培训过程的顺利融合。第二,我们提出了一种不对称结构语义代表制代表制学习计划。我们根据语义重建阶段和关联性结构相似性保护散列(ASSPH) 引入了结构结构性衡量标准,最终将我们空间结构的升级工作与当前空间结构的升级工作进行升级,最终将我们目前的空间结构的升级工作与目前的升级工作进行升级。