In recent years, Cross-Modal Hashing (CMH) has aroused much attention due to its fast query speed and efficient storage. Previous literatures have achieved promising results for Cross-Modal Retrieval (CMR) by discovering discriminative hash codes and modality-specific hash functions. Nonetheless, most existing CMR works are subjected to some restrictions: 1) It is assumed that data of different modalities are fully paired, which is impractical in real applications due to sample missing and false data alignment, and 2) binary regression targets including the label matrix and binary codes are too rigid to effectively learn semantic-preserving hash codes and hash functions. To address these problems, this paper proposes an Adaptive Marginalized Semantic Hashing (AMSH) method which not only enhances the discrimination of latent representations and hash codes by adaptive margins, but also can be used for both paired and unpaired CMR. As a two-step method, in the first step, AMSH generates semantic-aware modality-specific latent representations with adaptively marginalized labels, which enlarges the distances between different classes, and exploits the labels to preserve the inter-modal and intra-modal semantic similarities into latent representations and hash codes. In the second step, adaptive margin matrices are embedded into the hash codes, and enlarge the gaps between positive and negative bits, which improves the discrimination and robustness of hash functions. On this basis, AMSH generates similarity-preserving hash codes and robust hash functions without strict one-to-one data correspondence requirement. Experiments are conducted on several benchmark datasets to demonstrate the superiority and flexibility of AMSH over some state-of-the-art CMR methods.
翻译:近些年来,跨模量散列(CMH)因其快速查询速度和高效存储而引起了人们的极大关注。以前的文献通过发现歧视性散列代码和特定模式散列功能,为跨模量检索(CMR)取得了令人乐观的成果。然而,大多数现有的遗留集束弹药工作受到一些限制:(1) 假设不同模式的数据完全配对,由于抽样缺失和虚假数据对齐,这些数据在实际应用中是不切实际的;(2) 包括标签矩阵和二进制代码在内的二进制回归目标过于僵硬,无法有效地学习语义保存的散列代码和散列函数。为了解决这些问题,本文建议采用适应性边际混杂码(AMSH)方法,这种方法不仅增加了潜在表达和散列的偏差,而且还可用于配对和不相容的 CMRMR。 作为两步方法,AMSH生成了超标和超标定模式特定的潜在表达方式,在适应性边缘标签上,在不扩大等级之间对等的移动性功能之间,从而保持了内位和内位变变式。