Unsupervised hashing has attracted much attention for binary representation learning due to the requirement of economical storage and efficiency of binary codes. It aims to encode high-dimensional features in the Hamming space with similarity preservation between instances. However, most existing methods learn hash functions in manifold-based approaches. Those methods capture the local geometric structures (i.e., pairwise relationships) of data, and lack satisfactory performance in dealing with real-world scenarios that produce similar features (e.g. color and shape) with different semantic information. To address this challenge, in this work, we propose an effective unsupervised method, namely Jointly Personalized Sparse Hashing (JPSH), for binary representation learning. To be specific, firstly, we propose a novel personalized hashing module, i.e., Personalized Sparse Hashing (PSH). Different personalized subspaces are constructed to reflect category-specific attributes for different clusters, adaptively mapping instances within the same cluster to the same Hamming space. In addition, we deploy sparse constraints for different personalized subspaces to select important features. We also collect the strengths of the other clusters to build the PSH module with avoiding over-fitting. Then, to simultaneously preserve semantic and pairwise similarities in our JPSH, we incorporate the PSH and manifold-based hash learning into the seamless formulation. As such, JPSH not only distinguishes the instances from different clusters, but also preserves local neighborhood structures within the cluster. Finally, an alternating optimization algorithm is adopted to iteratively capture analytical solutions of the JPSH model. Extensive experiments on four benchmark datasets verify that the JPSH outperforms several hashing algorithms on the similarity search task.
翻译:由于需要节约存储和二进制代码的效率,未经监督的散列已经吸引了对二进制代表学习的注意。它旨在将哈姆林空间的高维特征编码为二进制空间的类似保存。然而,大多数现有方法在多基方法中学习散列功能。这些方法捕捉了数据的本地几何结构(即双向关系),在处理产生不同语义信息的类似特征(如颜色和形状)的真实世界情景时缺乏令人满意的性能。为了应对这一挑战,我们在这项工作中提出了一种有效的不受监督的方法,即:同时将二进制空间的双进制 Sprass Hashing (JPSH),用于二进制演示。首先,我们提出一个新的个性化的散列功能,即个人化的散列(即双向关系)数据。不同的个化子空间构建了不同组的分类模式,在同一组群中进行适应性地绘图。此外,我们在不同的个化的分层S的分层中设置了隐隐性限制,在不同的个人化的分层中,S 也选择了其他的代代代号的代号。