With the rapid development of social websites, recent years have witnessed an explosive growth of social images with user-provided tags which continuously arrive in a streaming fashion. Due to the fast query speed and low storage cost, hashing-based methods for image search have attracted increasing attention. However, existing hashing methods for social image retrieval are based on batch mode which violates the nature of social images, i.e., social images are usually generated periodically or collected in a stream fashion. Although there exist many online image hashing methods, they either adopt unsupervised learning which ignore the relevant tags, or are designed in the supervised manner which needs high-quality labels. In this paper, to overcome the above limitations, we propose a new method named Weakly-supervised Online Hashing (WOH). In order to learn high-quality hash codes, WOH exploits the weak supervision by considering the semantics of tags and removing the noise. Besides, We develop a discrete online optimization algorithm for WOH, which is efficient and scalable. Extensive experiments conducted on two real-world datasets demonstrate the superiority of WOH compared with several state-of-the-art hashing baselines.
翻译:随着社会网站的迅速发展,近年来社会形象的急剧增长,用户提供的标签不断以不断流传的方式到达。由于快速查询速度和低存储成本,基于散列的图像搜索方法引起了越来越多的关注。然而,现有的社会图像检索散列方法是基于违反社会图像性质的批量模式,即社会图像通常是定期生成或以流方式收集的。尽管存在许多在线图像散列方法,但它们要么采用不受监督的学习方法,忽视相关标签,要么以监督的方式设计,需要高质量的标签。在本文中,为了克服上述限制,我们提出了名为Waakly- Supered Online Hashing(WOH)的新方法。为了学习高质量的散列代码,WOH利用薄弱的监管手段,考虑标签的语义学和消除噪音。此外,我们为WOH开发了一个高效和可缩放的离子在线优化算法。在两个真实世界数据集上进行的广泛实验,展示了WOH的优越性,与几个州基准相比,展示了WHA的优越性。