Although sketch-to-photo retrieval has a wide range of applications, it is costly to obtain paired and rich-labeled ground truth. Differently, photo retrieval data is easier to acquire. Therefore, previous works pre-train their models on rich-labeled photo retrieval data (i.e., source domain) and then fine-tune them on the limited-labeled sketch-to-photo retrieval data (i.e., target domain). However, without co-training source and target data, source domain knowledge might be forgotten during the fine-tuning process, while simply co-training them may cause negative transfer due to domain gaps. Moreover, identity label spaces of source data and target data are generally disjoint and therefore conventional category-level Domain Adaptation (DA) is not directly applicable. To address these issues, we propose an Instance-level Heterogeneous Domain Adaptation (IHDA) framework. We apply the fine-tuning strategy for identity label learning, aiming to transfer the instance-level knowledge in an inductive transfer manner. Meanwhile, labeled attributes from the source data are selected to form a shared label space for source and target domains. Guided by shared attributes, DA is utilized to bridge cross-dataset domain gaps and heterogeneous domain gaps, which transfers instance-level knowledge in a transductive transfer manner. Experiments show that our method has set a new state of the art on three sketch-to-photo image retrieval benchmarks without extra annotations, which opens the door to train more effective models on limited-labeled heterogeneous image retrieval tasks. Related codes are available at https://github.com/fandulu/IHDA.
翻译:虽然素描到照片检索具有广泛的应用范围,但获取配对和贴有丰富标签的地面真象的成本很高。不同的是,照片检索数据更容易获取。因此,以前的作品在富标签的照片检索数据(即源域)上对其模型进行预先培训,然后将其微调到有限标签的素描到照片检索数据(即目标域)上。然而,如果没有共同培训源和目标数据,来源域知识可能会在微调过程中被遗忘,而只是共同训练它们可能会导致因域间差距而导致负转移。此外,源数据和目标数据的身份标签空间一般不相干,因此传统类别一级的Domain适应(DA)不能直接适用。为了解决这些问题,我们建议采用一个带点标签的超遗传到照片检索数据(即目标域域域域域域)检索数据(即源域域域域域域域域域域域域域域域域域域域域域域域域)框架。我们采用微调战略来学习身份标签,目的是在感化传输模式中将实例一级知识转移。同时,来源数据的标签属性可被选成为共同标签,用于在域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域图上利用的转移。