Zero-Shot Sketch-Based Image Retrieval (ZSSBIR) is an emerging task. The pioneering work focused on the modal gap but ignored inter-class information. Although recent work has begun to consider the triplet-based or contrast-based loss to mine inter-class information, positive and negative samples need to be carefully selected, or the model is prone to lose modality-specific information. To respond to these issues, an Ontology-Aware Network (OAN) is proposed. Specifically, the smooth inter-class independence learning mechanism is put forward to maintain inter-class peculiarity. Meanwhile, distillation-based consistency preservation is utilized to keep modality-specific information. Extensive experiments have demonstrated the superior performance of our algorithm on two challenging Sketchy and Tu-Berlin datasets.
翻译:零热相片图像检索(ZSSBIR)是一项新兴任务,其开拓性工作侧重于模式差距,但忽视了跨类信息。尽管最近的工作已经开始考虑对矿藏阶级间信息造成三重或反差损失的问题,但需要仔细选择正反抽样,或者模型容易丢失特定模式的信息。为了解决这些问题,提议建立本体-软件网络(OAN),具体地说,提出顺利的跨级独立学习机制,以保持不同类别间的特殊性。与此同时,利用蒸馏法的一致性保护来保存特定模式的信息。广泛的实验表明,我们在两个具有挑战性的Sketschy和Tu-Berlin数据集方面的算法表现优异。