Many efforts have been devoted to designing sampling, mining, and weighting strategies in high-level deep metric learning (DML) loss objectives. However, little attention has been paid to low-level but essential data transformation. In this paper, we develop a novel mechanism, the independent domain embedding augmentation learning ({IDEAL}) method. It can simultaneously learn multiple independent embedding spaces for multiple domains generated by predefined data transformations. Our IDEAL is orthogonal to existing DML techniques and can be seamlessly combined with prior DML approaches for enhanced performance. Empirical results on visual retrieval tasks demonstrate the superiority of the proposed method. For example, the IDEAL improves the performance of MS loss by a large margin, 84.5\% $\rightarrow$ 87.1\% on Cars-196, and 65.8\% $\rightarrow$ 69.5\% on CUB-200 at Recall$@1$. Our IDEAL with MS loss also achieves the new state-of-the-art performance on three image retrieval benchmarks, \ie, \emph{Cars-196}, \emph{CUB-200}, and \emph{SOP}. It outperforms the most recent DML approaches, such as Circle loss and XBM, significantly. The source code and pre-trained models of our method will be available at\emph{\url{https://github.com/emdata-ailab/IDEAL}}.
翻译:在高层次深度学习(DML)损失目标中,设计取样、采矿和加权战略的工作已经投入很多,但很少注意低层次但必要的数据转换。在本文中,我们开发了一个新的机制,即独立域嵌入增强学习({IDEAL})的方法。它可以同时学习由预先定义的数据转换产生的多个域的多个独立嵌入空间。我们的DIDAL与现有的DML技术是正统的,可以与以前的DML提高性能的方法密切结合。视觉检索任务的经验结果显示了拟议方法的优越性。例如,IDEAL通过大差额改进MS损失的性能,84.5 $\righturrowle$87.1 ⁇ (Cars-196)和65.8美元(CUB-200)的69.5 ⁇ (retrightrowrow $$@1$)。我们的DIDOL与MS损失还实现了三种图像检索基准的新状态表现, \ \ \ \ \ \ \ \ \ \ \ \ \ { Carph {Cars- 196}, } IMF {ML} 和 main road_BAR_ 这样的 方法将大大的系统模式, 。