Real-world visual search systems involve deployments on multiple platforms with different computing and storage resources. Deploying a unified model that suits the minimal-constrain platforms leads to limited accuracy. It is expected to deploy models with different capacities adapting to the resource constraints, which requires features extracted by these models to be aligned in the metric space. The method to achieve feature alignments is called ``compatible learning''. Existing research mainly focuses on the one-to-one compatible paradigm, which is limited in learning compatibility among multiple models. We propose a Switchable representation learning Framework with Self-Compatibility (SFSC). SFSC generates a series of compatible sub-models with different capacities through one training process. The optimization of sub-models faces gradients conflict, and we mitigate this problem from the perspective of the magnitude and direction. We adjust the priorities of sub-models dynamically through uncertainty estimation to co-optimize sub-models properly. Besides, the gradients with conflicting directions are projected to avoid mutual interference. SFSC achieves state-of-the-art performance on the evaluated datasets.
翻译:现实世界的视觉搜索系统涉及在多个具有不同计算和存储资源的平台上部署。部署适合最小约束平台的统一模型会导致精度受限。因此,需要部署适应资源约束的不同能力模型,这需要在度量空间中对这些模型提取的特征进行对齐。实现特征对齐的方法称为“兼容性学习”。现有研究主要关注一对一兼容性范式,限制了在多个模型之间学习兼容性。我们提出了一种具有自相容性的可切换表示学习框架(SFSC)。SFSC通过一个训练过程生成一系列不同容量的兼容子模型。子模型优化面临梯度冲突,我们从大小和方向的角度缓解了这个问题。我们通过不确定性估计动态调整子模型的优先级,以适当地合并子模型。此外,具有冲突方向的梯度进行投影以避免互相干扰。SFSC在评估数据集上实现了最先进的性能。