We present a novel self-taught framework for unsupervised metric learning, which alternates between predicting class-equivalence relations between data through a moving average of an embedding model and learning the model with the predicted relations as pseudo labels. At the heart of our framework lies an algorithm that investigates contexts of data on the embedding space to predict their class-equivalence relations as pseudo labels. The algorithm enables efficient end-to-end training since it demands no off-the-shelf module for pseudo labeling. Also, the class-equivalence relations provide rich supervisory signals for learning an embedding space. On standard benchmarks for metric learning, it clearly outperforms existing unsupervised learning methods and sometimes even beats supervised learning models using the same backbone network. It is also applied to semi-supervised metric learning as a way of exploiting additional unlabeled data, and achieves the state of the art by boosting performance of supervised learning substantially.
翻译:我们为不受监督的计量学习提出了一个新的自我教学框架,它通过嵌入模型的移动平均数来预测数据之间的类等关系,然后以假标签的形式学习模型。我们框架的核心是一种算法,它调查嵌入空间的数据背景,以预测其类等关系为假标签。算法使得高效端对端培训成为可能,因为它要求不要求为假标签提供现成模块。此外,类对等关系为学习嵌入空间提供了丰富的监督信号。关于标准学习基准,它明显优于现有的未经监督的学习方法,有时甚至用同一个主干网击打受监督的学习模式。它也用于半监督的计量学习,作为利用额外的无标签数据的一种方式,并通过大力提高监督学习的绩效来达到艺术状态。