Joint-Embedding Self Supervised Learning (JE-SSL) has seen a rapid development, with the emergence of many method variations but only few principled guidelines that would help practitioners to successfully deploy them. The main reason for that pitfall comes from JE-SSL's core principle of not employing any input reconstruction therefore lacking visual cues of unsuccessful training. Adding non informative loss values to that, it becomes difficult to deploy SSL on a new dataset for which no labels can help to judge the quality of the learned representation. In this study, we develop a simple unsupervised criterion that is indicative of the quality of the learned JE-SSL representations: their effective rank. Albeit simple and computationally friendly, this method -- coined RankMe -- allows one to assess the performance of JE-SSL representations, even on different downstream datasets, without requiring any labels. A further benefit of RankMe is that it does not have any training or hyper-parameters to tune. Through thorough empirical experiments involving hundreds of training episodes, we demonstrate how RankMe can be used for hyperparameter selection with nearly no reduction in final performance compared to the current selection method that involve a dataset's labels. We hope that RankMe will facilitate the deployment of JE-SSL towards domains that do not have the opportunity to rely on labels for representations' quality assessment.
翻译:联合自我监督学习(JE-SSL)出现许多方法变异,但只有少数原则性准则可以帮助从业者成功部署这些变异,因此出现了快速发展。这种陷阱的主要原因是JE-SSL的核心原则,即不使用任何投入重建,因此缺乏不成功培训的视觉提示。加上不提供信息的损失值,很难在一个新的数据集上部署SSL,因为没有任何标签可以帮助判断所学代表性的质量。在这项研究中,我们制定了一个简单的、不受监督的标准,表明所学的JE-SSL代表的质量:有效级别。尽管这种方法简单且在计算上是友好的,但这种方法 -- -- 硬体中Me -- -- 使得人们可以评估JE-SSL代表的性能,甚至在不同的下游数据集上,而不需要任何标签。RangMe的另一项好处是,它没有任何培训或超参数可以调和。通过涉及数百个培训事件的彻底实验,我们展示了RankMe如何在超标准性能选择JME的质量表现方面,而几乎没有降低JE标准性能,比我们最终选择标准性能选择标准,我们不会降低标准标准。