Co: 有效学习自我培训的消极对立派无人监督的代表的反向对比 (AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations from Self-Trained Negative Adversaries)

from arxiv, Appendices with more results on symmetric loss, different numbers of negative samples, computing costs is presented. We also discuss "whether we still need negative examples" in Appendix C, a question emerging from the comparison with the BYOL

Contrastive learning relies on constructing a collection of negative examples that are sufficiently hard to discriminate against positive queries when their representations are self-trained. Existing contrastive learning methods either maintain a queue of negative samples over minibatches while only a small portion of them are updated in an iteration, or only use the other examples from the current minibatch as negatives. They could not closely track the change of the learned representation over iterations by updating the entire queue as a whole, or discard the useful information from the past minibatches. Alternatively, we present to directly learn a set of negative adversaries playing against the self-trained representation. Two players, the representation network and negative adversaries, are alternately updated to obtain the most challenging negative examples against which the representation of positive queries will be trained to discriminate. We further show that the negative adversaries are updated towards a weighted combination of positive queries by maximizing the adversarial contrastive loss, thereby allowing them to closely track the change of representations over time. Experiment results demonstrate the proposed Adversarial Contrastive (AdCo) model not only achieves superior performances (a top-1 accuracy of 73.2\% over 200 epochs and 75.7\% over 800 epochs with linear evaluation on ImageNet), but also can be pre-trained more efficiently with fewer epochs.

翻译：对比式学习依赖于建立一系列在自我培训时很难歧视正面询问的负面实例。现有的对比式学习方法要么在小型公文箱上保留排队的负面样本,而其中只有一小部分在迭代中更新,要么只是将目前小型公文箱中的其他实例作为反面。它们无法通过更新整个整队更新来密切跟踪在迭代中学习到的表述变化,或者放弃过去小字盒提供的有用信息。或者,我们直接学习一组与自我培训的代表竞争的负面对手。两个玩家,即代表网和负对手,被交替更新,以获得最具挑战性的负面例子,而积极询问的表述将受到歧视培训。我们还表明,负对手通过最大限度地增加对抗性对比性损失,从而无法密切跟踪反复的表述变化。实验结果显示,拟议的反向对立(Adversari Contratitional (AdCo) 模式不仅能取得优异性业绩(上至上至上一个精确度为732°Q), 更短于800个前的图像和75°7 7,还可以超过100个版本。