Contrastive learning has recently demonstrated superior performance to supervised learning, despite requiring no training labels. We explore how contrastive learning can be applied to hundreds of thousands of unlabeled Mars terrain images, collected from the Mars rovers Curiosity and Perseverance, and from the Mars Reconnaissance Orbiter. Such methods are appealing since the vast majority of Mars images are unlabeled as manual annotation is labor intensive and requires extensive domain knowledge. Contrastive learning, however, assumes that any given pair of distinct images contain distinct semantic content. This is an issue for Mars image datasets, as any two pairs of Mars images are far more likely to be semantically similar due to the lack of visual diversity on the planet's surface. Making the assumption that pairs of images will be in visual contrast - when they are in fact not - results in pairs that are falsely considered as negatives, impacting training performance. In this study, we propose two approaches to resolve this: 1) an unsupervised deep clustering step on the Mars datasets, which identifies clusters of images containing similar semantic content and corrects false negative errors during training, and 2) a simple approach which mixes data from different domains to increase visual diversity of the total training dataset. Both cases reduce the rate of false negative pairs, thus minimizing the rate in which the model is incorrectly penalized during contrastive training. These modified approaches remain fully unsupervised end-to-end. To evaluate their performance, we add a single linear layer trained to generate class predictions based on these contrastively-learned features and demonstrate increased performance compared to supervised models; observing an improvement in classification accuracy of 3.06% using only 10% of the labeled data.
翻译:对比性学习最近显示,尽管不需要任何培训标签,但监督性学习的性能优于监督性学习。 我们探索了如何将对比性学习应用于数十万无标签的火星地形图象,这些图象来自火星变异器Curiosity and Perseverance, 以及火星侦察轨道。 这些方法具有吸引力,因为绝大多数火星图象没有标签,因为人工注解是劳动密集型的,需要广泛的域域知识。 对比性学习假设,任何一对不同的图象含有不同的语义内容。这是火星图象数据集的一个问题,因为任何两对火星图象的特征都极有可能在语义上相似,因为火星表面缺乏视觉多样性。 假设成对图像的配对将具有视觉对比性――当它们事实上不是时,这些成对配对的结果被错误地认为是负面的,影响培训绩效。 在这次研究中,我们提出了两种方法来解决这个问题:1) 火星图象数据集上一个不超前的深度组合,用来识别含有类似精度的图像的群状, 3) 用来显示这些直观性数据在模拟的模型中产生不真实性的数据率率率率分析中,因此显示, 数据在模拟中产生一种不精确的模型中产生反比重的数据。