Our planet is viewed by satellites through multiple sensors (e.g., multi-spectral, Lidar and SAR) and at different times. Multi-view observations bring us complementary information than the single one. Alternatively, there are common features shared between different views, such as geometry and semantics. Recently, contrastive learning methods have been proposed for the alignment of multi-view remote sensing images and improving the feature representation of single sensor images by modeling view-invariant factors. However, these methods are based on the pretraining of the predefined tasks or just focus on image-level classification. Moreover, these methods lack research on uncertainty estimation. In this work, a pixel-wise contrastive approach based on an unlabeled multi-view setting is proposed to overcome this limitation. This is achieved by the use of contrastive loss in the feature alignment and uniformity between multi-view images. In this approach, a pseudo-Siamese ResUnet is trained to learn a representation that aims to align features from the shifted positive pairs and uniform the induced distribution of the features on the hypersphere. The learned features of multi-view remote sensing images are evaluated on a liner protocol evaluation and an unsupervised change detection task. We analyze key properties of the approach that make it work, finding that the requirement of shift equivariance ensured the success of the proposed approach and the uncertainty estimation of representations leads to performance improvements. Moreover, the performance of multi-view contrastive learning is affected by the choice of different sensors. Results demonstrate both improvements in efficiency and accuracy over the state-of-the-art multi-view contrastive methods.
翻译:卫星通过多个传感器(例如多光谱、里达尔和萨尔)在不同的时间观察我们的地球。多视图观测给我们带来比单一的互补信息。 或者,不同观点之间有共同的特征,例如几何和语义学。最近,提出了对比式学习方法,以通过模拟视觉变化因素调整多视图遥感图像,改进单个传感器图像的特征表现。但是,这些方法基于预先界定任务前的训练,或者仅仅侧重于图像等级分类。此外,这些方法缺乏关于不确定性估计的研究。在这项工作中,提出了一种基于无标签多视图设置的具有比喻性的多比喻式对比性方法,以克服这一局限性。最近,提出了对比式学习方法,通过模拟视觉变化来改进单个传感器图像的特征,通过模拟对比式对比式对图像进行对比性评估,并统一高图像的诱导性分布。多视角遥感图像的学习特点,以未标注性能的精确性能分析方式,通过对动态的稳定性评估,通过测试性能评估,通过测试性能测定性能评估,通过测测测测测程的进度法,通过测测测测测测测测测测的性性性性性性测结果,通过测测测测测测测测测测测结果的性结果的性测结果,通过测测测测测测测测测测测测测测测结果的性测测测测测测测测测测测测测测结果的性测结果。 。