Synthetic Aperture Radar (SAR) data and Interferometric SAR (InSAR) products in particular, are one of the largest sources of Earth Observation data. InSAR provides unique information on diverse geophysical processes and geology, and on the geotechnical properties of man-made structures. However, there are only a limited number of applications that exploit the abundance of InSAR data and deep learning methods to extract such knowledge. The main barrier has been the lack of a large curated and annotated InSAR dataset, which would be costly to create and would require an interdisciplinary team of experts experienced on InSAR data interpretation. In this work, we put the effort to create and make available the first of its kind, manually annotated dataset that consists of 19,919 individual Sentinel-1 interferograms acquired over 44 different volcanoes globally, which are split into 216,106 InSAR patches. The annotated dataset is designed to address different computer vision problems, including volcano state classification, semantic segmentation of ground deformation, detection and classification of atmospheric signals in InSAR imagery, interferogram captioning, text to InSAR generation, and InSAR image quality assessment.
翻译:合成孔径雷达(SAR)数据和特别是干涉测量合成孔径雷达(InSAR)产品是地球观测数据的最大来源之一; 合成孔径雷达(ISAR)提供关于各种地球物理过程和地质学以及人造结构的地质技术特性的独特信息; 然而,利用InSAR大量数据和深层学习方法来获取这种知识的应用有限; 主要障碍是缺乏一个庞大的分类和附加说明的InSAR数据集,而建立这种数据集的费用很高,需要一支在InSAR数据解释方面有经验的跨学科专家小组; 在这项工作中,我们努力创建和提供第一组这类、手动附加说明的数据集,其中包括在全球44个不同火山上获得的19,919个单个哨兵-1个干涉图,该数据集被分成216,106个InSAR补丁。