Image classification is one of the main drivers of the rapid developments in deep learning with convolutional neural networks for computer vision. So is the analogous task of scene classification in remote sensing. However, in contrast to the computer vision community that has long been using well-established, large-scale standard datasets to train and benchmark high-capacity models, the remote sensing community still largely relies on relatively small and often application-dependend datasets, thus lacking comparability. With this letter, we present a classification-oriented conversion of the SEN12MS dataset. Using that, we provide results for several baseline models based on two standard CNN architectures and different input data configurations. Our results support the benchmarking of remote sensing image classification and provide insights to the benefit of multi-spectral data and multi-sensor data fusion over conventional RGB imagery.
翻译:图像分类是与计算机视觉的进化神经网络进行深层学习的快速发展的主要驱动因素之一,遥感现场分类任务也与此相似,然而,与长期以来一直使用完善的大型标准数据集来培训和基准高容量模型的计算机视觉界形成对照的是,遥感界在很大程度上仍然依赖相对小的、往往依赖应用程序的数据集,因此缺乏可比性。我们用本信展示了SEN12MS数据集的分类转换。我们利用本信为基于两个标准的CNN架构和不同输入数据配置的若干基线模型提供了结果。我们的成果支持遥感图像分类的基准,并为多光谱数据和多传感器数据融合超过常规 RGB 图像提供了洞察力。