The availability of the sheer volume of Copernicus Sentinel-2 imagery has created new opportunities for exploiting deep learning (DL) methods for land use land cover (LULC) image classification. However, an extensive set of benchmark experiments is currently lacking, i.e. DL models tested on the same dataset, with a common and consistent set of metrics, and in the same hardware. In this work, we use the BigEarthNet Sentinel-2 dataset to benchmark for the first time different state-of-the-art DL models for the multi-label, multi-class LULC image classification problem, contributing with an exhaustive zoo of 60 trained models. Our benchmark includes standard CNNs, as well as non-convolutional methods. We put to the test EfficientNets and Wide Residual Networks (WRN) architectures, and leverage classification accuracy, training time and inference rate. Furthermore, we propose to use the EfficientNet framework for the compound scaling of a lightweight WRN. Enhanced with an Efficient Channel Attention mechanism, our scaled lightweight model emerged as the new state-of-the-art. It achieves 4.5% higher averaged F-Score classification accuracy for all 19 LULC classes compared to a standard ResNet50 baseline model, with an order of magnitude less trainable parameters. We provide access to all trained models, along with our code for distributed training on multiple GPU nodes. This model zoo of pre-trained encoders can be used for transfer learning and rapid prototyping in different remote sensing tasks that use Sentinel-2 data, instead of exploiting backbone models trained with data from a different domain, e.g., from ImageNet. We validate their suitability for transfer learning in different datasets of diverse volumes. Our top-performing WRN achieves state-of-the-art performance (71.1% F-Score) on the SEN12MS dataset while being exposed to only a small fraction of the training dataset.
翻译:Copernicus Sentinel-2 图像的纯量的可用性为利用关于土地利用土地覆盖(LULC)图像分类的深层次学习(DL)方法创造了新的机会。 但是,目前缺乏一系列广泛的基准实验, 即在同一数据集中测试的DL模型, 并有一套共同和一致的衡量标准, 以及在同一硬件中测试的DL模型。 在这项工作中, 我们使用大地球网 Sentinel-2 数据集作为基准, 首次为多标签、 多级 LULLC 图像分类问题制定不同的最先进的 DLLLM 模型, 促成60个远程模型的完整快速模型。 我们的基准包括标准的CNN, 以及非革命性的方法。 我们测试了高效的网络和宽度网络(WRMN) 结构, 以及工具分类的准确性、 培训时间和推断率。 此外, 我们提议使用高效的网络框架框架来测量较轻的 RWN 。 通过高效的频道关注机制, 我们规模较轻的LU 模型作为新的州级的快速智能模型出现, 。