Change detection is one of the most active research areas in Remote Sensing (RS). Most of the recently developed change detection methods are based on deep learning (DL) algorithms. This kind of algorithms is generally focused on generating two-dimensional (2D) change maps, thus only identifying planimetric changes in land use/land cover (LULC) and not considering nor returning any information on the corresponding elevation changes. Our work goes one step further, proposing two novel networks, able to solve simultaneously the 2D and 3D CD tasks, and the 3DCD dataset, a novel and freely available dataset precisely designed for this multitask. Particularly, the aim of this work is to lay the foundations for the development of DL algorithms able to automatically infer an elevation (3D) CD map -- together with a standard 2D CD map --, starting only from a pair of bitemporal optical images. The proposed architectures, to perform the task described before, consist of a transformer-based network, the MultiTask Bitemporal Images Transformer (MTBIT), and a deep convolutional network, the Siamese ResUNet (SUNet). Particularly, MTBIT is a transformer-based architecture, based on a semantic tokenizer. SUNet instead combines, in a siamese encoder, skip connections and residual layers to learn rich features, capable to solve efficiently the proposed task. These models are, thus, able to obtain 3D CD maps from two optical images taken at different time instants, without the need to rely directly on elevation data during the inference step. Encouraging results, obtained on the novel 3DCD dataset, are shown. The code and the 3DCD dataset are available at \url{https://sites.google.com/uniroma1.it/3dchangedetection/home-page}.
翻译:更改检测是遥感(RS)中最活跃的研究领域之一 。 最近开发的更改检测方法大多基于深层次学习( DL) 算法。 这种算法一般侧重于生成二维(2D) 变化地图, 从而只能识别土地利用/土地覆盖( LULC) 的平面变化, 并且不考虑或返回相应的海拔变化的任何信息 。 我们的工作更进一步, 提议两个新网络, 能够同时解决 2D 和 3D CD 任务, 3D 数据集, 一个新颖和可自由获取的数据集。 特别是, 这项工作的目的是为 DL 算法的开发奠定基础, 能够自动推导升( 2D) CD 变化图( LULULLC) ), 并且仅仅从一对一对一对一的光学光学图像开始。 拟议的结构包括基于变压的网络、 多塔斯克 Bitoporal 图像转换 (MTITIT), 以及一个深变动的网络, Resal 3UNNet (SUD 3Net ) 和 Smartalde Stabilick 显示的 数据结构。