Modern optical satellite sensors enable high-resolution stereo reconstruction from space. But the challenging imaging conditions when observing the Earth from space push stereo matching to its limits. In practice, the resulting digital surface models (DSMs) are fairly noisy and often do not attain the accuracy needed for high-resolution applications such as 3D city modeling. Arguably, stereo correspondence based on low-level image similarity is insufficient and should be complemented with a-priori knowledge about the expected surface geometry beyond basic local smoothness. To that end, we introduce ResDepth, a convolutional neural network that learns such an expressive geometric prior from example data. ResDepth refines an initial, raw stereo DSM while conditioning the refinement on the images. I.e., it acts as a smart, learned post-processing filter and can seamlessly complement any stereo matching pipeline. In a series of experiments, we find that the proposed method consistently improves stereo DSMs both quantitatively and qualitatively. We show that the prior encoded in the network weights captures meaningful geometric characteristics of urban design, which also generalize across different districts and even from one city to another. Moreover, we demonstrate that, by training on a variety of stereo pairs, ResDepth can acquire a sufficient degree of invariance against variations in imaging conditions and acquisition geometry.
翻译:现代光学卫星传感器能够从空间进行高分辨率立体重建。但从空间对地球进行观测时,具有挑战性的成像条件从空间推动立体与其极限相匹配。实际上,由此产生的数字表面模型(DSM)相当吵闹,往往没有达到高分辨率应用所需的精确度,例如3D城市建模。可以说,基于低水平图像相似性的立体通信不够充分,应当辅之以关于预期表面几何的优先知识,超越基本的当地光滑。为此,我们引入了Res Depth,这是一个革命性神经网络,在从实例数据中先学到这样的直观几何测量特征。Res Depth在调整图像的精度时,对初始、原始立体DSM进行了精细的改进。I.e.e.,它作为智能的加工后过滤器,可以无缝地补充任何立体匹配管道。在一系列实验中,我们发现拟议方法始终在定量和定性上改进立体DSMMs。我们显示,先前在网络中编码的神经中收集了城市设计中有意义的几何特征特征特征特征特征特征特征特征特征特征特征,这也在不同的地区甚至从一个城市里获得。