Depth is a vital piece of information for autonomous vehicles to perceive obstacles. Due to the relatively low price and small size of monocular cameras, depth estimation from a single RGB image has attracted great interest in the research community. In recent years, the application of Deep Neural Networks (DNNs) has significantly boosted the accuracy of monocular depth estimation (MDE). State-of-the-art methods are usually designed on top of complex and extremely deep network architectures, which require more computational resources and cannot run in real-time without using high-end GPUs. Although some researchers tried to accelerate the running speed, the accuracy of depth estimation is degraded because the compressed model does not represent images well. In addition, the inherent characteristic of the feature extractor used by the existing approaches results in severe spatial information loss in the produced feature maps, which also impairs the accuracy of depth estimation on small sized images. In this study, we are motivated to design a novel and efficient Convolutional Neural Network (CNN) that assembles two shallow encoder-decoder style subnetworks in succession to address these problems. In particular, we place our emphasis on the trade-off between the accuracy and speed of MDE. Extensive experiments have been conducted on the NYU depth v2, KITTI, Make3D and Unreal data sets. Compared with the state-of-the-art approaches which have an extremely deep and complex architecture, the proposed network not only achieves comparable performance but also runs at a much faster speed on a single, less powerful GPU.
翻译:深度是自发车辆感知障碍的重要信息。 由于价格相对较低,单筒相机规模较小,单筒摄像机的深度估计在研究界引起了极大的兴趣。近年来,深神经网络(DNNS)的应用大大提高了单眼深度估计的准确性。最先进的方法通常设计在复杂和极其深厚的网络结构之上,这需要更多的计算资源,无法实时运行,而不使用高端GPU。虽然一些研究人员试图加快运行速度,但是由于压缩模型并不代表图像,深度估计的准确性会降低。此外,现有方法所使用的地貌提取器的固有特征使得生成的地貌图中空间信息严重损失,这也损害了对小型图像的深度估计的准确性。在本研究中,我们有动力设计一个新型和高效的进化神经网络(CNN),它只将两个浅度的电极分解式的子网络风格组合起来,接续续解决这些问题,因此深度估算的深度的深度估计值会降低。我们用深度的GIT3 深度的深度和深度的深度模型来进行深度的深度实验。我们用最深层的GIT网络进行深度的深度的实验。