The advanced performance of depth estimation is achieved by the employment of large and complex neural networks. While the performance has still been continuously improved, we argue that the depth estimation has to be accurate and efficient. It's a preliminary requirement for real-world applications. However, fast depth estimation tends to lower the performance as the trade-off between the model's capacity and accuracy. In this paper, we attempt to archive highly accurate depth estimation with a light-weight network. To this end, we first introduce a compact network that can estimate a depth map in real-time. We then technically show two complementary and necessary strategies to improve the performance of the light-weight network. As the number of real-world scenes is infinite, the first is the employment of auxiliary data that increases the diversity of training data. The second is the use of knowledge distillation to further boost the performance. Through extensive and rigorous experiments, we show that our method outperforms previous light-weight methods in terms of inference accuracy, computational efficiency and generalization. We can achieve comparable performance compared to state-of-the-of-art methods with only 1% parameters, on the other hand, our method outperforms other light-weight methods by a significant margin.
翻译:深度估计的先进性能是通过使用大型和复杂的神经网络来实现的。 虽然这种性能仍然不断改进, 但我们认为深度估计必须是准确和高效的。 这是真实世界应用的初步要求。 但是, 快速深度估计往往会降低性能, 因为模型的能力和准确性之间的权衡。 在本文中, 我们试图将高度准确的深度估计归档成一个轻量网络。 为此, 我们首先引入一个能够实时估计深度地图的紧凑网络。 然后, 我们从技术上展示了两种互补和必要的战略来改进轻量网络的性能。 由于真实世界场景的数量是无限的, 第一个是使用辅助数据来增加培训数据的多样性。 第二个是使用知识蒸馏来进一步提升性能。 我们通过广泛和严格的实验, 我们证明我们的方法在推算准确性、 计算效率和 概括化方面比照以前的轻量方法。 我们可以比照目前最先进的方法取得可比的性能, 而在另一边上只有1%的参数。