The panorama image can simultaneously demonstrate complete information of the surrounding environment and has many advantages in virtual tourism, games, robotics, etc. However, the progress of panorama depth estimation cannot completely solve the problems of distortion and discontinuity caused by the commonly used projection methods. This paper proposes SphereDepth, a novel panorama depth estimation method that predicts the depth directly on the spherical mesh without projection preprocessing. The core idea is to establish the relationship between the panorama image and the spherical mesh and then use a deep neural network to extract features on the spherical domain to predict depth. To address the efficiency challenges brought by the high-resolution panorama data, we introduce two hyper-parameters for the proposed spherical mesh processing framework to balance the inference speed and accuracy. Validated on three public panorama datasets, SphereDepth achieves comparable results with the state-of-the-art methods of panorama depth estimation. Benefiting from the spherical domain setting, SphereDepth can generate a high-quality point cloud and significantly alleviate the issues of distortion and discontinuity.
翻译:全景图像可以同时展示周围环境的完整信息,在虚拟旅游、游戏、机器人等方面有许多优势。然而,全景深度估计的进展无法完全解决由常用投影方法造成的扭曲和不连续问题。本文提议Sphere Depth,这是全景深度估计方法,在不进行投影预处理的情况下直接预测球状网格的深度。核心想法是建立全景图像与球状网块之间的关系,然后利用深神经网络提取球状域的特征以预测深度。为了应对高分辨率全景数据带来的效率挑战,我们为拟议的球状网格处理框架引入了两个超参数,以平衡推断速度和准确性。在三个公共全景数据集上验证,Sphere Depeh取得了与全景深度估计的最新方法相近的结果。从球域设置中受益,SphereDepeh能够产生高品质的云,并大大缓解扭曲和不精确性的问题。