In this work we address the problem of autonomous 3D exploration of an unknown indoor environment using a depth camera. We cast the problem as the estimation of the Next Best View (NBV) that maximises the coverage of the unknown area. We do this by re-formulating NBV estimation as a classification problem and we propose a novel learning-based metric that encodes both, the current 3D observation (a depth frame) and the history of the ongoing reconstruction. One of the major contributions of this work is about introducing a new representation for the 3D reconstruction history as an auxiliary utility map which is efficiently coupled with the current depth observation. With both pieces of information, we train a light-weight CNN, named ExHistCNN, that estimates the NBV as a set of directions towards which the depth sensor finds most unexplored areas. We perform extensive evaluation on both synthetic and real room scans demonstrating that the proposed ExHistCNN is able to approach the exploration performance of an oracle using the complete knowledge of the 3D environment.
翻译:在这项工作中,我们用深度摄像头自主探索未知室内环境的问题。我们把问题作为“下一个最佳视图”(NBV)的估算问题,该视图最大限度地扩大了未知区域的覆盖范围。我们通过将NBV的估算作为分类问题进行重新制定,并提出一种新的基于学习的衡量标准,将目前的3D观测(深度框架)和正在进行的重建的历史都编码起来。这项工作的主要贡献之一是将3D重建历史的新表述作为辅助实用地图,与目前的深度观测有效结合。我们用这两部分信息培训了名为ExHistCNN的轻量型CNNCNN, 将NBV作为深度传感器发现最未勘探地区的一套方向。我们对合成和真实房间扫描进行广泛评价,表明拟议的ExHistCNN能够利用对3D环境的完整了解来接近一个神器的勘探性表现。