Room layout estimation is a long-existing robotic vision task that benefits both environment sensing and motion planning. However, layout estimation using point clouds (PCs) still suffers from data scarcity due to annotation difficulty. As such, we address the semi-supervised setting of this task based upon the idea of model exponential moving averaging. But adapting this scheme to the state-of-the-art (SOTA) solution for PC-based layout estimation is not straightforward. To this end, we define a quad set matching strategy and several consistency losses based upon metrics tailored for layout quads. Besides, we propose a new online pseudo-label harvesting algorithm that decomposes the distribution of a hybrid distance measure between quads and PC into two components. This technique does not need manual threshold selection and intuitively encourages quads to align with reliable layout points. Surprisingly, this framework also works for the fully-supervised setting, achieving a new SOTA on the ScanNet benchmark. Last but not least, we also push the semi-supervised setting to the realistic omni-supervised setting, demonstrating significantly promoted performance on a newly annotated ARKitScenes testing set. Our codes, data and models are released in this repository.
翻译:室内布局估计是一项长期存在的机器人视野任务,既有利于环境感测,也有利于运动规划。然而,由于注释困难,使用点云(PCs)的布局估计仍然缺乏数据。因此,我们处理基于模型指数平均移动概念的半监督任务设置。但是,根据基于个人计算机的布局估计,将这一办法调整为最先进的(SOTA)解决方案并非直截了当。为此,我们根据为布局四分制而定制的量度标准,定义了四组匹配战略和若干一致性损失。此外,我们提议了一个新的在线假标签采集算法,将四和PC之间的混合距离测量分布分解成两个部分。这一技术不需要人工阈值选择,直觉地鼓励四分解与可靠的布局点保持一致。令人惊讶的是,这一框架也用于完全监督的设置,在扫描网基准上实现了一个新的SOTA。最后但并非最不重要的一点是,我们还将半超级定置的设置推向现实的omni监督设置,显示我们新加注的存储器的模型。