Category-level object pose estimation aims to predict the 6D pose as well as the 3D metric size of arbitrary objects from a known set of categories. Recent methods harness shape prior adaptation to map the observed point cloud into the canonical space and apply Umeyama algorithm to recover the pose and size. However, their shape prior integration strategy boosts pose estimation indirectly, which leads to insufficient pose-sensitive feature extraction and slow inference speed. To tackle this problem, in this paper, we propose a novel geometry-guided Residual Object Bounding Box Projection network RBP-Pose that jointly predicts object pose and residual vectors describing the displacements from the shape-prior-indicated object surface projections on the bounding box towards the real surface projections. Such definition of residual vectors is inherently zero-mean and relatively small, and explicitly encapsulates spatial cues of the 3D object for robust and accurate pose regression. We enforce geometry-aware consistency terms to align the predicted pose and residual vectors to further boost performance.
翻译:类级天体的估算旨在预测6D构成以及一组已知类别中任意物体的3D度大小。最近采用的方法在适应前先形状,将观测到的点云映射到运河空间,并应用Umeyama算法来恢复其形状和大小。然而,其形状先前的集成战略会间接地显示估计,从而导致对表面敏感特征的提取不足和缓慢的推导速度。为了解决这一问题,我们在本文件中提出了一个新的几何制-制导残余物体测深框投影网络RBP-Pose,它共同预测物体构成和残余矢量,说明从形状优先对象的物体表面预测到实际地表预测的边框的移位。这种残余矢量的定义本质上是零度和相对小的,并明确封存了3D天体的空间线索,以稳健和准确的剖面回归。我们实施了几何测量-感一致性术语,以调整预测的形状和残余矢量,以进一步增强性能。