We introduce the concept of geometric stability to the problem of 6D object pose estimation and propose to learn pose inference based on geometrically stable patches extracted from observed 3D point clouds. According to the theory of geometric stability analysis, a minimal set of three planar/cylindrical patches are geometrically stable and determine the full 6DoFs of the object pose. We train a deep neural network to regress 6D object pose based on geometrically stable patch groups via learning both intra-patch geometric features and inter-patch contextual features. A subnetwork is jointly trained to predict per-patch poses. This auxiliary task is a relaxation of the group pose prediction: A single patch cannot determine the full 6DoFs but is able to improve pose accuracy in its corresponding DoFs. Working with patch groups makes our method generalize well for random occlusion and unseen instances. The method is easily amenable to resolve symmetry ambiguities. Our method achieves the state-of-the-art results on public benchmarks compared not only to depth-only but also to RGBD methods. It also performs well in category-level pose estimation.
翻译:我们为6D天体的问题引入了几何稳定性概念,提出估计,并提议根据从观察到的3D点云中提取的几何稳定点来学习外形推断。根据几何稳定性分析理论,三个平面/圆柱形的最小部分具有几何稳定性,并确定物体的完整 6DoFs 。我们通过学习分数内部几何特征和相接背景特征来训练一个深神经网络以几何稳定点构成。一个子网络经过联合训练,可以预测每批的外形。这一辅助任务是:一个单一部分不能确定全部 6DoFs,但能够提高相应的DoFs 的准确性。与修补组合作,使我们的方法对随机封闭和不可见的事例具有总体性。这个方法很容易解决对称性模糊性。我们的方法不仅达到深度,而且达到RGBD 方法,而且还在类别中做了很好的估计。