In this paper, we present a generalizable model-free 6-DoF object pose estimator called Gen6D. Existing generalizable pose estimators either need high-quality object models or require additional depth maps or object masks in test time, which significantly limits their application scope. In contrast, our pose estimator only requires some posed images of the unseen object and is able to accurately predict the poses of the object in arbitrary environments. Gen6D consists of an object detector, a viewpoint selector and a pose refiner, all of which do not require the 3D object model and can generalize to unseen objects. Experiments show that Gen6D achieves state-of-the-art results on two model-free datasets: the MOPED dataset and a new GenMOP dataset collected by us. In addition, on the LINEMOD dataset, Gen6D achieves competitive results compared with instance-specific pose estimators. Project page: https://liuyuan-pal.github.io/Gen6D/.
翻译:在本文中,我们展示了一个通用的无模型 6-DoF 对象,即Gen6D。 现有的通用显示显示显示器要么需要高品质的物体模型,要么在试验时间需要额外的深度地图或对象面罩,从而大大限制其应用范围。 相反,我们的显示显示器只需要一些隐形物体的图像,并且能够准确预测该物体在任意环境中的构成。 Gen6D 包含一个物体探测器、一个视图选择器和一个变形精细器,所有这些都不需要3D 对象模型,可以概括到看不见的物体。 实验显示, Gen6D 在两个无型数据集(MOED数据集和我们收集的一个新的GenMOP数据集)上取得了最先进的结果。 此外,在 LINEMOD 数据集上, Gen6D 取得了与具体实例的构成验证器相比的竞争结果。 项目网页: https://liuyuuan-pal.github.io/Gen6D/。