While object semantic understanding is essential for most service robotic tasks, 3D object classification is still an open problem. Learning from artificial 3D models alleviates the cost of annotation necessary to approach this problem, but most methods still struggle with the differences existing between artificial and real 3D data. We conjecture that the cause of those issue is the fact that many methods learn directly from point coordinates, instead of the shape, as the former is hard to center and to scale under variable occlusions reliably. We introduce spherical kernel point convolutions that directly exploit the object surface, represented as a graph, and a voting scheme to limit the impact of poor segmentation on the classification results. Our proposed approach improves upon state-of-the-art methods by up to 36% when transferring from artificial objects to real objects.
翻译:虽然对象语义理解对于多数服务机器人任务来说至关重要, 3D对象分类仍然是一个尚未解决的问题。 从人工的 3D 模型中学习可以减轻解决这一问题所需的说明成本, 但大多数方法仍然与人工和真实的 3D 数据之间存在的差异相冲突。 我们推测,这些问题的原因是许多方法直接从点坐标而不是形状中学习, 因为前者很难在可变封闭状态下进行中枢和缩放。 我们引入直接利用物体表面( 以图表形式表示 ) 的球心点组合, 以及限制对分类结果的分解不力的投票计划。 我们建议的方法在从人造物体向真实物体转移时, 将最先进的方法改进到36% 。