Three dimensional (3D) object recognition is becoming a key desired capability for many computer vision systems such as autonomous vehicles, service robots and surveillance drones to operate more effectively in unstructured environments. These real-time systems require effective classification methods that are robust to various sampling resolutions, noisy measurements, and unconstrained pose configurations. Previous research has shown that points' sparsity, rotation and positional inherent variance can lead to a significant drop in the performance of point cloud based classification techniques. However, neither of them is sufficiently robust to multifactorial variance and significant sparsity. In this regard, we propose a novel approach for 3D classification that can simultaneously achieve invariance towards rotation, positional shift, scaling, and is robust to point sparsity. To this end, we introduce a new feature that utilizes graph structure of point clouds, which can be learned end-to-end with our proposed neural network to acquire a robust latent representation of the 3D object. We show that such latent representations can significantly improve the performance of object classification and retrieval tasks when points are sparse. Further, we show that our approach outperforms PointNet and 3DmFV by 35.0% and 28.1% respectively in ModelNet 40 classification tasks using sparse point clouds of only 16 points under arbitrary SO(3) rotation.
翻译:三维(3D)天体识别正在成为许多计算机视觉系统,如自主飞行器、服务机器人和无人机监视无人机等的关键理想能力,以便在非结构化环境中更有效地运行。这些实时系统需要有效的分类方法,这些方法对各种抽样分辨率、噪音测量和不受限制的配置具有很强的力度。以前的研究表明,点的宽度、旋转和位置固有的差异可能导致基于点云分类技术的性能显著下降。然而,这两个系统都不足以应对多因素差异和显著宽度。在这方面,我们提出了三维分类的新办法,可以同时实现旋转、定位转换、缩放,并能够发现偏斜度。为此,我们引入了一个新的特征,利用点云的图形结构,可以通过我们提议的神经网络来学习点的端到端,以获得3D天体的强势潜值代表。我们表明,当点稀少时,这种暗层显示能够显著改善对象分类和检索任务的性能。此外,我们还表明,在模型定位点上,只有40-3%的SOMLODO和 28-FO级之间,我们显示我们的方法在40%的模型定位上,只有40个OD-rblearrantalal-10。