In 2D image processing, some attempts decompose images into high and low frequency components for describing edge and smooth parts respectively. Similarly, the contour and flat area of 3D objects, such as the boundary and seat area of a chair, describe different but also complementary geometries. However, such investigation is lost in previous deep networks that understand point clouds by directly treating all points or local patches equally. To solve this problem, we propose Geometry-Disentangled Attention Network (GDANet). GDANet introduces Geometry-Disentangle Module to dynamically disentangle point clouds into the contour and flat part of 3D objects, respectively denoted by sharp and gentle variation components. Then GDANet exploits Sharp-Gentle Complementary Attention Module that regards the features from sharp and gentle variation components as two holistic representations, and pays different attentions to them while fusing them respectively with original point cloud features. In this way, our method captures and refines the holistic and complementary 3D geometric semantics from two distinct disentangled components to supplement the local information. Extensive experiments on 3D object classification and segmentation benchmarks demonstrate that GDANet achieves the state-of-the-arts with fewer parameters.
翻译:在 2D 图像处理中,有些尝试将图像分解成高频和低频部分,用于描述边缘和平滑部分。同样, 3D 对象的轮廓和平方区域, 如椅子的边界和座位面积, 描述不同但又相互补充的几何。 但是, 在以前通过直接处理所有点或本地补丁来理解点云的深深网络中, 这样的调查丢失了。 为了解决这个问题, 我们提议了几何分分角注意网络( GDANet ) 。 Gdanet 引入了几何分辨分解模块, 将点云动态分解成3D 对象的轮廓和平方部分, 分别用尖度和温度变异部件表示。 Gdanet 开发了Sharp- Gentle C 补充注意模块, 将尖度和温和度变异的部件的特性作为两个整体表示, 并用原始的点云度特征分别显示它们。 我们的方法从两个不同的分解的部件中捕获和完善了3D 3D 地理测量结构, 补充当地信息。