MASS: 用于感量最高级查看理解的LIDAR数据多部注意的语义解析 (MASS: Multi-Attentional Semantic Segmentation of LiDAR Data for Dense Top-View Understanding)

from arxiv, Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). Code is publicly available at https://github.com/KPeng9510/MASS

At the heart of all automated driving systems is the ability to sense the surroundings, e.g., through semantic segmentation of LiDAR sequences, which experienced a remarkable progress due to the release of large datasets such as SemanticKITTI and nuScenes-LidarSeg. While most previous works focus on sparse segmentation of the LiDAR input, dense output masks provide self-driving cars with almost complete environment information. In this paper, we introduce MASS - a Multi-Attentional Semantic Segmentation model specifically built for dense top-view understanding of the driving scenes. Our framework operates on pillar- and occupancy features and comprises three attention-based building blocks: (1) a keypoint-driven graph attention, (2) an LSTM-based attention computed from a vector embedding of the spatial input, and (3) a pillar-based attention, resulting in a dense 360-degree segmentation mask. With extensive experiments on both, SemanticKITTI and nuScenes-LidarSeg, we quantitatively demonstrate the effectiveness of our model, outperforming the state of the art by 19.0% on SemanticKITTI and reaching 30.4% in mIoU on nuScenes-LidarSeg, where MASS is the first work addressing the dense segmentation task. Furthermore, our multi-attention model is shown to be very effective for 3D object detection validated on the KITTI-3D dataset, showcasing its high generalizability to other tasks related to 3D vision.

翻译：所有自动化驱动系统的核心是感知周围环境的能力,例如,通过LIDAR序列的语义分解,通过LIDAR序列的语义分解取得了显著的进展,由于发行了大型数据集,如SemanticKITTI和nuScenes-LidarSeg。虽然大多数以前的工作侧重于LIDAR输入的稀疏分解,但密度高的输出面遮罩为自行驾驶的汽车提供了几乎完整的环境信息。在本文中,我们引入了MASS - 一个多机密的逻辑分解模型,专门为对驱动场进行密集的顶级理解。我们的框架在界碑-3的视野和占用特征上运作,并包括三个基于关注的建筑块:(1) 一个关键点驱动的图形注意,(2) 一个基于LSTM的注意,从嵌入空间输入的矢量的矢量上计算出一个密集的驱动力,导致一个稠密的360度分解面具。在Smantic-KITTI和nuScenes-LidarSeg中进行广泛的实验,我们在模型上展示其有效性,在高端点3-LIV上展示了艺术的状态,在30-SDI的Sxxxxxxxxxxxxxxxxxxxxxx

相关内容

MASS

关注 0

MASS：IEEE International Conference on Mobile Ad-hoc and Sensor Systems。 Explanation：移动Ad hoc和传感器系统IEEE国际会议。 Publisher：IEEE。 SIT： http://dblp.uni-trier.de/db/conf/mass/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】深度安全多视图聚类:降低因视图增加而导致聚类性能下降的风险，Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase

专知会员服务

10+阅读 · 2022年3月12日

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

专知会员服务

11+阅读 · 2022年3月12日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日