VINet:用于三维天体探测的轻量、可缩放和异种合作感知 (VINet: Lightweight, Scalable, and Heterogeneous Cooperative Perception for 3D Object Detection)

Utilizing the latest advances in Artificial Intelligence (AI), the computer vision community is now witnessing an unprecedented evolution in all kinds of perception tasks, particularly in object detection. Based on multiple spatially separated perception nodes, Cooperative Perception (CP) has emerged to significantly advance the perception of automated driving. However, current cooperative object detection methods mainly focus on ego-vehicle efficiency without considering the practical issues of system-wide costs. In this paper, we introduce VINet, a unified deep learning-based CP network for scalable, lightweight, and heterogeneous cooperative 3D object detection. VINet is the first CP method designed from the standpoint of large-scale system-level implementation and can be divided into three main phases: 1) Global Pre-Processing and Lightweight Feature Extraction which prepare the data into global style and extract features for cooperation in a lightweight manner; 2) Two-Stream Fusion which fuses the features from scalable and heterogeneous perception nodes; and 3) Central Feature Backbone and 3D Detection Head which further process the fused features and generate cooperative detection results. A cooperative perception platform is designed and developed for CP dataset acquisition and several baselines are compared during the experiments. The experimental analysis shows that VINet can achieve remarkable improvements for pedestrians and cars with 2x less system-wide computational costs and 12x less system-wide communicational costs.

翻译：利用人工智能(AI)的最新进展,计算机视觉界目前正在目睹各种认知任务,特别是目标探测方面前所未有地演变。基于多个空间分离的认知节点,合作感知(CP)已经出现,大大提高了自动驾驶的观念。然而,当前合作性物体探测方法主要侧重于自我车效率,而没有考虑全系统成本的实际问题。在本文件中,我们引入了VINet,即一个统一的深层次学习基点网络,用于可缩放、轻量和多种合作3D对象探测。VINet是从大规模系统一级实施的角度设计的第一个CP方法,可分为三个主要阶段:(1) 全球预处理前和轻度地段提取,以轻度方式将数据制成全球风格并提取合作特征;(2) 将可缩放和混杂感知节点的特征结合起来的双层裂开;(3) 中央特性后骨和3D探测头,进一步处理若干整合特征并产生合作性检测结果。一个合作性概念化和轻度通信平台可分为三个主要阶段:1) 合作性概念平台,将数据制成为全球模式,在全范围内的实验性成本和12级计算中可降低。

相关内容

关注 1

这是第25届年度会议，讨论有约束计算的所有方面，包括理论、算法、环境、语言、模型、系统和应用，如决策、资源分配、调度、配置和规划。为了纪念25周年，吉恩·弗洛伊德创作了一本“虚拟卷”来庆祝这个系列会议。信息可以在这里找到。约束编程协会有本系列中以前的会议列表。CP 2019计划将包括展示关于约束技术的高质量科学论文。除了通常的技术轨道外，CP 2019年会议还将有主题轨道。每个赛道都有一个专门的小组委员会，以确保有能力的评审员将审查这些领域的人提交的论文。官网链接：https://cp2019.a4cp.org/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日