长短网络：探索流式感知中的时间和语义特征融合 (LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception) - 专知论文

会员服务 ·

0

特征融合 · 语义特征 · 融合 · 语义融合 · 自主驾驶汽车 ·

2023 年 3 月 27 日

LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception

翻译：长短网络：探索流式感知中的时间和语义特征融合

Chenyang Li,Zhi-Qi Cheng,Jun-Yan He,Pengyu Li,Bin Luo,Han-Yuan Chen,Yifeng Geng,Jin-Peng Lan,Xuansong Xie

from arxiv, Accepted at ICASSP 2023, source code is at https://github.com/zhiqic/LongShortNet

Streaming perception is a fundamental task in autonomous driving that requires a careful balance between the latency and accuracy of the autopilot system. However, current methods for streaming perception are limited as they rely only on the current and adjacent two frames to learn movement patterns, which restricts their ability to model complex scenes, often leading to poor detection results. To address this limitation, we propose LongShortNet, a novel dual-path network that captures long-term temporal motion and integrates it with short-term spatial semantics for real-time perception. Our proposed LongShortNet is notable as it is the first work to extend long-term temporal modeling to streaming perception, enabling spatiotemporal feature fusion. We evaluate LongShortNet on the challenging Argoverse-HD dataset and demonstrate that it outperforms existing state-of-the-art methods with almost no additional computational cost.

翻译：流式感知是自动驾驶中的基本任务，它需要在自主驾驶汽车的延迟和准确性之间进行仔细的平衡。然而，目前的流式感知方法存在局限性，因为它们仅依赖于当前和相邻两帧来学习运动模式，这限制了它们对复杂场景模型的能力，经常导致检测结果不佳。为了解决这个问题，我们提出了一种新颖的双通道网络LongShortNet，它可以捕捉长期的时间运动，并将其与短期的空间语义融合用于实时感知。我们的LongShortNet之所以被注意到是因为它是第一个将长期时间建模扩展到流式感知的方法，从而实现了时空特征融合。我们在具有挑战性的Argoverse-HD数据集上评估了LongShortNet，并证明它在几乎不增加计算成本的情况下优于现有的最先进方法。

0

相关内容

特征融合

【ICML2022】刻画与克服多模态深度神经网络中的学习贪心特性

【ICML2022】刻画与克服多模态深度神经网络中的学习贪心特性

专知会员服务

10+阅读 · 2022年5月28日

【CVPR 2022】基于时空解耦与重耦的RGB-D动作识别 Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

【CVPR 2022】基于时空解耦与重耦的RGB-D动作识别 Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

专知会员服务

14+阅读 · 2022年3月19日

【ICML2021】轻量级结构多样化的网络结构

专知会员服务

28+阅读 · 2021年8月2日

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

专知会员服务

34+阅读 · 2021年5月5日

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ICCV 2019 行为识别/视频理解论文汇总

ICCV 2019 行为识别/视频理解论文汇总

极市平台

15+阅读 · 2019年9月26日

CVPR2019| 04-17更新17篇论文及代码（目标检测、语义分割、损失函数、姿态估计等）

CVPR2019| 04-17更新17篇论文及代码（目标检测、语义分割、损失函数、姿态估计等）

极市平台

24+阅读 · 2019年4月17日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

AAAI 2018 行为识别论文概览

AAAI 2018 行为识别论文概览

极市平台

18+阅读 · 2018年3月20日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

基于知识库构建的图像和视频角色语义关系的研究

国家自然科学基金

1+阅读 · 2015年12月31日

酸敏感离子通道介导慢性应激诱导认知功能损伤的作用与机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉皮层信息处理机制的行人检测与行为识别

国家自然科学基金

0+阅读 · 2013年12月31日

整合自上而下和自下而上处理机制的场景解析

国家自然科学基金

0+阅读 · 2013年12月31日

面向无线网络的动态自适应视频流传输与缓存机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于信息相关性的传感器网络数据聚合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于人类视觉自然搜索和注意认知机制的行车环境交通标识检测跟踪与识别

国家自然科学基金

0+阅读 · 2012年12月31日

无线多媒体传感器网络中多媒体信息处理及路由技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

集团化企业中基于社会网络分析的知识资源整合研究

国家自然科学基金

1+阅读 · 2008年12月31日

Ad Hoc网络中基于分布式异步带内信道预约机制的多址接入协议研究

国家自然科学基金

0+阅读 · 2008年12月31日

Error-Correcting Codes for Nanopore Sequencing

Arxiv

0+阅读 · 2023年5月17日

TextSLAM: Visual SLAM with Semantic Planar Text Features

Arxiv

0+阅读 · 2023年5月17日

Hitless memory-reconfigurable photonic reservoir computing architecture

Arxiv

0+阅读 · 2023年5月17日

Correlation Pyramid Network for 3D Single Object Tracking

Arxiv

0+阅读 · 2023年5月16日

Fast Staircase Detection and Estimation using 3D Point Clouds with Multi-detection Merging for Heterogeneous Robots

Arxiv

0+阅读 · 2023年5月15日

TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

Arxiv

0+阅读 · 2023年5月15日

Ripple sparse self-attention for monaural speech enhancement

Arxiv

0+阅读 · 2023年5月15日

Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

Arxiv

0+阅读 · 2023年5月15日

Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention

Arxiv

0+阅读 · 2023年5月12日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

VIP会员

文章信息

相关主题

自主驾驶汽车

相关VIP内容

【ICML2022】刻画与克服多模态深度神经网络中的学习贪心特性

【ICML2022】刻画与克服多模态深度神经网络中的学习贪心特性

专知会员服务

10+阅读 · 2022年5月28日

【CVPR 2022】基于时空解耦与重耦的RGB-D动作识别 Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

【CVPR 2022】基于时空解耦与重耦的RGB-D动作识别 Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

专知会员服务

14+阅读 · 2022年3月19日

【ICML2021】轻量级结构多样化的网络结构

专知会员服务

28+阅读 · 2021年8月2日

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

专知会员服务

34+阅读 · 2021年5月5日

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

ICCV 2019 行为识别/视频理解论文汇总

ICCV 2019 行为识别/视频理解论文汇总

极市平台

15+阅读 · 2019年9月26日

CVPR2019| 04-17更新17篇论文及代码（目标检测、语义分割、损失函数、姿态估计等）

CVPR2019| 04-17更新17篇论文及代码（目标检测、语义分割、损失函数、姿态估计等）

极市平台

24+阅读 · 2019年4月17日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

AAAI 2018 行为识别论文概览

AAAI 2018 行为识别论文概览

极市平台

18+阅读 · 2018年3月20日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Error-Correcting Codes for Nanopore Sequencing

Arxiv

0+阅读 · 2023年5月17日

TextSLAM: Visual SLAM with Semantic Planar Text Features

Arxiv

0+阅读 · 2023年5月17日

Hitless memory-reconfigurable photonic reservoir computing architecture

Arxiv

0+阅读 · 2023年5月17日

Correlation Pyramid Network for 3D Single Object Tracking

Arxiv

0+阅读 · 2023年5月16日

Fast Staircase Detection and Estimation using 3D Point Clouds with Multi-detection Merging for Heterogeneous Robots

Arxiv

0+阅读 · 2023年5月15日

TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

Arxiv

0+阅读 · 2023年5月15日

Ripple sparse self-attention for monaural speech enhancement

Arxiv

0+阅读 · 2023年5月15日

Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

Arxiv

0+阅读 · 2023年5月15日

Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention

Arxiv

0+阅读 · 2023年5月12日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

相关基金

基于知识库构建的图像和视频角色语义关系的研究

国家自然科学基金

1+阅读 · 2015年12月31日

酸敏感离子通道介导慢性应激诱导认知功能损伤的作用与机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉皮层信息处理机制的行人检测与行为识别

国家自然科学基金

0+阅读 · 2013年12月31日

整合自上而下和自下而上处理机制的场景解析

国家自然科学基金

0+阅读 · 2013年12月31日

面向无线网络的动态自适应视频流传输与缓存机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于信息相关性的传感器网络数据聚合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于人类视觉自然搜索和注意认知机制的行车环境交通标识检测跟踪与识别

国家自然科学基金

0+阅读 · 2012年12月31日

无线多媒体传感器网络中多媒体信息处理及路由技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

集团化企业中基于社会网络分析的知识资源整合研究

国家自然科学基金

1+阅读 · 2008年12月31日

Ad Hoc网络中基于分布式异步带内信道预约机制的多址接入协议研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员