FESSTA:通过空间-时时注意对景点云进行流动估计 (FESTA: Flow Estimation via Spatial-Temporal Attention for Scene Point Clouds) - 专知论文

会员服务 ·

0

估计/估计量 · 点云 · 注意力机制 · Extensibility · 层 ·

2021 年 12 月 6 日

FESTA: Flow Estimation via Spatial-Temporal Attention for Scene Point Clouds

翻译：FESSTA:通过空间-时时注意对景点云进行流动估计

Haiyan Wang,Jiahao Pang,Muhammad A. Lodhi,Yingli Tian,Dong Tian

from arxiv, Accepted at CVPR 2021 (Oral Presentation)

Scene flow depicts the dynamics of a 3D scene, which is critical for various applications such as autonomous driving, robot navigation, AR/VR, etc. Conventionally, scene flow is estimated from dense/regular RGB video frames. With the development of depth-sensing technologies, precise 3D measurements are available via point clouds which have sparked new research in 3D scene flow. Nevertheless, it remains challenging to extract scene flow from point clouds due to the sparsity and irregularity in typical point cloud sampling patterns. One major issue related to irregular sampling is identified as the randomness during point set abstraction/feature extraction -- an elementary process in many flow estimation scenarios. A novel Spatial Abstraction with Attention (SA^2) layer is accordingly proposed to alleviate the unstable abstraction problem. Moreover, a Temporal Abstraction with Attention (TA^2) layer is proposed to rectify attention in temporal domain, leading to benefits with motions scaled in a larger range. Extensive analysis and experiments verified the motivation and significant performance gains of our method, dubbed as Flow Estimation via Spatial-Temporal Attention (FESTA), when compared to several state-of-the-art benchmarks of scene flow estimation.

翻译：3D场景的动态显示3D场景的动态。 3D场景对于自主驱动、机器人导航、AR/VR等各种应用至关重要。从公约角度讲,现场流动是从密集/常规RGB视频框中估计出来的。随着深度遥感技术的发展,可以通过点云进行精确的3D场景测量,这在3D场景流中引发了新的研究。然而,由于典型点云采样模式的广度和不规则性,从点云中提取场景流仍然具有挑战性。与非常规取样相关的一个主要问题是定点抽取/速度提取过程中的随机性 -- -- 在许多流量估计情景中,这是一个基本过程。因此,提出了新的关注空间摘要(SA2)层,以缓解不稳定的抽取问题。此外,还提出了关注的时空抽象层(TA2),以纠正时空空间空间-时空注意(FESTA)的动态基准,从而在更大范围上缩小了运动的好处。广泛的分析和实验验证了我们方法的动机和显著的绩效收益。

0

相关内容

估计/估计量

估计/估计量

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

CVPR 2021 Oral | 室内动态场景中的相机重定位

CVPR 2021 Oral | 室内动态场景中的相机重定位

专知会员服务

16+阅读 · 2021年4月12日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】高动态环境的语义单目SLAM

【泡泡一分钟】高动态环境的语义单目SLAM

泡泡机器人SLAM

5+阅读 · 2019年3月27日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

【泡泡一分钟】RNFNet: 用于室内语义分割的RGB-D多层级残差特征融合（ICCV2017-523）

【泡泡一分钟】RNFNet: 用于室内语义分割的RGB-D多层级残差特征融合（ICCV2017-523）

泡泡机器人SLAM

10+阅读 · 2018年12月21日

【泡泡一分钟】一种利用点云数据建模城市场景的方法(ICCV2017-403)

【泡泡一分钟】一种利用点云数据建模城市场景的方法(ICCV2017-403)

泡泡机器人SLAM

3+阅读 · 2018年11月6日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

Geometric Transformer for Fast and Robust Point Cloud Registration

Arxiv

1+阅读 · 2022年2月14日

Gravity Estimation at Small Bodies via Optical Tracking of Hopping Artificial Probes

Arxiv

0+阅读 · 2022年2月13日

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

Arxiv

5+阅读 · 2021年10月6日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

ASLFeat: Learning Local Features of Accurate Shape and Localization

Arxiv

6+阅读 · 2020年3月23日

Review: deep learning on 3D point clouds

Review: deep learning on 3D point clouds

Arxiv

5+阅读 · 2020年1月17日

Deep Learning for 3D Point Clouds: A Survey

Deep Learning for 3D Point Clouds: A Survey

Arxiv

3+阅读 · 2019年12月27日

SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation

Arxiv

5+阅读 · 2019年11月22日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

估计/估计量

注意力机制

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

CVPR 2021 Oral | 室内动态场景中的相机重定位

CVPR 2021 Oral | 室内动态场景中的相机重定位

专知会员服务

16+阅读 · 2021年4月12日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型幻觉：系统综述

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

【博士论文】数据与任务的物理学：深度学习中的局部性与组合性理论

代理式人工智能时代的决策优势

相关资讯

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】高动态环境的语义单目SLAM

【泡泡一分钟】高动态环境的语义单目SLAM

泡泡机器人SLAM

5+阅读 · 2019年3月27日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

【泡泡一分钟】RNFNet: 用于室内语义分割的RGB-D多层级残差特征融合（ICCV2017-523）

【泡泡一分钟】RNFNet: 用于室内语义分割的RGB-D多层级残差特征融合（ICCV2017-523）

泡泡机器人SLAM

10+阅读 · 2018年12月21日

【泡泡一分钟】一种利用点云数据建模城市场景的方法(ICCV2017-403)

【泡泡一分钟】一种利用点云数据建模城市场景的方法(ICCV2017-403)

泡泡机器人SLAM

3+阅读 · 2018年11月6日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

相关论文

Geometric Transformer for Fast and Robust Point Cloud Registration

Arxiv

1+阅读 · 2022年2月14日

Gravity Estimation at Small Bodies via Optical Tracking of Hopping Artificial Probes

Arxiv

0+阅读 · 2022年2月13日

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

Arxiv

5+阅读 · 2021年10月6日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

ASLFeat: Learning Local Features of Accurate Shape and Localization

Arxiv

6+阅读 · 2020年3月23日

Review: deep learning on 3D point clouds

Review: deep learning on 3D point clouds

Arxiv

5+阅读 · 2020年1月17日

Deep Learning for 3D Point Clouds: A Survey

Deep Learning for 3D Point Clouds: A Survey

Arxiv

3+阅读 · 2019年12月27日

SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation

Arxiv

5+阅读 · 2019年11月22日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

微信扫码咨询专知VIP会员