AFT-VO:用于多视视观察光度测量估计的不同步融合变异器 (AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation) - 专知论文

会员服务 ·

0

估计/估计量 · 混合密度网络 · 传感器 · 变换 · Performer ·

2022 年 9 月 16 日

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

翻译：AFT-VO:用于多视视观察光度测量估计的不同步融合变异器

Nimet Kaygusuz,Oscar Mendez,Richard Bowden

Motion estimation approaches typically employ sensor fusion techniques, such as the Kalman Filter, to handle individual sensor failures. More recently, deep learning-based fusion approaches have been proposed, increasing the performance and requiring less model-specific implementations. However, current deep fusion approaches often assume that sensors are synchronised, which is not always practical, especially for low-cost hardware. To address this limitation, in this work, we propose AFT-VO, a novel transformer-based sensor fusion architecture to estimate VO from multiple sensors. Our framework combines predictions from asynchronous multi-view cameras and accounts for the time discrepancies of measurements coming from different sources. Our approach first employs a Mixture Density Network (MDN) to estimate the probability distributions of the 6-DoF poses for every camera in the system. Then a novel transformer-based fusion module, AFT-VO, is introduced, which combines these asynchronous pose estimations, along with their confidences. More specifically, we introduce Discretiser and Source Encoding techniques which enable the fusion of multi-source asynchronous signals. We evaluate our approach on the popular nuScenes and KITTI datasets. Our experiments demonstrate that multi-view fusion for VO estimation provides robust and accurate trajectories, outperforming the state of the art in both challenging weather and lighting conditions.

翻译：为了应对这一局限性,我们建议AFT-VO(AFT-VO)在这项工作中提出一个新的基于变压器的传感器聚合结构,以便从多个传感器中估计VO。我们的框架结合了来自不同来源的不同步多视相机的预测和源源时间差异的计算。我们的方法首先使用一个混合密度网络(MDN)来估计6-DOF对系统每个摄像头的概率分布。然后推出一个新的基于变压器的熔化模块AFT-VO(AFT-VO),该模块结合了这些不连贯的预测以及它们的信心。更具体地说,我们引入了分解和源代码解技术,用于测量不同来源的测量时间差异。我们的方法首先使用了一个混合密度网络(MDN)来估计6-DoF对系统每个摄像头的概率分布。然后引入了一个新的基于变压器的熔化器组合模块AFT-VO(AFT-VO),将这些不连贯的预测与它们的信任结合起来。更具体地说,我们引入了分解器和源解调调调和源解调调化技术,我们用于将多源的螺旋路路段的图像的图像的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的演示,为我们提供了我们的演制制制导。

0

相关内容

估计/估计量

估计/估计量

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

精神分裂症miRNA异常调控网络的验证

国家自然科学基金

0+阅读 · 2014年12月31日

腺嘌呤去甲基化酶FTO对2型糖尿病糖脂代谢关键基因的调控作用

国家自然科学基金

0+阅读 · 2014年12月31日

神经元和星形胶质细胞特异性miRNA对神经网络发育和功能的调控机制

国家自然科学基金

1+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于MEMS技术的THz焦平面阵列及其宽带成像的理论和方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRPP2-STIM1相互作用：脑缺血再灌注损伤新机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

草鱼脂肪代谢关键酶基因的分子特性及营养调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

脊髓细胞特异性miRNAs调控损伤运动神经元凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

酒精性心肌病胰岛素抵抗相关miRNA调控网络研究

国家自然科学基金

0+阅读 · 2011年12月31日

Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

Arxiv

0+阅读 · 2022年10月25日

Visual-based Kinematics and Pose Estimation for Skid-Steering Robots

Arxiv

0+阅读 · 2022年10月25日

Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Arxiv

0+阅读 · 2022年10月25日

Video based Object 6D Pose Estimation using Transformers

Arxiv

1+阅读 · 2022年10月24日

Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement

Arxiv

0+阅读 · 2022年10月24日

EpipolarNVS: leveraging on Epipolar geometry for single-image Novel View Synthesis

Arxiv

0+阅读 · 2022年10月24日

Semantic Geometric Fusion Multi-object Tracking and Lidar Odometry in Dynamic Environment

Arxiv

0+阅读 · 2022年10月23日

Edge-based Monocular Thermal-Inertial Odometry in Visually Degraded Environments

Arxiv

0+阅读 · 2022年10月22日

GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network

GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network

Arxiv

0+阅读 · 2022年10月21日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

VIP会员

文章信息

相关主题

估计/估计量

混合密度网络

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

相关论文

Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

Arxiv

0+阅读 · 2022年10月25日

Visual-based Kinematics and Pose Estimation for Skid-Steering Robots

Arxiv

0+阅读 · 2022年10月25日

Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Arxiv

0+阅读 · 2022年10月25日

Video based Object 6D Pose Estimation using Transformers

Arxiv

1+阅读 · 2022年10月24日

Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement

Arxiv

0+阅读 · 2022年10月24日

EpipolarNVS: leveraging on Epipolar geometry for single-image Novel View Synthesis

Arxiv

0+阅读 · 2022年10月24日

Semantic Geometric Fusion Multi-object Tracking and Lidar Odometry in Dynamic Environment

Arxiv

0+阅读 · 2022年10月23日

Edge-based Monocular Thermal-Inertial Odometry in Visually Degraded Environments

Arxiv

0+阅读 · 2022年10月22日

GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network

GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network

Arxiv

0+阅读 · 2022年10月21日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

相关基金

精神分裂症miRNA异常调控网络的验证

国家自然科学基金

0+阅读 · 2014年12月31日

腺嘌呤去甲基化酶FTO对2型糖尿病糖脂代谢关键基因的调控作用

国家自然科学基金

0+阅读 · 2014年12月31日

神经元和星形胶质细胞特异性miRNA对神经网络发育和功能的调控机制

国家自然科学基金

1+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于MEMS技术的THz焦平面阵列及其宽带成像的理论和方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRPP2-STIM1相互作用：脑缺血再灌注损伤新机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

草鱼脂肪代谢关键酶基因的分子特性及营养调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

脊髓细胞特异性miRNAs调控损伤运动神经元凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

酒精性心肌病胰岛素抵抗相关miRNA调控网络研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员