将利用深学习应用到司机参考车辆外物体时的多模式融合 (Multimodal Fusion Using Deep Learning Applied to Driver's Referencing of Outside-Vehicle Objects) - 专知论文

会员服务 ·

0

多峰值 · 学成 · 模态 · INTERACT · 深度学习 ·

2021 年 7 月 26 日

Multimodal Fusion Using Deep Learning Applied to Driver's Referencing of Outside-Vehicle Objects

翻译：将利用深学习应用到司机参考车辆外物体时的多模式融合

Abdul Rafey Aftab,Michael von der Beeck,Steven Rohrhirsch,Benoit Diotte,Michael Feld

There is a growing interest in more intelligent natural user interaction with the car. Hand gestures and speech are already being applied for driver-car interaction. Moreover, multimodal approaches are also showing promise in the automotive industry. In this paper, we utilize deep learning for a multimodal fusion network for referencing objects outside the vehicle. We use features from gaze, head pose and finger pointing simultaneously to precisely predict the referenced objects in different car poses. We demonstrate the practical limitations of each modality when used for a natural form of referencing, specifically inside the car. As evident from our results, we overcome the modality specific limitations, to a large extent, by the addition of other modalities. This work highlights the importance of multimodal sensing, especially when moving towards natural user interaction. Furthermore, our user based analysis shows noteworthy differences in recognition of user behavior depending upon the vehicle pose.

翻译：人们越来越关心与汽车进行更聪明的自然用户互动。手势和语言已经应用到驾驶汽车的互动中。此外,多式方法也在汽车业中表现出希望。在本文中,我们利用对多式聚合网络的深层次学习来查找车辆以外的物体。我们使用目光、头部和手指的特征,同时指向准确预测不同车辆的被引用物体。我们显示了在使用自然形式的参考时,特别是汽车内部,每种模式的实际局限性。从我们的结果中可以明显看出,我们在很大程度上通过添加其他模式克服了模式的具体局限性。这项工作突出了多式联运的重要性,特别是在转向自然用户互动时。此外,我们的用户分析显示,根据车辆的构成,在识别用户行为方面存在着显著的差异。

0

相关内容

多峰值

【德勤】数字化健康白皮书

专知会员服务

48+阅读 · 2020年12月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

使用TensorFlow建立深度学习模型，563页pdf，Deep Learning Pipeline Building a Deep Learning Model with TensorFlow

使用TensorFlow建立深度学习模型，563页pdf，Deep Learning Pipeline Building a Deep Learning Model with TensorFlow

专知会员服务

149+阅读 · 2020年1月2日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

医学 | 顶级SCI期刊专刊/国际会议信息4条

医学 | 顶级SCI期刊专刊/国际会议信息4条

Call4Papers

5+阅读 · 2018年12月28日

人工智能 | PRICAI 2019等国际会议信息9条

人工智能 | PRICAI 2019等国际会议信息9条

Call4Papers

6+阅读 · 2018年12月13日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Wearable-based Classification of Running Styles with Deep Learning

Arxiv

0+阅读 · 2021年9月23日

Diverse Video Captioning Through Latent Variable Expansion with Conditional GAN

Diverse Video Captioning Through Latent Variable Expansion with Conditional GAN

Arxiv

4+阅读 · 2020年3月5日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Arxiv

4+阅读 · 2019年3月28日

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Arxiv

3+阅读 · 2018年12月28日

Global Deep Learning Methods for Multimodality Isointense Infant Brain Image Segmentation

Global Deep Learning Methods for Multimodality Isointense Infant Brain Image Segmentation

Arxiv

3+阅读 · 2018年12月10日

Visual Semantic Navigation using Scene Priors

Arxiv

5+阅读 · 2018年10月15日

Efficient Road Lane Marking Detection with Deep Learning

Efficient Road Lane Marking Detection with Deep Learning

Arxiv

5+阅读 · 2018年9月11日

Text-to-Clip Video Retrieval with Early Fusion and Re-Captioning

Arxiv

4+阅读 · 2018年4月13日

Deep Learning in Finance

Arxiv

6+阅读 · 2018年1月14日

VIP会员

文章信息

相关主题

相关VIP内容

【德勤】数字化健康白皮书

专知会员服务

48+阅读 · 2020年12月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

使用TensorFlow建立深度学习模型，563页pdf，Deep Learning Pipeline Building a Deep Learning Model with TensorFlow

使用TensorFlow建立深度学习模型，563页pdf，Deep Learning Pipeline Building a Deep Learning Model with TensorFlow

专知会员服务

149+阅读 · 2020年1月2日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

医学 | 顶级SCI期刊专刊/国际会议信息4条

医学 | 顶级SCI期刊专刊/国际会议信息4条

Call4Papers

5+阅读 · 2018年12月28日

人工智能 | PRICAI 2019等国际会议信息9条

人工智能 | PRICAI 2019等国际会议信息9条

Call4Papers

6+阅读 · 2018年12月13日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Wearable-based Classification of Running Styles with Deep Learning

Arxiv

0+阅读 · 2021年9月23日

Diverse Video Captioning Through Latent Variable Expansion with Conditional GAN

Diverse Video Captioning Through Latent Variable Expansion with Conditional GAN

Arxiv

4+阅读 · 2020年3月5日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Deep Learning based Pedestrian Detection at Distance in Smart Cities

Arxiv

4+阅读 · 2019年3月28日

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Arxiv

3+阅读 · 2018年12月28日

Global Deep Learning Methods for Multimodality Isointense Infant Brain Image Segmentation

Global Deep Learning Methods for Multimodality Isointense Infant Brain Image Segmentation

Arxiv

3+阅读 · 2018年12月10日

Visual Semantic Navigation using Scene Priors

Arxiv

5+阅读 · 2018年10月15日

Efficient Road Lane Marking Detection with Deep Learning

Efficient Road Lane Marking Detection with Deep Learning

Arxiv

5+阅读 · 2018年9月11日

Text-to-Clip Video Retrieval with Early Fusion and Re-Captioning

Arxiv

4+阅读 · 2018年4月13日

Deep Learning in Finance

Arxiv

6+阅读 · 2018年1月14日

微信扫码咨询专知VIP会员