利用在变化环境下深分解网络进行直接源和早期反射的地方化 (Direct source and early reflections localization using deep deconvolution network under reverberant environment) - 专知论文

会员服务 ·

0

反卷积网络 · 有向 · Networking · 回合 · 协方差矩阵 ·

2021 年 10 月 22 日

Direct source and early reflections localization using deep deconvolution network under reverberant environment

翻译：利用在变化环境下深分解网络进行直接源和早期反射的地方化

Shan Gao,Xihong Wu,Tianshu Qu

This paper proposes a deconvolution-based network (DCNN) model for DOA estimation of direct source and early reflections under reverberant scenarios. Considering that the first-order reflections of the sound source also contain spatial directivity like the direct source, we treat both of them as the sources in the learning process. We use the covariance matrix of high order Ambisonics (HOA) signals in the time domain as the input feature of the network, which is concise while containing precise spatial information under reverberant scenarios. Besides, we use the deconvolution-based network for the spatial pseudo-spectrum (SPS) reconstruction in the 2D polar space, based on which the spatial relationship between elevation and azimuth can be depicted. We have carried out a series of experiments based on simulated and measured data under different reverberant scenarios, which prove the robustness and accuracy of the proposed DCNN model.

翻译：本文建议了一种基于分变网络的模型,用于在回旋情景下对直接源和早期反射进行数据分析。考虑到声源的第一阶反射还包含直接源的空间直接性,我们将两者视为学习过程中的来源。我们使用时间域中高压氨比松信号(HOA)的共变矩阵作为网络的输入特征,该矩阵简明扼要,在回动情景下包含精确的空间信息。此外,我们使用基于分变网络的2D极空间空间空间模拟光谱(SPS)重建空间假相(SPS)重建,在此基础上可以描述海拔和方位之间的空间关系。我们根据不同回动情景下的模拟和计量数据进行了一系列实验,证明了拟议的DCNN模型的稳健性和准确性。

0

相关内容

反卷积网络

反卷积网络

【NeurIPS2021】Spatial Ensemble：一种新颖的用于学生-老师框架的模型平滑机制

【NeurIPS2021】Spatial Ensemble：一种新颖的用于学生-老师框架的模型平滑机制

专知会员服务

18+阅读 · 2021年11月8日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

专知会员服务

60+阅读 · 2020年5月2日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【资源】语音增强资源集锦

【资源】语音增强资源集锦

专知

8+阅读 · 2020年7月4日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

极市平台

12+阅读 · 2019年5月31日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

语义分割 | context relation

语义分割 | context relation

极市平台

8+阅读 · 2019年2月9日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【泡泡一分钟】基于深度学习的视觉SLAM闭环检测的性能评估（Things-1）

【泡泡一分钟】基于深度学习的视觉SLAM闭环检测的性能评估（Things-1）

泡泡机器人SLAM

8+阅读 · 2018年5月5日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

专知

7+阅读 · 2018年2月26日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent

Arxiv

0+阅读 · 2021年12月21日

Topology Preserving Local Road Network Estimation from Single Onboard Camera Image

Arxiv

0+阅读 · 2021年12月19日

Online Grounding of PDDL Domains by Acting and Sensing in Unknown Environments

Arxiv

0+阅读 · 2021年12月18日

Advances in Online Audio-Visual Meeting Transcription

Advances in Online Audio-Visual Meeting Transcription

Arxiv

4+阅读 · 2019年12月10日

Graph Analysis and Graph Pooling in the Spatial Domain

Graph Analysis and Graph Pooling in the Spatial Domain

Arxiv

5+阅读 · 2019年10月3日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Online Deep Metric Learning

Arxiv

8+阅读 · 2018年5月15日

Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions

Arxiv

4+阅读 · 2018年3月15日

A guide to convolution arithmetic for deep learning

Arxiv

6+阅读 · 2018年1月11日

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Arxiv

16+阅读 · 2017年11月20日

VIP会员

文章信息

相关主题

反卷积网络

协方差矩阵

相关VIP内容

【NeurIPS2021】Spatial Ensemble：一种新颖的用于学生-老师框架的模型平滑机制

【NeurIPS2021】Spatial Ensemble：一种新颖的用于学生-老师框架的模型平滑机制

专知会员服务

18+阅读 · 2021年11月8日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

专知会员服务

60+阅读 · 2020年5月2日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

检索增强生成（RAG）技术，261页slides

美联参会指南-联合规划与执行概述及政策框架 | 32页

从DeepSeek-R1学到的三个核心经验

大规模视觉模型中的提示式适配：综述

相关资讯

【资源】语音增强资源集锦

【资源】语音增强资源集锦

专知

8+阅读 · 2020年7月4日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

极市平台

12+阅读 · 2019年5月31日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

语义分割 | context relation

语义分割 | context relation

极市平台

8+阅读 · 2019年2月9日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【泡泡一分钟】基于深度学习的视觉SLAM闭环检测的性能评估（Things-1）

【泡泡一分钟】基于深度学习的视觉SLAM闭环检测的性能评估（Things-1）

泡泡机器人SLAM

8+阅读 · 2018年5月5日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

专知

7+阅读 · 2018年2月26日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

相关论文

Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent

Arxiv

0+阅读 · 2021年12月21日

Topology Preserving Local Road Network Estimation from Single Onboard Camera Image

Arxiv

0+阅读 · 2021年12月19日

Online Grounding of PDDL Domains by Acting and Sensing in Unknown Environments

Arxiv

0+阅读 · 2021年12月18日

Advances in Online Audio-Visual Meeting Transcription

Advances in Online Audio-Visual Meeting Transcription

Arxiv

4+阅读 · 2019年12月10日

Graph Analysis and Graph Pooling in the Spatial Domain

Graph Analysis and Graph Pooling in the Spatial Domain

Arxiv

5+阅读 · 2019年10月3日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Online Deep Metric Learning

Arxiv

8+阅读 · 2018年5月15日

Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions

Arxiv

4+阅读 · 2018年3月15日

A guide to convolution arithmetic for deep learning

Arxiv

6+阅读 · 2018年1月11日

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Arxiv

16+阅读 · 2017年11月20日

微信扫码咨询专知VIP会员