使用多多边通道语音记录器进行盲人房间参数估计 (Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings) - 专知论文

会员服务 ·

0

估计/估计量 · Microsoft Surface · 可约的 · 估计误差 · Neural Networks ·

2021 年 7 月 29 日

Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings

翻译：使用多多边通道语音记录器进行盲人房间参数估计

Prerak Srivastava,Antoine Deleforge,Emmanuel Vincent

from arxiv, Accepted In WASPAA 2021 ( IEEE Workshop on Applications of Signal Processing to Audio and Acoustics )

Knowing the geometrical and acoustical parameters of a room may benefit applications such as audio augmented reality, speech dereverberation or audio forensics. In this paper, we study the problem of jointly estimating the total surface area, the volume, as well as the frequency-dependent reverberation time and mean surface absorption of a room in a blind fashion, based on two-channel noisy speech recordings from multiple, unknown source-receiver positions. A novel convolutional neural network architecture leveraging both single- and inter-channel cues is proposed and trained on a large, realistic simulated dataset. Results on both simulated and real data show that using multiple observations in one room significantly reduces estimation errors and variances on all target quantities, and that using two channels helps the estimation of surface and volume. The proposed model outperforms a recently proposed blind volume estimation method on the considered datasets.

翻译：了解一个房间的几何和声学参数可能有益于应用,如音频增强现实、语音偏差或音频法证等。在本文件中,我们研究了根据多个未知源接收器位置的双声道噪音录音,以盲目方式共同估计一个房间的总面积、体积、以及视频率而异的时间和平均表面吸收率的问题。提出了一个新的利用单一和跨频道信号的神经神经网络结构,并在一个大型、现实的模拟数据集方面进行了培训。模拟和真实数据的结果表明,使用一个房间的多次观测可以大大减少所有目标数量的估计误差和差异,并且使用两个渠道有助于估计表层和体积。拟议的模型比最近提议的关于考虑的数据集的盲体估计方法要强。

0

相关内容

估计/估计量

估计/估计量

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

专知会员服务

39+阅读 · 2020年1月30日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

13+阅读 · 2019年1月16日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

人工智能 | ICAPS 2019等国际会议信息3条

人工智能 | ICAPS 2019等国际会议信息3条

Call4Papers

3+阅读 · 2018年9月28日

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

泡泡机器人SLAM

38+阅读 · 2018年9月23日

【泡泡一分钟】SfM-Net：从视频中学习结构和运动

【泡泡一分钟】SfM-Net：从视频中学习结构和运动

泡泡机器人SLAM

9+阅读 · 2018年5月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

极市平台

6+阅读 · 2017年12月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

A Universal Deep Room Acoustics Estimator

Arxiv

0+阅读 · 2021年9月29日

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Arxiv

0+阅读 · 2021年9月29日

Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement

Arxiv

0+阅读 · 2021年9月26日

Efficient Force Estimation for Continuum Robot

Arxiv

0+阅读 · 2021年9月26日

Parameterized Channel Normalization for Far-field Deep Speaker Verification

Parameterized Channel Normalization for Far-field Deep Speaker Verification

Arxiv

0+阅读 · 2021年9月24日

Multi-View Video-Based 3D Hand Pose Estimation

Arxiv

0+阅读 · 2021年9月24日

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

Arxiv

7+阅读 · 2021年8月17日

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Arxiv

5+阅读 · 2019年3月8日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

VIP会员

文章信息

相关主题

估计/估计量

Microsoft Surface

Neural Networks

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

专知会员服务

39+阅读 · 2020年1月30日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

13+阅读 · 2019年1月16日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

人工智能 | ICAPS 2019等国际会议信息3条

人工智能 | ICAPS 2019等国际会议信息3条

Call4Papers

3+阅读 · 2018年9月28日

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

泡泡机器人SLAM

38+阅读 · 2018年9月23日

【泡泡一分钟】SfM-Net：从视频中学习结构和运动

【泡泡一分钟】SfM-Net：从视频中学习结构和运动

泡泡机器人SLAM

9+阅读 · 2018年5月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

极市平台

6+阅读 · 2017年12月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

相关论文

A Universal Deep Room Acoustics Estimator

Arxiv

0+阅读 · 2021年9月29日

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Arxiv

0+阅读 · 2021年9月29日

Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement

Arxiv

0+阅读 · 2021年9月26日

Efficient Force Estimation for Continuum Robot

Arxiv

0+阅读 · 2021年9月26日

Parameterized Channel Normalization for Far-field Deep Speaker Verification

Parameterized Channel Normalization for Far-field Deep Speaker Verification

Arxiv

0+阅读 · 2021年9月24日

Multi-View Video-Based 3D Hand Pose Estimation

Arxiv

0+阅读 · 2021年9月24日

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

Arxiv

7+阅读 · 2021年8月17日

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Arxiv

5+阅读 · 2019年3月8日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

微信扫码咨询专知VIP会员