使用参数化二进制 CDR 估计器改进听音器的空间提示 (Improving spatial cues for hearables using a parameterized binaural CDR estimator) - 专知论文

会员服务 ·

0

估计/估计量 · 回合 · 语音增强 · INFORMS · Performer ·

2022 年 7 月 17 日

Improving spatial cues for hearables using a parameterized binaural CDR estimator

翻译：使用参数化二进制 CDR 估计器改进听音器的空间提示

Reza Ghanavi,Craig Jin

from arxiv, Accepted by ICA2022. An Australian provisional patent application based on this manuscript has been filed by the University of Sydney

We investigate a speech enhancement method based on the binaural coherence-to-diffuse power ratio (CDR), which preserves auditory spatial cues for maskers and a broadside target. Conventional CDR estimators typically rely on a mathematical coherence model of the desired signal and/or diffuse noise field in their formulation, which may influence their accuracy in natural environments. This work proposes a new robust and parameterized directional binaural CDR estimator. The estimator is calculated in the time-frequency domain and is based on a geometrical interpretation of the spatial coherence function between the binaural microphone signals. The binaural performance of the new CDR estimator is compared with three state-of-the-art CDR estimators in cocktail-party-like environments and has shown improvements in terms of several objective speech quality metrics such as PESQ and SRMR. We also discuss the benefits of the parameterizable CDR estimator for varying sound environments and briefly reflect on several informal subjective evaluations using a low-latency real-time framework.

翻译：我们调查了一种基于二进制一致性到阻断功率比(CDR)的语音增强方法,这种方法为掩码器和宽边目标保留了听觉空间提示。常规CDR估计器通常依赖一个预想信号和/或扩散噪音场的数学一致性模型,这可能会影响其在自然环境中的准确性。这项工作提出了一个新的稳健和参数化方向性双进制CDR估计器。估计器是在时频域内计算出来的,并且基于对双进式麦克风信号之间的空间一致性功能的几何学解释。新的CDR估计器的二进制性能与三个在像鸡尾酒党的环境中最先进的CDR估计器相比较,在诸如PESQ和SRMR等若干客观的语音质量指标方面显示出改进。我们还讨论了可参数化的CDR估计器对不同声音环境的好处,并简要思考了使用低时实时框架进行的若干非正式的主观评价。

0

相关内容

估计/估计量

估计/估计量

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

NatD调节Slug基因表达促进肺癌细胞上皮间质转化

国家自然科学基金

0+阅读 · 2014年12月31日

动脉粥样硬化中oxLDL/CD36受体介导VSMC炎症表型转化的作用和机制

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶ERK1/2介导GLP-1改善胰岛β细胞功能障碍作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

液相还原法制备Heusler合金纳米颗粒及其结构和性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Co基磁性Heusler合金相关体系相图与化合物的结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Apelin减轻糖尿病缺血心肌易损性的作用及机制：改善心肌胰岛素敏感性

国家自然科学基金

0+阅读 · 2012年12月31日

新型稀土金属硼杂苯化合物化学

国家自然科学基金

0+阅读 · 2012年12月31日

Treg细胞对Th1、Th2、Th17细胞介导的眼内炎症的调节作用

国家自然科学基金

0+阅读 · 2009年12月31日

Indian Legal Text Summarization: A Text Normalisation-based Approach

Arxiv

0+阅读 · 2022年9月13日

Self-supervised motion descriptor for cardiac phase detection in 4D CMR based on discrete vector field estimations

Arxiv

0+阅读 · 2022年9月13日

Model interpretation using improved local regression with variable importance

Arxiv

0+阅读 · 2022年9月12日

Covariance-based rational approximations of fractional SPDEs for computationally efficient Bayesian inference

Arxiv

0+阅读 · 2022年9月10日

Support Recovery in Mixture Models with Sparse Parameters

Arxiv

0+阅读 · 2022年9月10日

Slice Weighted Average Regression

Arxiv

0+阅读 · 2022年9月10日

Alignment-based conformance checking over probabilistic events

Alignment-based conformance checking over probabilistic events

Arxiv

0+阅读 · 2022年9月9日

On the Asymptotic Properties of a Certain Class of Goodness-of-Fit Tests Associated with Multinomial Distributions

Arxiv

0+阅读 · 2022年9月9日

Explanation Method for Anomaly Detection on Mixed Numerical and Categorical Spaces

Arxiv

0+阅读 · 2022年9月9日

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Arxiv

15+阅读 · 2020年3月26日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

（干货书）军事人工智能技术：社会学、文化与伦理视角 | 2025最新258页书籍

数字战场：保护军用无人机免受网络攻击

反无人机：关于“无人机墙”的讨论及反无人机系统时讯更新

日本防卫省下一代信息通信战略

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Indian Legal Text Summarization: A Text Normalisation-based Approach

Arxiv

0+阅读 · 2022年9月13日

Self-supervised motion descriptor for cardiac phase detection in 4D CMR based on discrete vector field estimations

Arxiv

0+阅读 · 2022年9月13日

Model interpretation using improved local regression with variable importance

Arxiv

0+阅读 · 2022年9月12日

Covariance-based rational approximations of fractional SPDEs for computationally efficient Bayesian inference

Arxiv

0+阅读 · 2022年9月10日

Support Recovery in Mixture Models with Sparse Parameters

Arxiv

0+阅读 · 2022年9月10日

Slice Weighted Average Regression

Arxiv

0+阅读 · 2022年9月10日

Alignment-based conformance checking over probabilistic events

Alignment-based conformance checking over probabilistic events

Arxiv

0+阅读 · 2022年9月9日

On the Asymptotic Properties of a Certain Class of Goodness-of-Fit Tests Associated with Multinomial Distributions

Arxiv

0+阅读 · 2022年9月9日

Explanation Method for Anomaly Detection on Mixed Numerical and Categorical Spaces

Arxiv

0+阅读 · 2022年9月9日

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Arxiv

15+阅读 · 2020年3月26日

相关基金

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

NatD调节Slug基因表达促进肺癌细胞上皮间质转化

国家自然科学基金

0+阅读 · 2014年12月31日

动脉粥样硬化中oxLDL/CD36受体介导VSMC炎症表型转化的作用和机制

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶ERK1/2介导GLP-1改善胰岛β细胞功能障碍作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

液相还原法制备Heusler合金纳米颗粒及其结构和性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Co基磁性Heusler合金相关体系相图与化合物的结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Apelin减轻糖尿病缺血心肌易损性的作用及机制：改善心肌胰岛素敏感性

国家自然科学基金

0+阅读 · 2012年12月31日

新型稀土金属硼杂苯化合物化学

国家自然科学基金

0+阅读 · 2012年12月31日

Treg细胞对Th1、Th2、Th17细胞介导的眼内炎症的调节作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员