仅使用吵闹音频信号的自上声低调 (Self-Supervised Speech Denoising Using Only Noisy Audio Signals) - 专知论文

会员服务 ·

0

去噪 · Performer · HTTPS · Better · 端到端 ·

2023 年 1 月 19 日

Self-Supervised Speech Denoising Using Only Noisy Audio Signals

翻译：仅使用吵闹音频信号的自上声低调

Jiasong Wu,Qingchun Li,Guanyu Yang,Lei Li,Lotfi Senhadji,Huazhong Shu

from arxiv, 11 pages, 4 figures, 6 tables

In traditional speech denoising tasks, clean audio signals are often used as the training target, but absolutely clean signals are collected from expensive recording equipment or in studios with the strict environments. To overcome this drawback, we propose an end-to-end self-supervised speech denoising training scheme using only noisy audio signals, named Only-Noisy Training (ONT), without extra training conditions. The proposed ONT strategy constructs training pairs only from each single noisy audio, and it contains two modules: training audio pairs generated module and speech denoising module. The first module adopts a random audio sub-sampler on each noisy audio to generate training pairs. The sub-sampled pairs are then fed into a novel complex-valued speech denoising module. Experimental results show that the proposed method not only eliminates the high dependence on clean targets of traditional audio denoising tasks, but also achieves on-par or better performance than other training strategies. Availability-ONT is available at https://github.com/liqingchunnnn/Only-Noisy-Training

翻译：在传统的言语淡化任务中,清洁的音频信号往往被用作培训目标,但绝对清洁的信号是从昂贵的录音设备或在有严格环境的录音室中收集的。为了克服这一缺陷,我们建议只使用噪音音频信号,称为 " 唯一噪音培训 " (ONT),在不附加培训条件的情况下,采用终端到终端自我监督的言语淡化培训计划。拟议的ONT战略只从每个噪音音频中建立培训配对,它包含两个模块:培训音频配对生成模块和音频淡化模块。第一个模块在每部噪音音频上随机使用音频子简便器来生成培训配对。然后,将副标的对子配对输入一个新的复杂估价的言语淡化模块。实验结果表明,拟议方法不仅消除了对传统音频淡化任务清洁目标的高度依赖,而且还实现了在线或比其他培训战略更好的业绩。

0

相关内容

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

面向网络编码的编码理论

国家自然科学基金

0+阅读 · 2014年12月31日

判别式表观建模方法

国家自然科学基金

1+阅读 · 2014年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向生命周期能耗评估的建筑信息模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

图像识别中区分性稀疏表示理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高可靠性主动磁悬浮轴承的冗余支承重构关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

高光谱遥感图像谱窗选择及联合分类研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

求解多目标旅行商问题的分布估计算法研究

国家自然科学基金

1+阅读 · 2010年12月31日

几何计算与表示中的约束优化方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

Blind2Sound: Self-Supervised Image Denoising without Residual Noise

Arxiv

0+阅读 · 2023年3月14日

TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation

Arxiv

1+阅读 · 2023年3月14日

Lightweight feature encoder for wake-up word detection based on self-supervised speech representation

Arxiv

0+阅读 · 2023年3月14日

Identifying Label Errors in Object Detection Datasets by Loss Inspection

Arxiv

0+阅读 · 2023年3月13日

Twin Contrastive Learning with Noisy Labels

Arxiv

0+阅读 · 2023年3月13日

You Only Need End-to-End Training for Long-Tailed Recognition

Arxiv

0+阅读 · 2023年3月10日

Contrastive Audio-Visual Masked Autoencoder

Arxiv

0+阅读 · 2023年3月10日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】语义提示扩散变换器的像素级精确深度估计

俄乌冲突的地缘政治与军事教训（万字长文）

【博士论文】弥合多模态基础模型与世界模型之间的鸿沟

量子增强计算机视觉：超越经典算法

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Blind2Sound: Self-Supervised Image Denoising without Residual Noise

Arxiv

0+阅读 · 2023年3月14日

TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation

Arxiv

1+阅读 · 2023年3月14日

Lightweight feature encoder for wake-up word detection based on self-supervised speech representation

Arxiv

0+阅读 · 2023年3月14日

Identifying Label Errors in Object Detection Datasets by Loss Inspection

Arxiv

0+阅读 · 2023年3月13日

Twin Contrastive Learning with Noisy Labels

Arxiv

0+阅读 · 2023年3月13日

You Only Need End-to-End Training for Long-Tailed Recognition

Arxiv

0+阅读 · 2023年3月10日

Contrastive Audio-Visual Masked Autoencoder

Arxiv

0+阅读 · 2023年3月10日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

相关基金

面向网络编码的编码理论

国家自然科学基金

0+阅读 · 2014年12月31日

判别式表观建模方法

国家自然科学基金

1+阅读 · 2014年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向生命周期能耗评估的建筑信息模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

图像识别中区分性稀疏表示理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高可靠性主动磁悬浮轴承的冗余支承重构关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

高光谱遥感图像谱窗选择及联合分类研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

求解多目标旅行商问题的分布估计算法研究

国家自然科学基金

1+阅读 · 2010年12月31日

几何计算与表示中的约束优化方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员