HarmoF0: 用于切片估计的对数缩放分解裂变 (HarmoF0: Logarithmic Scale Dilated Convolution For Pitch Estimation) - 专知论文

会员服务 ·

0

膨胀卷积 · 对数尺度 · 卷积 · 估计/估计量 · 缩放 ·

2022 年 6 月 20 日

HarmoF0: Logarithmic Scale Dilated Convolution For Pitch Estimation

翻译：HarmoF0: 用于切片估计的对数缩放分解裂变

Weixing Wei,Peilin Li,Yi Yu,Wei Li

from arxiv, This paper is accepted by ICME2022

Sounds, especially music, contain various harmonic components scattered in the frequency dimension. It is difficult for normal convolutional neural networks to observe these overtones. This paper introduces a multiple rates dilated causal convolution (MRDC-Conv) method to capture the harmonic structure in logarithmic scale spectrograms efficiently. The harmonic is helpful for pitch estimation, which is important for many sound processing applications. We propose HarmoF0, a fully convolutional network, to evaluate the MRDC-Conv and other dilated convolutions in pitch estimation. The results show that this model outperforms the DeepF0, yields state-of-the-art performance in three datasets, and simultaneously reduces more than 90% parameters. We also find that it has stronger noise resistance and fewer octave errors. The code and pre-trained model are available at https://github.com/WX-Wei/HarmoF0.

翻译：声音, 特别是音乐, 包含在频率维度中分散的各种调音元件。正常的进化神经网络很难观察这些表面。本文引入了一种多重速率膨胀因果共变( MRDC- Conv) 方法, 以有效捕捉对数比例谱光谱中的调和结构。调音有助于定位估计, 这对于许多音频处理应用程序非常重要。我们提议建立完全进化的网络 HarmoF0, 以评价MRDC- Conv 和投影中的其他变异。结果显示, 这个模型在三个数据集中比 DeepF0, 产生最先进的性能, 同时减少超过 90%的参数。我们还发现, 它有更强的噪音阻力, 更少的八度错误。代码和预先训练的模型可以在 https://github. com/ WX- Wei/ HarmoF0 上查阅。

0

相关内容

膨胀卷积

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于OC-seislet变换的三维叠前复杂地震波场迭代数据插值方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PET/SPECT同机同时成像方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

神经型戈谢氏病发病机制及诱导性干细胞治疗策略探讨

国家自然科学基金

0+阅读 · 2012年12月31日

仿生复眼大视场立体视觉系统的基础理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

长循环、空间稳定、亲和素－PEG标记的SPIO脂质体对微小肺癌灶的MRI靶向增强成像研究

国家自然科学基金

0+阅读 · 2008年12月31日

AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

Arxiv

0+阅读 · 2022年8月10日

Fast and High-Quality Image Denoising via Malleable Convolutions

Arxiv

0+阅读 · 2022年8月9日

Gaze Estimation Approach Using Deep Differential Residual Network

Arxiv

0+阅读 · 2022年8月8日

Neighborhood Collective Estimation for Noisy Label Identification and Correction

Arxiv

0+阅读 · 2022年8月5日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACML2025教程】迈向鲁棒且可信的大语言模型：问题与缓解策略

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

Google《AI智能体企业应用手册报告》，46页pdf

面向现代武装力量的高级AI驱动军事模拟与训练软件

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

Arxiv

0+阅读 · 2022年8月10日

Fast and High-Quality Image Denoising via Malleable Convolutions

Arxiv

0+阅读 · 2022年8月9日

Gaze Estimation Approach Using Deep Differential Residual Network

Arxiv

0+阅读 · 2022年8月8日

Neighborhood Collective Estimation for Noisy Label Identification and Correction

Arxiv

0+阅读 · 2022年8月5日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

相关基金

基于OC-seislet变换的三维叠前复杂地震波场迭代数据插值方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PET/SPECT同机同时成像方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

神经型戈谢氏病发病机制及诱导性干细胞治疗策略探讨

国家自然科学基金

0+阅读 · 2012年12月31日

仿生复眼大视场立体视觉系统的基础理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

长循环、空间稳定、亲和素－PEG标记的SPIO脂质体对微小肺癌灶的MRI靶向增强成像研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员