轻量度仪器- 多声调笔记纹理和倍增切分估测的不可知模型 (A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation) - 专知论文

会员服务 ·

0

估计/估计量 · 模型评估 · MoDELS · Networking · 约束 ·

2022 年 5 月 12 日

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

翻译：轻量度仪器- 多声调笔记纹理和倍增切分估测的不可知模型

Rachel M. Bittner,Juan José Bosch,David Rubinstein,Gabriel Meseguer-Brocal,Sebastian Ewert

Automatic Music Transcription (AMT) has been recognized as a key enabling technology with a wide range of applications. Given the task's complexity, best results have typically been reported for systems focusing on specific settings, e.g. instrument-specific systems tend to yield improved results over instrument-agnostic methods. Similarly, higher accuracy can be obtained when only estimating frame-wise $f_0$ values and neglecting the harder note event detection. Despite their high accuracy, such specialized systems often cannot be deployed in the real-world. Storage and network constraints prohibit the use of multiple specialized models, while memory and run-time constraints limit their complexity. In this paper, we propose a lightweight neural network for musical instrument transcription, which supports polyphonic outputs and generalizes to a wide variety of instruments (including vocals). Our model is trained to jointly predict frame-wise onsets, multipitch and note activations, and we experimentally show that this multi-output structure improves the resulting frame-level note accuracy. Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems. With this work we hope to encourage the community to further investigate low-resource, instrument-agnostic AMT systems.

翻译：自动音乐追踪(AMT)已被公认为具有广泛应用范围的关键赋能技术(AMT) 。鉴于任务的复杂性,对侧重于特定环境的系统来说,报告的最佳结果通常都是最佳的,例如,特定仪器的系统在仪器识别方法方面往往能产生更好的效果。同样,仅仅根据框架估计值(f_0美元)和忽略较难的备注事件检测,就可以获得更高的准确性。尽管这种专门系统非常精确,但往往无法在现实世界中部署。储存和网络限制禁止使用多种专门模型,而记忆和运行时间限制则限制了这些模型的复杂性。在本文件中,我们建议为乐器笔录建立一个轻量的神经网络,支持多功能产出,并概括广泛的工具(包括声音)。我们的模型经过培训,可以共同预测以框架判断的起始、多功能和笔记的启动。我们实验性地表明,这种多功能结构提高了由此产生的框架级注释的准确性。尽管其简单性,但基准结果显示我们的系统注释估计比可比较的基线要好得多。我们提出的是,其框架级的精确性工具只能进一步调查那些低的系统。

0

相关内容

估计/估计量

估计/估计量

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

ECoG,EEG-fMRI多模态癫痫监测与病灶定位研究

国家自然科学基金

0+阅读 · 2014年12月31日

免疫性肝损伤过程中细胞色素P450酶系下调的转录水平调节和翻译后蛋白修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-124靶向TRAF6在骨肉瘤中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

靶向LDH-A能量代谢对T细胞急性淋巴细胞白血病的抗白血病效应及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

多目标图像分割的稀疏表示方法

国家自然科学基金

0+阅读 · 2012年12月31日

基于Rho/ROCK信号通路的双黄连注射液致过敏样反应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

从巨噬细胞中LXR-CCR7交互作用探讨丹参素抗动脉粥样硬化机制

国家自然科学基金

0+阅读 · 2011年12月31日

全反式维甲酸对T细胞谱系选择和变应性鼻炎的干预作用

国家自然科学基金

0+阅读 · 2008年12月31日

Towards an Architecture-centric Methodology for Migrating to Microservices

Towards an Architecture-centric Methodology for Migrating to Microservices

Arxiv

0+阅读 · 2022年7月1日

Neural Moving Horizon Estimation for Robust Flight Control

Arxiv

0+阅读 · 2022年7月1日

Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutions

Arxiv

0+阅读 · 2022年6月29日

Numerical Smoothing with Hierarchical Adaptive Sparse Grids and Quasi-Monte Carlo Methods for Efficient Option Pricing

Arxiv

0+阅读 · 2022年6月29日

Acoustics-specific Piano Velocity Estimation

Arxiv

0+阅读 · 2022年6月29日

Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach

Arxiv

0+阅读 · 2022年6月29日

State space model multiple imputation for missing data in non-stationary multivariate time series with application in digital Psychiatry

Arxiv

0+阅读 · 2022年6月29日

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications

Arxiv

0+阅读 · 2022年6月28日

A Temporal-Difference Approach to Policy Gradient Estimation

Arxiv

0+阅读 · 2022年6月28日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《"半人马"训练计划：美国陆军北方司令部兵棋推演与Scale AI系统集成》最新报告

《飞行自组织网络通信协议评估体系：三维高斯-马尔科夫移动模型的创新升级》172页

地面无人作战平台：现代战争中的机器士兵

《面向边缘智能应用的AI模型优化技术研究》139页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Towards an Architecture-centric Methodology for Migrating to Microservices

Towards an Architecture-centric Methodology for Migrating to Microservices

Arxiv

0+阅读 · 2022年7月1日

Neural Moving Horizon Estimation for Robust Flight Control

Arxiv

0+阅读 · 2022年7月1日

Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutions

Arxiv

0+阅读 · 2022年6月29日

Numerical Smoothing with Hierarchical Adaptive Sparse Grids and Quasi-Monte Carlo Methods for Efficient Option Pricing

Arxiv

0+阅读 · 2022年6月29日

Acoustics-specific Piano Velocity Estimation

Arxiv

0+阅读 · 2022年6月29日

Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach

Arxiv

0+阅读 · 2022年6月29日

State space model multiple imputation for missing data in non-stationary multivariate time series with application in digital Psychiatry

Arxiv

0+阅读 · 2022年6月29日

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications

Arxiv

0+阅读 · 2022年6月28日

A Temporal-Difference Approach to Policy Gradient Estimation

Arxiv

0+阅读 · 2022年6月28日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

相关基金

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

ECoG,EEG-fMRI多模态癫痫监测与病灶定位研究

国家自然科学基金

0+阅读 · 2014年12月31日

免疫性肝损伤过程中细胞色素P450酶系下调的转录水平调节和翻译后蛋白修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-124靶向TRAF6在骨肉瘤中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

靶向LDH-A能量代谢对T细胞急性淋巴细胞白血病的抗白血病效应及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

多目标图像分割的稀疏表示方法

国家自然科学基金

0+阅读 · 2012年12月31日

基于Rho/ROCK信号通路的双黄连注射液致过敏样反应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

从巨噬细胞中LXR-CCR7交互作用探讨丹参素抗动脉粥样硬化机制

国家自然科学基金

0+阅读 · 2011年12月31日

全反式维甲酸对T细胞谱系选择和变应性鼻炎的干预作用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员