音乐与 SliCQ 变换混合 (Music demixing with the sliCQ transform) - 专知论文

会员服务 ·

0

估计/估计量 · 变换 · 分离的 · 傅立叶变换 · MoDELS ·

2021 年 12 月 9 日

Music demixing with the sliCQ transform

翻译：音乐与 SliCQ 变换混合

from arxiv, 2 pages, 3 figures. Published in the MDX21 workshop (satellite event of ISMIR 2021): https://mdx-workshop.github.io/proceedings/hanssian.pdf

Music source separation is the task of extracting an estimate of one or more isolated sources or instruments (for example, drums or vocals) from musical audio. The task of music demixing or unmixing considers the case where the musical audio is separated into an estimate of all of its constituent sources that can be summed back to the original mixture. The Music Demixing Challenge was created to inspire new demixing research. Open-Unmix (UMX), and the improved variant CrossNet-Open-Unmix (X-UMX), were included in the challenge as the baselines. Both models use the Short-Time Fourier Transform (STFT) as the representation of music signals. The time-frequency uncertainty principle states that the STFT of a signal cannot have maximal resolution in both time and frequency. The tradeoff in time-frequency resolution can significantly affect music demixing results. Our proposed adaptation of UMX replaced the STFT with the sliCQT, a time-frequency transform with varying time-frequency resolution. Unfortunately, our model xumx-sliCQ achieved lower demixing scores than UMX.

翻译：音乐源分离的任务是从音乐音频中提取一种或多种孤立来源或乐器的估计值(例如鼓声或声响)。音乐解混或解混任务考虑到音乐音频被分离成所有组成来源的估计值,可以与原混合物相归。音乐解混挑战的创建是为了激发新的解混研究。Open-Unmix(UMX)和经改进的变体CrossNet-Open-Umix(X-UMX)被作为基准列入挑战中。两种模型都使用短时 Fourier变换(STFT)作为音乐信号的表示。时间-频率不确定原则指出,信号的STFT不能在时间和频率上具有最大分辨率。时间-频率分辨率的转换可以极大地影响音乐解混结果。我们提议的对UMX的调整用 sliCQT 取代STFT,这是时间-频率变换,但不幸的是,我们的模型 xumxliCQ实现了比UMX低解密得分数。

0

相关内容

估计/估计量

估计/估计量

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

专知会员服务

13+阅读 · 2021年12月31日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

专知会员服务

6+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

已删除

将门创投

3+阅读 · 2020年8月3日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

Arxiv

0+阅读 · 2022年2月15日

Distributed Sparse Normal Means Estimation with Sublinear Communication

Arxiv

0+阅读 · 2022年2月14日

Distribution augmentation for low-resource expressive text-to-speech

Arxiv

0+阅读 · 2022年2月13日

Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery

Arxiv

0+阅读 · 2022年2月13日

Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection

Arxiv

0+阅读 · 2022年2月12日

Audio Defect Detection in Music with Deep Networks

Arxiv

0+阅读 · 2022年2月11日

A Quick Repair Facility for Debugging

Arxiv

0+阅读 · 2022年2月11日

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Arxiv

0+阅读 · 2022年2月8日

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

Arxiv

3+阅读 · 2020年4月14日

Co-Generation with GANs using AIS based HMC

Arxiv

3+阅读 · 2019年10月31日

VIP会员

文章信息

相关主题

估计/估计量

傅立叶变换

相关VIP内容

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

专知会员服务

13+阅读 · 2021年12月31日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

【O'Reilly AI Conference 2019】对高质量数据的追求，The quest for high-quality data，滑铁卢大学Ihab Ilyas教授

专知会员服务

6+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

发射器定位中的传感器路径规划研究 | 235页

战略无人机 | 2025最新80页

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

无人机对机动战的影响 | 2025最新文献

相关资讯

已删除

将门创投

3+阅读 · 2020年8月3日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

Arxiv

0+阅读 · 2022年2月15日

Distributed Sparse Normal Means Estimation with Sublinear Communication

Arxiv

0+阅读 · 2022年2月14日

Distribution augmentation for low-resource expressive text-to-speech

Arxiv

0+阅读 · 2022年2月13日

Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery

Arxiv

0+阅读 · 2022年2月13日

Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection

Arxiv

0+阅读 · 2022年2月12日

Audio Defect Detection in Music with Deep Networks

Arxiv

0+阅读 · 2022年2月11日

A Quick Repair Facility for Debugging

Arxiv

0+阅读 · 2022年2月11日

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Arxiv

0+阅读 · 2022年2月8日

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

Arxiv

3+阅读 · 2020年4月14日

Co-Generation with GANs using AIS based HMC

Arxiv

3+阅读 · 2019年10月31日

微信扫码咨询专知VIP会员