一项经验研究,研究对一般音频代表系统进行监督不力的音频标签嵌入器进行实证研究 (An empirical study of weakly supervised audio tagging embeddings for general audio representations) - 专知论文

会员服务 ·

0

MoDELS · 监督 · Learning · 特征提取器 · 表示 ·

2022 年 9 月 30 日

An empirical study of weakly supervised audio tagging embeddings for general audio representations

翻译：一项经验研究,研究对一般音频代表系统进行监督不力的音频标签嵌入器进行实证研究

Heinrich Dinkel,Zhiyong Yan,Yongqing Wang,Junbo Zhang,Yujun Wang

from arxiv, Odyssey 2022

We study the usability of pre-trained weakly supervised audio tagging (AT) models as feature extractors for general audio representations. We mainly analyze the feasibility of transferring those embeddings to other tasks within the speech and sound domains. Specifically, we benchmark weakly supervised pre-trained models (MobileNetV2 and EfficientNet-B0) against modern self-supervised learning methods (BYOL-A) as feature extractors. Fourteen downstream tasks are used for evaluation ranging from music instrument classification to language classification. Our results indicate that AT pre-trained models are an excellent transfer learning choice for music, event, and emotion recognition tasks. Further, finetuning AT models can also benefit speech-related tasks such as keyword spotting and intent classification.

翻译：我们研究训练前监督薄弱的音频标记模型(AT)作为一般音频演示的特征提取器的可用性,我们主要分析将这些嵌入到语音和音响域内的其他任务的可行性,具体地说,我们参照作为特征提取器的现代自我监督学习方法(BYOL-A),将受监督薄弱的音频标记模型(AT)作为基准,14项下游任务用于从音乐仪器分类到语言分类的评价。我们的结果表明,AT预先培训模型是音乐、事件和情感识别任务中出色的传输学习选择。此外,微调AT模型还可以有利于与语音有关的任务,如关键词识别和意图分类。

0

相关内容

MoDELS

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

岩溶区富含铁锰结核土Cd、Zn迁移机制及环境效应

国家自然科学基金

0+阅读 · 2015年12月31日

β2-AR/PKA通路在内皮祖细胞修复急性肾损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

应力对FeRh薄膜磁卡效应的调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

肺脏巨噬细胞在流感病毒感染后对继发革兰氏阴性细菌感染的作用与机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

根际促生菌Bacillus amyloliquefaciens SQR9与植物根系分泌物互作的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

proBDNF通过P75NTR/sortilin受体促进心肌缺血再灌注损伤的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

乌梁素海沉积物生物地球化学特征及其对水环境响应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

植物不定根形成及其响应逆境胁迫的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

大气细颗粒物对冠状动脉粥样硬化的免疫损伤机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

酚类内分泌干扰物磁性印迹聚合物的分子设计及制备

国家自然科学基金

0+阅读 · 2010年12月31日

A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks

Arxiv

0+阅读 · 2022年11月4日

Biased Self-supervised learning for ASR

Arxiv

0+阅读 · 2022年11月4日

Audio Language Modeling using Perceptually-Guided Discrete Representations

Arxiv

0+阅读 · 2022年11月4日

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

Arxiv

0+阅读 · 2022年11月4日

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

Arxiv

0+阅读 · 2022年11月4日

Large Language Models Are Human-Level Prompt Engineers

Arxiv

0+阅读 · 2022年11月3日

A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives

Arxiv

0+阅读 · 2022年11月3日

DEArt: Dataset of European Art

Arxiv

0+阅读 · 2022年11月3日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

VIP会员

文章信息

相关主题

特征提取器

相关VIP内容

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks

Arxiv

0+阅读 · 2022年11月4日

Biased Self-supervised learning for ASR

Arxiv

0+阅读 · 2022年11月4日

Audio Language Modeling using Perceptually-Guided Discrete Representations

Arxiv

0+阅读 · 2022年11月4日

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

Arxiv

0+阅读 · 2022年11月4日

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

Arxiv

0+阅读 · 2022年11月4日

Large Language Models Are Human-Level Prompt Engineers

Arxiv

0+阅读 · 2022年11月3日

A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives

Arxiv

0+阅读 · 2022年11月3日

DEArt: Dataset of European Art

Arxiv

0+阅读 · 2022年11月3日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

相关基金

岩溶区富含铁锰结核土Cd、Zn迁移机制及环境效应

国家自然科学基金

0+阅读 · 2015年12月31日

β2-AR/PKA通路在内皮祖细胞修复急性肾损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

应力对FeRh薄膜磁卡效应的调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

肺脏巨噬细胞在流感病毒感染后对继发革兰氏阴性细菌感染的作用与机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

根际促生菌Bacillus amyloliquefaciens SQR9与植物根系分泌物互作的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

proBDNF通过P75NTR/sortilin受体促进心肌缺血再灌注损伤的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

乌梁素海沉积物生物地球化学特征及其对水环境响应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

植物不定根形成及其响应逆境胁迫的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

大气细颗粒物对冠状动脉粥样硬化的免疫损伤机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

酚类内分泌干扰物磁性印迹聚合物的分子设计及制备

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员