DCASE 2022 挑战中基于语言的音频检索任务 (Language-based Audio Retrieval Task in DCASE 2022 Challenge) - 专知论文

会员服务 ·

0

Performer · Analysis · 秩 · 基准 · 讲稿 ·

2022 年 10 月 4 日

Language-based Audio Retrieval Task in DCASE 2022 Challenge

翻译：DCASE 2022 挑战中基于语言的音频检索任务

Huang Xie,Samuel Lipping,Tuomas Virtanen

from arxiv, Update for arXiv:2206.06108 mistakenly submitted as a new article

Language-based audio retrieval is a task, where natural language textual captions are used as queries to retrieve audio signals from a dataset. It has been first introduced into DCASE 2022 Challenge as Subtask 6B of task 6, which aims at developing computational systems to model relationships between audio signals and free-form textual descriptions. Compared with audio captioning (Subtask 6A), which is about generating audio captions for audio signals, language-based audio retrieval (Subtask 6B) focuses on ranking audio signals according to their relevance to natural language textual captions. In DCASE 2022 Challenge, the provided baseline system for Subtask 6B was significantly outperformed, with top performance being 0.276 in mAP@10. This paper presents the outcome of Subtask 6B in terms of submitted systems' performance and analysis.

翻译：以语言为基础的音频检索是一项任务,其中自然语言文本字幕被用作查询从数据集中检索音频信号的查询工具,它首先作为任务6的子任务6B引入了DCASE 2022 挑战,作为任务6的子任务6B,目的是开发计算系统,以模拟音频信号和自由形式文字描述之间的关系。与音频字幕(Subtask 6A)相比,这是为音频信号制作音频字幕,基于语言的音频检索(Subtask 6B)侧重于根据与自然语言文本字幕的相关性排列音频信号的顺序。在DCASE 2022 挑战中,为 Subtask 6B提供的基线系统明显地超过功能,最大性能为 mAP@10中的0.276。本文介绍了Subtask 6B在提交的系统性能和分析方面的结果。

0

相关内容

Performer

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RecSys Challenge 历年推荐赛题汇总

RecSys Challenge 历年推荐赛题汇总

机器学习与推荐算法

0+阅读 · 2022年2月21日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

电动汽车高功率密度变换器拓扑、控制与能量管理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型量子点生物探针与肿瘤细胞传感平台的研究

国家自然科学基金

0+阅读 · 2013年12月31日

虫草素及甘草素定向合成产物促人肝癌细胞凋亡调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肿瘤源性热休克蛋白70激活肝癌射频消融术后肿瘤免疫反应及其对抗肿瘤作用影响的研究

国家自然科学基金

0+阅读 · 2013年12月31日

离子液体溶解甲烷机理及影响因素研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能化离子液体修饰石墨烯构筑的生物传感界面及其在有机磷农药中的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Jagged2high CD11bhigh 调节性树突状细胞防治cGVHD的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

DiffPhase: Generative Diffusion-based STFT Phase Retrieval

Arxiv

0+阅读 · 2022年11月8日

Semantic Information Retrieval in Wireless Networks

Arxiv

0+阅读 · 2022年11月8日

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

Arxiv

0+阅读 · 2022年11月7日

CgAT: Center-Guided Adversarial Training for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2022年11月7日

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

Arxiv

0+阅读 · 2022年11月4日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《为多域数字战场变革装甲力量》报告

《多域训练：利用开放标准将太空与网络域同陆、海、空域训练相整合》报告

面向城市战：欧美徒步作战新装备

《人工智能增强监视分析：利用跨网络、陆地、空中及海上领域的威胁向量实时建模》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RecSys Challenge 历年推荐赛题汇总

RecSys Challenge 历年推荐赛题汇总

机器学习与推荐算法

0+阅读 · 2022年2月21日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

DiffPhase: Generative Diffusion-based STFT Phase Retrieval

Arxiv

0+阅读 · 2022年11月8日

Semantic Information Retrieval in Wireless Networks

Arxiv

0+阅读 · 2022年11月8日

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

Arxiv

0+阅读 · 2022年11月7日

CgAT: Center-Guided Adversarial Training for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2022年11月7日

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

Arxiv

0+阅读 · 2022年11月4日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

相关基金

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

电动汽车高功率密度变换器拓扑、控制与能量管理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型量子点生物探针与肿瘤细胞传感平台的研究

国家自然科学基金

0+阅读 · 2013年12月31日

虫草素及甘草素定向合成产物促人肝癌细胞凋亡调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肿瘤源性热休克蛋白70激活肝癌射频消融术后肿瘤免疫反应及其对抗肿瘤作用影响的研究

国家自然科学基金

0+阅读 · 2013年12月31日

离子液体溶解甲烷机理及影响因素研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能化离子液体修饰石墨烯构筑的生物传感界面及其在有机磷农药中的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Jagged2high CD11bhigh 调节性树突状细胞防治cGVHD的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员