VoxSRC 2022:第四次VoxCeleb议长承认挑战</s> (VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge) - 专知论文

会员服务 ·

0

声纹识别 · INTERSPEECH · state-of-the-art · 真实值 · YouTube ·

2023 年 3 月 6 日

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

翻译：VoxSRC 2022:第四次VoxCeleb议长承认挑战

Jaesung Huh,Andrew Brown,Jee-weon Jung,Joon Son Chung,Arsha Nagrani,Daniel Garcia-Romero,Andrew Zisserman

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of this challenge was to evaluate how well state-of-the-art speaker recognition systems can diarise and recognise speakers from speech obtained "in the wild". The challenge consisted of: (i) the provision of publicly available speaker recognition and diarisation data from YouTube videos together with ground truth annotation and standardised evaluation software; and (ii) a public challenge and hybrid workshop held at INTERSPEECH 2022. We describe the four tracks of our challenge along with the baselines, methods, and results. We conclude with a discussion on the new domain-transfer focus of VoxSRC-22, and on the progression of the challenge from the previous three editions.

翻译：本文总结了与INTESPEECH 2022(VoxSSRC-22)联合举行的VoxCeleb发言人承认挑战2022(VoxSRC-22)的调查结果,挑战的目的是评价最先进的演讲者承认系统能够从“野外”获得的演讲中分裂和承认演讲者的能力,挑战包括:(一) 公开提供YouTube视频中的演讲者承认和分解数据以及地面真相说明和标准化评价软件;(二) 在INTESPEECH 2022(INTESECH 2022)举行的公开挑战和混合讲习班。我们描述了我们挑战的四个方面,以及基线、方法和结果。我们最后讨论了VoxSRC-22新的域转移重点,以及前三版挑战的进展。</s>

0

相关内容

声纹识别

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

三七生长发育和皂苷积累对低温的响应及其调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

关联半金属SrIrO3薄膜和超晶格的有序磁结构的产生、演化和调控

国家自然科学基金

0+阅读 · 2013年12月31日

基于开径式非相干宽带腔增强吸收光谱大气中HONO探测方法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型高效4H-SiC MOSFET的研究

国家自然科学基金

0+阅读 · 2012年12月31日

脉冲强磁场对低合金钢贝氏体转变的影响及机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-217 microRNA的表达调节及其对胰腺导管腺癌生长影响的机制和其与患者预后关系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

CoFeB基新型垂直磁各向异性薄膜的界面调控及其自旋输运特性的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型电致电阻材料的物理机理

国家自然科学基金

0+阅读 · 2011年12月31日

电泳沉积法制备纳米磁电多铁性复合薄膜的研究

国家自然科学基金

0+阅读 · 2008年12月31日

The Second Monocular Depth Estimation Challenge

Arxiv

0+阅读 · 2023年4月26日

Structure Diagram Recognition in Financial Announcements

Arxiv

0+阅读 · 2023年4月26日

Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge

Arxiv

0+阅读 · 2023年4月25日

Selective Data Augmentation for Robust Speech Translation

Arxiv

0+阅读 · 2023年4月25日

Diabetic Foot Ulcer Grand Challenge 2022 Summary

Arxiv

0+阅读 · 2023年4月24日

A Survey on Automated Driving System Testing: Landscapes and Trends

Arxiv

12+阅读 · 2022年6月13日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

GAN Inversion: A Survey

Arxiv

19+阅读 · 2021年1月14日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

相关论文

The Second Monocular Depth Estimation Challenge

Arxiv

0+阅读 · 2023年4月26日

Structure Diagram Recognition in Financial Announcements

Arxiv

0+阅读 · 2023年4月26日

Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge

Arxiv

0+阅读 · 2023年4月25日

Selective Data Augmentation for Robust Speech Translation

Arxiv

0+阅读 · 2023年4月25日

Diabetic Foot Ulcer Grand Challenge 2022 Summary

Arxiv

0+阅读 · 2023年4月24日

A Survey on Automated Driving System Testing: Landscapes and Trends

Arxiv

12+阅读 · 2022年6月13日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

GAN Inversion: A Survey

Arxiv

19+阅读 · 2021年1月14日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

相关基金

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

三七生长发育和皂苷积累对低温的响应及其调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

关联半金属SrIrO3薄膜和超晶格的有序磁结构的产生、演化和调控

国家自然科学基金

0+阅读 · 2013年12月31日

基于开径式非相干宽带腔增强吸收光谱大气中HONO探测方法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型高效4H-SiC MOSFET的研究

国家自然科学基金

0+阅读 · 2012年12月31日

脉冲强磁场对低合金钢贝氏体转变的影响及机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-217 microRNA的表达调节及其对胰腺导管腺癌生长影响的机制和其与患者预后关系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

CoFeB基新型垂直磁各向异性薄膜的界面调控及其自旋输运特性的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型电致电阻材料的物理机理

国家自然科学基金

0+阅读 · 2011年12月31日

电泳沉积法制备纳米磁电多铁性复合薄膜的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员