The variety of accents has posed a big challenge to speech recognition. The Accented English Speech Recognition Challenge (AESRC2020) is designed for providing a common testbed and promoting accent-related research. Two tracks are set in the challenge -- English accent recognition (track 1) and accented English speech recognition (track 2). A set of 160 hours of accented English speech collected from 8 countries is released with labels as the training set. Another 20 hours of speech without labels is later released as the test set, including two unseen accents from another two countries used to test the model generalization ability in track 2. We also provide baseline systems for the participants. This paper first reviews the released dataset, track setups, baselines and then summarizes the challenge results and major techniques used in the submissions.
翻译:不同的口音对语音识别提出了重大挑战。强化英语语音识别挑战(AESRC2020)旨在提供一个共同的测试台,并促进与口音有关的研究。挑战分为两个方面:英语口音识别(轨道1)和英语口音识别(轨道2),从8个国家收集的一套160小时的口音英语口音识别(口音识别)在培训中贴上标签,发布一套160小时的英语口音,随后又在测试组中发布20小时无标签的言语,包括来自另外两个国家用于测试轨道2模式通用能力的两种隐性口音,我们还为参与者提供基线系统。本文首先审查了所发布的数据集、跟踪设置、基线,然后总结了提交材料中使用的挑战结果和主要技术。