Past research has identified discriminatory automatic speech recognition (ASR) performance as a function of the racial group and nationality of the speaker. In this paper, we expand the discussion beyond bias as a function of the individual national origin of the speaker to look for bias as a function of the geopolitical orientation of their nation of origin. We audit some of the most popular English language ASR services using a large and global data set of speech from The Speech Accent Archive, which includes over 2,700 speakers of English born in 171 different countries. We show that, even when controlling for multiple linguistic covariates, ASR service performance has a statistically significant relationship to the political alignment of the speaker's birth country with respect to the United States' geopolitical power. This holds for all ASR services tested. We discuss this bias in the context of the historical use of language to maintain global and political power.
翻译:过去的研究发现,歧视性的自动语音识别(ASR)功能是该发言者的种族群体和国籍的一种功能,在本文中,我们扩大讨论范围,不再把偏见作为该发言者的个别民族的功能,而把偏见视为其原籍国地缘政治取向的一种功能;我们利用《演讲中心档案馆》的大型全球数据集对一些最受欢迎的英语ASR服务进行审计,其中包括在171个不同国家出生的2 700多名英语发言者;我们表明,即使控制多种语言共变,ASR服务绩效也与该发言者的出生国与美国地缘政治力量的政治一致性有着具有重要统计意义的关系,这适用于所有经过测试的ASR服务;我们从历史使用语言维持全球和政治权力的角度讨论这一偏见。