Despite inextricable ties between race and language, little work has considered race in NLP research and development. In this work, we survey 79 papers from the ACL anthology that mention race. These papers reveal various types of race-related bias in all stages of NLP model development, highlighting the need for proactive consideration of how NLP systems can uphold racial hierarchies. However, persistent gaps in research on race and NLP remain: race has been siloed as a niche topic and remains ignored in many NLP tasks; most work operationalizes race as a fixed single-dimensional variable with a ground-truth label, which risks reinforcing differences produced by historical racism; and the voices of historically marginalized people are nearly absent in NLP literature. By identifying where and how NLP literature has and has not considered race, especially in comparison to related fields, our work calls for inclusion and racial justice in NLP research practices.
翻译:尽管种族和语言之间有着千丝万缕的联系,但在国家劳工政策研究与发展中很少考虑种族问题。在这项工作中,我们调查了全国劳工政策研究与开发中的79篇提到种族问题的论文。这些文件揭示了国家劳工政策模式发展的所有阶段中与种族有关的各类偏见,突出表明需要积极考虑国家劳工政策系统如何维护种族等级制度。然而,种族问题和全国劳工政策研究中持续存在的差距:种族问题作为一个特殊专题被搁置,在许多国家劳工政策研究任务中仍然被忽视;大多数工作将种族作为一个固定的单维变量进行,并贴上地面真相标签,这有可能加剧历史种族主义造成的差异;国家劳工政策文献中几乎不存在历史上处于边缘地位的人的声音。通过查明国家劳工政策文献在哪些地方和如何考虑种族问题,特别是在相关领域,我们的工作要求将种族公正纳入国家劳工政策研究实践。