机器翻译：一种表征拉丁美洲自然语言处理中偏见和有害刻板印象的方法论 (A methodology to characterize bias and harmful stereotypes in natural language processing in Latin America)

Laura Alonso Alemany,Luciana Benotti,Hernán Maina,Lucía González,Mariela Rajngewerc,Lautaro Martínez,Jorge Sánchez,Mauro Schilman,Guido Ivetta,Alexia Halvorsen,Amanda Mata Rojo,Matías Bordone,Beatriz Busaniche

Automated decision-making systems, especially those based on natural language processing, are pervasive in our lives. They are not only behind the internet search engines we use daily, but also take more critical roles: selecting candidates for a job, determining suspects of a crime, diagnosing autism and more. Such automated systems make errors, which may be harmful in many ways, be it because of the severity of the consequences (as in health issues) or because of the sheer number of people they affect. When errors made by an automated system affect a population more than others, we call the system \textit{biased}. Most modern natural language technologies are based on artifacts obtained from enormous volumes of text using machine learning, namely language models and word embeddings. Since they are created by applying subsymbolic machine learning, mostly artificial neural networks, they are opaque and practically uninterpretable by direct inspection, thus making it very difficult to audit them. In this paper, we present a methodology that spells out how social scientists, domain experts, and machine learning experts can collaboratively explore biases and harmful stereotypes in word embeddings and large language models. Our methodology is based on the following principles: * focus on the linguistic manifestations of discrimination on word embeddings and language models, not on the mathematical properties of the models * reduce the technical barrier for discrimination experts%, be it social scientists, domain experts or other * characterize through a qualitative exploratory process in addition to a metric-based approach * address mitigation as part of the training process, not as an afterthought

翻译：自动决策系统，特别是基于自然语言处理的系统，已经渗透到我们生活的方方面面。它们不仅仅是我们每天使用的互联网搜索引擎背后的原因，而且还承担更加关键的角色：选择工作候选人，确定犯罪嫌疑人，诊断自闭症等等。这样的自动化系统会出现错误，这些错误可能在很多方面都具有害处，无论是因为后果的严重性（如健康问题）还是因为影响的人数之多。当自动化系统的错误影响到某个族群多于其他族群时，我们称该系统存在“偏见”。大多数现代自然语言技术是基于使用机器学习从大量文本中获取的人工智能，即语言模型和单词嵌入。由于它们是通过应用机器学习，主要是人工神经网络，来创建的，因此它们不透明，而且直接检查实际上不可解释，从而使审计它们变得非常困难。本文提出了一种方法论，阐述了社会科学家、领域专家和机器学习专家如何合作探索单词嵌入和大型语言模型中的偏见和有害刻板印象。我们的方法论基于以下原则：*关注言语上偏见和有害刻板印象的表现，而不是模型的数学属性*降低歧视专家，无论是社会科学家还是其他领域专家的技术障碍*通过定性的探索过程以及度量为基础的方法进行表征*将缓解作为训练过程的一部分，而不是作为事后思考