The word embedding association test (WEAT) is an important method for measuring linguistic biases against social groups such as ethnic minorities in large text corpora. It does so by comparing the semantic relatedness of words prototypical of the groups (e.g., names unique to those groups) and attribute words (e.g., 'pleasant' and 'unpleasant' words). We show that anti-black WEAT estimates from geo-tagged social media data at the level of metropolitan statistical areas strongly correlate with several measures of racial animus--even when controlling for sociodemographic covariates. However, we also show that every one of these correlations is explained by a third variable: the frequency of Black names in the underlying corpora relative to White names. This occurs because word embeddings tend to group positive (negative) words and frequent (rare) words together in the estimated semantic space. As the frequency of Black names on social media is strongly correlated with Black Americans' prevalence in the population, this results in spurious anti-Black WEAT estimates wherever few Black Americans live. This suggests that research using the WEAT to measure bias should consider term frequency, and also demonstrates the potential consequences of using black-box models like word embeddings to study human cognition and behavior.
翻译:嵌入协会测试(WEAT)是衡量针对社会群体的语言偏见的一个重要方法,比如在大型文本公司中少数民族(WEAT),它通过比较各群体原型词(如这些群体独有的名称)和属性单词(如“喜悦”和“不愉快”单词)的语义关联性关联性(WEAT)和属性单词(例如,“喜悦”和“不愉快”单词)。我们表明,都市统计地区地理标记的社交媒体数据中反黑人WEAT估计值与若干种族隐性衡量值(即使在控制社会人口变异性时也是如此)密切相关。然而,我们也表明,其中每一种关联性都由第三个变量来解释:黑名相对于白名的内在社团中的频度(如这些群名)和属性单词(如“喜悦”单词)的频度关联性关联性关系。我们发现,由于社会媒体上的黑名与美黑人在人口中的流行程度密切相关,因此,在少数黑人居住的地区,反美籍的WEAT估计结果都是虚假的。 这表明,使用“WAT”的频度研究,例如“人类“实验”等,将“错误”的模型进行。