Numerous works use word embedding-based metrics to quantify societal biases and stereotypes in texts. Recent studies have found that word embeddings can capture semantic similarity but may be affected by word frequency. In this work we study the effect of frequency when measuring female vs. male gender bias with word embedding-based bias quantification methods. We find that Skip-gram with negative sampling and GloVe tend to detect male bias in high frequency words, while GloVe tends to return female bias in low frequency words. We show these behaviors still exist when words are randomly shuffled. This proves that the frequency-based effect observed in unshuffled corpora stems from properties of the metric rather than from word associations. The effect is spurious and problematic since bias metrics should depend exclusively on word co-occurrences and not individual word frequencies. Finally, we compare these results with the ones obtained with an alternative metric based on Pointwise Mutual Information. We find that this metric does not show a clear dependence on frequency, even though it is slightly skewed towards male bias across all frequencies.
翻译:许多工作都使用基于嵌入的文字衡量标准来量化教科书中的社会偏见和陈规定型观念。最近的研究发现,嵌入的文字可以捕捉语义相似性,但可能会受到文字频度的影响。在这项工作中,我们研究用基于文字嵌入的偏见量化方法来测量女性对男性的性别偏见的频率效应。我们发现,带有负面抽样和GloVe的Skipp-gram往往在高频字眼中发现男性的偏见,而GloVe倾向于在低频字眼中返回女性的偏见。当单词被随机打乱时,我们就会发现这些行为仍然存在。这证明,在未打乱的复合体中观察到的基于频率的影响源于度量的特性,而不是词系的关联性。效果是虚假的,因为偏差指标应完全取决于单词共发,而不是单词频率。最后,我们把这些结果与基于点共信的替代指标进行比较。我们发现,这一指标没有显示对频率的明确依赖性,尽管它在所有频率上对男性的偏差。