Contemporary debates on filter bubbles and polarization in public and social media raise the question to what extent news media of the past exhibited biases. This paper specifically examines bias related to gender in six Dutch national newspapers between 1950 and 1990. We measure bias related to gender by comparing local changes in word embedding models trained on newspapers with divergent ideological backgrounds. We demonstrate clear differences in gender bias and changes within and between newspapers over time. In relation to themes such as sexuality and leisure, we see the bias moving toward women, whereas, generally, the bias shifts in the direction of men, despite growing female employment number and feminist movements. Even though Dutch society became less stratified ideologically (depillarization), we found an increasing divergence in gender bias between religious and social-democratic on the one hand and liberal newspapers on the other. Methodologically, this paper illustrates how word embeddings can be used to examine historical language change. Future work will investigate how fine-tuning deep contextualized embedding models, such as ELMO, might be used for similar tasks with greater contextual information.
翻译:关于公共和社交媒体过滤泡沫和两极分化的当代辩论提出了这样一个问题,即过去新闻媒体表现出偏见的程度;本文具体审查了1950年至1990年期间荷兰六家全国性报纸中与性别有关的偏见;我们通过比较当地在具有不同意识形态背景的报纸上培训的文字嵌入模式的变化,衡量与性别有关的偏见;我们显示了性别偏见以及报纸内部和不同时间之间在性别偏见和变化方面的明显差异;关于性与休闲等主题,我们看到偏向妇女,而一般而言,尽管女性就业人数和女权运动不断增加,偏见却向男子的方向转移。尽管荷兰社会在意识形态上已变得不那么分化(分化),但我们发现宗教和社会民主与自由报纸之间的性别偏见日益不同。从方法上讲,本文说明了如何用文字嵌入来审查历史语言变化。未来的工作将研究如何精细调整深背景化的嵌入模式,例如ELMO,用于与更多背景信息的类似任务。