Mental health stigma prevents many individuals from receiving the appropriate care, and social psychology studies have shown that mental health tends to be overlooked in men. In this work, we investigate gendered mental health stigma in masked language models. In doing so, we operationalize mental health stigma by developing a framework grounded in psychology research: we use clinical psychology literature to curate prompts, then evaluate the models' propensity to generate gendered words. We find that masked language models capture societal stigma about gender in mental health: models are consistently more likely to predict female subjects than male in sentences about having a mental health condition (32% vs. 19%), and this disparity is exacerbated for sentences that indicate treatment-seeking behavior. Furthermore, we find that different models capture dimensions of stigma differently for men and women, associating stereotypes like anger, blame, and pity more with women with mental health conditions than with men. In showing the complex nuances of models' gendered mental health stigma, we demonstrate that context and overlapping dimensions of identity are important considerations when assessing computational models' social biases.
翻译:心理健康污名使许多个人无法获得适当的护理,社会心理学研究显示,心理健康往往在男性中被忽视。在这项工作中,我们调查了隐形语言模型中的性别心理健康污名。在这样做时,我们通过制定基于心理学研究的框架来实施心理健康污名:我们利用临床心理学文献来翻译催化器,然后评估模型产生性别文字的倾向。我们发现,隐形语言模型反映了心理健康中的性别污名的社会污名:模型总是比男性更可能预测女性在患有心理健康状况(32%对19%)的判刑中,而这种差异在表明寻求治疗行为的判决中更加严重。此外,我们发现,不同的模型反映了对男子和妇女不同程度的污名,将愤怒、指责和怜悯等陈规定型观念与有心理健康条件的妇女联系在一起,而不是与男性联系起来。在展示模型的性别心理健康污名的复杂细微差别时,我们证明,在评估计算模型的社会偏见时,身份的背景和重叠层面是重要的考虑因素。