Mental health stigma prevents many individuals from receiving the appropriate care, and social psychology studies have shown that mental health tends to be overlooked in men. In this work, we investigate gendered mental health stigma in masked language models. In doing so, we operationalize mental health stigma by developing a framework grounded in psychology research: we use clinical psychology literature to curate prompts, then evaluate the models' propensity to generate gendered words. We find that masked language models capture societal stigma about gender in mental health: models are consistently more likely to predict female subjects than male in sentences about having a mental health condition (32% vs. 19%), and this disparity is exacerbated for sentences that indicate treatment-seeking behavior. Furthermore, we find that different models capture dimensions of stigma differently for men and women, associating stereotypes like anger, blame, and pity more with women with mental health conditions than with men. In showing the complex nuances of models' gendered mental health stigma, we demonstrate that context and overlapping dimensions of identity are important considerations when assessing computational models' social biases.
翻译:心理健康污名阻止许多人获得适当的护理,社会心理学研究表明,在男性中普遍忽视心理健康。在这项工作中,我们研究了蒙面语言模型中的性别化心理健康污名。为此,我们运用临床心理学文献策划提示性呈现,然后评估模型生成性别化词汇的倾向。我们发现,蒙面语言模型捕捉了心理健康的社会污名:在关于患有心理健康状况的句子中,模型通常更有可能预测女性主体而不是男性主体(32%和19%),“求诊”行为的句子加剧了这种差异。此外,我们发现不同的模型以不同的方式捕捉男女心理健康方面的污名维度,将愤怒、责备和怜悯等刻板印象更多地与患有心理健康问题的女性联系起来,而与男性联系较少。通过展示模型性别化的心理健康污名的复杂细微差别,我们表明在评估计算模型的社会偏见时,上下文和身份重叠维度是重要的考虑因素。