The COVID-19 pandemic has disproportionately impacted the lives of minorities, such as members of the LGBTQ community (lesbian, gay, bisexual, transgender, and queer) due to pre-existing social disadvantages and health disparities. Although extensive research has been carried out on the impact of the COVID-19 pandemic on different aspects of the general population's lives, few studies are focused on the LGBTQ population. In this paper, we identify a group of Twitter users who self-disclose to belong to the LGBTQ community. We develop and evaluate two sets of machine learning classifiers using a pre-pandemic and a during pandemic dataset to identify Twitter posts exhibiting minority stress, which is a unique pressure faced by the members of the LGBTQ population due to their sexual and gender identities. For this task, we collect a set of 20,593,823 posts by 7,241 self-disclosed LGBTQ users and annotate a randomly selected subset of 2800 posts. We demonstrate that our best pre-pandemic and during pandemic models show strong and stable performance for detecting posts that contain minority stress. We investigate the linguistic differences in minority stress posts across pre- and during-pandemic periods. We find that anger words are strongly associated with minority stress during the COVID-19 pandemic. We explore the impact of the pandemic on the emotional states of the LGBTQ population by conducting controlled comparisons with the general population. We adopt propensity score-based matching to perform a causal analysis. The results show that the LBGTQ population have a greater increase in the usage of cognitive words and worsened observable attribute in the usage of positive emotion words than the group of the general population with similar pre-pandemic behavioral attributes.
翻译:虽然对COVID-19流行病对一般人口生活不同方面的影响进行了广泛研究,但很少研究侧重于LGBTQ人群。在本文中,我们确定了一组自我披露属于LGBTQ社群的Twitter用户。我们开发并评估了两组机器学习分类师,使用流行病前和大流行病数据集来识别显示少数群体压力的Twitter文章,这是LGBTQ人群因其性和性别认同而面临的独特压力。为此,我们收集了一组20 593 823个员额,7 241个自我披露的LGBTQ用户,以及一个随机随机选择的2 800个员额。我们发现,我们的最佳前和大流行病模型显示,在检测含有少数群体压力的岗位方面,我们最有力和稳定地表现了超常的汇率。我们调查了LGBTF人群在性和性别认同方面所面临的语言差异。我们发现,GLEVQ在前和COV期间,少数群体在语言压力方面表现了比常年高的汇率变化。我们发现,在人群中,我们发现,与GLEV相关的语言压力表现了比前和高的汇率变化。我们在研究中,我们发现,在少数群体群体中,在人口流行中,我们发现,在研究中,在人口方面表现了比高的汇率压力中,在研究了比比。