During the COVID-19 pandemic, people started to discuss about pandemic-related topics on social media. On subreddit \textit{r/COVID19positive}, a number of topics are discussed or being shared, including experience of those who got a positive test result, stories of those who presumably got infected, and questions asked regarding the pandemic and the disease. In this study, we try to understand, from a linguistic perspective, the nature of discussions on the subreddit. We found differences in linguistic characteristics (e.g. psychological, emotional and reasoning) across three different categories of topics. We also classified posts into the different categories using SOTA pre-trained language models. Such classification model can be used for pandemic-related research on social media.
翻译:在COVID-19大流行期间,人们开始在社交媒体上讨论与大流行病有关的议题,在子编辑\ textit{r/COVID19正阳}方面,讨论或分享了若干议题,包括获得积极测试结果者的经验、可能感染者的故事,以及就该流行病和该疾病提出的问题。在这项研究中,我们从语言角度来理解有关子编辑的讨论的性质。我们发现语言特征(例如心理、情感和推理)在三类不同议题上存在差异。我们还利用SOTA预先培训的语言模式将职位分为不同类别。这种分类模式可用于社交媒体的与大流行病有关的研究。