The health mention classification (HMC) task is the process of identifying and classifying mentions of health-related concepts in text. This can be useful for identifying and tracking the spread of diseases through social media posts. However, this is a non-trivial task. Here we build on recent studies suggesting that using emotional information may improve upon this task. Our study results in a framework for health mention classification that incorporates affective features. We present two methods, an intermediate task fine-tuning approach (implicit) and a multi-feature fusion approach (explicit) to incorporate emotions into our target task of HMC. We evaluated our approach on 5 HMC-related datasets from different social media platforms including three from Twitter, one from Reddit and another from a combination of social media sources. Extensive experiments demonstrate that our approach results in statistically significant performance gains on HMC tasks. By using the multi-feature fusion approach, we achieve at least a 3% improvement in F1 score over BERT baselines across all datasets. We also show that considering only negative emotions does not significantly affect performance on the HMC task. Additionally, our results indicate that HMC models infused with emotional knowledge are an effective alternative, especially when other HMC datasets are unavailable for domain-specific fine-tuning. The source code for our models is freely available at https://github.com/tahirlanre/Emotion_PHM.
翻译:健康参考分类(HMC)是确定和分类在文本中提及与健康有关的概念的过程。这可能有益于通过社交媒体站点识别和跟踪疾病传播情况。然而,这是一个非三重任务。我们以最近的研究表明,使用情感信息可以改进这项任务。我们在包含情感特征的健康分类框架内的研究结果中提及健康分类。我们提出了两种方法,一种中间任务微调方法(隐含)和一种多功能融合方法(明确),将情感纳入我们的健康信息中心的目标任务。我们评估了我们在不同社交媒体平台,包括Twitter、Reddit和社交媒体来源组合的5个与HMC有关的数据集的方法。我们的广泛实验表明,我们的方法在包含情感特征特征特征的卫生分类任务中取得了重要的业绩成果。我们采用多功能融合方法,在所有数据集中,在BERT基线(隐含)上至少提高了3%的F1分。我们还表明,仅考虑负面情感并不显著影响HMC任务的业绩。此外,我们的成果表明,在可选择的HMC模式中,在可自由应用的HMC模型领域是有效的情感源。