PolyHope: 从 Tweets 创建双层希望语音探测任务数据集 (PolyHope: Dataset Creation for a Two-Level Hope Speech Detection Task from Tweets)

Hope is characterized as openness of spirit toward the future, a desire, expectation, and wish for something to happen or to be true that remarkably affects human's state of mind, emotions, behaviors, and decisions. Hope is usually associated with concepts of desired expectations and possibility/probability concerning the future. Despite its importance, hope has rarely been studied as a social media analysis task. This paper presents a hope speech dataset that classifies each tweet first into "Hope" and "Not Hope", then into three fine-grained hope categories: "Generalized Hope", "Realistic Hope", and "Unrealistic Hope" (along with "Not Hope"). English tweets in the first half of 2022 were collected to build this dataset. Furthermore, we describe our annotation process and guidelines in detail and discuss the challenges of classifying hope and the limitations of the existing hope speech detection corpora. In addition, we reported several baselines based on different learning approaches, such as traditional machine learning, deep learning, and transformers, to benchmark our dataset. We evaluated our baselines using weighted-averaged and macro-averaged F1-scores. Observations show that a strict process for annotator selection and detailed annotation guidelines enhanced the dataset's quality. This strict annotation process resulted in promising performance for simple machine learning classifiers with only bi-grams; however, binary and multiclass hope speech detection results reveal that contextual embedding models have higher performance in this dataset.

翻译：希望被描述为对未来的开放精神、希望、期望和希望的希望,希望的希望被描述为对未来的开放性,希望或希望的希望会发生或发生一些明显影响人类的心态、情感、行为和决定的事物。希望通常与期望的预期和对未来的可能/概率的概念相联系。尽管它很重要,但希望很少作为社交媒体分析任务来研究。本文展示了希望演讲数据集,将每条推文首先归为“希望”和“不是希望”,然后分为三个细微的希望类别:“普遍希望”、“现实希望”和“非现实希望”(连同“非希望”一起)。2022年上半年收集的英语推文通常与对未来的期望和可能性/可能性/概率概念的概念相联系。此外,我们详细描述了我们的批注过程和指导方针,并讨论了将希望言论探测现有希望感官的肉团的局限性。此外,我们报告了若干基于不同学习方法的基线,例如传统机器学习、深层次学习和变异,以衡量我们的数据集为基准。我们用加权和宏观的更深层次的语音模型来评估我们的基线。我们用加权和高层次的精确的精确的语音测量结果,结果显示一个精确的成绩。观察结果,然后显示一个精确的精确的精确的成绩。观察显示过程,结果显示一个精确的精确的精确的精确的成绩。一个过程,但结果显示一个精确的精确的精确的精确的精确的精确的精确的精确的成绩。