Understanding who blames or supports whom in news text is a critical research question in computational social science. Traditional methods and datasets for sentiment analysis are, however, not suitable for the domain of political text as they do not consider the direction of sentiments expressed between entities. In this paper, we propose a novel NLP task of identifying directed sentiment relationship between political entities from a given news document, which we call directed sentiment extraction. From a million-scale news corpus, we construct a dataset of news sentences where sentiment relations of political entities are manually annotated. We present a simple but effective approach for utilizing a pretrained transformer, which infers the target class by predicting multiple question-answering tasks and combining the outcomes. We demonstrate the utility of our proposed method for social science research questions by analyzing positive and negative opinions between political entities in two major events: 2016 U.S. presidential election and COVID-19. The newly proposed problem, data, and method will facilitate future studies on interdisciplinary NLP methods and applications.
翻译:了解谁在新闻文本中是谁是计算社会科学中的关键研究问题。但是,传统的方法和数据组用于情绪分析,并不适合政治文本领域,因为它们不考虑各实体之间表达的情绪方向。在本文中,我们提议一个新的国家语言方案任务,即从我们称之为直接情绪提取的某一新闻文件中查明政治实体之间的直接情绪关系。我们从100万个规模的新闻资料库中,建立一个新闻句子数据集,其中对政治实体的情绪关系进行人工注释。我们提出了一个简单而有效的方法,用于使用预先训练的变压器,通过预测多答问题任务和合并结果来推断目标类别。我们通过分析政治实体在两个重大活动中的正面和负面观点,展示了我们拟议的社会科学研究方法的效用:2016年美国总统选举和COVID-19。新提出的问题、数据和方法将促进未来对跨学科国家语言方案方法和应用程序的研究。