Many computational argumentation tasks, like stance classification, are topic-dependent: the effectiveness of approaches to these tasks significantly depends on whether the approaches were trained on arguments from the same topics as those they are tested on. So, which are these topics that researchers train approaches on? This paper contributes the first comprehensive survey of topic coverage, assessing 45 argument corpora. For the assessment, we take the first step towards building an argument topic ontology, consulting three diverse authoritative sources: the World Economic Forum, the Wikipedia list of controversial topics, and Debatepedia. Comparing the topic sets between the authoritative sources and corpora, our analysis shows that the corpora topics-which are mostly those frequently discussed in public online fora - are covered well by the sources. However, other topics from the sources are less extensively covered by the corpora of today, revealing interesting future directions for corpus construction.
翻译:许多计算论证任务,如立场分类,都取决于专题:这些任务的方法的有效性在很大程度上取决于这些方法是否按照与所测试的相同主题的论据进行了培训。因此,这些是研究人员所培训的课题吗?本文对第一次专题涵盖范围的全面调查作了贡献,评估了45个论点公司。关于评估,我们迈出了第一步,以建立一个辩论主题本体学,咨询了三个不同的权威来源:世界经济论坛、维基百科有争议的议题清单和辩论。比较了权威来源与公司之间的专题组合,我们的分析表明,来源很好地涵盖了大部分在公共在线论坛中经常讨论的社团专题。然而,今天的社团对来源的另一些专题的涵盖范围较小,揭示了构建实体的有趣未来方向。