Discourse information is difficult to represent and annotate. Among the major frameworks for annotating discourse information, RST, PDTB and SDRT are widely discussed and used, each having its own theoretical foundation and focus. Corpora annotated under different frameworks vary considerably. To make better use of the existing discourse corpora and achieve the possible synergy of different frameworks, it is worthwhile to investigate the systematic relations between different frameworks and devise methods of unifying the frameworks. Although the issue of framework unification has been a topic of discussion for a long time, there is currently no comprehensive approach which considers unifying both discourse structure and discourse relations and evaluates the unified framework intrinsically and extrinsically. We plan to use automatic means for the unification task and evaluate the result with structural complexity and downstream tasks. We will also explore the application of the unified framework in multi-task learning and graphical models.
翻译:在叙述性讨论信息的主要框架中,对讨论信息进行广泛讨论和使用,每个框架都有自己的理论基础和重点,Cordora在不同框架内的说明有很大差异,为了更好地利用现有讨论社团,实现不同框架可能的协同作用,有必要调查不同框架之间的系统关系,并设计统一框架的方法。虽然框架统一问题长期以来一直是讨论的议题,但目前没有考虑统一讨论结构和讨论关系并评估统一框架的综合办法,我们计划使用自动手段执行统一任务,并以结构复杂和下游任务来评估结果。我们还将探讨在多任务学习和图形模型中应用统一框架的问题。