Scientific press briefings are a valuable information source. They consist of alternating expert speeches, questions from the audience and their answers. Therefore, they can contribute to scientific and fact-based media coverage. Even though press briefings are highly informative, extracting statements relevant to individual journalistic tasks is challenging and time-consuming. To support this task, an automated statement extraction system is proposed. Claims are used as the main feature to identify statements in press briefing transcripts. The statement extraction task is formulated as a four-step procedure. First, the press briefings are split into sentences and passages, then claim sentences are identified through sequence classification. Subsequently, topics are detected, and the sentences are filtered to improve the coherence and assess the length of the statements. The results indicate that claim detection can be used to identify statements in press briefings. While many statements can be extracted automatically with this system, they are not always as coherent as needed to be understood without context and may need further review by knowledgeable persons.
翻译:科学新闻简报是宝贵的信息来源,包括专家演讲、听众提问和答复,因此,它们有助于科学的和基于事实的媒体报道。即使新闻简报内容丰富,但摘录与个别新闻工作有关的言论具有挑战性和耗时性。为了支持这项任务,提议采用自动声明提取系统,将索赔作为确定新闻简报记录稿中声明的主要特征。声明提取任务作为一个四步程序拟订。首先,新闻简报分为句子和段落,然后通过顺序分类确定判决。随后,发现主题,并过滤判决,以提高发言的连贯性和评估长度。结果显示,在新闻简报中,可以使用检测声明来识别声明。虽然许多声明可以自动提取,但是,它们并非始终像需要的那样一致,无需了解背景,可能需要有知识的人进一步审查。</s>