跨语言摘要调查 (A Survey on Cross-Lingual Summarization)

Cross-lingual summarization is the task of generating a summary in one language (e.g., English) for the given document(s) in a different language (e.g., Chinese). Under the globalization background, this task has attracted increasing attention of the computational linguistics community. Nevertheless, there still remains a lack of comprehensive review for this task. Therefore, we present the first systematic critical review on the datasets, approaches, and challenges in this field. Specifically, we carefully organize existing datasets and approaches according to different construction methods and solution paradigms, respectively. For each type of datasets or approaches, we thoroughly introduce and summarize previous efforts and further compare them with each other to provide deeper analyses. In the end, we also discuss promising directions and offer our thoughts to facilitate future research. This survey is for both beginners and experts in cross-lingual summarization, and we hope it will serve as a starting point as well as a source of new ideas for researchers and engineers interested in this area.

翻译：以一种语文(如英文)编写不同语文(如中文)的某一文件摘要(如英文)是一项任务。在全球化背景下,这项任务已引起计算语言界越来越多的注意,然而,仍缺乏对这项任务的全面审查,因此,我们首次对这一领域的数据集、方法和挑战进行系统的严格审查。具体地说,我们分别按照不同的构建方法和解决方案模式,仔细组织现有的数据集和方法。对于每一类数据集或方法,我们全面介绍和总结以往的努力,并进一步相互比较,以提供更深入的分析。最后,我们还讨论有希望的方向,提出我们的想法,以促进今后的研究。这项调查既针对初学者,也针对跨语言的拼图化专家,我们希望它将成为对这一领域感兴趣的研究人员和工程师的新想法的起点和来源。

相关内容

Computational Linguistics

关注 843

计算语言学(Computational Linguistics)是历史最悠久的出版物，专门研究语言的计算和数学特性以及自然语言处理系统的设计和分析。这本备受推崇的季刊为大学和工业界的语言学家、计算语言学家、人工智能和机器学习研究者、认知科学家、语言专家和哲学家提供有关语言研究各个方面的计算方面的最新信息。官网地址：http://dblp.uni-trier.de/db/journals/coling/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日