Archive collections are nowadays mostly available through search engines interfaces, which allow a user to retrieve documents by issuing queries. The study of these collections may be, however, impaired by some aspects of search engines, such as the overwhelming number of documents returned or the lack of contextual knowledge provided. New methods that could work independently or in combination with search engines are then required to access these collections. In this position paper, we propose to extend TimeLine Summarization (TLS) methods on archive collections to assist in their studies. We provide an overview of existing TLS methods and we describe a conceptual framework for an Archive TimeLine Summarization (ATLS) system, which aims to generate informative, readable and interpretable timelines.
翻译:目前,档案收藏大多通过搜索引擎接口提供,用户可以通过这些接口通过查询检索文件,但是,这些收藏的研究可能因搜索引擎的某些方面而受到影响,例如,大量文件被退回或缺乏背景知识,然后需要采用可以独立工作或与搜索引擎相结合的新方法来访问这些收藏。在本立场文件中,我们提议扩大关于档案收藏的时代缩略图(TLS)方法,以协助其研究。我们概述了现有的TLS方法,并描述了档案缩略图(ATLS)系统的概念框架,该系统旨在产生信息、可读和可解释的时间表。