As web archives' holdings grow, archivists subdivide them into collections so they are easier to understand and manage. In this work, we review the collection structures of eight web archive platforms: : Archive-It, Conifer, the Croatian Web Archive (HAW), the Internet Archive's user account web archives, Library of Congress (LC), PANDORA, Trove, and the UK Web Archive (UKWA). We note a plethora of different approaches to web archive collection structures. Some web archive collections support sub-collections and some permit embargoes. Curatorial decisions may be attributed to a single organization or many. Archived web pages are known by many names: mementos, copies, captures, or snapshots. Some platforms restrict a memento to a single collection and others allow mementos to cross collections. Knowledge of collection structures has implications for many different applications and users. Visitors will need to understand how to navigate collections. Future archivists will need to understand what options are available for designing collections. Platform designers need it to know what possibilities exist. The developers of tools that consume collections need to understand collection structures so they can meet the needs of their users.
翻译:随着网络档案库的积累,档案管理员将档案归并到收藏库中,这样他们更容易理解和管理。在这项工作中,我们审查了八个网络档案平台的收集结构:档案-IT、Conifer、克罗地亚网络档案馆(HAW)、互联网档案馆用户账户网络档案、国会图书馆(LC)、PANDORA、Trove和英国网络档案馆(UKWA)。我们注意到对网络档案收集结构的多种不同方法。一些网络档案收藏支持分集和一些许可禁运。一些网络档案收藏支持分集和一些许可禁运。判断性决定可能归属于一个组织或多个组织。档案网页有许多名称:纪念品、复制品、捕捉或快照。一些平台将一个纪念品限制为单一收藏,而另一些则允许纪念品交叉收藏。收集结构的知识对许多不同的应用程序和用户都有影响。访问者需要了解如何浏览收藏。未来档案管理员需要了解设计收藏的选项。平台设计师需要知道存在哪些可能性。使用收藏工具的开发者需要了解收藏结构,以便了解它们的用户的需要。