Digitization of historical records has produced a significant amount of data for analysis and interpretation. A critical challenge is the ability to relate historical information across different archives to allow for the data to be framed in the appropriate historical context. This paper presents a real-world case study on historical information integration and record matching with the goal to improve the historical value of archives containing data in the period 1800 to 1920. The archives contain unique information about M\'etis and Indigenous people in Canada and interactions with European settlers. The archives contain thousands of records that have increased relevance when relationships and interconnections are discovered. The contribution is a record linking approach suitable for historical archives and an evaluation of its effectiveness. Experimental results demonstrate potential for discovering historical linkage with high precision enabling new historical discoveries.
翻译:历史记录的数字化产生了大量用于分析和解释的数据,一个关键的挑战是如何将不同档案的历史信息联系起来,以便能够根据适当的历史背景来设计数据,本文件介绍了关于历史信息整合和记录与提高1800年至1920年期间包含数据档案的历史价值的目标相匹配的真实世界案例研究,档案中载有关于加拿大M\'etis和土著人的独特信息以及与欧洲定居者的互动。档案中包含成千上万的记录,在发现关系和互联关系时,这些记录的相关性有所提高。这种贡献是一种记录链接方法,适合历史档案并评估其有效性。实验结果表明,有可能以高度精确的方式发现历史联系,从而能够发现新的历史发现。