Mordecai3 is a new end-to-end text geoparser and event geolocation system. The system performs toponym resolution using a new neural ranking model to resolve a place name extracted from a document to its entry in the Geonames gazetteer. It also performs event geocoding, the process of linking events reported in text with the place names where they are reported to occur, using an off-the-shelf question-answering model. The toponym resolution model is trained on a diverse set of existing training data, along with several thousand newly annotated examples. The paper describes the model, its training process, and performance comparisons with existing geoparsers. The system is available as an open source Python library, Mordecai 3, and replaces an earlier geoparser, Mordecai v2, one of the most widely used text geoparsers (Halterman 2017).
翻译:Mordecai 3 是一款新的端到端文本地名解析与事件地理编码系统。该系统使用一种新的神经排序模型进行地名解析,将从文档中提取的地名解析为 Geonames 加泽提尔中的条目。它还使用一个现成的问答模型进行事件地理编码,即将在文本中报道的事件与其发生的地点联系起来。地名解析模型在现有训练数据的基础上,训练了一组多样化的注释示例。这篇论文描述了这个模型、其训练过程以及与现有地名解析器的性能比较。该系统作为一个开源的 Python 库 Mordecai 3 提供,并取代了一个广泛使用的文本地名解析器 Mordecai v2(Halterman 2017)。