Address parsing consists of identifying the segments that make up an address, such as a street name or a postal code. Because of its importance for tasks like record linkage, address parsing has been approached with many techniques, the latest relying on neural networks. While these models yield notable results, previous work on neural networks has only focused on parsing addresses from a single source country. This paper explores the possibility of transferring the address parsing knowledge acquired by training deep learning models on some countries' addresses to others with no further training in a zero-shot transfer learning setting. We also experiment using an attention mechanism and a domain adversarial training algorithm in the same zero-shot transfer setting to improve performance. Both methods yield state-of-the-art performance for most of the tested countries while giving good results to the remaining countries. We also explore the effect of incomplete addresses on our best model, and we evaluate the impact of using incomplete addresses during training. In addition, we propose an open-source Python implementation of some of our trained models.
翻译:地址分割包括确定构成地址的部分,如街道名称或邮政编码。由于地址分割对于记录链接等任务的重要性,我们用许多技术,即最近依赖神经网络的方法,处理地址分割问题。虽然这些模型产生了显著的成果,但以前关于神经网络的工作只侧重于单一来源国家的地址分割。本文探讨了将通过培训深度学习模式获得的地址划分知识转移到一些国家地址上的可能性,而一些国家的地址没有在零发转让学习环境中接受进一步培训。我们还在相同的零发转让环境中使用关注机制和域对域对抗性培训算法进行实验,以提高绩效。两种方法都为大多数受测试的国家带来最新业绩,同时给其余国家带来良好结果。我们还探讨了不完整地址对我们最佳模式的影响,我们评估在培训期间使用不完整地址的影响。此外,我们建议对一些经过培训的模型进行开放源 Python 实施。