多国地址地址划分:零热评价 (Multinational Address Parsing: A Zero-Shot Evaluation)

Address parsing consists of identifying the segments that make up an address, such as a street name or a postal code. Because of its importance for tasks like record linkage, address parsing has been approached with many techniques, the latest relying on neural networks. While these models yield notable results, previous work on neural networks has only focused on parsing addresses from a single source country. This paper explores the possibility of transferring the address parsing knowledge acquired by training deep learning models on some countries' addresses to others with no further training in a zero-shot transfer learning setting. We also experiment using an attention mechanism and a domain adversarial training algorithm in the same zero-shot transfer setting to improve performance. Both methods yield state-of-the-art performance for most of the tested countries while giving good results to the remaining countries. We also explore the effect of incomplete addresses on our best model, and we evaluate the impact of using incomplete addresses during training. In addition, we propose an open-source Python implementation of some of our trained models.

翻译：地址分割包括确定构成地址的部分,如街道名称或邮政编码。由于地址分割对于记录链接等任务的重要性,我们用许多技术,即最近依赖神经网络的方法,处理地址分割问题。虽然这些模型产生了显著的成果,但以前关于神经网络的工作只侧重于单一来源国家的地址分割。本文探讨了将通过培训深度学习模式获得的地址划分知识转移到一些国家地址上的可能性,而一些国家的地址没有在零发转让学习环境中接受进一步培训。我们还在相同的零发转让环境中使用关注机制和域对域对抗性培训算法进行实验,以提高绩效。两种方法都为大多数受测试的国家带来最新业绩,同时给其余国家带来良好结果。我们还探讨了不完整地址对我们最佳模式的影响,我们评估在培训期间使用不完整地址的影响。此外,我们建议对一些经过培训的模型进行开放源 Python 实施。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

专知会员服务

23+阅读 · 2020年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日