Character linking, the task of linking mentioned people in conversations to the real world, is crucial for understanding the conversations. For the efficiency of communication, humans often choose to use pronouns (e.g., "she") or normal phrases (e.g., "that girl") rather than named entities (e.g., "Rachel") in the spoken language, which makes linking those mentions to real people a much more challenging than a regular entity linking task. To address this challenge, we propose to incorporate the richer context from the coreference relations among different mentions to help the linking. On the other hand, considering that finding coreference clusters itself is not a trivial task and could benefit from the global character information, we propose to jointly solve these two tasks. Specifically, we propose C$^2$, the joint learning model of Coreference resolution and Character linking. The experimental results demonstrate that C$^2$ can significantly outperform previous works on both tasks. Further analyses are conducted to analyze the contribution of all modules in the proposed model and the effect of all hyper-parameters.
翻译:将对话中提及的人与真实世界连接起来的任务,是理解对话的关键。为了提高交流的效率,人类往往选择使用口语中的代名词(例如“女孩”)或普通词组(例如“女孩”),而不是名称实体(例如“女孩”),这使得将提及的人与真实人连接起来比将任务与正常实体连接起来要困难得多。为了应对这一挑战,我们提议纳入从不同提及之间的共同参照关系中获得的更丰富背景,以帮助连接。另一方面,考虑到找到共同参照组本身不是一件微不足道的任务,而且可能受益于全球特性信息,我们提议共同解决这两项任务。具体地说,我们提议2美元,即共同参照分辨率和字符连接的联合学习模式。实验结果表明,2美元可以大大超过先前关于这两项任务的工作。我们进行了进一步分析,以分析拟议模式中所有模块的贡献以及所有超参数的效果。