Chinese word segmentation and dependency parsing are two fundamental tasks for Chinese natural language processing. The dependency parsing is defined on word-level, therefore word segmentation is the precondition of dependency parsing, which makes dependency parsing suffers from error propagation. In this paper, we propose a unified model to integrate Chinese word segmentation and dependency parsing. Different from previous joint models, our proposed model is a graph-based model and more concise, which results in fewer efforts of feature engineering. Our joint model achieves better performance than previous joint models. Our joint model achieves the state-of-the-art results in both Chinese word segmentation and dependency parsing.
翻译:中文文字分割和依赖性分割是中国自然语言处理的两大基本任务。 依赖性分割是根据字级定义的,因此,文字分割是依赖性分割的先决条件,这使得依赖性分割受到错误传播的影响。 在本文中,我们提出了一个统一模式,将中文文字分割和依赖性分割结合起来。 与以前的联合模式不同,我们提议的模式是一个基于图表的模型,更加简洁,从而减少了特征工程的力度。 我们的联合模型比以前的联合模型取得了更好的性能。 我们的联合模型在中文文字分割和依赖性分割上都取得了最先进的成果。