We propose a transition-based approach that, by training a single model, can efficiently parse any input sentence with both constituent and dependency trees, supporting both continuous/projective and discontinuous/non-projective syntactic structures. To that end, we develop a Pointer Network architecture with two separate task-specific decoders and a common encoder, and follow a multitask learning strategy to jointly train them. The resulting quadratic system, not only becomes the first parser that can jointly produce both unrestricted constituent and dependency trees from a single model, but also proves that both syntactic formalisms can benefit from each other during training, achieving state-of-the-art accuracies in several widely-used benchmarks such as the continuous English and Chinese Penn Treebanks, as well as the discontinuous German NEGRA and TIGER datasets.
翻译:我们建议一种过渡性方法,通过培训单一模式,可以有效地分析任何含有成份和依赖性树木的输入句,同时支持连续/预测和非连续/非预测综合结构。 为此,我们开发了一个指点网络结构,配有两个单独的任务解码器和一个共同的编码器,并遵循一个多任务学习战略来联合培训它们。 由此形成的二次系统不仅成为第一个能够从单一模式中联合生产无限制成份和依赖性树木的剖析器,而且还证明,在培训、在诸如连续的英语和中国彭氏树库以及不连续的德国国家地理和地理研究所和TIGER数据集等一些广泛使用的基准中,实现最新技术的精密性,以及在连续的德国国家地理和TIGER数据集中,两者都能相互受益。