We present graph-based translation models which translate source graphs into target strings. Source graphs are constructed from dependency trees with extra links so that non-syntactic phrases are connected. Inspired by phrase-based models, we first introduce a translation model which segments a graph into a sequence of disjoint subgraphs and generates a translation by combining subgraph translations left-to-right using beam search. However, similar to phrase-based models, this model is weak at phrase reordering. Therefore, we further introduce a model based on a synchronous node replacement grammar which learns recursive translation rules. We provide two implementations of the model with different restrictions so that source graphs can be parsed efficiently. Experiments on Chinese--English and German--English show that our graph-based models are significantly better than corresponding sequence- and tree-based baselines.
翻译:我们提出基于图形的翻译模型,将源图转换成目标字符串。源图是用具有额外链接的有依赖性的树来构造的,这样可以连接非合成的词组。受基于语句的模型的启发,我们首先引入一个翻译模型,将图表分成一个断开的子集,通过使用光束搜索将左向右翻译合并产生翻译。然而,与基于语句的模型类似,该模型在重新排序短语方面软弱无力。因此,我们进一步引入一个基于同步节点替换语法的模型,以学习循环翻译规则。我们提供了两种模型的实施,但有不同的限制,以便源图能够有效地被分割。对中英和德文英语的实验显示,我们的基于图形的模型比相应的序列和树基基线要好得多。