The connection between the maximum spanning tree in a directed graph and the best dependency tree of a sentence has been exploited by the NLP community. However, for many dependency parsing schemes, an important detail of this approach is that the spanning tree must have exactly one edge emanating from the root. While work has been done to efficiently solve this problem for finding the one-best dependency tree, no research has attempted to extend this solution to finding the $K$-best dependency trees. This is arguably a more important extension as a larger proportion of decoded trees will not be subject to the root constraint of dependency trees. Indeed, we show that the rate of root constraint violations increases by an average of $13$ times when decoding with $K\!=\!50$ as opposed to $K\!=\!1$. In this paper, we provide a simplification of the $K$-best spanning tree algorithm of Camerini et al. (1980). Our simplification allows us to obtain a constant time speed-up over the original algorithm. Furthermore, we present a novel extension of the algorithm for decoding the $K$-best dependency trees of a graph which are subject to a root constraint.
翻译:在方向图中,最大树宽幅与最有依赖性的长树之间的连接被国家劳工局社区所利用。然而,在许多依赖性剖析办法中,这一方法的一个重要细节是,横幅树必须有一个完全的根边缘。虽然已经为找到最佳依赖性树做了有效解决该问题的工作,但没有研究试图将这一解决办法扩大到寻找最佳的K$-最佳依赖性树。这可以说是一个更重要的延伸,因为更大比例的已解码树不会受到依赖性树的根限制。事实上,我们表明,在用$K\@ ⁇.50美元解码时,根限制违反率平均增加了13美元,而用$K\\\\\\\\\\\\\\\!1美元解码时是1美元。在本文中,我们简化了卡梅里尼等人(1980年)最好的树宽幅算法。我们的简化使我们得以在原算法上获得一个固定的时间加速速度。此外,我们提出了解码法的新扩展法,以解码法将1美元最佳依赖性树的底部树木解。