The new and growing field of Quantitative Dependency Syntax has emerged at the crossroads between Dependency Syntax and Quantitative Linguistics. One of the main concerns in this field is the statistical patterns of syntactic dependency structures. These structures, grouped in treebanks, are the source for statistical analyses in these and related areas; dozens of scores devised over the years are the tools of a new industry to search for patterns and perform other sorts of analyses. The plethora of such metrics and their increasing complexity require sharing the source code of the programs used to perform such analyses. However, such code is not often shared with the scientific community or is tested following unknown standards. Here we present a new open-source tool, the Linear Arrangement Library (LAL), which caters to the needs of, especially, inexperienced programmers. This tool enables the calculation of these metrics on single syntactic dependency structures, treebanks, and collection of treebanks, grounded on ease of use and yet with great flexibility. LAL has been designed to be efficient, easy to use (while satisfying the needs of all levels of programming expertise), reliable (thanks to thorough testing), and to unite research from different traditions, geographic areas, and research fields.
翻译:新的和不断增长的量化依赖性语法领域出现在依赖性语法和定量语言之间的十字路口。这一领域的主要关切之一是综合依赖性结构的统计模式。这些结构以树库分类,是这些领域及相关领域统计分析的来源;多年来设计了几十个分数,是新产业寻找模式和进行其他分析的工具;这类指标过多及其日益复杂,需要分享用于进行这种分析的程序的源代码;然而,这种代码往往没有与科学界共享,也没有按照未知的标准进行测试。我们在这里提出了一个新的开放源工具,即线性安排图书馆(LAL),它满足特别是缺乏经验的程序员的需要。这一工具使得能够根据易于使用和具有极大灵活性的单一同步依赖性结构、树库和树库收集这些计量数据。LAL的设计效率高,便于使用(同时满足所有水平的专业知识需求),也便于使用。我们在这里展示了一个新的开放源工具,即线性安排图书馆(LAL),它满足了特别是缺乏经验的程序员的需要。这一工具有助于计算单项同步依赖性依赖性结构、树库和树库的收集。这些基准基于易于使用,但又具有极大的灵活性。LLLAL一直被设计为高效率、易于使用(同时满足所有水平的专门知识、从不同研究领域、可靠和不同领域的研究领域),可以进行彻底测试。