差异感化感化歌声 (Differentiable Generative Phonology)

The goal of generative phonology, as formulated by Chomsky and Halle (1968), is to specify a formal system that explains the set of attested phonological strings in a language. Traditionally, a collection of rules (or constraints, in the case of optimality theory) and underlying forms (UF) are posited to work in tandem to generate phonological strings. However, the degree of abstraction of UFs with respect to their concrete realizations is contentious. As the main contribution of our work, we implement the phonological generative system as a neural model differentiable end-to-end, rather than as a set of rules or constraints. Contrary to traditional phonology, in our model, UFs are continuous vectors in $\mathbb{R}^d$, rather than discrete strings. As a consequence, UFs are discovered automatically rather than posited by linguists, and the model can scale to the size of a realistic vocabulary. Moreover, we compare several modes of the generative process, contemplating: i) the presence or absence of an underlying representation in between morphemes and surface forms (SFs); and ii) the conditional dependence or independence of UFs with respect to SFs. We evaluate the ability of each mode to predict attested phonological strings on 2 datasets covering 5 and 28 languages, respectively. The results corroborate two tenets of generative phonology, viz. the necessity for UFs and their independence from SFs. In general, our neural model of generative phonology learns both UFs and SFs automatically and on a large-scale.

翻译：由Chomsky和Halle(1968年)所制定的基因质谱学的目标是指定一个正式系统,用一种语言来解释一套经证明的声乐字符串。传统上,一套规则(或约束,在最佳理论方面)和基本形式(UF)被假定为协同工作,以产生声乐字符串。然而,在具体实现方面,UF的抽象程度是有争议的。作为我们工作的主要贡献,我们把声调基因质谱系统作为一种神经模型,而不是一套规则或约束。与传统的声调学相反,在我们的模式中,UF是美元(或制约,在最佳理论理论方面)和基本形式(UF)的连续矢量。结果,UF是自动发现的,而不是语言学家的假设,而模型可以缩放到现实词汇的大小。此外,我们比较了几种配方的语谱化过程,我们思考了:是否存在或没有在表层和表层规则或约束下一套规则或约束。与我们模式的货币质变的货币系的可靠性、我们货币系的可靠性和货币质变的可靠性分别评估了我们货币质变的可靠性的可靠性和货币系的可靠性的可靠性的可靠性,我们对货币系的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性和稳定性的可靠性和稳定性的可靠性的可靠性的可靠性的可靠性的可靠性和稳定性的可靠性和可靠性,我们性能性能性能的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性的可靠性。