Schrödinger 树 - 语法和神经语言模型 (Schrödinger's Tree -- On Syntax and Neural Language Models)

from arxiv, preprint, submitted to Frontiers in Artificial Intelligence: Perspectives for Natural Language Processing between AI, Linguistics and Cognitive Science

In the last half-decade, the field of natural language processing (NLP) has undergone two major transitions: the switch to neural networks as the primary modeling paradigm and the homogenization of the training regime (pre-train, then fine-tune). Amidst this process, language models have emerged as NLP's workhorse, displaying increasingly fluent generation capabilities and proving to be an indispensable means of knowledge transfer downstream. Due to the otherwise opaque, black-box nature of such models, researchers have employed aspects of linguistic theory in order to characterize their behavior. Questions central to syntax -- the study of the hierarchical structure of language -- have factored heavily into such work, shedding invaluable insights about models' inherent biases and their ability to make human-like generalizations. In this paper, we attempt to take stock of this growing body of literature. In doing so, we observe a lack of clarity across numerous dimensions, which influences the hypotheses that researchers form, as well as the conclusions they draw from their findings. To remedy this, we urge researchers make careful considerations when investigating coding properties, selecting representations, and evaluating via downstream tasks. Furthermore, we outline the implications of the different types of research questions exhibited in studies on syntax, as well as the inherent pitfalls of aggregate metrics. Ultimately, we hope that our discussion adds nuance to the prospect of studying language models and paves the way for a less monolithic perspective on syntax in this context.

翻译：在过去的半个十年中,自然语言处理领域经历了两个主要转变:向神经网络的转换,作为主要示范模式,以及培训制度的同质化(培训前,然后是微调)。在这一过程中,语言模型作为国家语言处理领域的工作马出现,表现出日益流畅的生成能力,并证明是下游知识转让不可或缺的手段。由于这些模型的不透明、黑盒性质,研究人员利用语言理论的各个方面来描述其行为特征。对语法至关重要的问题 -- -- 语言等级结构的研究 -- -- 已经在很大程度上纳入到这项工作中,对模型的内在偏见及其作出类似人性概括的能力提出了宝贵的洞见。在本文件中,我们试图评估这一不断增长的文献体系。我们发现,许多方面缺乏清晰度,这影响到研究人员的假设,以及他们从调查结果中得出的结论。为了纠正这一点,我们敦促研究人员在研究数学特性、选择陈述和通过下游研究的深度影响时,要仔细考虑如何研究其本质,我们作为研究的底部研究的底部问题,我们作为研究的底部研究的底部和底部研究的底部问题。此外,我们还注意到,我们作为研究的底部研究的底部研究的底部研究的底部研究的底部研究的底部研究的底部问题。