The long-distance agreement, evidence for syntactic structure, is increasingly used to assess the syntactic generalization of Neural Language Models. Much work has shown that transformers are capable of high accuracy in varied agreement tasks, but the mechanisms by which the models accomplish this behavior are still not well understood. To better understand transformers' internal working, this work contrasts how they handle two superficially similar but theoretically distinct agreement phenomena: subject-verb and object-past participle agreement in French. Using probing and counterfactual analysis methods, our experiments show that i) the agreement task suffers from several confounders which partially question the conclusions drawn so far and ii) transformers handle subject-verb and object-past participle agreements in a way that is consistent with their modeling in theoretical linguistics.
翻译:长距离协议,即综合结构的证据,正越来越多地被用来评估神经语言模型的综合性概括性。许多工作表明变压器在各种不同的协议任务中能够具有高度准确性,但模型完成这种行为的机制仍然不很清楚。为了更好地了解变压器的内部工作,这项工作对比了它们如何处理两种表面上相似但理论上截然不同的协议现象:法语的主题动词和对象偏向部分的参与者协议。 我们的实验利用调查和反事实分析方法表明,(一) 协议任务受到一些同级者的影响,他们对迄今得出的结论有部分质疑,(二) 变压器处理主题动词和对象偏向部分协议的方式与其理论语言模式一致。