Probing classifiers have emerged as one of the prominent methodologies for interpreting and analyzing deep neural network models of natural language processing. The basic idea is simple---a classifier is trained to predict some linguistic property from a model's representations---and has been used to examine a wide variety of models and properties. However, recent studies have demonstrated various methodological weaknesses of this approach. This article critically reviews the probing classifiers framework, highlighting their promises, shortcomings, and advances.
翻译:检验分类器已成为解释和分析天然语言处理的深神经网络模型的突出方法之一,其基本思想是简单化的分类器经过培训,从模型的表述中预测一些语言属性,并用来审查各种模型和属性,然而,最近的研究表明了这一方法在方法上的各种弱点,这一条批判性地审查了检验分类器框架,突出了它们的许诺、缺点和进步。