自然语言中的纯净地物是否都像自然语言中的原始地物? (Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens)

The term `spurious correlations' has been used in NLP to informally denote any undesirable feature-label correlations. However, a correlation can be undesirable because (i) the feature is irrelevant to the label (e.g. punctuation in a review), or (ii) the feature's effect on the label depends on the context (e.g. negation words in a review), which is ubiquitous in language tasks. In case (i), we want the model to be invariant to the feature, which is neither necessary nor sufficient for prediction. But in case (ii), even an ideal model (e.g. humans) must rely on the feature, since it is necessary (but not sufficient) for prediction. Therefore, a more fine-grained treatment of spurious features is needed to specify the desired model behavior. We formalize this distinction using a causal model and probabilities of necessity and sufficiency, which delineates the causal relations between a feature and a label. We then show that this distinction helps explain results of existing debiasing methods on different spurious features, and demystifies surprising results such as the encoding of spurious features in model representations after debiasing.

翻译：在《国家劳工政策》中,“纯正的关联”一词被用来非正式地表示任何不可取的特征标签相关关系,但是,这种关联可能是不可取的,因为(一) 特征与标签无关(例如审查中的标点),或(二) 特征对标签的影响取决于上下文(例如审查中的否定词),语言任务无处不在。在(一) 情况下,我们希望模型对特征不起作用,而这对预测来说既不必要,也不足够。但是,在(二) 情况下,即使是理想模型(例如人类)也必须依赖特征,因为预测中有必要(但不够充分),因此,需要更精细地处理虚假特征,以具体说明理想的模型行为。我们用因果关系模型和必要性和充足性的概率将这种区分正规化,以描述特征和标签之间的因果关系。我们然后表明,这种区分有助于解释关于不同特征的模型不偏差方法的结果,在不同的表面特征上,以及令人惊讶的图像性之后,这些特征是令人惊讶的。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日