连接归属和QA 现实反事实模型行为 (Connecting Attributions and QA Model Behavior on Realistic Counterfactuals)

When a model attribution technique highlights a particular part of the input, a user might understand this highlight as making a statement about counterfactuals (Miller, 2019): if that part of the input were to change, the model's prediction might change as well. This paper investigates how well different attribution techniques align with this assumption on realistic counterfactuals in the case of reading comprehension (RC). RC is a particularly challenging test case, as token-level attributions that have been extensively studied in other NLP tasks such as sentiment analysis are less suitable to represent the reasoning that RC models perform. We construct counterfactual sets for three different RC settings, and through heuristics that can connect attribution methods' outputs to high-level model behavior, we can evaluate how useful different attribution methods and even different formats are for understanding counterfactuals. We find that pairwise attributions are better suited to RC than token-level attributions across these different RC settings, with our best performance coming from a modification that we propose to an existing pairwise attribution method.

翻译：当模型归因技术突出输入的某个特定部分时,用户可能会理解这段强调的关于反事实的陈述(Miller, 2019年):如果这部分投入要改变,模型的预测也会改变。本文调查了不同的归因技术如何与关于理解(RC)情况下现实反事实的假设相一致。驻地协调员是一个特别具有挑战性的测试案例,因为在其他非驻地项目任务中广泛研究的象征性归因,如情绪分析,不太适合代表驻地协调员模型的推理。我们为三个不同的驻地协调员设置了反事实数据集,并通过超自然学将归因方法的产出与高级示范行为联系起来,我们可以评估不同的归因方法、甚至不同格式对于理解反事实的有用程度。我们发现,对称的归因更适合驻地协调员,而不是这些不同的驻地协调员环境中的代为归因,我们的最佳性表现来自我们提出的对称归因方法的修改。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/