校准与解释相匹配: 一种简单有效的方法,用于模拟信任估计 (Calibration Meets Explanation: A Simple and Effective Approach for Model Confidence Estimates)

Calibration strengthens the trustworthiness of black-box models by producing better accurate confidence estimates on given examples. However, little is known about if model explanations can help confidence calibration. Intuitively, humans look at important features attributions and decide whether the model is trustworthy. Similarly, the explanations can tell us when the model may or may not know. Inspired by this, we propose a method named CME that leverages model explanations to make the model less confident with non-inductive attributions. The idea is that when the model is not highly confident, it is difficult to identify strong indications of any class, and the tokens accordingly do not have high attribution scores for any class and vice versa. We conduct extensive experiments on six datasets with two popular pre-trained language models in the in-domain and out-of-domain settings. The results show that CME improves calibration performance in all settings. The expected calibration errors are further reduced when combined with temperature scaling. Our findings highlight that model explanations can help calibrate posterior estimates.

翻译：校准加强了黑盒模型的可信度, 因为它对特定实例提出了更准确的可信度估计。但是, 模型解释是否有助于信任校准, 却鲜为人知。直观地说, 人类看重要的特征属性, 并决定模型是否可信。同样, 解释可以告诉我们模型可能何时知道。受此启发, 我们提议了一个名为 CME 的方法, 利用模型解释来使模型与非感应属性相比更不可信。设想是, 当模型不甚自信时, 很难找到任何类别的强烈迹象, 因此符号不会给任何类别带来高分数, 反之亦然。我们在六个数据集上进行了广泛的实验, 实验中有两个广受欢迎的预先训练的语言模型, 在主域内外设置中。结果显示, CME 改善了所有环境的校准性能。与温度缩放相结合时, 预期校准错误会进一步减少。我们的发现, 模型解释可以帮助校准海边的估计数。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/