以渐进回滚来解释神经矩阵因子化 (Explaining Neural Matrix Factorization with Gradient Rollback)

Explaining the predictions of neural black-box models is an important problem, especially when such models are used in applications where user trust is crucial. Estimating the influence of training examples on a learned neural model's behavior allows us to identify training examples most responsible for a given prediction and, therefore, to faithfully explain the output of a black-box model. The most generally applicable existing method is based on influence functions, which scale poorly for larger sample sizes and models. We propose gradient rollback, a general approach for influence estimation, applicable to neural models where each parameter update step during gradient descent touches a smaller number of parameters, even if the overall number of parameters is large. Neural matrix factorization models trained with gradient descent are part of this model class. These models are popular and have found a wide range of applications in industry. Especially knowledge graph embedding methods, which belong to this class, are used extensively. We show that gradient rollback is highly efficient at both training and test time. Moreover, we show theoretically that the difference between gradient rollback's influence approximation and the true influence on a model's behavior is smaller than known bounds on the stability of stochastic gradient descent. This establishes that gradient rollback is robustly estimating example influence. We also conduct experiments which show that gradient rollback provides faithful explanations for knowledge base completion and recommender datasets.

翻译：解释神经黑盒模型的预测是一个重要问题,特别是在用户信任至关重要的应用应用中使用这类模型时,就是一个重要问题。估计培训范例对学习神经模型行为的影响,使我们能够确定对特定预测负有最大责任的培训范例,从而忠实地解释黑盒模型的输出。最普遍适用的现有方法基于影响功能,对于较大的样本大小和模型来说规模不高。我们建议了梯度回滚,这是影响估计的一般方法,适用于神经模型,在梯度下降期间每个参数更新步骤都触及较少的参数,即使参数总数很大。用梯度下降训练的神经矩阵乘数模型是这一模型类别的一部分。这些模型很受欢迎,在行业中发现了广泛的应用。特别是属于这一类的知识图形嵌入方法被广泛使用。我们显示,在培训和测试时间,梯度回滚动是效率很高的。此外,我们从理论上看,梯度回滚动对模型行为的影响和对模型行为的真正影响之间的差别比已知的要小,即使参数的总数很大。用梯度递增指数模型模型的缩缩缩缩度模型,我们还可以推推推推推推推推推推推的梯度,从而推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推,推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推推

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/