争取在《国家劳工规划:调查》中作出忠实的示范解释 (Towards Faithful Model Explanation in NLP: A Survey)

End-to-end neural NLP architectures are notoriously difficult to understand, which gives rise to numerous efforts towards model explainability in recent years. An essential principle of model explanation is Faithfulness, i.e., an explanation should accurately represent the reasoning process behind the model's prediction. This survey first discusses the definition and evaluation of Faithfulness, as well as its significance for explainability. We then introduce the recent advances in faithful explanation by grouping approaches into five categories: similarity methods, analysis of model-internal structures, backpropagation-based methods, counterfactual intervention, and self-explanatory models. Each category will be illustrated with its representative studies, advantages, and shortcomings. Finally, we discuss all the above methods in terms of their common virtues and limitations, and reflect on future work directions towards faithful explainability. For researchers interested in studying interpretability, this survey will offer an accessible and comprehensive overview of the area, laying the basis for further exploration. For users hoping to better understand their own models, this survey will be an introductory manual helping with choosing the most suitable explanation method(s).

翻译：端到端神经液核电离层结构很难理解,这导致近年来在示范解释方面做出了许多努力。模型解释的基本原则是忠诚,即解释应准确地代表模型预测背后的推理过程。本调查首先讨论对忠诚的定义和评价,以及其解释的意义。然后我们介绍最近通过将方法分为五类在忠实解释方面取得的进展:相似方法、模型内部结构分析、反反动方法、反事实干预和自我解释模型。每个类别都将用其代表性研究、优缺点来说明。最后,我们从所有上述方法的共同优点和局限性的角度来讨论所有上述方法,并思考今后对忠实解释的方向。对于研究可解释性的研究人员来说,本调查将为该地区提供一个方便和全面的概览,为进一步探索奠定基础。对于希望更好地了解自己模型的用户来说,本调查将是一个介绍性手册,帮助选择最合适的解释方法。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日