学习到脚手架:优化教学示范解释 (Learning to Scaffold: Optimizing Model Explanations for Teaching)

Modern machine learning models are opaque, and as a result there is a burgeoning academic subfield on methods that explain these models' behavior. However, what is the precise goal of providing such explanations, and how can we demonstrate that explanations achieve this goal? Some research argues that explanations should help teach a student (either human or machine) to simulate the model being explained, and that the quality of explanations can be measured by the simulation accuracy of students on unexplained examples. In this work, leveraging meta-learning techniques, we extend this idea to improve the quality of the explanations themselves, specifically by optimizing explanations such that student models more effectively learn to simulate the original model. We train models on three natural language processing and computer vision tasks, and find that students trained with explanations extracted with our framework are able to simulate the teacher significantly more effectively than ones produced with previous methods. Through human annotations and a user study, we further find that these learned explanations more closely align with how humans would explain the required decisions in these tasks. Our code is available at https://github.com/coderpat/learning-scaffold

翻译：现代机器学习模型是不透明的,因此,在解释这些模型行为的方法方面,出现了一个新兴的学术子领域,解释这些模型的行为。然而,提供这种解释的准确目标是什么,我们如何能证明解释这个目标?一些研究认为,解释应该帮助教育学生(人或机器)模拟正在解释的模型,而解释的质量可以通过学生在解释性实例上的模拟准确性来衡量。在这项工作中,利用元学习技术,我们推广这一想法,以提高解释本身的质量,特别是优化解释,使学生模型能够更有效地学习模拟原始模型。我们培训三种自然语言处理和计算机视觉任务的模型,发现以我们框架进行解释训练的学生能够比用以前的方法模拟教师。我们通过人类说明和用户研究发现,这些学到的解释与人类如何解释这些任务中所需的决定更加接近。我们的代码可以在 https://github.com/cderpat/learning-scafold查阅。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日