示范蒸馏统计稳定通用办法 (A Generic Approach for Statistical Stability in Model Distillation)

Model distillation has been a popular method for producing interpretable machine learning. It uses an interpretable "student" model to mimic the predictions made by the black box "teacher" model. However, when the student model is sensitive to the variability of the data sets used for training, the corresponded interpretation is not reliable. Existing strategies stabilize model distillation by checking whether a large enough corpus of pseudo-data is generated to reliably reproduce student models, but methods to do so have so far been developed for a specific student model. In this paper, we develop a generic approach for stable model distillation based on central limit theorem for the average loss. We start with a collection of candidate student models and search for candidates that reasonably agree with the teacher. Then we construct a multiple testing framework to select a corpus size such that the consistent student model would be selected under different pseudo sample. We demonstrate the application of our proposed approach on three commonly used intelligible models: decision trees, falling rule lists and symbolic regression. Finally, we conduct simulation experiments on Mammographic Mass and Breast Cancer datasets and illustrate the testing procedure throughout a theoretical analysis with Markov process.

翻译：模型蒸馏是生成可解释的机器学习的一种流行方法。它使用一种可解释的“ 学生” 模型来模仿黑盒“ 教师” 模型所作的预测。但是, 当学生模型对用于培训的数据集的变异性敏感时, 对应的解释是不可靠的。现有的战略通过检查是否生成了足够多的伪数据来可靠复制学生模型来稳定模型蒸馏, 但迄今为止已经为特定学生模型开发了这样做的方法。在本文中, 我们开发了一种基于平均损失中央限值的稳定模型蒸馏的通用方法。我们从收集候选学生模型开始, 并寻找与教师合理一致的候选人。然后我们建立一个多重测试框架, 以选择一个体积大小, 这样一致的学生模型就可以在不同伪样本中选择。我们展示了我们提出的方法在三种常用的智能模型上的应用情况: 决策树、规则列表的下降和象征性回归。最后, 我们用Memgraphic质量和乳腺癌数据集进行模拟实验, 并在与Markov 过程的理论分析过程中说明测试程序。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日