选择性电动神经网络培训 (Selective Classification Via Neural Network Training Dynamics)

Selective classification is the task of rejecting inputs a model would predict incorrectly on through a trade-off between input space coverage and model accuracy. Current methods for selective classification impose constraints on either the model architecture or the loss function; this inhibits their usage in practice. In contrast to prior work, we show that state-of-the-art selective classification performance can be attained solely from studying the (discretized) training dynamics of a model. We propose a general framework that, for a given test input, monitors metrics capturing the disagreement with the final predicted label over intermediate models obtained during training; we then reject data points exhibiting too much disagreement at late stages in training. In particular, we instantiate a method that tracks when the label predicted during training stops disagreeing with the final predicted label. Our experimental evaluation shows that our method achieves state-of-the-art accuracy/coverage trade-offs on typical selective classification benchmarks. For example, we improve coverage on CIFAR-10/SVHN by 10.1%/1.5% respectively at a fixed target error of 0.5%.

翻译：选择性分类是拒绝输入的任务,一个模型通过输入空间覆盖面和模型准确性之间的权衡而预测错误。目前选择性分类的方法对模型结构或损失功能施加限制;这在实践中抑制了它们的使用。与先前的工作不同,我们显示,最先进的选择性分类性能只能通过研究一个模型的(分解的)培训动态才能达到。我们提议了一个总框架,对特定测试投入而言,用来监测与最终预测的标签在培训期间获得的中间模型的不一致程度;然后我们拒绝在培训的后期阶段表现出太多分歧的数据点。特别是,当培训期间预测的标签不再与最后预测的标签不一致时,我们即刻使用一种方法。我们的实验性评估表明,我们的方法在典型的选择性分类基准上达到了最新水平的准确/覆盖交易。例如,我们把CFAR-10/SVHN的覆盖范围分别提高10.1%/1.5%,固定目标误差为0.5%。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日