对边缘的预测:确定一个更大的模型在哪些方面更好 (Predicting on the Edge: Identifying Where a Larger Model Does Better)

Much effort has been devoted to making large and more accurate models, but relatively little has been put into understanding which examples are benefiting from the added complexity. In this paper, we demonstrate and analyze the surprisingly tight link between a model's predictive uncertainty on individual examples and the likelihood that larger models will improve prediction on them. Through extensive numerical studies on the T5 encoder-decoder architecture, we show that large models have the largest improvement on examples where the small model is most uncertain. On more certain examples, even those where the small model is not particularly accurate, large models are often unable to improve at all, and can even perform worse than the smaller model. Based on these findings, we show that a switcher model which defers examples to a larger model when a small model is uncertain can achieve striking improvements in performance and resource usage. We also explore committee-based uncertainty metrics that can be more effective but less practical.

翻译：已经为制作大型和更为准确的模型做出了很大的努力,但相对而言,对于哪些实例从增加的复杂性中受益的理解却很少。在本文件中,我们展示和分析了模型单个实例的预测不确定性与较大模型改进预测的可能性之间的令人惊讶的紧密联系。我们通过对T5编码器脱coder-decoder结构的广泛数字研究,表明大型模型在小型模型最不确定的示例方面有最大的改进。在更某些实例中,即使小型模型不特别准确,大型模型往往根本无法改进,甚至能够比小型模型更差。根据这些调查结果,我们表明,在小型模型不确定的情况下,将实例放入更大模型的开关模型,可以在性能和资源使用方面实现显著的改进。我们还探讨了基于委员会的不确定性指标,这些指标可能更加有效,但不太实用。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日