Rashomon曲线和卷卷的研究:关于机械学习中一般化和模式简化的新观点 (A study in Rashomon curves and volumes: A new perspective on generalization and model simplicity in machine learning)

The Rashomon effect occurs when many different explanations exist for the same phenomenon. In machine learning, Leo Breiman used this term to characterize problems where many accurate-but-different models exist to describe the same data. In this work, we study how the Rashomon effect can be useful for understanding the relationship between training and test performance, and the possibility that simple-yet-accurate models exist for many problems. We consider the Rashomon set - the set of almost-equally-accurate models for a given problem - and study its properties and the types of models it could contain. We present the Rashomon ratio as a new measure related to simplicity of model classes, which is the ratio of the volume of the set of accurate models to the volume of the hypothesis space; the Rashomon ratio is different from standard complexity measures from statistical learning theory. For a hierarchy of hypothesis spaces, the Rashomon ratio can help modelers to navigate the trade-off between simplicity and accuracy. In particular, we find empirically that a plot of empirical risk vs. Rashomon ratio forms a characteristic $\Gamma$-shaped Rashomon curve, whose elbow seems to be a reliable model selection criterion. When the Rashomon set is large, models that are accurate - but that also have various other useful properties - can often be obtained. These models might obey various constraints such as interpretability, fairness, or monotonicity.

翻译：Rashomon效应发生于对同一现象存在多种不同解释时。在机器学习中, Leo Breiman 使用这个术语来描述存在许多准确但不同模型的问题,以描述相同数据。在这项工作中,我们研究Rashomon效应如何有助于理解培训和测试性能之间的关系,以及存在简单但非准确模型的可能性。我们认为,Rashomon 集 — — 即对某个特定问题来说几乎是平等的精确模型集 — — 并研究其属性和它可能包含的模型类型。我们提出Rashomon比率作为与模型类的简单性相关的新衡量标准,即精确模型数量与假设空间数量之比;Rashomon 比率与统计学习理论的标准复杂度不同。对于假设空间的等级,Rashomon比率可以帮助模型在简单和准确性之间实现交易。我们特别发现,根据经验,Rashomon 比率构成一个特征 $Gammam$-smaility syality experformation 标准,这些模型可能具有其他的可靠性。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

专知会员服务

39+阅读 · 2020年11月3日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【牛津大学Yee Whye Teh 】论深度学习中的统计思维（On Statistical Thinking in Deep Learning），附49页ppt