Occam's razor is a guiding principle that models should be simple enough to describe observed data. While Bayesian model selection (BMS) embodies it by the intrinsic regularization effect (IRE), how observed data scale the IRE has not been fully understood. In the nonlinear regression with conditionally independent observations, we show that the IRE is scaled by observations' fineness, defined by the amount and quality of observed data. We introduce an observable that quantifies the IRE, referred to as the Bayes specific heat, inspired by the correspondence between statistical inference and statistical physics. We derive its scaling relation to observations' fineness. We demonstrate that the optimal model chosen by the BMS changes at critical values of observations' fineness, accompanying the IRE's variation. The changes are from choosing a coarse-grained model to a fine-grained one as observations' fineness increases. Our findings expand an understanding of BMS's typicality when observed data are insufficient.
翻译:Occam的剃刀是一个指导原则,模型应该简单到足以描述观察到的数据。 Baysian 模型选择(BMS)通过内在的正规化效果(IRE)体现出来,而观察到的数据规模却没有被完全理解。在非线性回归中,通过有条件的独立观测,我们显示IRE是通过观测的细微度来缩放的,根据观测数据的数量和质量来定义的。我们引入了一种可观察性,根据统计推理和统计物理学之间的对应关系,将IRE(称为Bayes)量化为特定热量。我们从它与观测的细微度之间的比例关系中得出。我们展示了BMS所选择的最佳模型在观测的精度关键值“细度”上的变化,与IRE的变异。这些变化是从选择粗微的模型到细微微的模型,随着观测的精度的增加。我们发现在观察到的数据不足时,对BMS的典型性能增加了了解。