In statistical inference, uncertainty is unknown and all models are wrong. A person who makes a statistical model and a prior distribution is simultaneously aware that they are fictional and virtual candidates. In order to study such cases, several statistical measures have been constructed, such as cross validation, information criteria, and marginal likelihood, however, their mathematical properties have not yet been completely clarified when statistical models are under- and over- parametrized. In this paper, we introduce a place of mathematical theory of Bayesian statistics for unknown uncertainty, on which we show general properties of cross validation, information criteria, and marginal likelihood. The derived theory holds even if an unknown uncertainty is unrealizable by a statistical morel or even if the posterior distribution cannot be approximated by any normal distribution, hence it gives a helpful standpoint for a person who cannot believe in any specific model and prior. The results are followings. (1) There exists a more precise statistical measure of the generalization loss than leave-one-out cross validation and information criterion based on the mathematical properties of them. (2) There exists a more efficient approximation method of the free energy, which is the minus log marginal likelihood, even if the posterior distribution cannot be approximated by any normal distribution. (3) And the prior distributions optimized by the cross validation and the widely applicable information criterion are asymptotically equivalent to each other, which are different from that by the marginal likelihood.
翻译:在统计推论中,不确定是未知的,所有模型都是错的。一个人,如果统计模型和先前的分布同时意识到它们都是虚构的和虚拟的候选者。为了研究这类案例,已经制定了若干统计措施,例如交叉验证、信息标准和可能性很小,但是,当统计模型的不对称和超称化时,它们的数学特性还没有完全澄清。在本文中,我们采用了一个数学理论,即巴伊西亚统计的数学理论,以未知的不确定性为准,我们根据这些理论显示了交叉验证、信息标准和可能性很小的一般特性。衍生的理论认为,即使一个未知的不确定性无法被统计更多或虚拟化。为了研究这类案例,已经制定了若干统计措施,例如交叉验证、信息标准和可能性不大,因此,对于无法相信任何具体模型和之前的模型的人来说,其数学特性还没有完全澄清。在本文中,我们采用了一种比较精确的统计标准,即泛泛度损失,而不是基于其数学特性的交叉验证、信息标准和信息标准。(2) 自由能源的近似法是非边际的近似可能性,这种近似可能性是比亚的,即使先前的分布标准不能以任何最优化的比正正的分布,也是其他的准的,即使通过应用的分布为不同的标准,也是任何最差的分布为不同的,也不可能的。