Statistical modeling is a key component in the extraction of physical results from lattice field theory calculations. Although the general models used are often strongly motivated by physics, many model variations can frequently be considered for the same lattice data. Model averaging, which amounts to a probability-weighted average over all model variations, can incorporate systematic errors associated with model choice without being overly conservative. We discuss the framework of model averaging from the perspective of Bayesian statistics, and give useful formulae and approximations for the particular case of least-squares fitting, commonly used in modeling lattice results. In addition, we frame the common problem of data subset selection (e.g. choice of minimum and maximum time separation for fitting a two-point correlation function) as a model selection problem and study model averaging as a straightforward alternative to manual selection of fit ranges. Numerical examples involving both mock and real lattice data are given.
翻译:统计建模是从拉蒂斯实地理论计算中提取物理结果的一个关键组成部分。虽然所使用的一般模型往往以物理学为强烈动机,但许多模型变异往往可以考虑用于同一数据。模型平均,相当于所有模型变异的概率加权平均数,可以纳入与模型选择有关的系统错误,而不过分保守。我们从贝叶斯统计的角度讨论平均模型框架,并为最不整齐的特例提供有用的公式和近似值,这些特例通常用于模拟拉蒂斯结果。此外,我们把数据子集选择(例如,选择最低和最长时间分离以适应两点相关功能)这一共同问题作为模型选择问题和研究模型,平均作为人工选择合适范围的简单替代。我们给出了模拟和真实拉蒂斯数据的数字示例。