There has been increasing interest on summary-free versions of approximate Bayesian computation (ABC), which replace distances among summaries with discrepancies between the whole empirical distributions of the observed data and the synthetic samples generated under the proposed parameter values. The success of these solutions has motivated theoretical studies on the concentration properties of the induced posteriors. However, current results are often specific to the selected discrepancy, and mostly rely on existence arguments which are typically difficult to verify and provide bounds not readily interpretable. We address these issues via a novel bridge between the concept of Rademacher complexity and recent concentration theory for discrepancy-based ABC. This perspective yields a unified and interpretable theoretical framework that relates the concentration of ABC posteriors to the behavior of the Rademacher complexity associated to the chosen discrepancy in the broad class of integral probability semimetrics. This class extends summary-based ABC, and includes the widely-implemented Wasserstein distance and maximum mean discrepancy (MMD), which admit interpretable bounds for the corresponding Rademacher complexity along with constructive sufficient conditions for the existence of such bounds. Therefore, this unique bridge crucially contributes towards an improved understanding of ABC, as further clarified through a focus of this theory on the MMD setting and via an illustrative simulation.
翻译:对无摘要版本的巴伊西亚计算(ABC)的兴趣日益浓厚,这种计算取代了摘要之间的距离,而根据所观察到的数据和根据拟议参数值产生的合成样品的整个实证分布与所观察到的数据和合成样品之间的差异之间存在差异。这些解决办法的成功促使对引致的子孙的浓度特性进行了理论研究;然而,目前的结果往往是特定的差异所特有的,而且大多依赖通常难以核实和提供不易解释的界限的存在论据。我们通过Rademacher复杂程度概念和最近基于差异的ABC集中理论之间的新桥梁来解决这些问题。这种观点产生了一个统一和可解释的理论框架,将ABC后部的集中与拉迪马赫复杂程度的行为联系起来,这与所选择的整体概率半量值大类的差异有关。这一类别扩展了以摘要为基础的ABC,包括广泛执行的Wacerstein距离和最大中值差异(MD),它承认Rademacher复杂性的可解释界限,以及这种界限存在的建设性充分条件。因此,这一独特的理论有助于通过MBC进一步阐明对ABC的模拟理论的改进。