Stein importance sampling is a widely applicable technique based on kernelized Stein discrepancy, which corrects the output of approximate sampling algorithms by reweighting the empirical distribution of the samples. A general analysis of this technique is conducted for the previously unconsidered setting where samples are obtained via the simulation of a Markov chain, and applies to an arbitrary underlying Polish space. We prove that Stein importance sampling yields consistent estimators for quantities related to a target distribution of interest by using samples obtained from a geometrically ergodic Markov chain with a possibly unknown invariant measure that differs from the desired target. The approach is shown to be valid under conditions that are satisfied for a large number of unadjusted samplers, and is capable of retaining consistency when data subsampling is used. Along the way, a universal theory of reproducing Stein kernels is established, which enables the construction of kernelized Stein discrepancy on general Polish spaces, and provides sufficient conditions for kernels to be convergence-determining on such spaces. These results are of independent interest for the development of future methodology based on kernelized Stein discrepancies.
翻译:Stein 重要性取样是一种广泛应用的技术,其依据是内核化 Stein 差异,它通过对样品的经验分布进行重新加权,纠正了近似抽样算法的输出。对这一技术的一般分析是针对通过模拟Markov 链子获得样品的先前未考虑过的环境进行的,并适用于波兰任意的内在空间。我们证明,Stein 重要性取样通过使用从几何ERGodic Markov 链条取得的样品,对目标分布的相关数量得出一致的估计值,这些样品可能与预期目标不同,但可能未知的变异性测量值。在满足大量未经调整的采样器的条件下,该方法证明是有效的,在使用数据再抽样时能够保持一致性。此外,还建立了一种生产Stein内核的普遍理论,从而能够在波兰一般空间上建造内核化的石内核差异,并为内核内核在这种空间上形成趋同式确定足够条件。这些结果对于今后根据内核化的石质差异制定方法具有独立的兴趣。