高维多变量推断中的挑战和机会 (Challenges and Opportunities in High-dimensional Variational Inference)

Current black-box variational inference (BBVI) methods require the user to make numerous design choices -- such as the selection of variational objective and approximating family -- yet there is little principled guidance on how to do so. We develop a conceptual framework and set of experimental tools to understand the effects of these choices, which we leverage to propose best practices for maximizing posterior approximation accuracy. Our approach is based on studying the pre-asymptotic tail behavior of the density ratios between the joint distribution and the variational approximation, then exploiting insights and tools from the importance sampling literature. Our framework and supporting experiments help to distinguish between the behavior of BBVI methods for approximating low-dimensional versus moderate-to-high-dimensional posteriors. In the latter case, we show that mass-covering variational objectives are difficult to optimize and do not improve accuracy, but flexible variational families can improve accuracy and the effectiveness of importance sampling -- at the cost of additional optimization challenges. Therefore, for moderate-to-high-dimensional posteriors we recommend using the (mode-seeking) exclusive KL divergence since it is the easiest to optimize, and improving the variational family or using model parameter transformations to make the posterior and optimal variational approximation more similar. On the other hand, in low-dimensional settings, we show that heavy-tailed variational families and mass-covering divergences are effective and can increase the chances that the approximation can be improved by importance sampling.

翻译：目前黑箱变异推断方法(BBVI)要求用户做出许多设计选择,例如选择变异目标和近似家庭,但对于如何做到这一点,我们没有多少原则性指导。我们开发了一个概念框架和一套实验工具来理解这些选择的影响,我们利用这些框架和一系列实验工具来提出最大限度地提高后游近准确性的最佳做法。我们的方法是研究联合分布和变差近似之间密度比率的预防性尾部行为,然后利用重要抽样文献的洞察和工具。我们的框架和支持实验有助于区分BBBVI在接近低维相对于中高度后代后代后代环境方面的行为。在后一种情况下,我们表明大规模扩大变异性目标很难优化,不会提高准确性,但灵活的变异性家庭可以提高重要取样的准确性和有效性 -- -- 其代价是额外的优化挑战。因此,我们建议使用(模式寻求)独家可分立的KL质量定比值定值方法,从而区分BBBVI在接近低维度和中位后代后代为最容易优化的变异性,通过最优化的或最优化的更优化的更优化的更优化的更优化的更优化的更优化的更优化的更近光度显示。