用于简化 BART 版本的混合时间下端圆环 (A Mixing Time Lower Bound for a Simplified Version of BART)

Bayesian Additive Regression Trees (BART) is a popular Bayesian non-parametric regression algorithm. The posterior is a distribution over sums of decision trees, and predictions are made by averaging approximate samples from the posterior. The combination of strong predictive performance and the ability to provide uncertainty measures has led BART to be commonly used in the social sciences, biostatistics, and causal inference. BART uses Markov Chain Monte Carlo (MCMC) to obtain approximate posterior samples over a parameterized space of sums of trees, but it has often been observed that the chains are slow to mix. In this paper, we provide the first lower bound on the mixing time for a simplified version of BART in which we reduce the sum to a single tree and use a subset of the possible moves for the MCMC proposal distribution. Our lower bound for the mixing time grows exponentially with the number of data points. Inspired by this new connection between the mixing time and the number of data points, we perform rigorous simulations on BART. We show qualitatively that BART's mixing time increases with the number of data points. The slow mixing time of the simplified BART suggests a large variation between different runs of the simplified BART algorithm and a similar large variation is known for BART in the literature. This large variation could result in a lack of stability in the models, predictions, and posterior intervals obtained from the BART MCMC samples. Our lower bound and simulations suggest increasing the number of chains with the number of data points.

翻译：Bayesian Additive Regrestition 树(BART)是一种流行的Bayesian非参数回归法(BART ) 。后端是决定树数量之间的分布, 预测是通过平均从后端采集的近似样本作出的。强大的预测性能和提供不确定性措施的能力相结合, 使BART在社会科学、生物统计学和因果关系方面普遍使用。 BART 使用Markov 链链 Monte Carlo( MC ) 在树的参数化比例范围内获取近似后方样本, 但经常观察到链链的混合速度缓慢。在本文中, 我们为BART的简化版本提供了第一个更低的混合时间框架, 简化后一树的组合, 并使用一系列可能的组合来进行 MMC 提案的分布。我们的混合时间范围随着数据点的混合时间与数据数量之间的这种新联系, 我们对BARRT 进行了严格的模拟。我们从质量上显示, BAR 将时间与ART 的快速的模型和已知的模型之间的大量数据变异变。