We revisit the Bayesian Context Trees (BCT) modelling framework for discrete time series, which was recently found to be very effective in numerous tasks including model selection, estimation and prediction. A novel representation of the induced posterior distribution on model space is derived in terms of a simple branching process, and several consequences of this are explored in theory and in practice. First, it is shown that the branching process representation leads to a simple variable-dimensional Monte Carlo sampler for the joint posterior distribution on models and parameters, which can efficiently produce independent samples. This sampler is found to be more efficient than earlier MCMC samplers for the same tasks. Then, the branching process representation is used to establish the asymptotic consistency of the BCT posterior, including the derivation of an almost-sure convergence rate. Finally, an extensive study is carried out on the performance of the induced Bayesian entropy estimator. Its utility is illustrated through both simulation experiments and real-world applications, where it is found to outperform several state-of-the-art methods.
翻译:我们重新审视了最近在多个任务中包括模型选择,估计和预测方面被发现非常有效的离散时间序列贝叶斯上下文树(BCT)建模框架。我们推导了引起的模型空间后验分布的新颖表示形式,用一个简单的分枝过程的形式进行表示,并在理论和实践中探讨了这个表示形式的几个结果。首先,我们证明了分枝过程表示形式可以导出一个简单的可变维度蒙特卡洛采样器,用于生成模型和参数的联合后验分布,可以高效地产生独立样本。我们发现,与早期的MCMC采样器相比,这个采样器更加高效。然后,我们使用分支过程表示法来建立BCT后验的渐近一致性,包括导出几乎肯定的收敛速率。最后,我们对引起的贝叶斯熵估计器的性能进行了广泛的研究。通过模拟实验和实际应用程序,我们证明其实用性,并且发现其优于一些最先进的方法。