统一ERgodidic Markov 链条和应用程序的第二顺序的U-统计的集中不平等 (Concentration inequality for U-statistics of order two for uniformly ergodic Markov chains, and applications)

from arxiv, In this first revised version, we improve our main result asking only for the uniform ergodicity of the Markov chain (i.e. we get rid of the strong aperiodicity assumption) and we correct an error of the previous version regarding the use of the Talagrand inequality

We prove a new concentration inequality for U-statistics of order two for uniformly ergodic Markov chains. Working with bounded $\pi$-canonical kernels, we show that we can recover the convergence rate of Arcones and Gine (1993) who proved a concentration result for U-statistics of independent random variables and canonical kernels. Our proof relies on an inductive analysis where we use martingale techniques, uniform ergodicity, Nummelin splitting and Bernstein's type inequality where the spectral gap of the chain emerges. Our result allows us to conduct three applications. First, we establish a new exponential inequality for the estimation of spectra of trace class integral operators with MCMC methods. The novelty is that this result holds for kernels with positive and negative eigenvalues, which is new as far as we know. In addition, we investigate generalization performance of online algorithms working with pairwise loss functions and Markov chain samples. We provide an online-to-batch conversion result by showing how we can extract a low risk hypothesis from the sequence of hypotheses generated by any online learner. We finally give a non-asymptotic analysis of a goodness-of-fit test on the density of the invariant measure of a Markov chain. We identify the classes of alternatives over which our test based on the L2 distance has a prescribed power.

翻译：我们证明,对于统一ERgodic Markov 链的二号秩序的U-统计学来说,我们是一个新的集中不平等。我们与受约束的 $\pi$-canonical 内核合作,我们展示了我们可以恢复Arcones和Gine(1993年)的趋同率,后者证明是独立随机变数和卡通内核的U-统计学集中率。我们的证据依赖于一种感知分析,即我们使用martingale 技术、统一惯性、Nummelin 分裂和Bernstein 的不平等类型,在链条的光谱差距出现时,我们可以进行三种应用。我们的结果使我们得以进行三个应用。首先,我们为利用MCMC方法估算追踪级整体操作者的光谱度估计,我们建立了一个新的指数性不平等。新颖的是,这一结果为具有正负等值的内核值的内核核内核核核。此外,我们还调查了在线算法的通用性功能和Markov 链样本的普及性表现。我们提供了一个在线转换结果,通过显示我们如何从在线测算出一个不那么的内核的内核的内核标准的测测测测测度序列。

相关内容

马尔可夫链

关注 289

马尔可夫链，因安德烈·马尔可夫（A.A.Markov，1856－1922）得名，是指数学中具有马尔可夫性质的离散事件随机过程。该过程中，在给定当前知识或信息的情况下，过去（即当前以前的历史状态）对于预测将来（即当前以后的未来状态）是无关的。在马尔可夫链的每一步，系统根据概率分布，可以从一个状态变到另一个状态，也可以保持当前状态。状态的改变叫做转移，与不同的状态改变相关的概率叫做转移概率。随机漫步就是马尔可夫链的例子。随机漫步中每一步的状态是在图形中的点，每一步可以移动到任何一个相邻的点，在这里移动到每一个点的概率都是相同的（无论之前漫步路径是如何的）。

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【快讯】NeurIPS2020结果出炉，1900篇上榜，你的paper中了吗？

专知会员服务

54+阅读 · 2020年9月26日