Despite the ubiquity of U-statistics in modern Probability and Statistics, their non-asymptotic analysis in a dependent framework may have been overlooked. In a recent work, a new concentration inequality for U-statistics of order two for uniformly ergodic Markov chains has been proved. In this paper, we put this theoretical breakthrough into action by pushing further the current state of knowledge in three different active fields of research. First, we establish a new exponential inequality for the estimation of spectra of trace class integral operators with MCMC methods. The novelty is that this result holds for kernels with positive and negative eigenvalues, which is new as far as we know. In addition, we investigate generalization performance of online algorithms working with pairwise loss functions and Markov chain samples. We provide an online-to-batch conversion result by showing how we can extract a low risk hypothesis from the sequence of hypotheses generated by any online learner. We finally give a non-asymptotic analysis of a goodness-of-fit test on the density of the invariant measure of a Markov chain. We identify some classes of alternatives over which our test based on the $L_2$ distance has a prescribed power.
翻译:尽管在现代概率和统计中,U-统计学是普遍存在的,但在现代概率和统计中,U-统计学在依赖性框架内的非抽取性分析可能被忽视。在最近的一项工作中,U-统计学在统一ERgodidic Markov 链条方面出现了新的二号顺序的集中不平等。在本文件中,我们将这一理论突破付诸行动,在三个不同的活跃研究领域进一步推进目前的知识状态。首先,我们为利用MCMCM方法估计微量级整体操作者的频谱建立了新的指数性不平等。新颖之处是,这一结果对具有正值和负值的内核,就我们所知,这是新的。此外,我们还调查了双向损失函数和Markov 链样的在线算法的通用性表现。我们通过展示我们如何从任何在线学习者产生的假设序列中获取低风险假设。我们最后对基于马可夫2号远程电路段的耐久度度度测量值的内核值测试进行了非抽取性测试。