The problem of testing the equality of the generating processes of two categorical time series is addressed in this work. To this aim, we propose three tests relying on a dissimilarity measure between categorical processes. Particular versions of these tests are constructed by considering three specific distances evaluating discrepancy between the marginal distributions and the serial dependence patterns of both processes. Proper estimates of these dissimilarities are an essential element of the constructed tests, which are based on the bootstrap. Specifically, a parametric bootstrap method assuming the true generating models and extensions of the moving blocks bootstrap and the stationary bootstrap are considered. The approaches are assessed in a broad simulation study including several types of categorical models with different degrees of complexity. Advantages and disadvantages of each one of the methods are properly discussed according to their behavior under the null and the alternative hypothesis. The impact that some important input parameters have on the results of the tests is also analyzed. An application involving biological sequences highlights the usefulness of the proposed techniques.
翻译:本文探讨了两个分类时间序列生成过程相等性检验的问题。为此,我们提出了三种基于分类过程之间的距离度量的检验方法。文中分别考虑了三个特定的距离度量版本,用于评估两个过程的边缘分布和串行依赖模式之间的差异。这些检验方法的构建需要正确估计这些差异度量,并以自助法为基础。具体而言,我们考虑了一个基于真实生成模型的参数自助法,以及移动块自助法和稳定自助法的扩展版本。通过广泛的模拟研究,包括不同复杂度的多种分类模型,来评估这些方法的表现。根据这些方法在零假设和备择假设下的行为,适当讨论了各自的优缺点。还分析了某些重要输入参数对测试结果的影响。最后,通过涉及生物序列的应用,展示了所提出的技术的实用性。