In many fields of biomedical sciences, it is common that random variables are measured repeatedly across different subjects. In such a repeated measurement setting, dependence structures among random variables that are between subjects and within a subject may be different, and should be estimated differently. Ignoring this fact may lead to questionable or even erroneous scientific conclusions. In this paper, we study the problem of sparse and positive-definite estimation of between-subject and within-subject covariance matrices for high-dimensional repeated measurements. Our estimators are defined as solutions to convex optimization problems, which can be solved efficiently. We establish estimation error rate for our proposed estimators of the two target matrices, and demonstrate their favorable performance through theoretical analysis and comprehensive simulation studies. We further apply our methods to recover two covariance graphs of clinical variables from hemodialysis patients.
翻译:在许多生物医学领域中,随机变量常常在不同的受试者之间和同一受试者之间重复测量。在这种重复测量的情况下,不同受试者之间和同一受试者之间的随机变量之间的依赖结构可能是不同的,应该分别进行估计。忽略这样的事实可能导致可疑甚至错误的科学结论。在本文中,我们研究了对于高维重复测量的受试者间和同一受试者间协方差矩阵进行稀疏而正定估计的问题。我们的估计器被定义为凸优化问题的解,可以高效地求解。我们为两个目标矩阵的提议估计器建立了估计误差率,并通过理论分析和全面的模拟研究展示了它们的良好性能。我们进一步应用了我们的方法从血透患者的临床变量中恢复了两个协方差图。