In genome-wide epigenetic studies, exposures (e.g., Single Nucleotide Polymorphisms) affect outcomes (e.g., gene expression) through intermediate variables such as DNA methylation. Mediation analysis offers a way to study these intermediate variables and identify the presence or absence of causal mediation effects. Testing for mediation effects lead to a composite null hypothesis. Existing methods like the Sobel's test or the Max-P test are often underpowered because 1) statistical inference is often conducted based on distributions determined under a subset of the null and 2) they are not designed to shoulder the multiple testing burden. To tackle these issues, we introduce a technique called MLFDR (Mediation Analysis using Local False Discovery Rates) for high dimensional mediation analysis, which uses the local False Discovery Rates based on the coefficients of the structural equation model specifying the mediation relationship to construct a rejection region. We have shown theoretically as well as through simulation studies that in the high-dimensional setting, the new method of identifying the mediating variables controls the FDR asymptotically and performs better with respect to power than several existing methods such as DACT (Liu et al.)and JS-mixture (Dai et al).
翻译:在全基因组表观遗传学研究中,暴露因素(如单核苷酸多态性)通过DNA甲基化等中介变量影响结局变量(如基因表达)。中介分析为研究这些中介变量并识别因果中介效应的存在与否提供了方法。对中介效应的检验引出了一个复合零假设。现有方法如Sobel检验或Max-P检验往往检验效能不足,原因在于:1)统计推断通常基于零假设子集确定的分布进行;2)这些方法并非为承担多重检验负担而设计。为解决这些问题,我们提出了一种称为MLFDR(基于局部错误发现率的中介分析)的高维中介分析技术。该方法利用基于指定中介关系的结构方程模型系数的局部错误发现率来构建拒绝域。我们通过理论证明和模拟研究表明,在高维设定下,这种识别中介变量的新方法能渐近控制错误发现率,并且在检验效能方面优于DACT(Liu等人提出)和JS-mixture(Dai等人提出)等现有方法。