Mendelian randomization (MR) is a statistical method exploiting genetic variants as instrumental variables to estimate the causal effect of modifiable risk factors on an outcome of interest. Despite wide uses of various popular two-sample MR methods based on genome-wide association study summary level data, however, those methods could suffer from potential power loss or/and biased inference when the chosen genetic variants are in linkage disequilibrium (LD), and have relatively large direct effects on the outcome whose distribution might be heavy-tailed which is commonly referred to as the idiosyncratic pleiotropy. To resolve those two issues, we propose a novel Robust Bayesian Mendelian Randomization (RBMR) model that uses the more robust multivariate generalized $t$-distribution to model such direct effects in a probabilistic model framework which can also incorporate the LD structure explicitly. The generalized $t$-distribution can be represented as a Gaussian scaled mixture so that our model parameters can be estimated by the EM-type algorithms. We compute the standard errors by calibrating the evidence lower bound (ELBO) using the likelihood ratio test. Through extensive simulation studies, we show that our RBMR has robust performance compared to other competing methods. We also apply our RBMR method to two benchmark data sets and find that RBMR has smaller bias and standard errors. Using our proposed RBMR method, we found that coronary artery disease (CAD) is associated with increased risk of coronavirus disease 2019 (COVID-19). We also develop a user-friendly R package RBMR for public use.
翻译:Mendelian随机化(MR)是一种统计方法,它利用基因变异体作为工具变量,评估可变风险因素对某种利益结果的因果关系。尽管广泛使用基于全基因组协会研究摘要水平数据的各种流行的双模MR方法,但当所选遗传变体处于联系不均度(LD)时,这些方法可能会受到潜在的权力损失或/和偏颇的推断。 通用美元分配制可以被表现为高尔斯比例化的混合物,这样我们的模型参数通常被称为特殊合成性脾脏。为了解决这两个问题,我们建议采用新型Robust Bayesian Mendelian 随机化(RBMR)模式,采用更强的多变通用美元分配模式,以模拟这种直接效应,这种模型也可以明确纳入LD结构。通用美元分配制可以被表现为高尔斯比例化的混合物,这样,我们的模式参数可以被通常称为EM型算法。我们通过校正标准错误来计算标准错误。我们通过校正的 RBMR(EBR) 也使用较强的测试方法,我们使用另一种方法来测试我们的标准性测试我们的标准。