Computational efficient evaluation of penalized estimators of multivariate exponential family distributions is sought. These distributions encompass among others Markov random fields with variates of mixed type (e.g. binary and continuous) as special case of interest. The model parameter is estimated by maximization of the pseudo-likelihood augmented with a convex penalty. The estimator is shown to be consistent. With a world of multi-core computers in mind, a computationally efficient parallel Newton-Raphson algorithm is presented for numerical evaluation of the estimator alongside conditions for its convergence. Parallelization comprises the division of the parameter vector into subvectors that are estimated simultaneously and subsequently aggregated to form an estimate of the original parameter. This approach may also enable efficient numerical evaluation of other high-dimensional estimators. The performance of the proposed estimator and algorithm are evaluated and compared in a simulation study. Finally, the paper concludes with an illustration of the presented methodology in the reconstruction of the conditional independence network from data of an integrative omics study.
翻译:对多变量指数家庭分布的受罚估计值进行高效的计算性评估,这些分布包括作为特殊感兴趣的特例的混合类型(例如二进制和连续)的Markov随机字段,其中含有不同变量(例如二进制和连续型),模型参数通过最大限度地增加假象加封状法来估计。估计值显示是一致的。考虑到一个多核心计算机的世界,为了对估计值进行数字评估,提出了一种具有计算效率的平行牛顿-拉夫森算法,同时提供其趋同的条件。平行化包括将参数矢量分解为子变量,同时估算并随后汇总,以形成原始参数的估计数。这个方法还有助于对其他高维估计值者进行有效的数字评估。在模拟研究中评估和比较了拟议的估计值和算法的性能。最后,文件最后用一个说明,说明在从综合生态研究的数据中重建有条件的独立网络时所提出的方法。