The direct Gaussian copula model with discrete marginal distributions is an appealing data-analytic tool but poses difficult computational challenges due to its intractable likelihood. A number of approximations/surrogates for the likelihood have been proposed, including the continuous extension-based approximation (CE) and the distributional transform-based approximation (DT). The continuous extension approach is exact up to Monte Carlo error but does not scale well computationally. The distributional transform approach permits efficient computation but offers no theoretical guarantee that it is exact. In practice, though, the distributional transform-based approximate likelihood is so very nearly exact for some variants of the model as to permit genuine maximum likelihood or Bayesian inference. We demonstrate the exactness of the distributional transform-based objective function for two interesting variants of the model, and propose a quantity that can be used to assess exactness for experimentally observed datasets. Said diagnostic will permit practitioners to determine whether genuine Bayesian inference or ordinary maximum likelihood inference using the DT-based likelihood is possible for a given dataset.
翻译:带有离散边际分布的直接高山千叶模型是一个有吸引力的数据分析工具,但由于其难以捉摸的可能性而给计算带来困难的挑战。已经为这一可能性提出了一些近似值/代谢值,包括连续的扩展近似值(CE)和分布式变异近似值(DT)。连续的扩展方法精确到蒙特卡洛错误,但没有精确的计算。分布式变换方法允许有效计算,但没有提供准确性的理论保证。但实际上,对于模型的某些变异体而言,分布式变异的近似可能性非常接近于允许真正的最大可能性或巴耶斯推断。我们展示了该模型两个有趣的变异体基于分布式变异的精确性,并提出了可用于评估实验性观察数据集准确性的数量。诊断将使执业者能够确定在给定数据集时使用基于DT的可能性是否真实地推断或普通最大可能性。