Likelihood-free methods are useful for parameter estimation of complex models with intractable likelihood functions for which it is easy to simulate data. Such models are prevalent in many disciplines including genetics, biology, ecology and cosmology. Likelihood-free methods avoid explicit likelihood evaluation by finding parameter values of the model that generate data close to the observed data. The general consensus has been that it is most efficient to compare datasets on the basis of a low dimensional informative summary statistic, incurring information loss in favour of reduced dimensionality. More recently, researchers have explored various approaches for efficiently comparing empirical distributions in the likelihood-free context in an effort to avoid data summarisation. This article provides a review of these full data distance based approaches, and conducts the first comprehensive comparison of such methods, both qualitatively and empirically. We also conduct a substantive empirical comparison with summary statistic based likelihood-free methods. The discussion and results offer guidance to practitioners considering a likelihood-free approach. Whilst we find the best approach to be problem dependent, we also find that the full data distance based approaches are promising and warrant further development. We discuss some opportunities for future research in this space.
翻译:无隐性方法有助于对复杂模型进行参数估计,这些模型具有易于模拟数据的难以捉摸的可能功能。这些模型在许多学科中十分普遍,包括遗传学、生物学、生态学和宇宙学。无隐性方法通过寻找产生接近观察数据的数据模型的参数值而避免明显的可能性评估。一般的共识是,在低维信息性摘要统计的基础上比较数据集最为有效,造成信息损失,有利于减少维度。最近,研究人员探索了各种方法,有效比较无可能性环境中的经验分布,以努力避免数据对称。本文章审查了这些完全基于数据距离的方法,并且从质量上和实验上对此类方法进行了第一次全面比较。我们还对基于简要统计数据的无可能性方法进行了实质性经验比较。讨论和结果为考虑无可能性方法的从业人员提供了指导。我们发现,最佳方法取决于问题,但我们也发现,完全基于数据距离的方法很有希望,值得进一步发展。我们讨论了今后在空间进行研究的一些机会。