Exponential random graph models, or ERGMs, are a flexible and general class of models for modeling dependent data. While the early literature has shown them to be powerful in capturing many network features of interest, recent work highlights difficulties related to the models' ill behavior, such as most of the probability mass being concentrated on a very small subset of the parameter space. This behavior limits both the applicability of an ERGM as a model for real data and inference and parameter estimation via the usual Markov chain Monte Carlo algorithms. To address this problem, we propose a new exponential family of models for random graphs that build on the standard ERGM framework. Specifically, we solve the problem of computational intractability and `degenerate' model behavior by an interpretable support restriction. We introduce a new parameter based on the graph-theoretic notion of degeneracy, a measure of sparsity whose value is commonly low in real-worlds networks. The new model family is supported on the sample space of graphs with bounded degeneracy and is called degeneracy-restricted ERGMs, or DERGMs for short. Since DERGMs generalize ERGMs -- the latter is obtained from the former by setting the degeneracy parameter to be maximal -- they inherit good theoretical properties, while at the same time place their mass more uniformly over realistic graphs. The support restriction allows the use of new (and fast) Monte Carlo methods for inference, thus making the models scalable and computationally tractable. We study various theoretical properties of DERGMs and illustrate how the support restriction improves the model behavior. We also present a fast Monte Carlo algorithm for parameter estimation that avoids many issues faced by Markov Chain Monte Carlo algorithms used for inference in ERGMs.
翻译:ExGM 模型或ERGM 模型是模拟依赖数据的一种灵活和一般的模型类型。 虽然早期文献显示这些模型在捕捉许多引起兴趣的网络特征方面的力量很强, 但最近的工作凸显了与模型不良行为有关的困难, 比如大部分概率质量集中在参数空间的一个非常小的子集上。 这种行为限制了ERGM作为真实数据的模型的适用性, 也限制了通过通常的 Markov 链的 Monte Carlo 算法进行推断和参数估计。 为了解决这个问题, 我们提出了一个新的随机图表的指数系列。 在标准ERGM 框架的基础上建立随机图表。 具体来说, 我们通过可解释的支持限制计算性与“ 下降性” 模型行为之间的问题。 我们引入了一个新的参数, 以图表- 模型变异性化概念概念概念为基础, 测量在现实世界网络中的价值通常很低。 新的模型群群群在以可约束性、 可变异性化的图表样本空间支持, 并且被称为变异性能支持。 在 ERGMDR 中, 快速地, 将 快速地变变变变变变现 。