A class of models that have been widely used are the exponential random graph (ERG) models, which form a comprehensive family of models that include independent and dyadic edge models, Markov random graphs, and many other graph distributions, in addition to allow the inclusion of covariates that can lead to a better fit of the model. Another increasingly popular class of models in statistical network analysis are stochastic block models (SBMs). They can be used for the purpose of grouping nodes into communities or discovering and analyzing a latent structure of a network. The stochastic block model is a generative model for random graphs that tends to produce graphs containing subsets of nodes characterized by being connected to each other, called communities. Many researchers from various areas have been using computational tools to adjust these models without, however, analyzing their suitability for the data of the networks they are studying. The complexity involved in the estimation process and in the goodness-of-fit verification methodologies for these models can be factors that make the analysis of adequacy difficult and a possible discard of one model in favor of another. And it is clear that the results obtained through an inappropriate model can lead the researcher to very wrong conclusions about the phenomenon studied. The purpose of this work is to present a simple methodology, based on Hypothesis Tests, to verify if there is a model specification error for these two cases widely used in the literature to represent complex networks: the ERGM and the SBM. We believe that this tool can be very useful for those who want to use these models in a more careful way, verifying beforehand if the models are suitable for the data under study.
翻译:广泛使用的一组模型是指数随机图(ERG)模型,这些模型构成一个全面的模型组合,其中包括独立和双亚齐边缘模型、Markov随机图和许多其他图表分布,此外,还允许纳入可更好适应模型的共变模型。统计网络分析中日益受欢迎的另一类模型是随机区块模型(SBMs)。这些模型可用于将节点分组到社区或发现和分析网络的潜在结构。随机图块模型是一个随机图的基因化模型,这些模型往往产生含有以相互连接为特点的节点子组的图表集,这些社区被称作社区。来自不同地区的许多研究人员一直在使用计算工具来调整这些模型,而没有分析这些模型是否适合他们正在研究的网络数据。这些模型的复杂程度以及对这些模型的完善性核查方法可能是一些有用的因素,如果对是否充足性的分析更加困难,并且有可能对一种模型进行仔细研究,则有利于另一种模型。而且很显然,通过一种非常复杂的模型获得的结果,如果通过一种不适当的模型来进行核查,那么在一种基于模型的模型的研究中,这些模型中,这些是用于一种非常错误的模型研究的模型中的一种方法, 一种非常错误的模型研究。