We consider the problem of modelling high-dimensional distributions and generating new examples of data with complex relational feature structure coherent with a graph skeleton. The model we propose tackles the problem of generating the data features constrained by the specific graph structure of each data point by splitting the task into two phases. In the first it models the distribution of features associated with the nodes of the given graph, in the second it complements the edge features conditionally on the node features. We follow the strategy of implicit distribution modelling via generative adversarial network (GAN) combined with permutation equivariant message passing architecture operating over the sets of nodes and edges. This enables generating the feature vectors of all the graph objects in one go (in 2 phases) as opposed to a much slower one-by-one generations of sequential models, prevents the need for expensive graph matching procedures usually needed for likelihood-based generative models, and uses efficiently the network capacity by being insensitive to the particular node ordering in the graph representation. To the best of our knowledge, this is the first method that models the feature distribution along the graph skeleton allowing for generations of annotated graphs with user specified structures. Our experiments demonstrate the ability of our model to learn complex structured distributions through quantitative evaluation over three annotated graph datasets.
翻译:我们考虑了模拟高维分布和产生具有复杂关系特征结构的数据新实例的问题,与图形骨架相协调。我们提议的模型通过将任务分为两个阶段,解决生成受每个数据点具体图形结构制约的数据特征的问题。首先,它模拟与给定图节点相关的特征分布,其次,它补充以节点特征为条件的边缘特征。我们采用通过基因对抗网络(GAN)进行隐性分布模型的战略,同时结合对等电文传递结构,在节点和边缘各组之间运行。这样可以生成所有图形对象的特征矢量,一个行(分两个阶段),一个行生成速度慢得多的顺序模型各代的特征矢量,避免对基于概率的变色模型通常需要的昂贵的图形匹配程序进行分配,并高效使用网络能力,对图形表达中的特殊节点排序保持敏感。我们最了解的是,这是第一个在图表骨架上制作特征分布模型的方法,允许几代相形(分两个阶段)生成一个附加说明的图表,而不是一个慢得多的顺序模型,从而通过用户结构结构结构进行我们的数据分析。我们用三个图表的能力进行数据评估。