Low-dimensional embeddings, from classical spectral embeddings to modern neural-net-inspired methods, are a cornerstone in the modeling and analysis of complex networks. Recent work by Seshadhri et al. (PNAS 2020) suggests that such embeddings cannot capture local structure arising in complex networks. In particular, they show that any network generated from a natural low-dimensional model cannot be both sparse and have high triangle density (high clustering coefficient), two hallmark properties of many real-world networks. In this work we show that the results of Seshadhri et al. are intimately connected to the model they use rather than the low-dimensional structure of complex networks. Specifically, we prove that a minor relaxation of their model can generate sparse graphs with high triangle density. Surprisingly, we show that this same model leads to exact low-dimensional factorizations of many real-world networks. We give a simple algorithm based on logistic principal component analysis (LPCA) that succeeds in finding such exact embeddings. Finally, we perform a large number of experiments that verify the ability of very low-dimensional embeddings to capture local structure in real-world networks.
翻译:从古典光谱嵌入到现代神经网激励方法的低维嵌入器是复杂网络模型和分析的基石。Seshadhri等人(PNAS 2020)最近的工作表明,这种嵌入器无法捕捉复杂网络中产生的本地结构。特别是,它们表明,自然低维模型所产生的任何网络不可能既稀少,也不能同时具有高三角密度(高集聚系数),这是许多现实世界网络的两个标志性特征。在这项工作中,我们显示Seshadhri等人的结果与其使用的模型而不是复杂网络的低维结构密切相关。具体地说,我们证明,稍稍放松其模型可以生成高三角密度的稀薄图。令人惊讶的是,我们显示,这种模型可以导致许多真实世界网络的精确的低维因子化。我们给出了一个基于后勤本要素分析(LPCA)的简单算法,它能够成功找到这种精确嵌入器。最后,我们进行了大量实验,以核实非常低维嵌入器在现实世界网络中采集本地结构的能力。