对称网络主要构件分析的合并式CP分解 (A Coupled CP Decomposition for Principal Components Analysis of Symmetric Networks)

In a number of application domains, one observes a sequence of network data; for example, repeated measurements between users interactions in social media platforms, financial correlation networks over time, or across subjects, as in multi-subject studies of brain connectivity. One way to analyze such data is by stacking networks into a third-order array or tensor. We propose a principal components analysis (PCA) framework for sequence network data, based on a novel decomposition for semi-symmetric tensors. We derive efficient algorithms for computing our proposed "Coupled CP" decomposition and establish estimation consistency of our approach under an analogue of the spiked covariance model with rates the same as the matrix case up to a logarithmic term. Our framework inherits many of the strengths of classical PCA and is suitable for a wide range of unsupervised learning tasks, including identifying principal networks, isolating meaningful changepoints or outliers across observations, and for characterizing the "variability network" of the most varying edges. Finally, we demonstrate the effectiveness of our proposal on simulated data and on examples from political science and financial economics. The proof techniques used to establish our main consistency results are surprisingly straight-forward and may find use in a variety of other matrix and tensor decomposition problems.

翻译：在一系列应用领域,人们观察一系列网络数据;例如,在社交媒体平台、金融关联网络或不同学科用户之间互动的反复测量,如对大脑连接的多科目研究中,在时间上或不同学科之间反复测量用户在社交媒体平台、金融关联网络中的相互作用。分析这些数据的一种方法是将网络堆叠成三阶阵列或高压。我们建议了一个主要组成部分分析框架,用于对半对称温度进行序列数据分析。我们得出高效的算法,用于计算我们提议的“混合式CP”分解,并估算我们方法的一致性,在快速变异模型模拟模型中,其比率与矩阵案例相同,直至对数术语。我们的框架继承了经典五氯苯的许多长处,适合广泛的非超常的学习任务,包括确定主要网络,将有意义的变异点或异点隔开,以及确定最不同边缘的“变异性网络”的特性。最后,我们展示我们关于模拟数据的建议的有效性,以及政治学和金融学和经济学问题实例的相似性。我们使用的证据性矩阵可以令人惊讶地确定我们的主要一致性。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日