Network data are increasingly available in various research fields, motivating statistical analysis for populations of networks where a network as a whole is viewed as a data point. Due to the non-Euclidean nature of networks, basic statistical tools available for scalar and vector data are no longer applicable when one aims to relate networks as outcomes to Euclidean covariates, while the study of how a network changes in dependence on covariates is often of paramount interest. This motivates to extend the notion of regression to the case of responses that are network data. Here we propose to adopt conditional Fr\'{e}chet means implemented with both global least squares regression and local weighted least squares smoothing, extending the Fr\'{e}chet regression concept to networks that are quantified by their graph Laplacians. The challenge is to characterize the space of graph Laplacians so as to justify the application of Fr\'{e}chet regression. This characterization then leads to asymptotic rates of convergence for the corresponding M-estimators by applying empirical process methods. We demonstrate the usefulness and good practical performance of the proposed framework with simulations and with network data arising from NYC taxi records, as well as resting-state fMRI in neuroimaging.
翻译:各个研究领域都越来越多地提供网络数据,为整个网络被视为数据点的网络人口提供统计分析。由于网络的非欧化性质,在将网络与欧化共变结果联系起来时,对于将网络与欧化共变结果联系起来时,对于将网络依赖共变数依赖性变化的研究往往具有极大的兴趣。这促使将回归的概念扩大到对网络数据答复的回归情况。我们在这里建议采用有条件的Fr\'{{e}chet,这意味着既在全球最小正方回归和局部加权最不平坦的平方之间执行,也可以将卡路里和矢量器数据的现有基本统计工具不再适用,将Fr\{e}推回归概念扩展至其图解的网络。挑战在于如何描述拉平梯图的空间,从而证明应用Fr\'{{e}chetretracations的回归是最重要的理由。然后,通过应用实验过程方法,使相应的M-Servictors的一致率达到微调率。我们展示了拟议框架的有用性和良好实际表现,因为模拟和网络产生的数据正在形成。