Linear regression on network-linked observations has been an essential tool in modeling the relationship between response and covariates with additional network structures. Previous methods either lack inference tools or rely on restrictive assumptions on social effects and usually assume that networks are observed without errors. This paper proposes a regression model with nonparametric network effects. The model does not assume that the relational data or network structure is exactly observed and can be provably robust to network perturbations. Asymptotic inference framework is established under a general requirement of the network observational errors, and the robustness of this method is studied in the specific setting when the errors come from random network models. We discover a phase-transition phenomenon of the inference validity concerning the network density when no prior knowledge of the network model is available while also showing a significant improvement achieved by knowing the network model. Simulation studies are conducted to verify these theoretical results and demonstrate the advantage of the proposed method over existing work in terms of accuracy and computational efficiency under different data-generating models. The method is then applied to middle school students' network data to study the effectiveness of educational workshops in reducing school conflicts.
翻译:网络联系观测的线性回归是模拟反应和网络结构共同变化之间关系的一个必要工具。以前的方法要么缺乏推断工具,要么依赖关于社会效应的限制性假设,通常假设网络不受误差的观察。本文提出一个具有非参数网络效应的回归模型。模型并不假定关系数据或网络结构得到准确的观察,而且对网络扰动可能具有稳健性。根据网络观测错误的一般要求,建立了抗药性推断框架。当错误来自随机网络模型时,在特定环境中对这种方法的稳健性进行了研究。我们发现,当以前没有关于网络模型的知识时,网络密度的推断是分阶段过渡性的,同时也表明通过了解网络模型取得了显著的改进。进行了模拟研究,以核实这些理论结果,并表明拟议方法对不同数据生成模型下的现有工作在准确性和计算效率方面的优势。然后将这一方法应用于中学生网络数据,以研究教育讲习班在减少学校冲突方面的有效性。