Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing framework consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy (MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation (MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this valid and more powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not enough evidence to reject the null hypothesis that the two hemispheres are equally distributed.
翻译:在随机点数产品图表模型下,非参数假设测试框架包括使用相邻光谱嵌入(ASE)嵌入图形,然后使用中位翻动脂质调整嵌入,最后应用非参数最大平均值差异(MMD)测试以获得p-value。我们利用Drosophilla大脑网络生成的合成数据,显示中位翻转超热性结果无效测试,并证明最佳运输质谱(OTP)以对齐解决了无效性。我们进一步证明,以多尺度图形相关性(MGCC)测试取代MMD测试可以在合成和模拟数据中进行更强有力的测试。最后,我们将这一有效、更强大的测试应用到幼虫Droophilia蘑菇脑网络的右半球和左半球,并得出结论,没有足够证据否定两个半球分布均匀的假设。