We study the fundamental limits for reconstruction in weighted graph (or matrix) database alignment. We consider a model of two graphs where $\pi^*$ is a planted uniform permutation and all pairs of edge weights $(A_{i,j}, B_{\pi^*(i),\pi^*(j)})_{1 \leq i<j \leq n}$ are i.i.d. pairs of Gaussian variables with zero mean, unit variance and correlation parameter $\rho \in [0,1]$. We prove that there is a sharp threshold for exact recovery of $\pi^*$: if $n \rho^2 \geq (4+\epsilon) \log n + \omega(1)$ for some $\epsilon>0$, there is an estimator $\hat{\pi}$ -- namely the MAP estimator -- based on the observation of databases $A,B$ that achieves exact reconstruction with high probability. Conversely, if $n \rho^2 \leq 4 \log n - \log \log n - \omega(1)$, then any estimator $\hat{\pi}$ verifies $\hat{\pi}=\pi$ with probability $o(1)$. This result shows that the information-theoretic threshold for exact recovery is the same as the one obtained for detection in a recent work by Wu et al. (2020): in other words, for Gaussian weighted graph alignment, the problem of reconstruction is not more difficult than that of detection. Though the reconstruction task was already well understood for vector-shaped database alignment (that is taking signal of the form $(u_i, v_{\pi^*(i)})_{1 \leq i\leq n}$ where $(u_i, v_{\pi^*(i)})$ are i.i.d. pairs in $\mathbb{R}^{d_u} \times \mathbb{R}^{d_v}$), its formulation for graph (or matrix) databases brings a drastically different problem for which the hard phase is conjectured to be wide. The proofs build upon the analysis of the MAP estimator and the second moment method, together with the study of the correlation structure of energies of permutations.
翻译:我们用加权图形( 或矩阵) 数据库对齐来研究重建的基本限值 。 我们考虑两个图表的模型, 其中$\ p% 是一个固定的一致调整值, 而所有对齐的边权加权值 $( A ⁇ i, j}, B ⁇ pi ⁇ ( i),\\\ pi} (j)\\\\ leq i < i < j\\ leq n} i. d. 由零平均值、 单位差异和相关参数 $[ 0, 1美元] 。 我们证明有一个精确恢复$( pion) 的亮点 : 如果 $\ rh% 2\ g ( 4 ⁇ ) 和所有边端权重的亮度重量 $( 4\ eepsil) (log n= = 美元) 美元, 那么对于恢复结果的估测结果 $( 美元) 是相同的 。