Multi-reference alignment (MRA) is the problem of recovering a signal from its multiple noisy copies, each acted upon by a random group element. MRA is mainly motivated by single-particle cryo-electron microscopy (cryo-EM) that has recently joined X-ray crystallography as one of the two leading technologies to reconstruct biological molecular structures. Previous papers have shown that in the high noise regime, the sample complexity of MRA and cryo-EM is $n=\omega(\sigma^{2d})$, where $n$ is the number of observations, $\sigma^2$ is the variance of the noise, and $d$ is the lowest-order moment of the observations that uniquely determines the signal. In particular, it was shown that in many cases, $d=3$ for generic signals, and thus the sample complexity is $n=\omega(\sigma^6)$. In this paper, we analyze the second moment of the MRA and cryo-EM models. First, we show that in both models the second moment determines the signal up to a set of unitary matrices, whose dimension is governed by the decomposition of the space of signals into irreducible representations of the group. Second, we derive sparsity conditions under which a signal can be recovered from the second moment, implying sample complexity of $n=\omega(\sigma^4)$. Notably, we show that the sample complexity of cryo-EM is $n=\omega(\sigma^4)$ if at most one third of the coefficients representing the molecular structure are non-zero; this bound is near-optimal. The analysis is based on tools from representation theory and algebraic geometry. We also derive bounds on recovering a sparse signal from its power spectrum, which is the main computational problem of X-ray crystallography.
翻译:多坐标校正(MRA) 是一个问题, 是如何从多盘杂音复制件中恢复信号, 每份由随机组元素执行。 MRA主要受单粒冷冻- 电子显微镜(cryo- EM)的驱动, 后者最近加入了X射线晶体学, 作为重建生物分子结构的两大主要技术之一。 前几篇文章显示, 在高噪音系统中, MRA 和冷冻- EM 的样本复杂性是 $@ omga (\\ sigma=2d}) 。 其中, 美元是观测的数量, $\ sigma4, 美元是噪音的差异, $dald$是观测的最小端点。 特别是, 在许多案例中, 用于普通信号的, $d=3$( sigma), 因此样本的复杂度是 $nçoomga (\ gmama) 和 colo- EM 变异度模型的第二个时刻, 如果我们在两个模型中, 将信号的第二个瞬间点确定接近于统一基质基质矩阵结构的第二信号, 。 其尺寸的内, 的数值是来自正位的图像的内, 度, 的数值是来自正值的数值值 。