We study a class of orbit recovery problems in which we observe independent copies of an unknown element of $\mathbb{R}^p$, each linearly acted upon by a random element of some group (such as $\mathbb{Z}/p$ or $\mathrm{SO}(3)$) and then corrupted by additive Gaussian noise. We prove matching upper and lower bounds on the number of samples required to approximately recover the group orbit of this unknown element with high probability. These bounds, based on quantitative techniques in invariant theory, give a precise correspondence between the statistical difficulty of the estimation problem and algebraic properties of the group. Furthermore, we give computer-assisted procedures to certify these properties that are computationally efficient in many cases of interest. The model is motivated by geometric problems in signal processing, computer vision, and structural biology, and applies to the reconstruction problem in cryo-electron microscopy (cryo-EM), a problem of significant practical interest. Our results allow us to verify (for a given problem size) that if cryo-EM images are corrupted by noise with variance $\sigma^2$, the number of images required to recover the molecule structure scales as $\sigma^6$. We match this bound with a novel (albeit computationally expensive) algorithm for ab initio reconstruction in cryo-EM, based on invariant features of degree at most 3. We further discuss how to recover multiple molecular structures from mixed (or heterogeneous) cryo-EM samples.
翻译:我们研究了一个轨道回收问题类别,在这个类别中,我们观测一个未知元素($mathbb{R ⁇ p$)的独立副本,每个未知元素($mathbb{p$/p$或$mathrm{SO}(3)美元)由某组的随机元素(例如$mathbb}/p$或$gmathrm{SO}(3)美元)进行线性行动,然后被添加高森噪音腐蚀。我们证明,在大约恢复这一未知元素的组间轨道所需的样本数量上下限是相匹配的,概率很高。根据变数理论的定量技术,这些界限在估计问题的统计困难和该组的变数特性之间提供了精确的对应。此外,我们提供了计算机辅助程序来验证这些特性,这些特性在许多利益中具有计算效率。模型的动机是信号处理、计算机视觉和结构生物学中的几何测度问题,并适用于冷冻电子显微镜(cryo-EM)的重建问题。我们的结果使我们能够进一步核实(对于一个问题的规模来说)如果冷色-EM图像由于噪音与最大程度的变异度结构而损坏,需要如何重新进行。