We consider the problem of recovering the three-dimensional atomic structure of a flexible macromolecule from a heterogeneous cryo-EM dataset. The dataset contains noisy tomographic projections of the electrostatic potential of the macromolecule, taken from different viewing directions, and in the heterogeneous case, each image corresponds to a different conformation of the macromolecule. Under the assumption that the macromolecule can be modelled as a chain, or discrete curve (as it is for instance the case for a protein backbone with a single chain of amino-acids), we introduce a method to estimate the deformation of the atomic model with respect to a given conformation, which is assumed to be known a priori. Our method consists on estimating the torsion and bond angles of the atomic model in each conformation as a linear combination of the eigenfunctions of the Laplace operator in the manifold of conformations. These eigenfunctions can be approximated by means of a well-known technique in manifold learning, based on the construction of a graph Laplacian using the cryo-EM dataset. Finally, we test our approach with synthetic datasets, for which we recover the atomic model of two-dimensional and three-dimensional flexible structures from noisy tomographic projections.
翻译:我们考虑从一个混杂的冷冻-EM数据集中恢复一个灵活大型分子三维原子结构的问题。该数据集包含从不同查看方向对一个大型分子的电流潜力进行杂乱的图像预测,在各种情况中,每个图像都与一个大分子不同。根据一个假设,即大型分子可以仿制成一个链条,或离散曲线(例如,一个具有单一氨基酸链的蛋白脊柱),我们采用了一种方法来估计原子模型在某种特定符合性方面的变形,假定该符合性是已知的。我们的方法是估计每个符合性模型的振动和粘合角度,作为拉贝操作员在多个相符合性体中的脑功能的线性组合。这些二元元元元元元元元元元元元元元元元可以通过一种众所周知的多重学习技术加以比较,其基础是用加密-EM-光度模型构建一个图表模型,而该模型假定是一个前置的序。我们的方法是估算每个符合原子模型的原子模型的透度和连接角度数据,最后我们测试了三维的合成数据,我们用三维的合成数据,从合成的合成数据,测试了三维数据。