Recent breakthroughs in high resolution imaging of biomolecules in solution with cryo-electron microscopy (cryo-EM) have unlocked new doors for the reconstruction of molecular volumes, thereby promising further advances in biology, chemistry, and pharmacological research amongst others. Despite significant headway, the immense challenges in cryo-EM data analysis remain legion and intricately inter-disciplinary in nature, requiring insights from physicists, structural biologists, computer scientists, statisticians, and applied mathematicians. Meanwhile, recent next-generation volume reconstruction algorithms that combine generative modeling with end-to-end unsupervised deep learning techniques have shown promising results on simulated data, but still face considerable hurdles when applied to experimental cryo-EM images. In light of the proliferation of such methods and given the interdisciplinary nature of the task, we propose here a critical review of recent advances in the field of deep generative modeling for high resolution cryo-EM volume reconstruction. The present review aims to (i) compare and contrast these new methods, while (ii) presenting them from a perspective and using terminology familiar to scientists in each of the five aforementioned fields with no specific background in cryo-EM. The review begins with an introduction to the mathematical and computational challenges of deep generative models for cryo-EM volume reconstruction, along with an overview of the baseline methodology shared across this class of algorithms. Having established the common thread weaving through these different models, we provide a practical comparison of these state-of-the-art algorithms, highlighting their relative strengths and weaknesses, along with the assumptions that they rely on. This allows us to identify bottlenecks in current methods and avenues for future research.
翻译:在高分辨率生物分子成像中,最近在高分辨率生物分子成像与冷冻-电子显微镜(cryo-EM)的溶解方面出现的突破为分子量的重建打开了新的大门,从而在生物学、化学和药理学研究等方面有望取得进一步的进展。尽管取得了显著进展,但冷冻-EM数据分析中的巨大挑战仍然是分流和复杂的跨学科性质,需要物理学家、结构性生物学家、计算机科学家、统计家和应用数学家的深刻认识。与此同时,最近新一代的量重建算法,将归正模型与端到端的、不受监督的深层次学习技术相结合,为模拟数据的重建打开了新的门,但在应用实验性冷冻-EM图像时,仍然面临着巨大的障碍。鉴于这些方法的扩散和任务的跨学科性质,我们在这里建议对高分辨率冷冻-EM数量重建的深层次建模领域的最新进展进行一次批判性审查。本次审查的目的是(i)比较和对比这些新方法的瓶颈,同时(ii)从一个视角和术语上展示模拟模拟模拟数据模拟数据模拟的当前和每个模型的深度结构开始,并开始在5个具体的计算方法。