Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology that enables the visualization of subcellular structures in situ at near-atomic resolution. Cellular cryo-ET images help in resolving the structures of macromolecules and determining their spatial relationship in a single cell, which has broad significance in cell and structural biology. Subtomogram classification and recognition constitute a primary step in the systematic recovery of these macromolecular structures. Supervised deep learning methods have been proven to be highly accurate and efficient for subtomogram classification, but suffer from limited applicability due to scarcity of annotated data. While generating simulated data for training supervised models is a potential solution, a sizeable difference in the image intensity distribution in generated data as compared to real experimental data will cause the trained models to perform poorly in predicting classes on real subtomograms. In this work, we present Cryo-Shift, a fully unsupervised domain adaptation and randomization framework for deep learning-based cross-domain subtomogram classification. We use unsupervised multi-adversarial domain adaption to reduce the domain shift between features of simulated and experimental data. We develop a network-driven domain randomization procedure with `warp' modules to alter the simulated data and help the classifier generalize better on experimental data. We do not use any labeled experimental data to train our model, whereas some of the existing alternative approaches require labeled experimental samples for cross-domain classification. Nevertheless, Cryo-Shift outperforms the existing alternative approaches in cross-domain subtomogram classification in extensive evaluation studies demonstrated herein using both simulated and experimental data.
翻译:心电感应成像技术(cryo-ET)是一种三维成像技术,它使近原子分辨率的亚细胞结构在现场可视化。 细胞冷凝- ET图像有助于解决大型分子结构的结构并确定其在单细胞中的空间关系,这在细胞和结构生物学中具有广泛意义。 子图的分类和识别是系统恢复这些大型分子结构的主要步骤。 监督的深层学习方法对于子图解分类来说是非常准确有效的,但由于缺少附加说明的数据,其适用性有限。 在为受监督的模型提供培训的模拟数据时,一个潜在的解决方案是模拟的。 与实际实验数据相比,生成的数据的图像强度分布差异很大,导致经过训练的模型在预测实际子图和结构图课课程方面表现不佳。 在这项工作中,我们提出了完全不超超超超超的域适应和随机化的域图框架,用于深度学习的跨数据分类,但是由于缺少附加说明的数据说明,我们使用不超超级的多对域域域数据进行变通的调整,用于减少实验性实验性实验性实验性实验模型数据模型数据模型的模型,同时进行。 在模拟数据模型中,我们用模拟数据模拟数据模拟和实验性模型进行中,需要中,需要,我们使用任何常规的模型进行某些的模拟的模拟的模拟的模型, 需要,在模拟的模拟的模型,在模拟的模型,在模拟的模型进行中,在实验性模型进行中,在模拟数据中,在模拟的模型进行。