Protein-RNA interactions are of vital importance to a variety of cellular activities. Both experimental and computational techniques have been developed to study the interactions. Due to the limitation of the previous database, especially the lack of protein structure data, most of the existing computational methods rely heavily on the sequence data, with only a small portion of the methods utilizing the structural information. Recently, AlphaFold has revolutionized the entire protein and biology field. Foreseeably, the protein-RNA interaction prediction will also be promoted significantly in the upcoming years. In this work, we give a thorough review of this field, surveying both the binding site and binding preference prediction problems and covering the commonly used datasets, features, and models. We also point out the potential challenges and opportunities in this field. This survey summarizes the development of the RBP-RNA interaction field in the past and foresees its future development in the post-AlphaFold era.
翻译:蛋白质-RNA相互作用对于各种细胞活动至关重要,已经开发了实验和计算技术来研究这些相互作用。由于前一个数据库的局限性,特别是缺乏蛋白质结构数据,大多数现有计算方法主要依赖序列数据,只有一小部分方法使用结构信息。最近,AlphaFold使整个蛋白和生物学领域发生了革命性变化。未来几年,蛋白质-RNA相互作用预测也将得到显著推广。在这项工作中,我们彻底审查了这个领域,调查了约束性和约束性偏好预测问题,并覆盖了常用的数据集、特征和模型。我们还指出了该领域的潜在挑战和机遇。这项调查总结了RPP-RNA互动领域过去的发展情况,并预见了其在后AlphaFold时代的未来发展。