Reversible Post-Translational Modifications (PTMs) have vital roles in extending the functional diversity of proteins and effect meaningfully the regulation of protein functions in prokaryotic and eukaryotic organisms. PTMs have happened as crucial molecular regulatory mechanisms that are utilized to regulate diverse cellular processes. Nevertheless, among the most well-studied PTMs can say mainly types of proteins are containing phosphorylation and significant roles in many biological processes. Disorder in this modification can be caused by multiple diseases including neurological disorders and cancers. Therefore, it is necessary to predict the phosphorylation of target residues in an uncharacterized amino acid sequence. Most experimental techniques for predicting phosphorylation are time-consuming, costly, and error-prone. By the way, computational methods have replaced these techniques. These days, a vast amount of phosphorylation data is publicly accessible through many online databases. In this study, at first, all datasets of PTMs that include phosphorylation sites (p-sites) were comprehensively reviewed. Furthermore, we showed that there are basically two main approaches for phosphorylation prediction by machine learning: End-to-End and conventional. We gave an overview for both of them. Also, we introduced 15 important feature extraction techniques which mostly have been used for conventional machine learning methods
翻译:在扩大蛋白质的功能多样性和对蛋白质和蛋白质生物中蛋白质功能的调控产生有意义的效果方面,可逆转的变异后变异作用具有关键作用。PTM是作为关键分子监管机制而发生的,用于调节不同的细胞过程。然而,在最受广泛研究的PTM中,可以说主要类型的蛋白质含有磷酸和许多生物过程的重要作用。这种变异可能由多种疾病(包括神经系统疾病和癌症)引起。因此,有必要预测在一种未说明的氨基酸序列中目标残留物的磷酸化作用。大多数预测磷酸化的实验技术都是耗时、昂贵和易出错的。顺便说,计算方法取代了这些技术。这些天来,大量磷酸化数据可以通过许多在线数据库公开查阅。在这项研究中,首先对包括磷酸化站(站点)在内的所有PTM数据集进行了全面审查。此外,我们表明,在磷酸化预测中基本上有两种主要方法,即用于常规提取方法的15号机学。我们还采用了一种重要方法。