Neoplasms (NPs) and neurological diseases and disorders (NDDs) are amongst the major classes of diseases underlying deaths of a disproportionate number of people worldwide. To determine if there exist some distinctive features in the local wiring patterns of protein interactions emerging at the onset of a disease belonging to either of these two classes, we examined 112 and 175 protein interaction networks belonging to NPs and NDDs, respectively. Orbit usage profiles (OUPs) for each of these networks were enumerated by investigating the networks' local topology. 56 non-redundant OUPs (nrOUPs) were derived and used as network features for classification between these two disease classes. Four machine learning classifiers, namely, k-nearest neighbour (KNN), support vector machine (SVM), deep neural network (DNN), random forest (RF) were trained on these data. DNN obtained the greatest average AUPRC (0.988) among these classifiers. DNNs developed on node2vec and the proposed nrOUPs embeddings were compared using 5-fold cross validation on the basis of average values of the six of performance measures, viz., AUPRC, Accuracy, Sensitivity, Specificity, Precision and MCC. It was found that nrOUPs based classifier performed better in all of these six performance measures.
翻译:肿瘤(NPs)和神经疾病和疾病(NDDs)是造成全世界死亡人数过多的人死亡的主要疾病类别之一。为了确定在这两种疾病中任何一种疾病开始时出现的当地蛋白互动线模式中是否存在某些显著特征,我们分别检查了属于NPs和NDDs的112和175个蛋白互动网络。这些网络的本地地形调查列出了其中每一种网络的轨道使用概况(OUPs)。56个非编辑的OUPs(nOUPs)被衍生出来并用作这两个疾病类别之间的网络特征。四个机器学习分类师,即K-近邻(KNNN)、支持矢量机(SVM)、深神经网络(DNNNN)、随机森林(RF)都接受了有关这些数据的培训。 DNNN获得这些分类员中最大平均数的AURC(0.988)。根据Nde2vec开发的DNNNNPs和拟议嵌入的NUPs(NUPs)是使用5倍的交叉验证,根据SENC的6级平均业绩,在SENCSBis和SLCSBisality的6度中发现。