Drug-target interaction (DTI) prediction plays a crucial role in drug discovery, and deep learning approaches have achieved state-of-the-art performance in this field. We introduce an ensemble of deep learning models (EnsembleDLM) for DTI prediction. EnsembleDLM only uses the sequence information of chemical compounds and proteins, and it aggregates the predictions from multiple deep neural networks. This approach not only achieves state-of-the-art performance in Davis and KIBA datasets but also reaches cutting-edge performance in the cross-domain applications across different bio-activity types and different protein classes. We also demonstrate that EnsembleDLM achieves a good performance (Pearson correlation coefficient and concordance index > 0.8) in the new domain with approximately 50% transfer learning data, i.e., the training set has twice as much data as the test set.
翻译:药物-目标互动(DTI)预测在药物发现中起着关键作用,深层次学习方法已经在这一领域取得了最先进的表现。我们为DTI预测引入了一套深层次学习模型(EntsbleDLM)的组合。聚合DLM只使用化学化合物和蛋白质的序列信息,并汇总了多个深层神经网络的预测。这种方法不仅在Davis和KIBA数据集中达到了最先进的性能,而且在跨生物活动类型和不同蛋白类的交叉应用中也达到了最先进的性能。我们还表明,在新领域,EnsembleDLM取得了良好的性能(Pearson相关系数和和谐指数 > 0.8),大约50%的转移学习数据,即,培训数据集拥有比测试数据集多一倍的数据。