Predicting drug side-effects before they occur is a key task in keeping the number of drug-related hospitalizations low and to improve drug discovery processes. Automatic predictors of side-effects generally are not able to process the structure of the drug, resulting in a loss of information. Graph neural networks have seen great success in recent years, thanks to their ability of exploiting the information conveyed by the graph structure and labels. These models have been used in a wide variety of biological applications, among which the prediction of drug side-effects on a large knowledge graph. Exploiting the molecular graph encoding the structure of the drug represents a novel approach, in which the problem is formulated as a multi-class multi-label graph-focused classification. We developed a methodology to carry out this task, using recurrent Graph Neural Networks, and building a dataset from freely accessible and well established data sources. The results show that our method has an improved classification capability, under many parameters and metrics, with respect to previously available predictors.
翻译:在药物住院治疗数量少和改善药物发现过程方面,一项关键任务是在出现药物前预测药物副作用。副作用的自动预测器一般无法处理药物的结构,从而导致信息丢失。图表神经网络近年来由于能够利用图形结构和标签传递的信息,取得了巨大成功。这些模型被用于各种各样的生物应用,其中包括在大型知识图上预测药物副作用。利用分子图将药物的结构编码是一种新颖的方法,其中将问题表述为多级多标签图表分类。我们开发了一种方法,利用经常性的图形神经网络来完成这项任务,并且从可自由获取和公认的数据源中建立数据集。结果显示,我们的方法在很多参数和指标下,在以前的预测器方面提高了分类能力。