Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE) models have recently begun to be explored in the context of drug discovery and have the potential to assist in key challenges such as target identification. In the drug discovery domain, KGs can be employed as part of a process which can result in lab-based experiments being performed, or impact on other decisions, incurring significant time and financial costs and most importantly, ultimately influencing patient healthcare. For KGE models to have impact in this domain, a better understanding of not only of performance, but also the various factors which determine it, is required. In this study we investigate, over the course of many thousands of experiments, the predictive performance of five KGE models on two public drug discovery-oriented KGs. Our goal is not to focus on the best overall model or configuration, instead we take a deeper look at how performance can be affected by changes in the training setup, choice of hyperparameters, model parameter initialisation seed and different splits of the datasets. Our results highlight that these factors have significant impact on performance and can even affect the ranking of models. Indeed these factors should be reported along with model architectures to ensure complete reproducibility and fair comparisons of future work, and we argue this is critical for the acceptance of use, and impact of KGEs in a biomedical setting.
翻译:最近,在药物发现的背景下,开始探索知识图(KG)和相关的知识图嵌入(KGE)模型,这些模型最近在药物发现的背景下开始探索,并有可能协助应对关键挑战,如目标确定等。在药物发现领域,KGs可以被用作一个过程的一部分,这一过程可导致进行实验室实验,或影响其他决定,产生巨大的时间和财政成本,而且最重要的是,最终影响病人保健。KGE模型要在这方面产生影响,就需要更好地了解性能以及决定性能的各种因素。在本研究中,我们在数千项实验中调查了五个KGE模型在两种面向公众药物发现型KGS的预测性能。我们的目标不是侧重于最佳的总体模型或配置,而是更深入地研究如何能因培训设置的变化、选择超参数、模型参数初始化种子和数据集的不同分裂而影响业绩。我们的结果突出表明,这些因素对性能有重大影响,甚至可能影响模型的排序。我们报告这些因素是为了在将这种接受性能与模型的正确性能进行对比。我们报告这些因素,为了将这种接受性能与模型联系起来。