OBJECTIVE: Leverage existing biomedical NLP tools and DS domain terminology to produce a novel and comprehensive knowledge graph containing dietary supplement (DS) information for discovering interactions between DS and drugs, or Drug-Supplement Interactions (DSI). MATERIALS AND METHODS: We created SemRepDS (an extension of SemRep), capable of extracting semantic relations from abstracts by leveraging a DS-specific terminology (iDISK) containing 28,884 DS terms not found in the UMLS. PubMed abstracts were processed using SemRepDS to generate semantic relations, which were then filtered using a PubMedBERT-based model to remove incorrect relations before generating our knowledge graph (SuppKG). Two pathways are used to identify potential DS-Drug interactions which are then evaluated by medical professionals for mechanistic plausibility. RESULTS: Comparison analysis found that SemRepDS returned 206.9% more DS relations and 158.5% more DS entities than SemRep. The fine-tuned BERT model obtained an F1 score of 0.8605 and removed 43.86% of the relations, improving the precision of the relations by 26.4% compared to pre-filtering. SuppKG consists of 2,928 DS-specific nodes. Manual review of findings identified 44 (88%) proposed DS-Gene-Drug and 32 (64%) proposed DS-Gene1-Function-Gene2-Drug pathways to be mechanistically plausible. DISCUSSION: The additional relations extracted using SemRepDS generated SuppKG that was used to find plausible DSI not found in the current literature. By the nature of the SuppKG, these interactions are unlikely to have been found using SemRep without the expanded DS terminology. CONCLUSION: We successfully extend SemRep to include DS information and produce SuppKG which can be used to find potential DS-Drug interactions.
翻译:目标: 利用现有生物医学NLP工具和 DS 域名, 以生成包含饮食补充信息的新颖和全面的知识图表, 包括用于发现DS和药物之间互动的饮食补充( DS) 信息, 或药物补充互动( DSI) 。 材料和方法 : 我们创建了 SemrepDS (SemRepPDS 扩展版), 能够利用DS 特定术语( IDIS), 包含28, 884 DS 当前术语。 利用SemrepDDDS 的当前术语处理 PubMed摘要, 以生成语义关系。 精密的BERT( DMED) 使用基于 普麦德BERT 的模型进行过滤, 以消除不正确的关系 。 我们创建了DRepDDDS 的当前关系 206.9% 和158.5% DS 实体。 精密的BERTFS 模型, 使用 285 GGMS 的精确度分析结果, 找到了 285- GS 和删除的FMLOLODDR%