Background : Knowledge is evolving over time, often as a result of new discoveries or changes in the adopted methods of reasoning. Also, new facts or evidence may become available, leading to new understandings of complex phenomena. This is particularly true in the biomedical field, where scientists and physicians are constantly striving to find new methods of diagnosis, treatment and eventually cure. Knowledge Graphs (KGs) offer a real way of organizing and retrieving the massive and growing amount of biomedical knowledge. Objective : We propose an end-to-end approach for knowledge extraction and analysis from biomedical clinical notes using the Bidirectional Encoder Representations from Transformers (BERT) model and Conditional Random Field (CRF) layer. Methods : The approach is based on knowledge graphs, which can effectively process abstract biomedical concepts such as relationships and interactions between medical entities. Besides offering an intuitive way to visualize these concepts, KGs can solve more complex knowledge retrieval problems by simplifying them into simpler representations or by transforming the problems into representations from different perspectives. We created a biomedical Knowledge Graph using using Natural Language Processing models for named entity recognition and relation extraction. The generated biomedical knowledge graphs (KGs) are then used for question answering. Results : The proposed framework can successfully extract relevant structured information with high accuracy (90.7% for Named-entity recognition (NER), 88% for relation extraction (RE)), according to experimental findings based on real-world 505 patient biomedical unstructured clinical notes. Conclusions : In this paper, we propose a novel end-to-end system for the construction of a biomedical knowledge graph from clinical textual using a variation of BERT models.
翻译:背景:知识随时间而演变,通常是由于新的发现或理论推断方法的改变。此外,新的事实或证据可能会变得可用,从而导致对复杂现象的新理解。在生物医学领域,这尤其正确,科学家和医生不断努力寻找新的诊断、治疗和最终治愈方法。知识图谱(KGs)为组织和检索大量不断增长的生物医学知识提供了一种真实的方式。
目标:我们提出了一种基于Bidirectional Encoder Representations from Transformers(BERT)模型和Conditional Random Field(CRF)层的生物医学临床笔记知识提取和分析的端到端方法。
方法:该方法基于知识图谱,可以有效地处理生物医学概念,如医学实体之间的关系和相互作用。除了提供一种直观的方式来可视化这些概念外,KGs还可以通过将这些问题简化为更简单的表示或从不同的角度转换问题的表示来解决更复杂的知识检索问题。我们利用自然语言处理模型进行命名实体识别和关系提取,创建了生物医学知识图谱。利用生成的生物医疗知识图谱回答问题。
结果:根据基于真实生物医学临床笔记的实验结果,该框架可以成功地以高精度(命名实体识别(NER),准确率为90.7%;关系提取(RE),准确率为88%)提取相关的结构化信息。
结论:本文提出了一种基于BERT模型变化的从临床文本中构建生物医学知识图谱的新型端到端系统。