Automatic causal graph construction is of high importance in medical research. They have many applications, such as clinical trial criteria design, where identification of confounding variables is a crucial step. The quality bar for clinical applications is high, and the lack of public corpora is a barrier for such studies. Large language models (LLMs) have demonstrated impressive capabilities in natural language processing and understanding, so applying such models in clinical settings is an attractive direction, especially in clinical applications with complex relations between entities, such as diseases, symptoms and treatments. Whereas, relation extraction has already been studied using LLMs, here we present an end-to-end machine learning solution of causal relationship analysis between aforementioned entities using EMR notes. Additionally, in comparison to other studies, we demonstrate extensive evaluation of the method.
翻译:自动因果图的构造在医学研究中具有非常重要的意义,它们有许多应用,例如临床试验标准设计,其中查明混淆的变量是一个关键步骤。临床应用的质量标准很高,缺乏公共公司是这类研究的障碍。大型语言模型在自然语言处理和理解方面表现出令人印象深刻的能力,因此在临床环境中应用这种模型是一个有吸引力的方向,特别是在临床应用中,这些模型与实体之间复杂的关系,如疾病、症状和治疗等。虽然已经利用LLMS研究过关系提取,但在这里我们提出了上述实体使用EMR说明进行因果关系分析的端到端机学习解决方案。此外,与其他研究相比,我们展示了对方法的广泛评价。