使用培训前序列到序列模式的医院进展说明中概述患者问题 (Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models) - 专知论文

会员服务 ·

0

T5 · INFORMS · Performer · BART · MoDELS ·

2022 年 9 月 14 日

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

翻译：使用培训前序列到序列模式的医院进展说明中概述患者问题

Yanjun Gao,Dmitriy Dligach,Timothy Miller,Dongfang Xu,Matthew M. Churpek,Majid Afshar

from arxiv, Paper is accepted to COLING 2022

Automatically summarizing patients' main problems from daily progress notes using natural language processing methods helps to battle against information and cognitive overload in hospital settings and potentially assists providers with computerized diagnostic decision support. Problem list summarization requires a model to understand, abstract, and generate clinical documentation. In this work, we propose a new NLP task that aims to generate a list of problems in a patient's daily care plan using input from the provider's progress notes during hospitalization. We investigate the performance of T5 and BART, two state-of-the-art seq2seq transformer architectures, in solving this problem. We provide a corpus built on top of progress notes from publicly available electronic health record progress notes in the Medical Information Mart for Intensive Care (MIMIC)-III. T5 and BART are trained on general domain text, and we experiment with a data augmentation method and a domain adaptation pre-training method to increase exposure to medical vocabulary and knowledge. Evaluation methods include ROUGE, BERTScore, cosine similarity on sentence embedding, and F-score on medical concepts. Results show that T5 with domain adaptive pre-training achieves significant performance gains compared to a rule-based system and general domain pre-trained language models, indicating a promising direction for tackling the problem summarization task.

翻译：通过使用自然语言处理方法的日常进度说明自动总结病人的主要问题有助于对抗医院环境中的信息和认知过量,并有可能协助提供计算机化诊断决定支持的提供者。问题列表总和需要一个理解、抽象和生成临床文件的模式。在这项工作中,我们提出一个新的国家医疗计划任务,目的是利用住院期间提供者进度说明的投入,在患者日常护理计划中产生一系列问题。我们调查T5和BART、两个最先进的后继2eq变压器结构在解决这一问题方面的性能。我们提供了一套基于公共可得到的电子健康记录说明之上的文具。我们提供了在医疗强化护理(MIMIMIC)-III医疗信息网(MMIMIC-III)中公开提供的电子健康记录进展说明的文具。T5和BART接受一般域文本培训,我们试验了一种数据增强方法和区域适应前培训方法,以增加医疗词汇和知识的接触。评价方法包括ROUGE、BERTScore、刑罚嵌入的精度相似性变式和医学概念上的F。结果显示,具有区域适应性前语言模式的T5和训练前方向,表明解决规则问题的重要领域任务。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

铀的电荷密度波转变及维度调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

PRKAA1基因遗传变异与胃癌的相关性及其功能机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

铜液定氢传感器电解质CaZr1-xAlxO3-α的电化学性能及质子传输机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体calpain-1调控活性氧产生在糖尿病心肌病发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

油酰乙醇胺对缺血性脑卒中神经血管稳态重构的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高电子迁移率超支化聚酰亚胺的合成及其电储存性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元纳米结构与器件的研制与高空间分辨表征

国家自然科学基金

0+阅读 · 2011年12月31日

几何阻挫体系ATO2中自旋、电荷、轨道序及其相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

钙基铁电材料

国家自然科学基金

0+阅读 · 2009年12月31日

新基因Chin1在神经细胞凋亡中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Confident Adaptive Language Modeling

Confident Adaptive Language Modeling

Arxiv

0+阅读 · 2022年10月25日

IELM: An Open Information Extraction Benchmark for Pre-Trained Language Models

Arxiv

0+阅读 · 2022年10月25日

Proficiency assessment of L2 spoken English using wav2vec 2.0

Arxiv

0+阅读 · 2022年10月24日

On Cross-Domain Pre-Trained Language Models for Clinical Text Mining: How Do They Perform on Data-Constrained Fine-Tuning?

Arxiv

0+阅读 · 2022年10月23日

Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models

Arxiv

0+阅读 · 2022年10月21日

A Semi-supervised Approach for a Better Translation of Sentiment in Dialectical Arabic UGT

Arxiv

0+阅读 · 2022年10月21日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

31+阅读 · 2020年5月5日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Arxiv

14+阅读 · 2019年6月19日

Incorporating Dictionaries into Deep Neural Networks for the Chinese Clinical Named Entity Recognition

Arxiv

12+阅读 · 2018年4月13日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《面向无人机集群的避障动态传感器覆盖算法》最新38页

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Confident Adaptive Language Modeling

Confident Adaptive Language Modeling

Arxiv

0+阅读 · 2022年10月25日

IELM: An Open Information Extraction Benchmark for Pre-Trained Language Models

Arxiv

0+阅读 · 2022年10月25日

Proficiency assessment of L2 spoken English using wav2vec 2.0

Arxiv

0+阅读 · 2022年10月24日

On Cross-Domain Pre-Trained Language Models for Clinical Text Mining: How Do They Perform on Data-Constrained Fine-Tuning?

Arxiv

0+阅读 · 2022年10月23日

Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models

Arxiv

0+阅读 · 2022年10月21日

A Semi-supervised Approach for a Better Translation of Sentiment in Dialectical Arabic UGT

Arxiv

0+阅读 · 2022年10月21日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

31+阅读 · 2020年5月5日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Arxiv

14+阅读 · 2019年6月19日

Incorporating Dictionaries into Deep Neural Networks for the Chinese Clinical Named Entity Recognition

Arxiv

12+阅读 · 2018年4月13日

相关基金

铀的电荷密度波转变及维度调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

PRKAA1基因遗传变异与胃癌的相关性及其功能机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

铜液定氢传感器电解质CaZr1-xAlxO3-α的电化学性能及质子传输机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体calpain-1调控活性氧产生在糖尿病心肌病发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

油酰乙醇胺对缺血性脑卒中神经血管稳态重构的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高电子迁移率超支化聚酰亚胺的合成及其电储存性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元纳米结构与器件的研制与高空间分辨表征

国家自然科学基金

0+阅读 · 2011年12月31日

几何阻挫体系ATO2中自旋、电荷、轨道序及其相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

钙基铁电材料

国家自然科学基金

0+阅读 · 2009年12月31日

新基因Chin1在神经细胞凋亡中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员