SPeC：一种基于软提示的校准模型，减轻临床笔记摘要性能变异性 (SPeC: A Soft Prompt-Based Calibration on Mitigating Performance Variability in Clinical Notes Summarization) - 专知论文

会员服务 ·

0

Performer · SOFT · 方差 · Prompt · Extensibility ·

2023 年 3 月 23 日

SPeC: A Soft Prompt-Based Calibration on Mitigating Performance Variability in Clinical Notes Summarization

翻译：SPeC：一种基于软提示的校准模型，减轻临床笔记摘要性能变异性

Yu-Neng Chuang,Ruixiang Tang,Xiaoqian Jiang,Xia Hu

Electronic health records (EHRs) store an extensive array of patient information, encompassing medical histories, diagnoses, treatments, and test outcomes. These records are crucial for enabling healthcare providers to make well-informed decisions regarding patient care. Summarizing clinical notes further assists healthcare professionals in pinpointing potential health risks and making better-informed decisions. This process contributes to reducing errors and enhancing patient outcomes by ensuring providers have access to the most pertinent and current patient data. Recent research has shown that incorporating prompts with large language models (LLMs) substantially boosts the efficacy of summarization tasks. However, we show that this approach also leads to increased output variance, resulting in notably divergent outputs even when prompts share similar meanings. To tackle this challenge, we introduce a model-agnostic Soft Prompt-Based Calibration (SPeC) pipeline that employs soft prompts to diminish variance while preserving the advantages of prompt-based summarization. Experimental findings on multiple clinical note tasks and LLMs indicate that our method not only bolsters performance but also effectively curbs variance for various LLMs, providing a more uniform and dependable solution for summarizing vital medical information.

翻译：电子健康记录（EHR）存储着包括医疗史、诊断、治疗和检测结果等方方面面的患者信息。这些记录对于医生作出明智的病情抉择至关重要。将临床笔记进行摘要有助于医疗保健专业人士准确判断潜在的健康风险，做出更明智的决策，这有助于减少错误，并通过确保医疗提供者获得最相关和最新的患者数据来提高患者的疗效。最近的研究表明，将大型语言模型（LLM）与提示结合使用，显著提高了摘要任务的效果。然而，我们发现这种方法也会导致输出方差增加，即使提示具有相似的含义，输出也会非常不同。为了解决这个挑战，我们引入了一种基于模型的 Soft Prompt-Based Calibration （SPeC）流程，采用软提示来减小方差，同时保留提示型总结的优势。在多个临床笔记任务和LLM上进行的实验发现，我们的方法不仅增强了性能，还有效地抑制了不同LLM的方差，为摘要重要医疗信息提供了更加均匀和可靠的解决方案。

0

相关内容

Performer

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【MIT Sam Hopkins】如何读论文？How to Read a Paper

【MIT Sam Hopkins】如何读论文？How to Read a Paper

专知会员服务

108+阅读 · 2022年3月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知

4+阅读 · 2022年10月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

基于体积测量和功能状态的精准肝切除手术风险量化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

一种乳腺癌分子特异性手术导航成像方法

国家自然科学基金

1+阅读 · 2015年12月31日

PD-1/PD-L1通路介导手术创伤后T淋巴细胞功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

介孔碳/硫复合锂-硫电池正极材料与电化学性能

国家自然科学基金

0+阅读 · 2014年12月31日

CuS/NaYF4:Yb, Er/SiO2复合纳米胶囊及肿瘤荧光成像诊断和光热消融治疗性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对老年性骨骼肌肉减少症的作用及分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

多级评分认知诊断的测验设计、模型开发及施测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

早期乳腺癌保乳术后基于复发风险和组织病理的个体化亚临床病灶的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Co-W磁性层/Ru-X下底层晶格错配对Co-W磁晶各向异性能的影响及新型Ru基下底层的设计

国家自然科学基金

0+阅读 · 2009年12月31日

Unlocking the Potential of Medical Imaging with ChatGPT's Intelligent Diagnostics

Arxiv

0+阅读 · 2023年5月12日

Towards Understanding Omission in Dialogue Summarization

Arxiv

0+阅读 · 2023年5月11日

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

Arxiv

0+阅读 · 2023年5月11日

On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm

Arxiv

0+阅读 · 2023年5月11日

Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3 (with Varying Success)

Arxiv

0+阅读 · 2023年5月10日

Large Language Models Need Holistically Thought in Medical Conversational QA

Arxiv

0+阅读 · 2023年5月10日

Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

Arxiv

0+阅读 · 2023年5月10日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【MIT Sam Hopkins】如何读论文？How to Read a Paper

【MIT Sam Hopkins】如何读论文？How to Read a Paper

专知会员服务

108+阅读 · 2022年3月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

计算机视觉领域的后门攻击与防御：综述

美陆军高管谈俄乌战事启示：数据、人工智能与电磁战

深度学习中泛化的量化、理解与改进

从图像去噪到成像逆问题的正则化：综述

相关资讯

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知

4+阅读 · 2022年10月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Unlocking the Potential of Medical Imaging with ChatGPT's Intelligent Diagnostics

Arxiv

0+阅读 · 2023年5月12日

Towards Understanding Omission in Dialogue Summarization

Arxiv

0+阅读 · 2023年5月11日

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

Arxiv

0+阅读 · 2023年5月11日

On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm

Arxiv

0+阅读 · 2023年5月11日

Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3 (with Varying Success)

Arxiv

0+阅读 · 2023年5月10日

Large Language Models Need Holistically Thought in Medical Conversational QA

Arxiv

0+阅读 · 2023年5月10日

Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

Arxiv

0+阅读 · 2023年5月10日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

相关基金

基于体积测量和功能状态的精准肝切除手术风险量化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

一种乳腺癌分子特异性手术导航成像方法

国家自然科学基金

1+阅读 · 2015年12月31日

PD-1/PD-L1通路介导手术创伤后T淋巴细胞功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

介孔碳/硫复合锂-硫电池正极材料与电化学性能

国家自然科学基金

0+阅读 · 2014年12月31日

CuS/NaYF4:Yb, Er/SiO2复合纳米胶囊及肿瘤荧光成像诊断和光热消融治疗性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对老年性骨骼肌肉减少症的作用及分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

多级评分认知诊断的测验设计、模型开发及施测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

早期乳腺癌保乳术后基于复发风险和组织病理的个体化亚临床病灶的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Co-W磁性层/Ru-X下底层晶格错配对Co-W磁晶各向异性能的影响及新型Ru基下底层的设计

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员