ImpressionGPT:利用ChatGPT的迭代优化框架进行放射学报告摘要 (ImpressionGPT: An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT)

Chong Ma,Zihao Wu,Jiaqi Wang,Shaochen Xu,Yaonai Wei,Zhengliang Liu,Lei Guo,Xiaoyan Cai,Shu Zhang,Tuo Zhang,Dajiang Zhu,Dinggang Shen,Tianming Liu,Xiang Li

The 'Impression' section of a radiology report is a critical basis for communication between radiologists and other physicians, and it is typically written by radiologists based on the 'Findings' section. However, writing numerous impressions can be laborious and error-prone for radiologists. Although recent studies have achieved promising results in automatic impression generation using large-scale medical text data for pre-training and fine-tuning pre-trained language models, such models often require substantial amounts of medical text data and have poor generalization performance. While large language models (LLMs) like ChatGPT have shown strong generalization capabilities and performance, their performance in specific domains, such as radiology, remains under-investigated and potentially limited. To address this limitation, we propose ImpressionGPT, which leverages the in-context learning capability of LLMs by constructing dynamic contexts using domain-specific, individualized data. This dynamic prompt approach enables the model to learn contextual knowledge from semantically similar examples from existing data. Additionally, we design an iterative optimization algorithm that performs automatic evaluation on the generated impression results and composes the corresponding instruction prompts to further optimize the model. The proposed ImpressionGPT model achieves state-of-the-art performance on both MIMIC-CXR and OpenI datasets without requiring additional training data or fine-tuning the LLMs. This work presents a paradigm for localizing LLMs that can be applied in a wide range of similar application scenarios, bridging the gap between general-purpose LLMs and the specific language processing needs of various domains.

翻译：放射学报告的“结论”部分是放射科医生与其他医生之间通讯的重要基础，通常是基于“发现”部分编写的。然而，写作许多印象对于放射科医生来说可能是费力的且容易出错的。虽然最近的研究利用大规模医学文本数据进行预训练和微调预训练语言模型，在自动印象生成方面取得了有希望的结果，但这些模型通常需要大量医学文本数据，并且具有较差的泛化性能。虽然大型语言模型（LLMs）如ChatGPT已经表现出了较强的泛化能力和性能，但它们在特定领域，如放射学，的表现仍未得到充分研究，可能存在一定局限性。为了解决这个问题，我们提出了ImpressionGPT，它利用LLMs的上下文学习能力，通过使用领域特定的，个性化的数据构建动态上下文。这种动态提示方法使模型能够从现有数据的语义相似的示例中学习上下文知识。此外，我们设计了一种迭代优化算法，对生成印象结果进行自动评估，并组合相应的指令提示以进一步优化模型。所提出的ImpressionGPT模型在MIMIC-CXR和OpenI数据集上均取得了最先进的性能，而无需额外的训练数据或微调LLMs。本研究提供了一个定位LLMs的范例，可应用于各种类似的应用场景，弥补了通用LLMs和各种领域的特定语言处理需求之间的差距。