ChatDoctor：基于医学领域知识Fine-tuned LLaMA模型的医疗聊天模型 (ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge) - 专知论文

会员服务 ·

0

领域知识 · 知识 · 维基百科 · 语言模型 · 自主知识 ·

2023 年 4 月 18 日

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

翻译：ChatDoctor：基于医学领域知识Fine-tuned LLaMA模型的医疗聊天模型

Yunxiang Li,Zihan Li,Kai Zhang,Ruilong Dan,You Zhang

Recent large language models (LLMs) in the general domain, such as ChatGPT, have shown remarkable success in following instructions and producing human-like responses. However, such language models have yet to be adapted for the medical domain, resulting in poor accuracy of responses and an inability to provide sound advice on medical diagnoses, medications, etc. To address this problem, we fine-tuned our ChatDoctor model based on 100k real-world patient-physician conversations from an online medical consultation site. Besides, we add autonomous knowledge retrieval capabilities to our ChatDoctor, for example, Wikipedia or a disease database as a knowledge brain. By fine-tuning the LLMs using these 100k patient-physician conversations, our model showed significant improvements in understanding patients' needs and providing informed advice. The autonomous ChatDoctor model based on Wikipedia and Database Brain can access real-time and authoritative information and answer patient questions based on this information, significantly improving the accuracy of the model's responses, which shows extraordinary potential for the medical field with a low tolerance for error. To facilitate the further development of dialogue models in the medical field, we make available all source code, datasets, and model weights available at: https://github.com/Kent0n-Li/ChatDoctor.

翻译：近来，普通领域的大型语言模型（LLMs），如ChatGPT，在遵循指令和产生人类式响应方面表现出令人瞩目的成功。然而，这种语言模型尚未针对医学领域进行调整，结果导致响应精度低下和无法就医疗诊断、药物等问题提供合理建议。为了应对这个问题，我们基于在线医疗咨询网站的10万个真实患者-医生交谈Fine-tuned了聊天医生模型。同时，我们为我们的ChatDoctor增加了自主知识检索功能，例如维基百科或疾病数据库作为知识来源。通过使用这10万个患者-医生交谈Fine-tuned LLMs，我们的模型在理解患者需求和提供知情建议方面显示出了显著的改进。基于维基百科和数据库大脑的ChatDoctor能够访问实时和权势信息，并根据这些信息回答患者问题，从而显着提高模型的响应准确性，这对于容忍错误率较低的医疗领域具有极大的潜力。为了促进医学领域对话模型的进一步开发，我们https://github.com/Kent0n-Li/ChatDoctor提供所有源代码、数据集和模型权重。

3

相关内容

领域知识

领域知识：特定行业，方向的专业知识。

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

不可错过！斯坦福<人工智能疾病诊断与信息推荐>2021课程，附Slides下载

不可错过！斯坦福<人工智能疾病诊断与信息推荐>2021课程，附Slides下载

专知会员服务

47+阅读 · 2021年4月29日

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

专知会员服务

74+阅读 · 2020年7月28日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

专知会员服务

57+阅读 · 2019年11月23日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

资源 | DeepPavlov：一个训练对话系统和聊天机器人的开源库

资源 | DeepPavlov：一个训练对话系统和聊天机器人的开源库

机器之心

22+阅读 · 2018年2月27日

【论文】深度学习的数学解释

【论文】深度学习的数学解释

机器学习研究会

10+阅读 · 2017年12月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

语义Web知识库补全关键技术研究

国家自然科学基金

14+阅读 · 2017年12月31日

牛磺酸抑制AS肉鸡右心肥大过程中calpains介导细胞凋亡作用的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于经颅超声多模态影像信息融合的帕金森病早期辅助诊断模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

蒙医方剂数据挖掘关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于重症肌无力miRNA-mRNA双重表达谱解析miRNA调控通路的研究

国家自然科学基金

0+阅读 · 2012年12月31日

科研网络社区中社会化的知识推荐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

专家检索资源获取与学习排序方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

掺杂细胞色素c的酶界面及在电化学传感器中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于生物信息学的microRNA和人类疾病关联研究

国家自然科学基金

0+阅读 · 2009年12月31日

myostatin调控脂肪酸代谢的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

The Chai Platform's AI Safety Framework

Arxiv

1+阅读 · 2023年6月5日

Bridging the Domain Gap between Synthetic and Real-World Data for Autonomous Driving

Arxiv

0+阅读 · 2023年6月5日

Computing Education in the Era of Generative AI

Arxiv

1+阅读 · 2023年6月5日

CBLab: Supporting the Training of Large-scale Traffic Control Policies with Scalable Traffic Simulation

Arxiv

0+阅读 · 2023年6月5日

On Knowledge Editing in Federated Learning: Perspectives, Challenges, and Future Directions

Arxiv

0+阅读 · 2023年6月2日

Improving the Robustness of Summarization Systems with Dual Augmentation

Arxiv

0+阅读 · 2023年6月1日

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

Arxiv

0+阅读 · 2023年6月1日

A Survey on ChatGPT: AI-Generated Contents, Challenges, and Solutions

Arxiv

54+阅读 · 2023年5月25日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

VIP会员

文章信息

相关主题

相关VIP内容

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

不可错过！斯坦福<人工智能疾病诊断与信息推荐>2021课程，附Slides下载

不可错过！斯坦福<人工智能疾病诊断与信息推荐>2021课程，附Slides下载

专知会员服务

47+阅读 · 2021年4月29日

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

专知会员服务

74+阅读 · 2020年7月28日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

专知会员服务

57+阅读 · 2019年11月23日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

资源 | DeepPavlov：一个训练对话系统和聊天机器人的开源库

资源 | DeepPavlov：一个训练对话系统和聊天机器人的开源库

机器之心

22+阅读 · 2018年2月27日

【论文】深度学习的数学解释

【论文】深度学习的数学解释

机器学习研究会

10+阅读 · 2017年12月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

The Chai Platform's AI Safety Framework

Arxiv

1+阅读 · 2023年6月5日

Bridging the Domain Gap between Synthetic and Real-World Data for Autonomous Driving

Arxiv

0+阅读 · 2023年6月5日

Computing Education in the Era of Generative AI

Arxiv

1+阅读 · 2023年6月5日

CBLab: Supporting the Training of Large-scale Traffic Control Policies with Scalable Traffic Simulation

Arxiv

0+阅读 · 2023年6月5日

On Knowledge Editing in Federated Learning: Perspectives, Challenges, and Future Directions

Arxiv

0+阅读 · 2023年6月2日

Improving the Robustness of Summarization Systems with Dual Augmentation

Arxiv

0+阅读 · 2023年6月1日

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

Arxiv

0+阅读 · 2023年6月1日

A Survey on ChatGPT: AI-Generated Contents, Challenges, and Solutions

Arxiv

54+阅读 · 2023年5月25日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

相关基金

语义Web知识库补全关键技术研究

国家自然科学基金

14+阅读 · 2017年12月31日

牛磺酸抑制AS肉鸡右心肥大过程中calpains介导细胞凋亡作用的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于经颅超声多模态影像信息融合的帕金森病早期辅助诊断模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

蒙医方剂数据挖掘关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于重症肌无力miRNA-mRNA双重表达谱解析miRNA调控通路的研究

国家自然科学基金

0+阅读 · 2012年12月31日

科研网络社区中社会化的知识推荐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

专家检索资源获取与学习排序方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

掺杂细胞色素c的酶界面及在电化学传感器中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于生物信息学的microRNA和人类疾病关联研究

国家自然科学基金

0+阅读 · 2009年12月31日

myostatin调控脂肪酸代谢的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员