ChatDoctor:使用药娘Fine-tuning的医疗聊天模型，采用医疗领域专业知识 (ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge) - 专知论文

会员服务 ·

0

语言模型 · tuning · 医学诊断 · 药物推荐 · 知识 ·

2023 年 3 月 27 日

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

翻译：ChatDoctor:使用药娘Fine-tuning的医疗聊天模型，采用医疗领域专业知识

Li Yunxiang,Li Zihan,Zhang Kai,Dan Ruilong,Zhang You

Recent large language models (LLMs) in the general domain, such as ChatGPT, have shown remarkable success in following instructions and producing human-like responses. However, such language models have not been tailored to the medical domain, resulting in poor answer accuracy and inability to give plausible recommendations for medical diagnosis, medications, etc. To address this issue, we collected more than 700 diseases and their corresponding symptoms, required medical tests, and recommended medications, from which we generated 5K doctor-patient conversations. By fine-tuning LLMs using these tailored doctor-patient conversations, the resulting models emerge with great potential to understand patients' needs, provide informed advice, and offer valuable assistance in a variety of medical-related fields. The integration of these advanced language models into healthcare can revolutionize the way healthcare professionals and patients communicate, ultimately improving the overall efficiency and quality of patient care and outcomes. In addition, we made public all the source codes, datasets, and model weights to facilitate the further development of dialogue models in the medical field. The training data, codes, and weights of this project are available at: https://github.com/Kent0n-Li/ChatDoctor.

翻译：---- 近期的大语言模型（LLMs）在通用领域，如ChatGPT，在遵循指令并产生类人回复方面表现出了非凡的成功。然而，这种语言模型并没有针对医疗领域进行定制，导致了答案准确度低下和无法提供合理的医学诊断、药物推荐等建议。为了解决这个问题，我们收集了700多种疾病及其相应的症状、需要的医学检查和推荐的药物，从中生成了5K个医生-患者对话。通过使用这些定制的医生-患者对话Fine-tuning LLMs，得到的模型具有非常强的潜力，可以理解患者的需求，提供权威建议，并在各种医疗相关领域提供有价值的帮助。将这些先进的语言模型整合到医疗保健中，可以彻底改革医疗专业人员和患者的沟通方式，最终提高患者护理和结果的整体效率和质量。此外，我们公开了所有源代码、数据集和模型权重，以便促进医疗领域对话模型的进一步发展。该项目的训练数据，代码和权重可在此处获得：https://github.com/Kent0n-Li/ChatDoctor。

1

相关内容

语言模型

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

专知会员服务

80+阅读 · 2023年4月5日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【WWW2020】学习上下文化文档表示用于医疗答案检索，Learning Contextualized Document Representations for Healthcare Answer Retrieval

【WWW2020】学习上下文化文档表示用于医疗答案检索，Learning Contextualized Document Representations for Healthcare Answer Retrieval

专知会员服务

26+阅读 · 2020年2月10日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

微软开源DeepSpeed Chat，人人可快速训练百亿、千亿级ChatGPT大模型

微软开源DeepSpeed Chat，人人可快速训练百亿、千亿级ChatGPT大模型

机器之心

5+阅读 · 2023年4月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

桂皮醛干预糖尿病Hap1-Ahi1信号通路的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

等离子体中分数阶微分方程求解的有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

小波分析在R-L分数阶微分方程数值解中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

分泌型金属蛋白酶CLCA在哮喘气道重塑中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光相干层析成像研究血液凝固过程中的光学性质动态变化及特征参数

国家自然科学基金

0+阅读 · 2011年12月31日

基于机器翻译的汉-维哈蒙多语种电子病历的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Wnt/βatenin信号通路在纳米拓扑结构诱导骨髓间充质干细胞向成骨细胞分化中的作用和机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

中文医学文本中关联信息提取方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

多源遥感数据反演农作物叶面积指数中的冠层模型改进与信息量评价方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

Arxiv

0+阅读 · 2023年5月17日

Qualifying Chinese Medical Licensing Examination with Knowledge Enhanced Generative Pre-training Model

Arxiv

0+阅读 · 2023年5月17日

Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs

Arxiv

0+阅读 · 2023年5月17日

GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information

Arxiv

0+阅读 · 2023年5月16日

Knowledge distillation with Segment Anything (SAM) model for Planetary Geological Mapping

Arxiv

0+阅读 · 2023年5月15日

Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law

Arxiv

0+阅读 · 2023年5月15日

KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding

Arxiv

0+阅读 · 2023年5月14日

When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust

Arxiv

0+阅读 · 2023年5月12日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Few-Shot Knowledge Graph Completion

Arxiv

15+阅读 · 2019年11月26日

VIP会员

文章信息

相关主题

相关VIP内容

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

专知会员服务

80+阅读 · 2023年4月5日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【WWW2020】学习上下文化文档表示用于医疗答案检索，Learning Contextualized Document Representations for Healthcare Answer Retrieval

【WWW2020】学习上下文化文档表示用于医疗答案检索，Learning Contextualized Document Representations for Healthcare Answer Retrieval

专知会员服务

26+阅读 · 2020年2月10日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

微软开源DeepSpeed Chat，人人可快速训练百亿、千亿级ChatGPT大模型

微软开源DeepSpeed Chat，人人可快速训练百亿、千亿级ChatGPT大模型

机器之心

5+阅读 · 2023年4月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

相关论文

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

Arxiv

0+阅读 · 2023年5月17日

Qualifying Chinese Medical Licensing Examination with Knowledge Enhanced Generative Pre-training Model

Arxiv

0+阅读 · 2023年5月17日

Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs

Arxiv

0+阅读 · 2023年5月17日

GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information

Arxiv

0+阅读 · 2023年5月16日

Knowledge distillation with Segment Anything (SAM) model for Planetary Geological Mapping

Arxiv

0+阅读 · 2023年5月15日

Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law

Arxiv

0+阅读 · 2023年5月15日

KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding

Arxiv

0+阅读 · 2023年5月14日

When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust

Arxiv

0+阅读 · 2023年5月12日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Few-Shot Knowledge Graph Completion

Arxiv

15+阅读 · 2019年11月26日

相关基金

桂皮醛干预糖尿病Hap1-Ahi1信号通路的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

等离子体中分数阶微分方程求解的有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

小波分析在R-L分数阶微分方程数值解中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

分泌型金属蛋白酶CLCA在哮喘气道重塑中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光相干层析成像研究血液凝固过程中的光学性质动态变化及特征参数

国家自然科学基金

0+阅读 · 2011年12月31日

基于机器翻译的汉-维哈蒙多语种电子病历的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Wnt/βatenin信号通路在纳米拓扑结构诱导骨髓间充质干细胞向成骨细胞分化中的作用和机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

中文医学文本中关联信息提取方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

多源遥感数据反演农作物叶面积指数中的冠层模型改进与信息量评价方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员