BioGGPT:生物医学文本制作和采矿业培训前先导变异器 (BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · MINE · 变换 · Extensibility ·

2022 年 10 月 19 日

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

翻译：BioGGPT:生物医学文本制作和采矿业培训前先导变异器

Renqian Luo,Liai Sun,Yingce Xia,Tao Qin,Sheng Zhang,Hoifung Poon,Tie-Yan Liu

from arxiv, Published at Briefings in Bioinformatics. Code is available at https://github.com/microsoft/BioGPT

Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain. Among the two main branches of pre-trained language models in the general language domain, i.e., BERT (and its variants) and GPT (and its variants), the first one has been extensively studied in the biomedical domain, such as BioBERT and PubMedBERT. While they have achieved great success on a variety of discriminative downstream biomedical tasks, the lack of generation ability constrains their application scope. In this paper, we propose BioGPT, a domain-specific generative Transformer language model pre-trained on large scale biomedical literature. We evaluate BioGPT on six biomedical NLP tasks and demonstrate that our model outperforms previous models on most tasks. Especially, we get 44.98%, 38.42% and 40.76% F1 score on BC5CDR, KD-DTI and DDI end-to-end relation extraction tasks respectively, and 78.2% accuracy on PubMedQA, creating a new record. Our case study on text generation further demonstrates the advantage of BioGPT on biomedical literature to generate fluent descriptions for biomedical terms. Code is available at https://github.com/microsoft/BioGPT.

翻译：在一般自然语言领域的伟大成功激励下,预先培训的语言模式在生物医学领域引起了越来越多的注意。在一般语言领域,即BERT(及其变种)和GPT(及其变种)这两个经过培训的通用语言模式两个主要分支中,第一个在生物医学领域,例如BioBERT和PubMedBERT,已经进行了广泛的研究。尽管在一系列歧视性下游生物医学任务方面取得了巨大成功,但缺乏发电能力限制了其应用范围。在本文中,我们提出BioGPT,这是在大规模生物医学文献方面预先培训的域性基因变异变异语言模式。我们评估生物基因变异语言模式的六个主要分支,并表明我们的模型在大多数任务上超越了以前的模式。特别是,我们在BC5CDR、KD-DTI和DDI端至端关系提取任务上分别取得了44.98%和40.76%的F1得分,而在PubMQA上提出了78.2%的精准度,创造了新的记录。我们对MISG/MIPB的版本进行案例研究,进一步展示了MISBE/MBBD的版本。

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

94+阅读 · 2020年4月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于同步辐射X射线多元标记蛋白成像新方法的研究

国家自然科学基金

0+阅读 · 2014年12月31日

单分子乃至亚分子尺度的量子态研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于混合模型的多模态复杂工业过程监测研究

国家自然科学基金

8+阅读 · 2013年12月31日

长链非编码RNA HOTTIP参与小细胞肺癌耐药的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁酸铋基多相复合材料高温电磁特性及微波响应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型沸石分子筛的合成与结构

国家自然科学基金

0+阅读 · 2011年12月31日

先进树脂基透波复合材料界面结构的控制及介电性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

我国研究生教育规模结构与我国经济发展水平的适应性研究

国家自然科学基金

0+阅读 · 2009年12月31日

铜基复合材料中锂霞石的相变行为及其对复合材料热膨胀性能的影响机制

国家自然科学基金

0+阅读 · 2008年12月31日

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Arxiv

0+阅读 · 2022年11月30日

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Zero Shot Action Generation

Arxiv

0+阅读 · 2022年11月30日

BARTSmiles: Generative Masked Language Models for Molecular Representations

Arxiv

0+阅读 · 2022年11月29日

A Survey of Natural Language Generation

Arxiv

15+阅读 · 2021年12月22日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

94+阅读 · 2020年4月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Arxiv

0+阅读 · 2022年11月30日

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Zero Shot Action Generation

Arxiv

0+阅读 · 2022年11月30日

BARTSmiles: Generative Masked Language Models for Molecular Representations

Arxiv

0+阅读 · 2022年11月29日

A Survey of Natural Language Generation

Arxiv

15+阅读 · 2021年12月22日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

相关基金

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于同步辐射X射线多元标记蛋白成像新方法的研究

国家自然科学基金

0+阅读 · 2014年12月31日

单分子乃至亚分子尺度的量子态研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于混合模型的多模态复杂工业过程监测研究

国家自然科学基金

8+阅读 · 2013年12月31日

长链非编码RNA HOTTIP参与小细胞肺癌耐药的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁酸铋基多相复合材料高温电磁特性及微波响应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型沸石分子筛的合成与结构

国家自然科学基金

0+阅读 · 2011年12月31日

先进树脂基透波复合材料界面结构的控制及介电性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

我国研究生教育规模结构与我国经济发展水平的适应性研究

国家自然科学基金

0+阅读 · 2009年12月31日

铜基复合材料中锂霞石的相变行为及其对复合材料热膨胀性能的影响机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员