使用BERT和T5模型组合体的药物/化学-蛋白相互作用 (Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models) - 专知论文

会员服务 ·

0

INTERACT · T5 · MINE · 文本数据挖掘 · MoDELS ·

2021 年 11 月 30 日

Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models

翻译：使用BERT和T5模型组合体的药物/化学-蛋白相互作用

Virginia Adams,Hoo-Chang Shin,Carol Anderson,Bo Liu,Anas Abidin

from arxiv, Submission to the BioCreative VII challenge, Track-1

In Track-1 of the BioCreative VII Challenge participants are asked to identify interactions between drugs/chemicals and proteins. In-context named entity annotations for each drug/chemical and protein are provided and one of fourteen different interactions must be automatically predicted. For this relation extraction task, we attempt both a BERT-based sentence classification approach, and a more novel text-to-text approach using a T5 model. We find that larger BERT-based models perform better in general, with our BioMegatron-based model achieving the highest scores across all metrics, achieving 0.74 F1 score. Though our novel T5 text-to-text method did not perform as well as most of our BERT-based models, it outperformed those trained on similar data, showing promising results, achieving 0.65 F1 score. We believe a text-to-text approach to relation extraction has some competitive advantages and there is a lot of room for research advancement.

翻译：在第七轮《生物动力挑战》第1轨中,参与者被要求确定药物/化学和蛋白质之间的相互作用,提供每种药物/化学和蛋白质的内置名称实体说明,必须自动预测14种不同相互作用中的一个。对于这一关系提取任务,我们试图采用基于BERT的判刑分类方法,并采用采用T5模式的更新颖的文本对文本的方法。我们认为,基于BERT的更大模型总体上表现更好,我们基于BEERT的模型在所有指标中得分最高,达到0.74 F1分。尽管我们新型的T5文本对文本方法没有发挥以及我们基于BERT的大多数模型,但它优于那些受过类似数据培训的模型,显示有希望的结果,达到0.65 F1分。我们认为,采用文本对文本方法进行关联提取具有一些竞争优势,而且研究进展的空间很大。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

专知会员服务

27+阅读 · 2020年3月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【论文推荐ICLR2020】组合语义解释Transformers/RNNs，explaining compositional semantics for Transformers/RNNs

【论文推荐ICLR2020】组合语义解释Transformers/RNNs，explaining compositional semantics for Transformers/RNNs

专知会员服务

6+阅读 · 2019年12月24日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

inpluslab

8+阅读 · 2019年10月29日

学术报告|港科大助理教授宋阳秋博士

学术报告|港科大助理教授宋阳秋博士

科技创新与创业

7+阅读 · 2019年7月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition

Arxiv

4+阅读 · 2021年6月1日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Type-augmented Relation Prediction in Knowledge Graphs

Type-augmented Relation Prediction in Knowledge Graphs

Arxiv

6+阅读 · 2020年9月16日

Inducing Relational Knowledge from BERT

Arxiv

3+阅读 · 2019年11月28日

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

Arxiv

13+阅读 · 2019年11月1日

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

Arxiv

6+阅读 · 2019年9月26日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Arxiv

6+阅读 · 2019年8月21日

Interaction Embeddings for Prediction and Explanation in Knowledge Graphs

Arxiv

7+阅读 · 2019年3月12日

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Arxiv

7+阅读 · 2019年2月3日

VIP会员

文章信息

相关主题

文本数据挖掘

相关VIP内容

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

专知会员服务

27+阅读 · 2020年3月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【论文推荐ICLR2020】组合语义解释Transformers/RNNs，explaining compositional semantics for Transformers/RNNs

【论文推荐ICLR2020】组合语义解释Transformers/RNNs，explaining compositional semantics for Transformers/RNNs

专知会员服务

6+阅读 · 2019年12月24日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

已删除

inpluslab

8+阅读 · 2019年10月29日

学术报告|港科大助理教授宋阳秋博士

学术报告|港科大助理教授宋阳秋博士

科技创新与创业

7+阅读 · 2019年7月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition

Arxiv

4+阅读 · 2021年6月1日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Type-augmented Relation Prediction in Knowledge Graphs

Type-augmented Relation Prediction in Knowledge Graphs

Arxiv

6+阅读 · 2020年9月16日

Inducing Relational Knowledge from BERT

Arxiv

3+阅读 · 2019年11月28日

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

Arxiv

13+阅读 · 2019年11月1日

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

Arxiv

6+阅读 · 2019年9月26日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Arxiv

6+阅读 · 2019年8月21日

Interaction Embeddings for Prediction and Explanation in Knowledge Graphs

Arxiv

7+阅读 · 2019年3月12日

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Arxiv

7+阅读 · 2019年2月3日

微信扫码咨询专知VIP会员