当几何深深学习与事先训练的蛋白语言模型相遇时 (When Geometric Deep Learning Meets Pretrained Protein Language Models) - 专知论文

会员服务 ·

0

语言模型化 · Learning · MoDELS · state-of-the-art · Networks ·

2022 年 12 月 7 日

When Geometric Deep Learning Meets Pretrained Protein Language Models

翻译：当几何深深学习与事先训练的蛋白语言模型相遇时

Fang Wu,Yu Tao,Dragomir Radev,Jinbo Xu

Geometric deep learning has recently achieved great success in non-Euclidean domains, and learning on 3D structures of large biomolecules is emerging as a distinct research area. However, its efficacy is largely constrained due to the limited quantity of structural data. Meanwhile, protein language models trained on substantial 1D sequences have shown burgeoning capabilities with scale in a broad range of applications. Nevertheless, no preceding studies consider combining these different protein modalities to promote the representation power of geometric neural networks. To address this gap, we make the foremost step to integrate the knowledge learned by well-trained protein language models into several state-of-the-art geometric networks. Experiments are evaluated on a variety of protein representation learning benchmarks, including protein-protein interface prediction, model quality assessment, protein-protein rigid-body docking, and binding affinity prediction, leading to an overall improvement of 20% over baselines and the new state-of-the-art performance. Strong evidence indicates that the incorporation of protein language models' knowledge enhances geometric networks' capacity by a significant margin and can be generalized to complex tasks.

翻译：最近,在非欧洲的深海领域,几何深学取得了巨大成功,大型生物分子的3D结构的学习正在作为一个独特的研究领域出现。然而,由于结构数据数量有限,其功效在很大程度上受到限制。与此同时,在大量1D序列方面受过培训的蛋白质语言模型显示,在广泛的应用中,有大量的1D序列能力正在迅速增强。然而,以前没有任何研究考虑将这些不同的蛋白模式结合起来,以促进几何神经网络的代表性。为弥补这一差距,我们采取了最首要的步骤,将受过良好训练的蛋白语言模型所学的知识纳入若干最先进的几何计量网络。对各种蛋白质代表学习基准进行了评估,包括蛋白质-蛋白接口预测、模型质量评估、蛋白质-蛋白硬体对接和结合性亲近性预测,导致在基线和新状态表现上总体改进了20%。有力的证据表明,蛋白语言模型的纳入知识将大大提升了几何网络的能力,可以推广到复杂的任务中。

0

相关内容

语言模型化

语言模型化

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

专知会员服务

54+阅读 · 2021年12月4日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

319+阅读 · 2020年11月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

KDD2021 | 最新GNN官方教程

KDD2021 | 最新GNN官方教程

机器学习与推荐算法

2+阅读 · 2021年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

激活PPARβ/δ通过GPR40对2型糖尿病大鼠胰岛β细胞抗脂毒性凋亡及机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

韧性城市卫生健康领域适应气候变化评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

G-四链体靶向的新型天然端粒酶抑制剂发现及抗肿瘤作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

典型炸药爆炸机理的量子分子动力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

靶向Bcl-2蛋白家族抗肿瘤活性有机小分子抑制剂的优化和研究

国家自然科学基金

0+阅读 · 2011年12月31日

行波感应加热的研究

国家自然科学基金

0+阅读 · 2008年12月31日

土壤氮素行为及其模拟模型不确定性的Monte-Carlo分析

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

Structure-informed Language Models Are Protein Designers

Arxiv

0+阅读 · 2023年2月9日

CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets

Arxiv

0+阅读 · 2023年2月7日

Continual Learning of Language Models

Arxiv

0+阅读 · 2023年2月7日

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Arxiv

0+阅读 · 2023年2月7日

Protecting Language Generation Models via Invisible Watermarking

Arxiv

0+阅读 · 2023年2月6日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

A Survey on Green Deep Learning

Arxiv

10+阅读 · 2021年11月10日

Federated Learning Meets Natural Language Processing: A Survey

Arxiv

19+阅读 · 2021年7月27日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

VIP会员

文章信息

相关主题

语言模型化

state-of-the-art

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

专知会员服务

54+阅读 · 2021年12月4日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

319+阅读 · 2020年11月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

281页pdf《神经网络设计入门》

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

KDD2021 | 最新GNN官方教程

KDD2021 | 最新GNN官方教程

机器学习与推荐算法

2+阅读 · 2021年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Structure-informed Language Models Are Protein Designers

Arxiv

0+阅读 · 2023年2月9日

CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets

Arxiv

0+阅读 · 2023年2月7日

Continual Learning of Language Models

Arxiv

0+阅读 · 2023年2月7日

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Arxiv

0+阅读 · 2023年2月7日

Protecting Language Generation Models via Invisible Watermarking

Arxiv

0+阅读 · 2023年2月6日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

A Survey on Green Deep Learning

Arxiv

10+阅读 · 2021年11月10日

Federated Learning Meets Natural Language Processing: A Survey

Arxiv

19+阅读 · 2021年7月27日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

相关基金

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

激活PPARβ/δ通过GPR40对2型糖尿病大鼠胰岛β细胞抗脂毒性凋亡及机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

韧性城市卫生健康领域适应气候变化评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

G-四链体靶向的新型天然端粒酶抑制剂发现及抗肿瘤作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

典型炸药爆炸机理的量子分子动力学研究

国家自然科学基金

0+阅读 · 2011年12月31日

靶向Bcl-2蛋白家族抗肿瘤活性有机小分子抑制剂的优化和研究

国家自然科学基金

0+阅读 · 2011年12月31日

行波感应加热的研究

国家自然科学基金

0+阅读 · 2008年12月31日

土壤氮素行为及其模拟模型不确定性的Monte-Carlo分析

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员