通过 2 级 VAE 改进分子属性 (Improving Molecule Properties Through 2-Stage VAE) - 专知论文

会员服务 ·

0

变分自编码 · 流形 · 数据集 · 统计量 · Performance ·

2022 年 12 月 6 日

Improving Molecule Properties Through 2-Stage VAE

翻译：通过 2 级 VAE 改进分子属性

Chenghui Zhou,Barnabas Poczos

Variational autoencoder (VAE) is a popular method for drug discovery and there had been a great deal of architectures and pipelines proposed to improve its performance. But the VAE model itself suffers from deficiencies such as poor manifold recovery when data lie on low-dimensional manifold embedded in higher dimensional ambient space and they manifest themselves in each applications differently. The consequences of it in drug discovery is somewhat under-explored. In this paper, we study how to improve the similarity of the data generated via VAE and the training dataset by improving manifold recovery via a 2-stage VAE where the second stage VAE is trained on the latent space of the first one. We experimentally evaluated our approach using the ChEMBL dataset as well as a polymer datasets. In both dataset, the 2-stage VAE method is able to improve the property statistics significantly from a pre-existing method.

翻译：变化式自动编码器(VAE)是一种流行的药物发现方法,而且为了改进其性能,曾提议过许多建筑和管道,但VAE模型本身存在缺陷,例如当数据位于高维环境空间内,数据位于低维方位时,数据在高维环境空间内,数据在每种应用中都有不同的表现,而数据在药物发现中的后果是探索不足的。在本文中,我们研究如何通过二阶段VAE改进多元恢复,改进通过VAE生成的数据和培训数据集的相似性,第二阶段VAE在该阶段接受了关于第一个阶段潜在空间的培训。我们利用CHEMBL数据集和聚合数据集对我们的方法进行了实验性评估。在这两个数据集中,二阶段VAE方法能够从先前存在的方法中大大改进财产统计。

0

相关内容

变分自编码

变分自编码

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

专知会员服务

75+阅读 · 2022年4月6日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

靶向调节HDAC6增加t-PA静脉溶栓治疗的有效性及安全性研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用活性筛选-质谱特征引导的双示踪法快速发现海绵环肽类抗肿瘤先导化合物

国家自然科学基金

0+阅读 · 2013年12月31日

无机纳米药物载体影响血浆蛋白结构及功能的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

PE相关分子miR-18b的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

牙菌斑生物膜相容性基因的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

Inverse Models for Estimating the Initial Condition of Spatio-Temporal Advection-Diffusion Processes

Arxiv

0+阅读 · 2023年2月8日

Towards Inferential Reproducibility of Machine Learning Research

Arxiv

0+阅读 · 2023年2月8日

Sample-efficient Multi-objective Molecular Optimization with GFlowNets

Arxiv

0+阅读 · 2023年2月8日

Recent advances in the Self-Referencing Embedding Strings (SELFIES) library

Arxiv

0+阅读 · 2023年2月7日

Modeling Human Driving Behavior through Generative Adversarial Imitation Learning

Arxiv

0+阅读 · 2023年2月7日

Structured variational approximations with skew normal decomposable graphical models

Arxiv

0+阅读 · 2023年2月7日

CRU: A Novel Neural Architecture for Improving the Predictive Performance of Time-Series Data

Arxiv

0+阅读 · 2023年2月7日

GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

Arxiv

1+阅读 · 2023年2月6日

Dynamic CoVaR Modeling

Arxiv

0+阅读 · 2023年2月6日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

变分自编码

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

专知会员服务

75+阅读 · 2022年4月6日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Inverse Models for Estimating the Initial Condition of Spatio-Temporal Advection-Diffusion Processes

Arxiv

0+阅读 · 2023年2月8日

Towards Inferential Reproducibility of Machine Learning Research

Arxiv

0+阅读 · 2023年2月8日

Sample-efficient Multi-objective Molecular Optimization with GFlowNets

Arxiv

0+阅读 · 2023年2月8日

Recent advances in the Self-Referencing Embedding Strings (SELFIES) library

Arxiv

0+阅读 · 2023年2月7日

Modeling Human Driving Behavior through Generative Adversarial Imitation Learning

Arxiv

0+阅读 · 2023年2月7日

Structured variational approximations with skew normal decomposable graphical models

Arxiv

0+阅读 · 2023年2月7日

CRU: A Novel Neural Architecture for Improving the Predictive Performance of Time-Series Data

Arxiv

0+阅读 · 2023年2月7日

GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

Arxiv

1+阅读 · 2023年2月6日

Dynamic CoVaR Modeling

Arxiv

0+阅读 · 2023年2月6日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

图论中的整数流与圆流

国家自然科学基金

0+阅读 · 2015年12月31日

靶向调节HDAC6增加t-PA静脉溶栓治疗的有效性及安全性研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用活性筛选-质谱特征引导的双示踪法快速发现海绵环肽类抗肿瘤先导化合物

国家自然科学基金

0+阅读 · 2013年12月31日

无机纳米药物载体影响血浆蛋白结构及功能的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

PE相关分子miR-18b的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

牙菌斑生物膜相容性基因的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员