多节制分子生成,使用局部高集中电解稀释性静电筛选的粗略拉动培训数据 (Multi-Constraint Molecular Generation using Sparsely Labelled Training Data for Localized High-Concentration Electrolyte Diluent Screening) - 专知论文

会员服务 ·

0

训练数据 · 标注 · 稀疏 · MoDELS · 约束 ·

2023 年 1 月 12 日

Multi-Constraint Molecular Generation using Sparsely Labelled Training Data for Localized High-Concentration Electrolyte Diluent Screening

翻译：多节制分子生成,使用局部高集中电解稀释性静电筛选的粗略拉动培训数据

Jonathan P. Mailoa,Xin Li,Jiezhong Qiu,Shengyu Zhang

Recently, machine learning methods have been used to propose molecules with desired properties, which is especially useful for exploring large chemical spaces efficiently. However, these methods rely on fully labelled training data, and are not practical in situations where molecules with multiple property constraints are required. There is often insufficient training data for all those properties from publicly available databases, especially when ab-initio simulation or experimental property data is also desired for training the conditional molecular generative model. In this work, we show how to modify a semi-supervised variational auto-encoder (SSVAE) model which only works with fully labelled and fully unlabelled molecular property training data into the ConGen model, which also works on training data that have sparsely populated labels. We evaluate ConGen's performance in generating molecules with multiple constraints when trained on a dataset combined from multiple publicly available molecule property databases, and demonstrate an example application of building the virtual chemical space for potential Lithium-ion battery localized high-concentration electrolyte (LHCE) diluents.

翻译：最近,机器学习方法被用来提出具有预期特性的分子,这对有效探索大型化学空间特别有用,然而,这些方法依赖充分标记的培训数据,而在需要具有多种属性限制的分子的情况下,这些方法不切实际。从公共数据库中通常没有足够的关于所有这些特性的培训数据,特别是在AB-initio模拟或实验性属性数据也用于培训有条件分子基因化模型时。在这项工作中,我们展示了如何修改半监督的变异自动编码(SSVAE)模型,该模型仅对ConGen模型中完全标记和完全没有标记的分子特性培训数据起作用,该模型还用于培训有稀少人口特征的数据。我们评估ConGen在利用多种公开分子属性数据库的数据集进行训练时产生具有多重限制的分子的性能表现,并展示了为潜在的锂离子电池局部高浓缩电解(LHCHE)稀释剂建造虚拟化学空间的范例。

0

相关内容

训练数据

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

Cr3+:ABSi2O6(A=Na，K，Ca；B=Mg，Al)可调谐激光晶体的研制

国家自然科学基金

0+阅读 · 2014年12月31日

微驱动器用新型高介电低模量的聚硅氧烷介电弹性体的设计与制备

国家自然科学基金

0+阅读 · 2014年12月31日

基于氮氧化物高效催化脱除的分子筛拓扑结构调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calreticulin突变在JAK2 V617F阴性的骨髓增殖性肿瘤中的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cu基类金刚石结构新型热电化合物的设计与优化

国家自然科学基金

0+阅读 · 2012年12月31日

膀胱癌特异性lncRNA-UCA1的结合分子及作用

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

Loss-Curvature Matching for Dataset Selection and Condensation

Arxiv

0+阅读 · 2023年3月8日

Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction

Arxiv

0+阅读 · 2023年3月7日

Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features

Arxiv

0+阅读 · 2023年3月7日

Manually Selecting The Data Function for Supervised Learning of small datasets

Arxiv

0+阅读 · 2023年3月7日

Data Valuation Without Training of a Model

Arxiv

0+阅读 · 2023年3月7日

Data-driven Modeling of Mach-Zehnder Interferometer-based Optical Matrix Multipliers

Arxiv

0+阅读 · 2023年3月6日

Multi-Order Networks for Action Unit Detection

Arxiv

0+阅读 · 2023年3月6日

Uncertainty Estimation by Fisher Information-based Evidential Deep Learning

Arxiv

0+阅读 · 2023年3月3日

Deterministic training of generative autoencoders using invertible layers

Arxiv

0+阅读 · 2023年3月3日

GeomCA: Geometric Evaluation of Data Representations

GeomCA: Geometric Evaluation of Data Representations

Arxiv

11+阅读 · 2021年5月26日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【普林斯顿博士论文】在线学习：优化、控制与学习理论

不确定环境下无人机三维路径规划研究 | 221页

【NeurIPS2025】《LeapFactual：基于条件流匹配的可靠视觉反事实解释》

大语言模型将如何改变军事指挥结构

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Loss-Curvature Matching for Dataset Selection and Condensation

Arxiv

0+阅读 · 2023年3月8日

Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction

Arxiv

0+阅读 · 2023年3月7日

Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features

Arxiv

0+阅读 · 2023年3月7日

Manually Selecting The Data Function for Supervised Learning of small datasets

Arxiv

0+阅读 · 2023年3月7日

Data Valuation Without Training of a Model

Arxiv

0+阅读 · 2023年3月7日

Data-driven Modeling of Mach-Zehnder Interferometer-based Optical Matrix Multipliers

Arxiv

0+阅读 · 2023年3月6日

Multi-Order Networks for Action Unit Detection

Arxiv

0+阅读 · 2023年3月6日

Uncertainty Estimation by Fisher Information-based Evidential Deep Learning

Arxiv

0+阅读 · 2023年3月3日

Deterministic training of generative autoencoders using invertible layers

Arxiv

0+阅读 · 2023年3月3日

GeomCA: Geometric Evaluation of Data Representations

GeomCA: Geometric Evaluation of Data Representations

Arxiv

11+阅读 · 2021年5月26日

相关基金

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

Cr3+:ABSi2O6(A=Na，K，Ca；B=Mg，Al)可调谐激光晶体的研制

国家自然科学基金

0+阅读 · 2014年12月31日

微驱动器用新型高介电低模量的聚硅氧烷介电弹性体的设计与制备

国家自然科学基金

0+阅读 · 2014年12月31日

基于氮氧化物高效催化脱除的分子筛拓扑结构调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calreticulin突变在JAK2 V617F阴性的骨髓增殖性肿瘤中的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cu基类金刚石结构新型热电化合物的设计与优化

国家自然科学基金

0+阅读 · 2012年12月31日

膀胱癌特异性lncRNA-UCA1的结合分子及作用

国家自然科学基金

0+阅读 · 2011年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员