使用大型语言模型作为主钥匙: 使用GPT揭开材料科学的秘密 (Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT) - 专知论文

会员服务 ·

0

大型语言模型 · 语言模型 · 数据集 · 钙钛矿太阳能电池 · 科学家 ·

2023 年 4 月 5 日

Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT

翻译：使用大型语言模型作为主钥匙: 使用GPT揭开材料科学的秘密

Tong Xie,Yuwei Wa,Wei Huang,Yufei Zhou,Yixuan Liu,Qingyuan Linghu,Shaozhou Wang,Chunyu Kit,Clara Grazian,Bram Hoex

This article presents a new NLP task called structured information inference (SIS) to address the complexities of information extraction at the device level in materials science. We accomplished this task by finetuning GPT-3 on a exsiting perovskite solar cell FAIR dataset with 91.8 F1-score and we updated the dataset with all related scientific papers up to now. The produced dataset is formatted and normalized, enabling its direct utilization as input in subsequent data analysis. This feature will enable materials scientists to develop their own models by selecting high-quality review papers within their domain. Furthermore, we designed experiments to predict PCE and reverse-predict parameters and obtained comparable performance with DFT, which demonstrates the potential of large language models to judge materials and design new materials like a materials scientist.

翻译：本文提出了一种新的自然语言处理任务，称为结构化信息推理(SIS)，以应对材料科学设备级信息提取的复杂性。我们使用预训练语言模型GPT-3对现有的钙钛矿太阳能电池FAIR数据集进行了微调，并更新了该数据集到目前为止所有相关的科学论文。生成的数据集经过格式化和归一化处理，使其可以直接用作后续数据分析的输入。这一特点将使材料科学家能够通过选择自己领域内的高质量综述论文来开发自己的模型。此外，我们设计实验来预测PCE和反向预测参数，并获得了与DFT相当的性能，这展示了大型语言模型像材料科学家一样判断材料并设计新材料的潜力。

0

相关内容

大型语言模型

大型语言模型

【2022新书】Python数据科学导论，309页pdf

【2022新书】Python数据科学导论，309页pdf

专知会员服务

82+阅读 · 2022年8月6日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

32+阅读 · 2022年3月9日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

50+阅读 · 2019年9月24日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

LDHs调控水泥混凝土水化硬化过程及抗氯离子-硫酸盐侵蚀的机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米材料性质定量分析中的反问题

国家自然科学基金

1+阅读 · 2014年12月31日

伽玛辐照诱发石墨烯结构的损伤及修复机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ce3+/Eu2+离子激活多格位基质的稀土发光材料设计、制备及其动力学行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

点青霉葡萄糖氧化酶热稳定性关键氨基酸研究

国家自然科学基金

0+阅读 · 2012年12月31日

固态（量子点）量子计算的纠错研究

国家自然科学基金

0+阅读 · 2012年12月31日

人类线粒体DNA古老变异潜在致病性的功能验证

国家自然科学基金

0+阅读 · 2011年12月31日

无机纳米材料-聚合物复合结构高效率电致发光

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

非晶稀土氧化物高k栅介质材料的制备及物理特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation

Arxiv

0+阅读 · 2023年5月23日

On Learning to Summarize with Large Language Models as References

Arxiv

0+阅读 · 2023年5月23日

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年5月23日

ChatGPT: Jack of all trades, master of none

Arxiv

0+阅读 · 2023年5月23日

VideoLLM: Modeling Video Sequence with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

Diving into the Inter-Consistency of Large Language Models: An Insightful Analysis through Debate

Arxiv

0+阅读 · 2023年5月19日

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Arxiv

0+阅读 · 2023年5月18日

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Arxiv

22+阅读 · 2023年5月3日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

VIP会员

文章信息

相关主题

大型语言模型

钙钛矿太阳能电池

相关VIP内容

【2022新书】Python数据科学导论，309页pdf

【2022新书】Python数据科学导论，309页pdf

专知会员服务

82+阅读 · 2022年8月6日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

32+阅读 · 2022年3月9日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

50+阅读 · 2019年9月24日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型时代的文档智能：综述

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

文档视觉问答简述

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

相关论文

CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation

Arxiv

0+阅读 · 2023年5月23日

On Learning to Summarize with Large Language Models as References

Arxiv

0+阅读 · 2023年5月23日

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年5月23日

ChatGPT: Jack of all trades, master of none

Arxiv

0+阅读 · 2023年5月23日

VideoLLM: Modeling Video Sequence with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

Diving into the Inter-Consistency of Large Language Models: An Insightful Analysis through Debate

Arxiv

0+阅读 · 2023年5月19日

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Arxiv

0+阅读 · 2023年5月18日

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Arxiv

22+阅读 · 2023年5月3日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

相关基金

LDHs调控水泥混凝土水化硬化过程及抗氯离子-硫酸盐侵蚀的机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

纳米材料性质定量分析中的反问题

国家自然科学基金

1+阅读 · 2014年12月31日

伽玛辐照诱发石墨烯结构的损伤及修复机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ce3+/Eu2+离子激活多格位基质的稀土发光材料设计、制备及其动力学行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

点青霉葡萄糖氧化酶热稳定性关键氨基酸研究

国家自然科学基金

0+阅读 · 2012年12月31日

固态（量子点）量子计算的纠错研究

国家自然科学基金

0+阅读 · 2012年12月31日

人类线粒体DNA古老变异潜在致病性的功能验证

国家自然科学基金

0+阅读 · 2011年12月31日

无机纳米材料-聚合物复合结构高效率电致发光

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

非晶稀土氧化物高k栅介质材料的制备及物理特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员