使用 Max K-Armed 盗匪发现材料 (Materials Discovery using Max K-Armed Bandit) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 上置信界限 · 蒙特卡洛树搜索 · 蒙特卡罗 · Better ·

2022 年 12 月 16 日

Materials Discovery using Max K-Armed Bandit

翻译：使用 Max K-Armed 盗匪发现材料

Nobuaki Kikkawa,Hiroshi Ohno

Search algorithms for the bandit problems are applicable in materials discovery. However, the objectives of the conventional bandit problem are different from those of materials discovery. The conventional bandit problem aims to maximize the total rewards, whereas materials discovery aims to achieve breakthroughs in material properties. The max K-armed bandit (MKB) problem, which aims to acquire the single best reward, matches with the discovery tasks better than the conventional bandit. Thus, here, we propose a search algorithm for materials discovery based on the MKB problem using a pseudo-value of the upper confidence bound of expected improvement of the best reward. This approach is pseudo-guaranteed to be asymptotic oracles that do not depends on the time horizon. In addition, compared with other MKB algorithms, the proposed algorithm has only one hyperparameter, which is advantageous in materials discovery. We applied the proposed algorithm to synthetic problems and molecular-design demonstrations using a Monte Carlo tree search. According to the results, the proposed algorithm stably outperformed other bandit algorithms in the late stage of the search process when the optimal arm of the MKB could not be determined based on its expectation reward.

翻译：对土匪问题的搜索算法适用于物料发现。然而,传统土匪问题的目的与物料发现的目的不同。传统土匪问题的目的是最大限度地提高总报酬,而材料发现的目的是在物质属性方面实现突破。最大K型武装土匪问题(MKB)旨在获得单一最佳报酬,与发现任务比传统土匪更匹配。因此,我们在此建议一种基于MKB问题的材料发现搜索算法,使用预期最佳报酬改进的高度信任圈的伪值。这种方法是伪担保的,不取决于时间范围。此外,与其他MKB算法相比,拟议的算法只有一个超参数,在材料发现方面是有利的。我们用拟议算法处理合成问题和使用蒙特卡洛树搜索的分子设计演示。根据结果,拟议的算法在搜索过程的后期,在MKB最佳手臂的预期值不能确定时,在搜索过程的后期将其他土匪算算法。

0

相关内容

赌博机/老虎机

赌博机/老虎机

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

低应力长寿命高温部件的蠕变本构及设计准则研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于热红外发射光谱的水稻磷素营养诊断研究

国家自然科学基金

0+阅读 · 2013年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

蝉棒束孢杀虫活性成分分离鉴定及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

BODIPY 类新型光敏剂的合成与构效研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于图顶点划分的 Thomassen 猜想

国家自然科学基金

0+阅读 · 2011年12月31日

高维问题和稳健性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

On the Sparse DAG Structure Learning Based on Adaptive Lasso

Arxiv

0+阅读 · 2023年2月17日

Symbolic Discovery of Optimization Algorithms

Symbolic Discovery of Optimization Algorithms

Arxiv

0+阅读 · 2023年2月17日

The non-overlapping statistical approximation to overlapping group lasso

Arxiv

0+阅读 · 2023年2月17日

Robust expected improvement for Bayesian optimization

Arxiv

0+阅读 · 2023年2月16日

Local Causal Discovery for Estimating Causal Effects

Arxiv

0+阅读 · 2023年2月16日

Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation

Arxiv

0+阅读 · 2023年2月15日

Understanding Expertise through Demonstrations: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

Arxiv

0+阅读 · 2023年2月15日

A model-free feature selection technique of feature screening and random forest based recursive feature elimination

Arxiv

0+阅读 · 2023年2月15日

Bandit Social Learning: Exploration under Myopic Behavior

Arxiv

0+阅读 · 2023年2月15日

Deep Learning on Image Denoising: An overview

Arxiv

13+阅读 · 2020年8月3日

VIP会员

文章信息

相关主题

赌博机/老虎机

上置信界限

蒙特卡洛树搜索

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

On the Sparse DAG Structure Learning Based on Adaptive Lasso

Arxiv

0+阅读 · 2023年2月17日

Symbolic Discovery of Optimization Algorithms

Symbolic Discovery of Optimization Algorithms

Arxiv

0+阅读 · 2023年2月17日

The non-overlapping statistical approximation to overlapping group lasso

Arxiv

0+阅读 · 2023年2月17日

Robust expected improvement for Bayesian optimization

Arxiv

0+阅读 · 2023年2月16日

Local Causal Discovery for Estimating Causal Effects

Arxiv

0+阅读 · 2023年2月16日

Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation

Arxiv

0+阅读 · 2023年2月15日

Understanding Expertise through Demonstrations: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

Arxiv

0+阅读 · 2023年2月15日

A model-free feature selection technique of feature screening and random forest based recursive feature elimination

Arxiv

0+阅读 · 2023年2月15日

Bandit Social Learning: Exploration under Myopic Behavior

Arxiv

0+阅读 · 2023年2月15日

Deep Learning on Image Denoising: An overview

Arxiv

13+阅读 · 2020年8月3日

相关基金

低应力长寿命高温部件的蠕变本构及设计准则研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于热红外发射光谱的水稻磷素营养诊断研究

国家自然科学基金

0+阅读 · 2013年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

蝉棒束孢杀虫活性成分分离鉴定及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

BODIPY 类新型光敏剂的合成与构效研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于图顶点划分的 Thomassen 猜想

国家自然科学基金

0+阅读 · 2011年12月31日

高维问题和稳健性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员