优化美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元 (Optimizing $αμ$) - 专知论文

会员服务 ·

0

优化器 · INFORMS · 不完美信息 · 蒙特卡罗 · 结点 ·

2021 年 1 月 29 日

Optimizing $αμ$

翻译：优化美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元美元

Tristan Cazenave,Swann Legras,Véronique Ventos

$\alpha\mu$ is a search algorithm which repairs two defaults of Perfect Information Monte Carlo search: strategy fusion and non locality. In this paper we optimize $\alpha\mu$ for the game of Bridge, avoiding useless computations. The proposed optimizations are general and apply to other imperfect information turn-based games. We define multiple optimizations involving Pareto fronts, and show that these optimizations speed up the search. Some of these optimizations are cuts that stop the search at a node, while others keep track of which possible worlds have become redundant, avoiding unnecessary, costly evaluations. We also measure the benefits of parallelizing the double dummy searches at the leaves of the $\alpha\mu$ search tree.

翻译：$\ alpha\ mu$ 是一种搜索算法,它修复了蒙特卡洛完美信息搜索的两个默认值: 战略融合和非地点。在本文中, 我们优化了用于桥牌游戏的 $\ alpha\ mu$, 避免了无用的计算。提议的优化是一般性的, 适用于其他不完善的信息翻转游戏。我们定义了涉及 Pareto 的多重优化, 并显示这些优化加快了搜索速度。有些优化是削减, 停止在节点搜索, 而另一些优化则跟踪了哪些可能的世界已经变得多余, 避免了不必要的、昂贵的评估。我们还测量了在$\ alpha\ mu$ 搜索树叶上平行进行双假搜索的好处。

0

相关内容

优化器

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

近期必读的7篇ICML 2019【Meta-Learning（元学习）】相关论文和代码

近期必读的7篇ICML 2019【Meta-Learning（元学习）】相关论文和代码

专知会员服务

37+阅读 · 2020年1月11日

【斯坦福大学】TASO:基于深度学习优化的自动生成图变换（TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions），35页ppt

【斯坦福大学】TASO:基于深度学习优化的自动生成图变换（TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions），35页ppt

专知会员服务

10+阅读 · 2019年12月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

泡泡机器人SLAM

14+阅读 · 2019年4月30日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

已删除

将门创投

3+阅读 · 2017年9月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation

Arxiv

0+阅读 · 2021年3月24日

$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space

Arxiv

0+阅读 · 2021年3月23日

HADAD: A Lightweight Approach for Optimizing Hybrid Complex Analytics Queries (Extended Version)

Arxiv

0+阅读 · 2021年3月23日

Evolving Continuous Optimisers from Scratch

Arxiv

0+阅读 · 2021年3月22日

Sparsity-Inducing Optimal Control via Differential Dynamic Programming

Arxiv

0+阅读 · 2021年3月22日

Optimal Advertising for Information Products

Arxiv

0+阅读 · 2021年3月22日

Approximate Solutions to a Class of Reachability Games

Arxiv

0+阅读 · 2021年3月20日

Zero-Cost Proxies for Lightweight NAS

Arxiv

0+阅读 · 2021年3月19日

Optimizing Fitness-For-Use of Differentially Private Linear Queries

Arxiv

0+阅读 · 2021年3月19日

Generating Adversarial Computer Programs using Optimized Obfuscations

Arxiv

0+阅读 · 2021年3月18日

VIP会员

文章信息

相关主题

不完美信息

相关VIP内容

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

【Google-CMU】元伪标签的元学习，Meta Pseudo Labels

专知会员服务

32+阅读 · 2020年3月30日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

近期必读的7篇ICML 2019【Meta-Learning（元学习）】相关论文和代码

近期必读的7篇ICML 2019【Meta-Learning（元学习）】相关论文和代码

专知会员服务

37+阅读 · 2020年1月11日

【斯坦福大学】TASO:基于深度学习优化的自动生成图变换（TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions），35页ppt

【斯坦福大学】TASO:基于深度学习优化的自动生成图变换（TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions），35页ppt

专知会员服务

10+阅读 · 2019年12月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

NeurIPS 2025 | 自动化所新作速览（一）

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

NeurIPS 2025 | 自动化所新作速览（二）

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

泡泡机器人SLAM

14+阅读 · 2019年4月30日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

已删除

将门创投

3+阅读 · 2017年9月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation

Arxiv

0+阅读 · 2021年3月24日

$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space

Arxiv

0+阅读 · 2021年3月23日

HADAD: A Lightweight Approach for Optimizing Hybrid Complex Analytics Queries (Extended Version)

Arxiv

0+阅读 · 2021年3月23日

Evolving Continuous Optimisers from Scratch

Arxiv

0+阅读 · 2021年3月22日

Sparsity-Inducing Optimal Control via Differential Dynamic Programming

Arxiv

0+阅读 · 2021年3月22日

Optimal Advertising for Information Products

Arxiv

0+阅读 · 2021年3月22日

Approximate Solutions to a Class of Reachability Games

Arxiv

0+阅读 · 2021年3月20日

Zero-Cost Proxies for Lightweight NAS

Arxiv

0+阅读 · 2021年3月19日

Optimizing Fitness-For-Use of Differentially Private Linear Queries

Arxiv

0+阅读 · 2021年3月19日

Generating Adversarial Computer Programs using Optimized Obfuscations

Arxiv

0+阅读 · 2021年3月18日

微信扫码咨询专知VIP会员