通过蒙特卡洛树搜索转移金融衍生产品合同 (Hedging of Financial Derivative Contracts via Monte Carlo Tree Search) - 专知论文

会员服务 ·

0

蒙特卡洛树搜索 · 蒙特卡罗 · 收缩 · 学成 · state-of-the-art ·

2021 年 4 月 19 日

Hedging of Financial Derivative Contracts via Monte Carlo Tree Search

翻译：通过蒙特卡洛树搜索转移金融衍生产品合同

from arxiv, Corrected typos. Shorter Presentation. 15 pages, 5 figures

The construction of approximate replication strategies for pricing and hedging of derivative contracts in incomplete markets is a key problem of financial engineering. Recently Reinforcement Learning algorithms for hedging under realistic market conditions have attracted significant interest. While research in the derivatives area mostly focused on variations of $Q$-learning, in artificial intelligence Monte Carlo Tree Search is the recognized state-of-the-art method for various planning problems, such as the games of Hex, Chess, Go,... This article introduces Monte Carlo Tree Search as a method to solve the stochastic optimal control problem behind the pricing and hedging tasks. As compared to $Q$-learning it combines Reinforcement Learning with tree search techniques. As a consequence Monte Carlo Tree Search has higher sample efficiency, is less prone to over-fitting to specific market models and generally learns stronger policies faster. In our experiments we find that Monte Carlo Tree Search, being the world-champion in games like Chess and Go, is easily capable of maximizing the utility of investor's terminal wealth without setting up an auxiliary mathematical framework.

翻译：在不完善的市场上,为衍生品合同的定价和套期保值建立近似复制战略是金融工程的一个关键问题。最近,在现实市场条件下进行套期保值的强化学习算法吸引了极大的兴趣。虽然衍生品领域的研究主要侧重于Q美元学习的变异,但人工智能蒙特卡洛树搜索是公认的解决各种规划问题的最先进方法,如Hex、Ches、Go等的游戏......这一文章将蒙特卡洛树搜索作为解决定价和套期保值任务背后的随机最佳控制问题的一种方法。与用$Q的学习相比,它将强化学习与树类搜索技术相结合。因此,蒙特卡洛树搜索的样本效率较高,因此不易过度适应特定市场模式,通常学习更快的政策。在我们的实验中,我们发现蒙特卡洛树搜索是象Ches和Go这样的游戏的世界版,很容易在不建立辅助数学框架的情况下最大限度地发挥投资者终极财富的效用。

0

相关内容

蒙特卡洛树搜索

蒙特卡洛树搜索

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

114+阅读 · 2021年4月17日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

148+阅读 · 2020年4月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

专知会员服务

11+阅读 · 2020年1月17日

【Google 76分钟训练万BERT最新论文】Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

【Google 76分钟训练万BERT最新论文】Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

专知会员服务

4+阅读 · 2020年1月7日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

老铁，邀请你来免费学习人工智能！！！

老铁，邀请你来免费学习人工智能！！！

量化投资与机器学习

4+阅读 · 2017年11月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Active Inference and Behavior Trees for Reactive Action Planning and Execution in Robotics

Arxiv

0+阅读 · 2021年6月9日

Blockchain for IoT Access Control: Recent Trends and Future Research Directions

Arxiv

0+阅读 · 2021年6月9日

Measurable Monte Carlo Search Error Bounds

Arxiv

0+阅读 · 2021年6月8日

Automatically Differentiable Random Coefficient Logistic Demand Estimation

Arxiv

0+阅读 · 2021年6月8日

Sequential- and Parallel- Constrained Max-value Entropy Search via Information Lower Bound

Arxiv

0+阅读 · 2021年6月8日

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Arxiv

0+阅读 · 2021年6月8日

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

Arxiv

0+阅读 · 2021年6月7日

Task-Guided Inverse Reinforcement Learning Under Partial Information

Arxiv

0+阅读 · 2021年5月28日

High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Arxiv

0+阅读 · 2021年4月23日

Logically-Constrained Reinforcement Learning

Logically-Constrained Reinforcement Learning

Arxiv

3+阅读 · 2018年12月6日

VIP会员

文章信息

相关主题

蒙特卡洛树搜索

state-of-the-art

相关VIP内容

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

114+阅读 · 2021年4月17日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

148+阅读 · 2020年4月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

专知会员服务

11+阅读 · 2020年1月17日

【Google 76分钟训练万BERT最新论文】Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

【Google 76分钟训练万BERT最新论文】Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

专知会员服务

4+阅读 · 2020年1月7日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

老铁，邀请你来免费学习人工智能！！！

老铁，邀请你来免费学习人工智能！！！

量化投资与机器学习

4+阅读 · 2017年11月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Active Inference and Behavior Trees for Reactive Action Planning and Execution in Robotics

Arxiv

0+阅读 · 2021年6月9日

Blockchain for IoT Access Control: Recent Trends and Future Research Directions

Arxiv

0+阅读 · 2021年6月9日

Measurable Monte Carlo Search Error Bounds

Arxiv

0+阅读 · 2021年6月8日

Automatically Differentiable Random Coefficient Logistic Demand Estimation

Arxiv

0+阅读 · 2021年6月8日

Sequential- and Parallel- Constrained Max-value Entropy Search via Information Lower Bound

Arxiv

0+阅读 · 2021年6月8日

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Arxiv

0+阅读 · 2021年6月8日

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

Arxiv

0+阅读 · 2021年6月7日

Task-Guided Inverse Reinforcement Learning Under Partial Information

Arxiv

0+阅读 · 2021年5月28日

High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Arxiv

0+阅读 · 2021年4月23日

Logically-Constrained Reinforcement Learning

Logically-Constrained Reinforcement Learning

Arxiv

3+阅读 · 2018年12月6日

微信扫码咨询专知VIP会员