非混凝解斯托克最佳最佳控制问题相继接近的全球趋同 (Global Convergence of Successive Approximations for Non-convex Stochastic Optimal Control Problems) - 专知论文

会员服务 ·

0

控制器 · 近似 · 优化器 · 可约的 · 泛函 ·

2022 年 7 月 5 日

Global Convergence of Successive Approximations for Non-convex Stochastic Optimal Control Problems

翻译：非混凝解斯托克最佳最佳控制问题相继接近的全球趋同

Shaolin Ji,Rundong Xu

This paper focuses on finding approximate solutions to the stochastic optimal control problems where the state trajectory is subject to controlled stochastic differential equations permitting controls in the diffusion coefficients. An algorithm based on the method of successive approximations is described for finding a set of small measure, in which the control is varied finitely so as to reduce the value of the functional and, as the control domains are not necessarily convex, the second-order adjoint processes are introduced in each minimization step of the Hamiltonian. Under certain convexity conditions, we prove that the values of the cost functional descend to the global minimum as the number of iterations tends to infinity. In particular, a convergence rate for a class of linear-quadratic systems is available.

翻译：本文的重点是,在国家轨迹受到受控制的随机差分方程管制的情况下,找到能够控制扩散系数的随机最佳控制问题的近似解决办法。根据连续近似法描述一种算法,以寻找一套小的计量方法,其中控制有一定的差别,以降低功能值,由于控制领域不一定是连接的,因此在汉密尔顿群岛的每个步骤中都引入了二级联合程序。在某些交融条件下,我们证明成本功能值随着迭代次数的多寡而降至全球最低值。特别是,可以找到线性水晶系统类别的趋同率。

0

相关内容

控制器

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

关于 Finsler 流形上调和映射与 Laplacian 的若干问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非线性偏微分方程的非线性微分约束

国家自然科学基金

1+阅读 · 2013年12月31日

地球流体力学和物理学中一些非线性偏微分方程研究

国家自然科学基金

0+阅读 · 2011年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

Non-probabilistic Supervised Learning for Non-linear Convex Variational Problems

Arxiv

0+阅读 · 2022年8月26日

Optimal Bound on the Combinatorial Complexity of Approximating Polytopes

Arxiv

0+阅读 · 2022年8月24日

On Fitness Landscape Analysis of Permutation Problems: From Distance Metrics to Mutation Operator Selection

Arxiv

0+阅读 · 2022年8月23日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Arxiv

11+阅读 · 2019年9月19日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Non-probabilistic Supervised Learning for Non-linear Convex Variational Problems

Arxiv

0+阅读 · 2022年8月26日

Optimal Bound on the Combinatorial Complexity of Approximating Polytopes

Arxiv

0+阅读 · 2022年8月24日

On Fitness Landscape Analysis of Permutation Problems: From Distance Metrics to Mutation Operator Selection

Arxiv

0+阅读 · 2022年8月23日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Arxiv

11+阅读 · 2019年9月19日

相关基金

关于 Finsler 流形上调和映射与 Laplacian 的若干问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非线性偏微分方程的非线性微分约束

国家自然科学基金

1+阅读 · 2013年12月31日

地球流体力学和物理学中一些非线性偏微分方程研究

国家自然科学基金

0+阅读 · 2011年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员