We provide a unifying approximate dynamic programming framework that applies to a broad variety of problems involving sequential estimation. We consider first the construction of surrogate cost functions for the purposes of optimization, and we focus on the special case of Bayesian optimization, using the rollout algorithm and some of its variations. We then discuss the more general case of sequential estimation of a random vector using optimal measurement selection, and its application to problems of stochastic and adaptive control. We finally consider related search and sequential decoding problems, and a rollout algorithm for the approximate solution of the Wordle and Mastermind puzzles, recently developed in the paper [BBB22].
翻译:我们首先考虑为优化目的构建代用成本功能,我们侧重于贝叶斯优化的特殊情况,使用推出算法及其某些变异。然后我们讨论使用最佳计量选择对随机矢量进行顺序估算的更一般性案例,以及将其应用于随机矢量的随机量测及其适应性和适应性控制问题。我们最后考虑相关的搜索和顺序解码问题,以及最近由文件[BBBB22] 开发的旨在大致解决Wordle和Mastermind拼图的推出算法。