There is significant interest in learning and optimizing a complex system composed of multiple sub-components, where these components may be agents or autonomous sensors. Among the rich literature on this topic, agent-based and domain-specific simulations can capture complex dynamics and subgroup interaction, but optimizing over such simulations can be computationally and algorithmically challenging. Bayesian approaches, such as Gaussian processes (GPs), can be used to learn a computationally tractable approximation to the underlying dynamics but typically neglect the detailed information about subgroups in the complicated system. We attempt to find the best of both worlds by proposing the idea of decomposed feedback, which captures group-based heterogeneity and dynamics. We introduce a novel decomposed GP regression to incorporate the subgroup decomposed feedback. Our modified regression has provably lower variance -- and thus a more accurate posterior -- compared to previous approaches; it also allows us to introduce a decomposed GP-UCB optimization algorithm that leverages subgroup feedback. The Bayesian nature of our method makes the optimization algorithm trackable with a theoretical guarantee on convergence and no-regret property. To demonstrate the wide applicability of this work, we execute our algorithm on two disparate social problems: infectious disease control in a heterogeneous population and allocation of distributed weather sensors. Experimental results show that our new method provides significant improvement compared to the state-of-the-art.
翻译:学习和优化一个由多个子组成部分组成的复杂系统,这些组成部分可以是代理或自主传感器; 在关于这一专题的丰富文献中,基于代理的和针对具体域的模拟可以捕捉复杂的动态和分组互动,但对这种模拟的优化可以在计算上和逻辑上具有挑战性。 诸如高森进程(GPs)等巴伊西亚方法可以用来学习一种可计算到的基本动态近似,但通常忽视关于复杂系统中分组的详细信息。我们试图通过提出分解反馈的想法,找到两个世界的最好之处,这种反馈可以捕捉基于集团的异质性和动态。我们引入了一种新型的分解GP回归,以纳入分解的分组反馈。我们经过修改的回归与以往方法相比,差异小得多,因而更准确的后台效应。它使我们能够引入一种分解的GP-UCB优化算法,以利用新的分组反馈。我们方法的性质使得最优化的算法可以追踪,同时从理论上保证基于集团的异性异性异性组合和不易变的气候性算法分配:在我们的重要的实验性人口结构上进行这种实验性变的实验性分析。