Complex systems are ubiquitous in the real world and tend to have complicated and poorly understood dynamics. For their control issues, the challenge is to guarantee accuracy, robustness, and generalization in such bloated and troubled environments. Fortunately, a complex system can be divided into multiple modular structures that human cognition appears to exploit. Inspired by this cognition, a novel control method, Causal Coupled Mechanisms (CCMs), is proposed that explores the cooperation in division and competition in combination. Our method employs the theory of hierarchical reinforcement learning (HRL), in which 1) the high-level policy with competitive awareness divides the whole complex system into multiple functional mechanisms, and 2) the low-level policy finishes the control task of each mechanism. Specifically for cooperation, a cascade control module helps the series operation of CCMs, and a forward coupled reasoning module is used to recover the coupling information lost in the division process. On both synthetic systems and a real-world biological regulatory system, the CCM method achieves robust and state-of-the-art control results even with unpredictable random noise. Moreover, generalization results show that reusing prepared specialized CCMs helps to perform well in environments with different confounders and dynamics.
翻译:复杂系统在现实世界中无处不在,而且往往具有复杂和不易理解的动态。对于它们的控制问题,挑战在于保证准确性、稳健性和在如此繁忙和麻烦的环境中普遍化。幸运的是,一个复杂的系统可以分为多种模块结构,人类认知似乎会加以利用。在这种认知的启发下,提出了一种新型控制方法,即Causal Compaced机制(CCCM),探索在分化和竞争结合方面的合作。我们的方法采用了等级强化学习理论(HRL),其中1)具有竞争意识的高层政策将整个复杂系统分成多种功能机制,2)低层次政策完成了每个机制的控制任务。具体地说,一个级联动控制模块有助于CCM的系列运作,并使用一个前向组合推理模块来恢复在分化过程中丢失的信息。在合成系统和现实世界生物管理系统中,CCM方法既能实现稳健和状态控制结果,即使有不可预测的随机噪音。此外,一般化的结果显示,与专门化的CMMD环境的重新开发有助于在不同的同步环境中运行。