In this thesis, we consider two simple but typical control problems and apply deep reinforcement learning to them, i.e., to cool and control a particle which is subject to continuous position measurement in a one-dimensional quadratic potential or in a quartic potential. We compare the performance of reinforcement learning control and conventional control strategies on the two problems, and show that the reinforcement learning achieves a performance comparable to the optimal control for the quadratic case, and outperforms conventional control strategies for the quartic case for which the optimal control strategy is unknown. To our knowledge, this is the first time deep reinforcement learning is applied to quantum control problems in continuous real space. Our research demonstrates that deep reinforcement learning can be used to control a stochastic quantum system in real space effectively as a measurement-feedback closed-loop controller, and our research also shows the ability of AI to discover new control strategies and properties of the quantum systems that are not well understood, and we can gain insights into these problems by learning from the AI, which opens up a new regime for scientific research.
翻译:在此论文中,我们考虑了两个简单但典型的控制问题,并对他们应用了深度强化学习,即冷却和控制一个粒子,该粒子在单维四方潜力或孔径潜力中需持续测量位置。我们比较了强化学习控制的表现和关于这两个问题的常规控制战略。我们比较了这两个问题,并表明强化学习取得了与四方情况最佳控制相近的绩效,并且优于不为最佳控制战略所了解的倾斜情况常规控制战略。据我们了解,这是首次将深度强化学习应用到连续真实空间的量子控制问题。我们的研究显示,深度强化学习可以有效地用于控制真实空间的蒸汽量子系统,作为一种测量和后背闭环控制器,我们的研究还表明,AI能够发现新的控制战略和量子系统不甚为人们所理解的特性,我们可以通过从AI中学习对这些问题的深入了解,这为科学研究开辟了新的制度。