Quantum computing has promised significant improvement in solving difficult computational tasks over classical computers. Designing quantum circuits for practical use, however, is not a trivial objective and requires expert-level knowledge. To aid this endeavor, this paper proposes a machine learning-based method to construct quantum circuit architectures. Previous works have demonstrated that classical deep reinforcement learning (DRL) algorithms can successfully construct quantum circuit architectures without encoded physics knowledge. However, these DRL-based works are not generalizable to settings with changing device noises, thus requiring considerable amounts of training resources to keep the RL models up-to-date. With this in mind, we incorporated continual learning to enhance the performance of our algorithm. In this paper, we present the Probabilistic Policy Reuse with deep Q-learning (PPR-DQL) framework to tackle this circuit design challenge. By conducting numerical simulations over various noise patterns, we demonstrate that the RL agent with PPR was able to find the quantum gate sequence to generate the two-qubit Bell state faster than the agent that was trained from scratch. The proposed framework is general and can be applied to other quantum gate synthesis or control problems -- including the automatic calibration of quantum devices.
翻译:量子计算机承诺在解决古典计算机的难以计算任务方面作出重大改进。然而,设计实际使用的量子电路并不是一个微不足道的目标,需要专家水平的知识。为了帮助这项工作,本文件提出一种基于机器的学习方法,以构建量子电路结构。先前的工作表明,传统的深层强化学习(DRL)算法可以在没有编码物理知识的情况下成功构建量子电路结构。然而,这些基于DRL的工程不能被广泛用于设备噪音变化的环境,因此需要大量的培训资源来保持RL模型的更新。考虑到这一点,我们吸收了不断学习以提高我们算法的性能。在本文中,我们介绍了具有深度Q学习(PPR-DQL)框架的概率政策再利用,以应对这一电路路设计挑战。通过对各种噪音模式进行数字模拟,我们证明PPR的RL代理能够找到离子门序列,以更快的速度生成两QBell状态。拟议的框架是通用的,可以应用于其他量子门或合成装置的校准,包括自动化的容器的校准。