Quantum mechanics has the potential to speedup machine learning algorithms, including reinforcement learning(RL). Previous works have shown that quantum algorithms can efficiently solve RL problems in discrete action space, but could become intractable in continuous domain, suffering notably from the curse of dimensionality due to discretization. In this work, we propose an alternative quantum circuit design that can solve RL problems in continuous action space without the dimensionality problem. Specifically, we propose a quantum version of the Deep Deterministic Policy Gradient method constructed from quantum neural networks, with the potential advantage of obtaining an exponential speedup in gate complexity for each iteration. As applications, we demonstrate that quantum control tasks, including the eigenvalue problem and quantum state generation, can be formulated as sequential decision problems and solved by our method.
翻译:量子力学具有加速机器学习算法的潜力,包括强化学习(RL) 。 先前的工程已经表明量子算法可以有效解决离散行动空间的RL问题,但有可能在连续领域变得难以解决,特别是由于离散而受维度诅咒的影响。在这项工作中,我们提出了一个替代量子电路设计,可以在没有维度问题的情况下在连续行动空间解决RL问题。具体地说,我们提出了从量子神经网络构建的深确定论政策分级法的量子版,其潜在优势是每次迭代都能在门复杂度上获得指数加速加速。 正如应用一样,我们证明量子控制任务,包括电子价值问题和量子状态生成,可以作为顺序决定问题,通过我们的方法加以解决。