With the advent of real-world quantum computing, the idea that parametrized quantum computations can be used as hypothesis families in a quantum-classical machine learning system is gaining increasing traction. Such hybrid systems have already shown the potential to tackle real-world tasks in supervised and generative learning, and recent works have established their provable advantages in special artificial tasks. Yet, in the case of reinforcement learning, which is arguably most challenging and where learning boosts would be extremely valuable, no proposal has been successful in solving even standard benchmarking tasks, nor in showing a theoretical learning advantage over classical algorithms. In this work, we achieve both. We propose a hybrid quantum-classical reinforcement learning model using very few qubits, which we show can be effectively trained to solve several standard benchmarking environments. Moreover, we demonstrate, and formally prove, the ability of parametrized quantum circuits to solve certain learning tasks that are intractable for classical models, including current state-of-art deep neural networks, under the widely-believed classical hardness of the discrete logarithm problem.
翻译:随着现实世界量子计算的到来,在量子古典机器学习系统中,平衡量子计算可以用作假想家庭的想法正在逐渐增加。这种混合系统已经展示了在监督和基因学习中完成现实世界任务的潜力,而最近的工程在特殊的人工任务中确立了其可证实的优势。 然而,在强化学习方面,可以说是最具挑战性的,而且学习的推动将极有价值,但在甚至解决标准基准任务或显示对古典算法的理论学习优势方面,没有任何建议取得成功。在这项工作中,我们都实现了。我们建议使用极少的量子来混合量子级强化学习模式,我们展示了能够有效地解决若干标准基准环境的培训。此外,我们证明并正式证明,在人们广泛相信的离散日志的古典硬性下,配成的量子电路能够解决古典模型难以解决的某些学习任务,包括当前状态的深层神经网络。