Variational quantum algorithms have been acknowledged as a leading strategy to realize near-term quantum advantages in meaningful tasks, including machine learning and combinatorial optimization. When applied to tasks involving classical data, such algorithms generally begin with quantum circuits for data encoding and then train quantum neural networks (QNNs) to minimize target functions. Although QNNs have been widely studied to improve these algorithms' performance on practical tasks, there is a gap in systematically understanding the influence of data encoding on the eventual performance. In this paper, we make progress in filling this gap by considering the common data encoding strategies based on parameterized quantum circuits. We prove that, under reasonable assumptions, the distance between the average encoded state and the maximally mixed state could be explicitly upper-bounded with respect to the width and depth of the encoding circuit. This result in particular implies that the average encoded state will concentrate on the maximally mixed state at an exponential speed on depth. Such concentration seriously limits the capabilities of quantum classifiers, and strictly restricts the distinguishability of encoded states from a quantum information perspective. We further support our findings by numerically verifying these results on both synthetic and public data sets. Our results highlight the significance of quantum data encoding in machine learning tasks and may shed light on future encoding strategies.
翻译:虽然对QNN进行了广泛研究,以改进这些算法在实际任务方面的表现,但在系统理解数据编码对最终性能的影响方面存在差距。在本文件中,我们通过考虑基于参数化量子电路的共同数据编码战略,在填补这一差距方面取得进展。我们证明,在合理的假设下,平均编码状态与最大混合状态之间的距离可以明确以编码电路的宽度和深度为上限。这特别意味着,平均编码状态将以指数化速度以指数化速度集中于最大混合状态。这种集中严重限制了量子分类器的能力,并严格限制了根据量子信息电路进行编码的状态的区别。我们进一步支持我们在定量信息电路上对平均编码状态和最大混合状态之间的距离,在编码电路宽度和深度方面可以明确地以最大限制。这尤其意味着,平均编码状态将以指数化状态在深度上最接近的混合状态为重点。这种集中严重限制了量级分类器的能力,并严格地限制根据量级信息电路法分析编码状态的可辨别性能。我们进一步支持我们的调查结果,通过对合成数据序列和未来数据序列研究的结果进行数字化研究。