Topological data analysis (TDA) is a powerful technique for extracting complex and valuable shape-related summaries of high-dimensional data. However, the computational demands of classical TDA algorithms are exorbitant, and quickly become impractical for high-order characteristics. Quantum computing promises exponential speedup for certain problems. Yet, many existing quantum algorithms with notable asymptotic speedups require a degree of fault tolerance that is currently unavailable. In this paper, we present NISQ-TDA, the first fully implemented end-to-end quantum machine learning algorithm needing only a linear circuit-depth, that is applicable to non-handcrafted high-dimensional classical data, with potential speedup under stringent conditions. The algorithm neither suffers from the data-loading problem nor does it need to store the input data on the quantum computer explicitly. Our approach includes three key innovations: (a) an efficient realization of the full boundary operator as a sum of Pauli operators; (b) a quantum rejection sampling and projection approach to restrict a uniform superposition to the simplices of the desired order in the complex; and (c) a stochastic rank estimation method to estimate the topological features in the form of approximate Betti numbers. We present theoretical results that establish additive error guarantees for NISQ-TDA, and the circuit and computational time and depth complexities for exponentially scaled output estimates, up to the error tolerance. The algorithm was successfully executed on quantum computing devices, as well as on noisy quantum simulators, applied to small datasets. Preliminary empirical results suggest that the algorithm is robust to noise.
翻译:地形数据分析(TDA)是提取复杂和有价值的高维数据形状摘要的有力技术。然而,古典TDA算法的计算要求过高,对于高阶特性来说很快变得不切实际。量子计算意味着某些问题的指数加速。然而,许多现有的量子算法具有明显的无症状加速速度,需要一定程度的过错容忍度,而目前还不具备这一点。在本文件中,我们提出了NISQ-TDA,这是第一个完全执行的端到端量子机学习算法,只需要一个线性电路深度,适用于非手制的高维度古典数据,有可能在严格的条件下加快速度。算法既不会因数据负荷问题而受到影响,也不需要明确地将输入数据储存在量子计算机上。我们的方法包括三个关键创新:(a) 有效地实现完全的边界操作者,这是保利操作者的总和;(b) 量子量拒绝抽样和预测方法,将统一的量子值超值定位方法限制到复杂秩序的精度;以及(c) 用于非手制高维的高度古典的古典的古典数据,我们用来估算的比级计算方法,是用来测测测测测测测测测算。