Kernel methods are an important class of techniques in machine learning. To be effective, good feature maps are crucial for mapping non-linearly separable input data into a higher dimensional (feature) space, thus allowing the data to be linearly separable in feature space. Previous work has shown that quantum feature map design can be automated for a given dataset using NSGA-II, a genetic algorithm, while both minimizing circuit size and maximizing classification accuracy. However, the evaluation of the accuracy achieved by a candidate feature map is costly. In this work, we demonstrate the suitability of kernel-target alignment as a substitute for accuracy in genetic algorithm-based quantum feature map design. Kernel-target alignment is faster to evaluate than accuracy and doesn't require some data points to be reserved for its evaluation. To further accelerate the evaluation of genetic fitness, we provide a method to approximate kernel-target alignment. To improve kernel-target alignment and root mean squared error, the final trainable parameters of the generated circuits are further trained using COBYLA to determine whether a hybrid approach applying conventional circuit parameter training can easily complement the genetic structure optimization approach. A total of eight new approaches are compared to the original across nine varied binary classification problems from the UCI machine learning repository, showing that kernel-target alignment and its approximation produce feature map circuits enabling comparable accuracy to the previous work but with larger margins on training data (in excess of 20\% larger) that improve further with circuit parameter training.
翻译:内核图是机器学习的一个重要技术。要有效,良好的地貌图对于将非线性分解的输入数据绘制成更高维度(地貌)空间至关重要,从而使得数据在地貌空间中可以线性分解。先前的工作表明,利用基因算法NSGA-II(基因算法),量子特征地图设计可以自动化,用于特定数据集,同时尽量减少电路大小和尽量扩大分类准确性。然而,对候选人地貌图所实现的准确性的评价成本很高。在这项工作中,我们证明内核目标对准是替代基于基因算法的量子图设计准确性的合适性,从而使得数据能够比精度更快地评价数据,而不需要为评估保留一些数据点。为了进一步加快对基因健康的评价,我们提供了接近内核目标对齐的方法。为了改进内核目标的校正和根正方差,对产生的电路路路的最后可训练参数将进一步加以培训。在CUBYLA中,以确定使用混合方法来取代基于遗传算法的更精确性参数培训,是否能够轻易地对原精度进行精度的精度,从而推测测测测测地进行八号的机械图。