We propose a method for the approximation of high- or even infinite-dimensional feature vectors, which play an important role in supervised learning. The goal is to reduce the size of the training data, resulting in lower storage consumption and computational complexity. Furthermore, the method can be regarded as a regularization technique, which improves the generalizability of learned target functions. We demonstrate significant improvements in comparison to the computation of data-driven predictions involving the full training data set. The method is applied to classification and regression problems from different application areas such as image recognition, system identification, and oceanographic time series analysis.
翻译:我们提出了一种近似高维甚甚或无限特质矢量的方法,在监督学习中发挥重要作用,目标是缩小培训数据的规模,降低存储消耗量和计算复杂性;此外,该方法可被视为一种正规化技术,可以提高学习目标功能的可概括性;与计算涉及全部培训数据集的数据驱动预测相比,我们显示出了显著的改进;该方法适用于图像识别、系统识别和海洋学时间序列分析等不同应用领域的分类和回归问题。