在非线性支持矢量机分类中选择特征的新嵌入式最小最大值方法 (A novel embedded min-max approach for feature selection in nonlinear support vector machine classification)

In recent years, feature selection has become a challenging problem in several machine learning fields, such as classification problems. Support Vector Machine (SVM) is a well-known technique applied in classification tasks. Various methodologies have been proposed in the literature to select the most relevant features in SVM. Unfortunately, all of them either deal with the feature selection problem in the linear classification setting or propose ad-hoc approaches that are difficult to implement in practice. In contrast, we propose an embedded feature selection method based on a min-max optimization problem, where a trade-off between model complexity and classification accuracy is sought. By leveraging duality theory, we equivalently reformulate the min-max problem and solve it without further ado using off-the-shelf software for nonlinear optimization. The efficiency and usefulness of our approach are tested on several benchmark data sets in terms of accuracy, number of selected features and interpretability.

翻译：近年来,在诸如分类问题等若干机械学习领域,地物选择已成为一个具有挑战性的难题。支持矢量机(SVM)是用于分类任务的一种众所周知的技术。文献中提出了各种方法来选择SVM中最相关的特征。不幸的是,所有这些方法要么处理线性分类设置中的地物选择问题,要么提出在实践中难以执行的临时性方法。相比之下,我们提议基于微量最大优化问题的嵌入性地物选择方法,在模型复杂性和分类准确性之间求得平衡。我们利用双重性理论,将微量轴问题进行等同重新配置,并在不使用现成软件进一步进行非线性优化的情况下加以解决。我们的方法的效率和有用性在精确性、选定特征的数量和可解释性等几个基准数据集上进行了测试。

相关内容

特征选择

关注 5931

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

专知会员服务

39+阅读 · 2020年11月3日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日