Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting. In this paper, we present a deep learning-based method - GRAph Convolutional nEtwork feature Selector (GRACES) - to select important features for HDLSS data. We demonstrate empirical evidence that GRACES outperforms other feature selection methods on both synthetic and real-world datasets.
翻译:地物选择是一种强大的减少维度技术,它为模型构建选择了一组相关特征。提出了许多特征选择方法,但由于高维和低标尺寸(HDLSS)的设置,其中多数因超装问题而失败。在本文件中,我们介绍了一种深层次的学习方法――巨猿进化新工程特征选择(GRACES)――为HDLSS数据选择重要特征。我们展示了实践证据,证明GRACES在合成和真实世界数据集方面优于其他特征选择方法。