Feature selection is generally used as one of the most important preprocessing techniques in machine learning, as it helps to reduce the dimensionality of data and assists researchers and practitioners in understanding data. Thereby, by utilizing feature selection, better performance and reduced computational consumption, memory complexity and even data amount can be expected. Although there exist approaches leveraging the power of deep neural networks to carry out feature selection, many of them often suffer from sensitive hyperparameters. This paper proposes a feature mask module (FM-module) for feature selection based on a novel batch-wise attenuation and feature mask normalization. The proposed method is almost free from hyperparameters and can be easily integrated into common neural networks as an embedded feature selection method. Experiments on popular image, text and speech datasets have shown that our approach is easy to use and has superior performance in comparison with other state-of-the-art deep-learning-based feature selection methods.
翻译:选择地物通常被用作机器学习中最重要的预处理技术之一,因为它有助于减少数据的维度,协助研究人员和从业人员了解数据。因此,通过利用地物选择,提高性能,减少计算消耗,可以预期内存的复杂性甚至数据的数量。虽然有办法利用深神经网络的力量进行地物选择,但其中许多往往存在敏感的超参数。本文件提议在新型批量减速和特征掩码正常化的基础上,为地物选择使用一个功能遮罩模块(FM-模块)。拟议方法几乎没有超光度计,很容易作为嵌入的地物选择方法纳入普通神经网络。关于流行图像、文本和语音数据集的实验表明,我们的方法很容易使用,并且与其他最先进的深学习地物选择方法相比,其性能更高。