使用批量- Wise 缩放和特性掩码常化, 使用批量- Wise 缩放和特性掩码 (Feature Selection Using Batch-Wise Attenuation and Feature Mask Normalization)

Feature selection is generally used as one of the most important preprocessing techniques in machine learning, as it helps to reduce the dimensionality of data and assists researchers and practitioners in understanding data. Thereby, by utilizing feature selection, better performance and reduced computational consumption, memory complexity and even data amount can be expected. Although there exist approaches leveraging the power of deep neural networks to carry out feature selection, many of them often suffer from sensitive hyperparameters. This paper proposes a feature mask module (FM-module) for feature selection based on a novel batch-wise attenuation and feature mask normalization. The proposed method is almost free from hyperparameters and can be easily integrated into common neural networks as an embedded feature selection method. Experiments on popular image, text and speech datasets have shown that our approach is easy to use and has superior performance in comparison with other state-of-the-art deep-learning-based feature selection methods.

翻译：选择地物通常被用作机器学习中最重要的预处理技术之一,因为它有助于减少数据的维度,协助研究人员和从业人员了解数据。因此,通过利用地物选择,提高性能,减少计算消耗,可以预期内存的复杂性甚至数据的数量。虽然有办法利用深神经网络的力量进行地物选择,但其中许多往往存在敏感的超参数。本文件提议在新型批量减速和特征掩码正常化的基础上,为地物选择使用一个功能遮罩模块(FM-模块)。拟议方法几乎没有超光度计,很容易作为嵌入的地物选择方法纳入普通神经网络。关于流行图像、文本和语音数据集的实验表明,我们的方法很容易使用,并且与其他最先进的深学习地物选择方法相比,其性能更高。

相关内容

特征选择

关注 5931

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Graph Normalization (GN)：为图神经网络学习一个有效的图归一化

专知会员服务

16+阅读 · 2020年9月28日

基于Transformer嵌入模型的个性化产品搜索，A Transformer-based Embedding Model for Personalized Product Search

专知会员服务

31+阅读 · 2020年5月20日

【伯克利】再思考 Transformer中的Batch Normalization

专知会员服务

41+阅读 · 2020年3月21日