通过层叠叠图层组合体为深点击区速率预测模型进行神经特征选择的强化探索 (Enhanced Exploration in Neural Feature Selection for Deep Click-Through Rate Prediction Models via Ensemble of Gating Layers)

Feature selection has been an essential step in developing industry-scale deep Click-Through Rate (CTR) prediction systems. The goal of neural feature selection (NFS) is to choose a relatively small subset of features with the best explanatory power as a means to remove redundant features and reduce computational cost. Inspired by gradient-based neural architecture search (NAS) and network pruning methods, people have tackled the NFS problem with Gating approach that inserts a set of differentiable binary gates to drop less informative features. The binary gates are optimized along with the network parameters in an efficient end-to-end manner. In this paper, we analyze the gradient-based solution from an exploration-exploitation perspective and use empirical results to show that Gating approach might suffer from insufficient exploration. To improve the exploration capacity of gradient-based solutions, we propose a simple but effective ensemble learning approach, named Ensemble Gating. We choose two public datasets, namely Avazu and Criteo, to evaluate this approach. Our experiments show that, without adding any computational overhead or introducing any hyper-parameter (except the size of the ensemble), our method is able to consistently improve Gating approach and find a better subset of features on the two datasets with three different underlying deep CTR prediction models.

翻译：在开发工业规模深度点击率(CTR)预测系统的过程中,选择地貌特征一直是开发行业深度点击率(CTR)预测系统的一个必要步骤。神经特征选择的目标是选择相对小的一组具有最佳解释力的特征,作为消除冗余特征和降低计算成本的手段。在基于梯度的神经结构搜索(NAS)和网络修剪方法的启发下,人们已经用定位方法解决NFS问题,即插入一套不同的双进制门,以降低信息量。二进制门与网络参数一道,以高效的端到端方式优化。在本文件中,我们从探索-开发角度分析梯度基解决方案,并使用实验结果来表明Gateg方法可能因探索不足而受到损害。为了提高基于梯度的解决方案的探索能力,我们提出了简单而有效的共通学习方法,名为“Entsemble Getable Gatet。我们选择了两个公共数据集,即Avazu和Criteo来评估这一方法。我们的实验显示,在不增加任何计算间接间接或引入任何超位模型的情况下,我们在不增加任何高分数或引入任何超分数方法的情况下,我们最能的C和最能的C将找到一种方法。

相关内容

特征选择

关注 5931

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

首篇「课程学习（Curriculum Learning)」2021综述论文

专知会员服务

50+阅读 · 2021年1月31日

【视频预测深度学习综述论文】A Review on Deep Learning Techniques for Video Prediction

专知会员服务

52+阅读 · 2020年4月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【自监督学习深度神经网络视觉特征学习综述论文】Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey

专知会员服务

87+阅读 · 2020年3月1日