Click-Through Rate (CTR) prediction is critical for industrial recommender systems, where most deep CTR models follow an Embedding \& Feature Interaction paradigm. However, the majority of methods focus on designing network architectures to better capture feature interactions while the feature embedding, especially for numerical features, has been overlooked. Existing approaches for numerical features are difficult to capture informative knowledge because of the low capacity or hard discretization based on the offline expertise feature engineering. In this paper, we propose a novel embedding learning framework for numerical features in CTR prediction (AutoDis) with high model capacity, end-to-end training and unique representation properties preserved. AutoDis consists of three core components: meta-embeddings, automatic discretization and aggregation. Specifically, we propose meta-embeddings for each numerical field to learn global knowledge from the perspective of field with a manageable number of parameters. Then the differentiable automatic discretization performs soft discretization and captures the correlations between the numerical features and meta-embeddings. Finally, distinctive and informative embeddings are learned via an aggregation function. Comprehensive experiments on two public and one industrial datasets are conducted to validate the effectiveness of AutoDis. Moreover, AutoDis has been deployed onto a mainstream advertising platform, where online A/B test demonstrates the improvement over the base model by 2.1% and 2.7% in terms of CTR and eCPM, respectively. In addition, the code of our framework is publicly available in MindSpore(https://gitee.com/mindspore/mindspore/tree/master/model_zoo/research/recommend/autodis).
翻译:点击- 浏览率( CTR) 预测对于工业推荐系统至关重要, 最深的 CTR 模型都遵循嵌入式的“ 功能互动” 模式。 然而, 大多数方法侧重于设计网络结构以更好地捕捉特征互动, 而功能嵌入, 特别是数字特性则被忽略。 数字特征的现有方法很难捕捉信息知识, 因为基于离线专业特征工程的容量低或硬离散。 在本文中, 我们提议为CTR 预测( AutoDis) 中的数字特征建立一个创新的嵌入学习框架, 具有高模型能力、 端到端培训和保存独特的演示属性。 自动Dision由三个核心组成部分组成: 元组合、 自动离散化和聚合。 具体地说, 我们建议每个数字字段的元组合, 以可调控的参数从字段的角度学习全球知识。 然后, 不同的自动离散化运行软离解功能, 并捕捉到数字特征特性和元编集/ 。 最后, 通过汇总的 egnal eal adal adal developmental developmental developmental developmental developmental deal developmental development the the ex developmental developmental developmental developmental deal deal deal deal deal deal deal deal deal deal deal deal deal deal deal deal deal dealdaldaldaldald sild saldaldaldaldaldmental.