Click-Through Rate (CTR) prediction is a core task in nowadays commercial recommender systems. Feature crossing, as the mainline of research on CTR prediction, has shown a promising way to enhance predictive performance. Even though various models are able to learn feature interactions without manual feature engineering, they rarely attempt to individually learn representations for different feature structures. In particular, they mainly focus on the modeling of cross sparse features but neglect to specifically represent cross dense features. Motivated by this, we propose a novel Extreme Cross Network, abbreviated XCrossNet, which aims at learning dense and sparse feature interactions in an explicit manner. XCrossNet as a feature structure-oriented model leads to a more expressive representation and a more precise CTR prediction, which is not only explicit and interpretable, but also time-efficient and easy to implement. Experimental studies on Criteo Kaggle dataset show significant improvement of XCrossNet over state-of-the-art models on both effectiveness and efficiency.
翻译:点击浏览率(CTR)预测是当今商业推荐人系统中的一项核心任务。作为CTR预测研究的主线,特征跨线是提高预测性能的一个很有希望的方法。尽管各种模型能够学习特征互动,而没有手工特征工程,但它们很少试图单独学习不同特征结构的描述。特别是,它们主要侧重于交叉分散特征的建模,但忽视具体代表交叉密集特征。为此,我们提议建立一个新型的极端交叉网络,缩略 XCrossNet,目的是以明确的方式学习密集和稀少特征的相互作用。XCrossNet作为一种以特征结构为导向的模型,可以带来更清晰的表述和更精确的CTR预测,不仅清晰和可解释,而且具有时间效率和易于执行。关于Crito Kagle数据集的实验研究表明,XCrossNet在效力和效率两方面都大大改进了最新模型的XCrossNet。