Advertising click fraud detection plays one of the vital roles in current E-commerce websites as advertising is an essential component of its business model. It aims at, given a set of corresponding features, e.g., demographic information of users and statistical features of clicks, predicting whether a click is fraudulent or not in the community. Recent efforts attempted to incorporate attributed behavior sequence and heterogeneous network for extracting complex features of users and achieved significant effects on click fraud detection. In this paper, we propose a Multimodal and Contrastive learning network for Click Fraud detection (MCCF). Specifically, motivated by the observations on differences of demographic information, behavior sequences and media relationship between fraudsters and genuine users on E-commerce platform, MCCF jointly utilizes wide and deep features, behavior sequence and heterogeneous network to distill click representations. Moreover, these three modules are integrated by contrastive learning and collaboratively contribute to the final predictions. With the real-world datasets containing 2.54 million clicks on Alibaba platform, we investigate the effectiveness of MCCF. The experimental results show that the proposed approach is able to improve AUC by 7.2% and F1-score by 15.6%, compared with the state-of-the-art methods.
翻译:广告点击欺诈检测在当前电子商务网站中发挥着关键作用,因为广告是其商业模式的一个基本组成部分,因此,广告是其商业模式的基本组成部分。它的目标是,考虑到一系列相应的特征,例如用户的人口信息以及点击的统计特征,预测点击是否在社区中是欺诈的。最近努力试图纳入行为顺序和不同网络,以提取用户的复杂特征,并对点击欺诈检测产生重大影响。在本文中,我们建议建立一个多式和反式学习网络,用于点击欺诈检测。具体地说,根据关于人口信息差异、行为序列以及欺诈者与电子商务平台真正用户之间媒体关系差异的观察,CCF联合利用广泛而深层的特点、行为顺序和多样性网络来吸引点击演示。此外,这三个模块通过对比性学习和协作为最后预测作出贡献。在Alibaba平台上包含254万次点击的真实世界数据集,我们调查MCCFF的实效。实验结果显示,拟议的方法能够以7.2%和15.6%的F1-art方法改进AUC和F1-Art-15.6%的状态。