Low-quality listings and bad actor behavior in online retail websites threatens e-commerce business as these result in sub-optimal buying experience and erode customer trust. When a new listing is created, how to tell it has good-quality? Is the method effective, fast, and scalable? Previous approaches often have three limitations/challenges: (1) unable to handle cold start problems where new sellers/listings lack sufficient selling histories. (2) inability of scoring hundreds of millions of listings at scale, or compromise performance for scalability. (3) has space challenges from large-scale graph with giant e-commerce business size. To overcome these limitations/challenges, we proposed ColdGuess, an inductive graph-based risk predictor built upon a heterogeneous seller product graph, which effectively identifies risky seller/product/listings at scale. ColdGuess tackles the large-scale graph by consolidated nodes, and addresses the cold start problems using homogeneous influence1. The evaluation on real data demonstrates that ColdGuess has stable performance as the number of unknown features increases. It outperforms the lightgbm2 by up to 34 pcp ROC-AUC in a cold start case when a new seller sells a new product . The resulting system, ColdGuess, is effective, adaptable to changing risky seller behavior, and is already in production
翻译:在线零售网站的低质量列名和不良行为威胁到电子商务业务,因为这样会导致低劣的购买经验,并削弱客户信任。当新列名创建时,如何告诉它质量良好?方法是否有效、快速和可推广?以往的做法往往有三个限制/挑战:(1) 无法处理新销售者/名单缺乏足够销售历史的冷启动问题。(2) 无法在规模上赢得数亿列名或降低可缩放性能时处理冷启动问题。(3) 具有巨大电子商务规模的大型图表在空间方面遇到挑战。为克服这些限制/挑战,我们提议ColdGuess,一个基于直观图的风险预测器,建在多样化的卖方产品图上,该图能有效识别风险的卖方/产品/清单规模。ColdGuess通过合并节点处理大型的启动问题,并利用同质影响解决冷启动问题。1 对真实数据的评估表明,随着未知特征数量的增加,ColdGuess的性能稳定了稳定运行。为了克服这些限制/挑战,我们建议,ColdGuess,一个基于直方图的风险预测的风险预测的预测器建于34个变冷式产品,在新的销售系统中,而导致的冷质变换后的风险行为是一个新的产品。