A well-known algorithm in privacy-preserving ML is differentially private stochastic gradient descent (DP-SGD). While this algorithm has been evaluated on text and image data, it has not been previously applied to ads data, which are notorious for their high class imbalance and sparse gradient updates. In this work we apply DP-SGD to several ad modeling tasks including predicting click-through rates, conversion rates, and number of conversion events, and evaluate their privacy-utility trade-off on real-world datasets. Our work is the first to empirically demonstrate that DP-SGD can provide both privacy and utility for ad modeling tasks.
翻译:在保护隐私方面众所周知的算法是差异化的私人随机梯度基底(DP-SGD ) 。 虽然这一算法已经对文本和图像数据进行了评估,但以前还没有应用于广告数据,因为广告数据因其高档不平衡和低梯度更新而臭名昭著。 在这项工作中,我们应用DP-SGD 来做一些模拟任务,包括预测点击率、换算率和转换事件的数量,并评估其在真实世界数据集上的隐私-利用权权衡。 我们的工作首先从经验上证明DP-SGD可以提供隐私和用于模拟任务的实用性。