Gaussian smoothed sliced Wasserstein distance has been recently introduced for comparing probability distributions, while preserving privacy on the data. It has been shown, in applications such as domain adaptation, to provide performances similar to its non-private (non-smoothed) counterpart. However, the computational and statistical properties of such a metric is not yet been well-established. In this paper, we analyze the theoretical properties of this distance as well as those of generalized versions denoted as Gaussian smoothed sliced divergences. We show that smoothing and slicing preserve the metric property and the weak topology. We also provide results on the sample complexity of such divergences. Since, the privacy level depends on the amount of Gaussian smoothing, we analyze the impact of this parameter on the divergence. We support our theoretical findings with empirical studies of Gaussian smoothed and sliced version of Wassertein distance, Sinkhorn divergence and maximum mean discrepancy (MMD). In the context of privacy-preserving domain adaptation, we confirm that those Gaussian smoothed sliced Wasserstein and MMD divergences perform very well while ensuring data privacy.
翻译:高斯平滑的瓦瑟斯坦平滑片段距离最近被引入,以比较概率分布,同时保留数据隐私。在域适应等应用中,显示提供类似于非私有(非悬浮)对应方的性能。然而,尚未很好地确定这种计量的计算和统计属性。在本文件中,我们分析了这一距离的理论属性以及通用版本的理论属性,这些通用版本的理论属性被标为高斯平滑的切片差异。我们表明,光滑和切片保存了计量属性和薄弱的表层。我们还提供了这些差异的样本复杂性结果。因为隐私水平取决于高斯平滑的数量,我们分析了这一参数对差异的影响。我们支持我们的理论结论,对高斯平滑的瓦塞尔坦距离、辛克霍恩差异和最大平均值差异(MMMD)进行了经验性研究。在维护隐私的域适应方面,我们确认高斯平滑切片瓦列斯坦和MMD的隐私差异在确保数据运行良好的情况下,这些数据运行良好。