A key problem in a variety of applications is that of domain adaptation from a public source domain, for which a relatively large amount of labeled data with no privacy constraints is at one's disposal, to a private target domain, for which a private sample is available with very few or no labeled data. In regression problems with no privacy constraints on the source or target data, a discrepancy minimization algorithm based on several theoretical guarantees was shown to outperform a number of other adaptation algorithm baselines. Building on that approach, we design differentially private discrepancy-based algorithms for adaptation from a source domain with public labeled data to a target domain with unlabeled private data. The design and analysis of our private algorithms critically hinge upon several key properties we prove for a smooth approximation of the weighted discrepancy, such as its smoothness with respect to the $\ell_1$-norm and the sensitivity of its gradient. Our solutions are based on private variants of Frank-Wolfe and Mirror-Descent algorithms. We show that our adaptation algorithms benefit from strong generalization and privacy guarantees and report the results of experiments demonstrating their effectiveness.
翻译:各种应用中的一个关键问题是,从公共源域到私人目标域的域性适应问题,对于公共源域而言,有相当大数量没有隐私限制的贴标签数据可供个人处置,而私人目标域则可供个人使用,只有极少或没有贴标签数据。在对源或目标数据没有隐私限制的回归问题中,基于若干理论保证的差别最小化算法超过了其他适应算法的基线。在这种方法的基础上,我们设计了基于差异的私人算法,从具有公共标签数据的来源域到具有未贴标签私人数据的目标域的适应。我们私人算法的设计和分析关键取决于若干关键属性,我们证明它们能够顺利地接近加权差异,例如其对美元-美元-美元-美元-诺尔米及其梯度的敏感性。我们的解决办法以弗兰克-沃勒费和镜子-白日算法的私人变体为基础。我们表明,我们的适应算法受益于强有力的一般化和隐私保障,并报告实验结果,以证明其有效性。