While the application of differential privacy (DP) has been well-studied in cross-device federated learning (FL), there is a lack of work considering DP and its implications for cross-silo FL, a setting characterized by a limited number of clients each containing many data subjects. In cross-silo FL, usual notions of client-level DP are less suitable as real-world privacy regulations typically concern the in-silo data subjects rather than the silos themselves. In this work, we instead consider an alternative notion of silo-specific sample-level DP, where silos set their own privacy targets for their local examples. Under this setting, we reconsider the roles of personalization in federated learning. In particular, we show that mean-regularized multi-task learning (MR-MTL), a simple personalization framework, is a strong baseline for cross-silo FL: under stronger privacy requirements, silos are incentivized to federate more with each other to mitigate DP noise, resulting in consistent improvements relative to standard baseline methods. We provide an empirical study of competing methods as well as a theoretical characterization of MR-MTL for mean estimation, highlighting the interplay between privacy and cross-silo data heterogeneity. Our work serves to establish baselines for private cross-silo FL as well as identify key directions of future work in this area.
翻译:虽然不同隐私(DP)的应用在跨行业联合学习(FL)中得到了很好的研究,但是没有考虑到DP及其对于跨SIlo FL的影响,这种背景的特点是,每个包含许多数据主题的客户数量有限。在跨SIlo FL中,客户一级DP的通常概念不太适合,因为现实世界隐私条例通常涉及的是内部保密数据主题,而不是空地本身。在这项工作中,我们考虑的是单独地点抽样一级DP的替代概念,其中筒仓为自己的地方实例设定了自己的隐私目标。在这一背景下,我们重新考虑个人化在联合学习中的作用。特别是,我们表明,一个简单的个性化框架,即平均地常规化多任务学习(MR-MTL)是跨行业的坚实基线:在更强的隐私要求下,筒仓被鼓励更多地相互联结,以缓解DP的噪音,从而与标准基线方法相适应改进。我们提供了一个实证性研究,将个人化方法作为相互竞争的方法,作为MR-MTL的跨行业的理论性工作方向,作为我们隐私(MR-MTL)的跨领域的核心分析基础,作为我们未来隐私(MR-MMMMT-L)基线的理论领域的核心工作的基础和交叉分析。