Federated learning is an effective way of extracting insights from different user devices while preserving the privacy of users. However, new classes with completely unseen data distributions can stream across any device in a federated learning setting, whose data cannot be accessed by the global server or other users. To this end, we propose a unified zero-shot framework to handle these aforementioned challenges during federated learning. We simulate two scenarios here -- 1) when the new class labels are not reported by the user, the traditional FL setting is used; 2) when new class labels are reported by the user, we synthesize Anonymized Data Impressions by calculating class similarity matrices corresponding to each device's new classes followed by unsupervised clustering to distinguish between new classes across different users. Moreover, our proposed framework can also handle statistical heterogeneities in both labels and models across the participating users. We empirically evaluate our framework on-device across different communication rounds (FL iterations) with new classes in both local and global updates, along with heterogeneous labels and models, on two widely used audio classification applications -- keyword spotting and urban sound classification, and observe an average deterministic accuracy increase of ~4.041% and ~4.258% respectively.
翻译:联邦学习是从不同用户设备中提取见解,同时保护用户隐私的有效方法。然而,具有完全不为人知的数据发布方式的新类别可以在联合学习环境中从任何设备中流出,而全球服务器或其他用户无法访问这些数据。为此,我们提议了一个统一零射框架,以便在联合学习期间处理上述挑战。我们在这里模拟两种情景 -- -- 1当用户不报告新类别标签时,使用传统的FL设置;2当用户报告新类别标签时,我们通过计算与每个设备的新类别相对应的类似类矩阵,并随后进行不受监督的分组,以区分不同用户的新类别。此外,我们提议的框架还可以处理参与用户在标签和模型中的统计差异性差异性。我们用经验评估了我们在不同通信周期(FLiel Iterations)中的新分类框架,以及本地和全球更新的新分类中的新类别,以及混类标签和模型,在两种广泛使用的音频分类应用程序 -- 关键词定位和城市声音分类中,4.2 并观察平均的精确度增加比例。