联邦近近邻翻译 (Federated Nearest Neighbor Machine Translation)

To protect user privacy and meet legal regulations, federated learning (FL) is attracting significant attention. Training neural machine translation (NMT) models with traditional FL algorithm (e.g., FedAvg) typically relies on multi-round model-based interactions. However, it is impractical and inefficient for machine translation tasks due to the vast communication overheads and heavy synchronization. In this paper, we propose a novel federated nearest neighbor (FedNN) machine translation framework that, instead of multi-round model-based interactions, leverages one-round memorization-based interaction to share knowledge across different clients to build low-overhead privacy-preserving systems. The whole approach equips the public NMT model trained on large-scale accessible data with a $k$-nearest-neighbor ($$kNN) classifier and integrates the external datastore constructed by private text data in all clients to form the final FL model. A two-phase datastore encryption strategy is introduced to achieve privacy-preserving during this process. Extensive experiments show that FedNN significantly reduces computational and communication costs compared with FedAvg, while maintaining promising performance in different FL settings.

翻译：为了保护用户隐私和遵守法律条例,联合会学习(FL)正在引起人们的极大关注。使用传统FL算法(例如FedAvg)培训神经机器翻译(NMT)模式的训练通常依赖多轮模型互动,然而,由于通信管理费用巨大和高度同步,机器翻译任务不切实际,效率低下。在本文中,我们提议建立一个新型的FedNNN(FedNN)近邻联合机器翻译框架,它不是多轮式模式互动,而是利用一回合的模拟互动,在不同客户之间分享知识,以建立低超载隐私保护系统。整个方法为接受大规模无障碍数据培训的公共NMT模型配备了成本为美元-远端邻居($kNNN)的分类器,并将所有客户中私人文本数据所建的外部数据储存器整合成FL模式。在此过程中引入了两阶段数据存储加密战略,以实现隐私保护。广泛的实验显示,FDNNND大大降低了与FDAvg的计算和通信成本,同时保持不同性。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日