联邦学习满足自然语言处理:调查 (Federated Learning Meets Natural Language Processing: A Survey)

Federated Learning aims to learn machine learning models from multiple decentralized edge devices (e.g. mobiles) or servers without sacrificing local data privacy. Recent Natural Language Processing techniques rely on deep learning and large pre-trained language models. However, both big deep neural and language models are trained with huge amounts of data which often lies on the server side. Since text data is widely originated from end users, in this work, we look into recent NLP models and techniques which use federated learning as the learning framework. Our survey discusses major challenges in federated natural language processing, including the algorithm challenges, system challenges as well as the privacy issues. We also provide a critical review of the existing Federated NLP evaluation methods and tools. Finally, we highlight the current research gaps and future directions.

翻译：联邦学习协会的目的是在不牺牲当地数据隐私的情况下,从多个分散的边缘设备(例如移动设备)或服务器学习机器学习模式,近期的自然语言处理技术依靠深层次的学习和大量预先培训的语言模型,然而,大型的深神经和语言模型都经过大量数据的培训,这些数据往往存在于服务器方面。由于文本数据广泛来自终端用户,我们在此工作中考察了最近使用联合学习作为学习框架的NLP模型和技术。我们的调查讨论了联合自然语言处理的重大挑战,包括算法挑战、系统挑战和隐私问题。我们还对现有联邦国家语言规划的评估方法和工具进行了严格审查。最后,我们强调了当前的研究差距和今后的方向。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

联邦学习自然语言处理综述论文

专知会员服务

66+阅读 · 2021年8月1日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日