Although backdoor learning is an active research topic in the NLP domain, the literature lacks studies that systematically categorize and summarize backdoor attacks and defenses. To bridge the gap, we present a comprehensive and unifying study of backdoor learning for NLP by summarizing the literature in a systematic manner. We first present and motivate the importance of backdoor learning for building robust NLP systems. Next, we provide a thorough account of backdoor attack techniques, their applications, defenses against backdoor attacks, and various mitigation techniques to remove backdoor attacks. We then provide a detailed review and analysis of evaluation metrics, benchmark datasets, threat models, and challenges related to backdoor learning in NLP. Ultimately, our work aims to crystallize and contextualize the landscape of existing literature in backdoor learning for the text domain and motivate further research in the field. To this end, we identify troubling gaps in the literature and offer insights and ideas into open challenges and future research directions. Finally, we provide a GitHub repository with a list of backdoor learning papers that will be continuously updated at https://github.com/marwanomar1/Backdoor-Learning-for-NLP.
翻译:虽然后门学习是国家劳工政策领域一个积极的研究专题,但文献缺乏系统分类和总结后门攻击和防御的研究。为弥合差距,我们通过系统总结文献,为国家劳工政策提出全面、统一的后门学习研究。我们首先介绍并激励后门学习对于建设强有力的国家劳工政策系统的重要性。接着,我们全面介绍后门攻击技术、其应用、防止后门攻击的防御和各种消除后门攻击的缓解技术。然后我们详细审查和分析评价指标、基准数据集、威胁模型和与国家劳工政策中的后门学习有关的挑战。最终,我们的工作目标是将现有的后门学习文本领域的文献背景化和背景化,并激励该领域的进一步研究。为此,我们查明文献中的令人不安的差距,并就公开的挑战和未来研究方向提供见解和想法。最后,我们向GitHub存放一个后门学习文件的清单,将在https://github.com/maro1/Back-Lasow-Lestor-Lestor上不断更新。