Code-Switching, a common phenomenon in written text and conversation, has been studied over decades by the natural language processing (NLP) research community. Initially, code-switching is intensively explored by leveraging linguistic theories and, currently, more machine-learning oriented approaches to develop models. We introduce a comprehensive systematic survey on code-switching research in natural language processing to understand the progress of the past decades and conceptualize the challenges and tasks on the code-switching topic. Finally, we summarize the trends and findings and conclude with a discussion for future direction and open questions for further investigation.
翻译:密码转换是书面文本和谈话中常见的现象,几十年来,自然语言处理研究界一直在研究这种现象,最初,通过利用语言理论和目前更注重机器学习的方法开发模型,对代码转换进行深入探讨,我们采用对自然语言处理中代码转换研究的全面系统调查,以了解过去几十年的进展,并构思代码转换专题的挑战和任务。最后,我们总结趋势和结论,最后讨论未来方向和开放问题,供进一步调查。