Building open-domain conversational systems (or chatbots) that produce convincing responses is a recognized challenge. Recent state-of-the-art (SoTA) transformer-based models for the generation of natural language dialogue have demonstrated impressive performance in simulating human-like, single-turn conversations in English. This work investigates, by an empirical study, the potential for transfer learning of such models to Swedish language. DialoGPT, an English language pre-trained model, is adapted by training on three different Swedish language conversational datasets obtained from publicly available sources. Perplexity score (an automated intrinsic language model metric) and surveys by human evaluation were used to assess the performances of the fine-tuned models, with results that indicate that the capacity for transfer learning can be exploited with considerable success. Human evaluators asked to score the simulated dialogue judged over 57% of the chatbot responses to be human-like for the model trained on the largest (Swedish) dataset. We provide the demos and model checkpoints of our English and Swedish chatbots on the HuggingFace platform for public use.
翻译:建立能产生令人信服的反应的开放对话系统(或聊天机)是一个公认的挑战。最近为产生自然语言对话而采用的最新最先进的基于艺术的变压器模型在模拟英语人样的单回合对话方面表现出了令人印象深刻的成绩。这项工作通过一项经验研究调查了将这种模型的学习能力转移至瑞典语的可能性。英语预先培训的DialoGPT模式,通过培训从公开来源获得的三种不同的瑞典语言对话数据集而加以调整。常识分(自动内在语言模型)和人类评估调查被用来评估精细调模型的性能,结果表明转移学习的能力可以相当成功地加以利用。人类评价者要求将模拟对话评分到超过57%的聊天机反应中,以便像人类一样在最大的(瑞典语)数据集上培训的模型。我们提供了我们英语和瑞典语聊天机在HuggingFace平台上供公众使用的演示和示范检查站。