We present the InterviewBot that dynamically integrates conversation history and customized topics into a coherent embedding space to conduct 10 mins hybrid-domain (open and closed) conversations with foreign students applying to U.S. colleges for assessing their academic and cultural readiness. To build a neural-based end-to-end dialogue model, 7,361 audio recordings of human-to-human interviews are automatically transcribed, where 440 are manually corrected for finetuning and evaluation. To overcome the input/output size limit of a transformer-based encoder-decoder model, two new methods are proposed, context attention and topic storing, allowing the model to make relevant and consistent interactions. Our final model is tested both statistically by comparing its responses to the interview data and dynamically by inviting professional interviewers and various students to interact with it in real-time, finding it highly satisfactory in fluency and context awareness.
翻译:我们提出了InterviewBot,它将对话历史和定制话题动态地集成到一致的嵌入空间中,以进行10分钟的混合领域(开放和封闭)与申请美国大学的外国学生进行面试,以评估他们的学术和文化准备情况。为了构建基于神经元的端到端对话模型,我们自动转录了7,361个人对人的面试音频录音,其中440个进行了手动纠正以供微调和评估。为了克服变压器编码器解码器模型的输入/输出大小限制,我们提出了两种新方法,上下文注意力和主题存储,使模型可以进行相关和一致的交互。我们的最终模型通过比较其响应面试数据来进行统计测试,并邀请专业面试官和各种学生在实时环境中与其交互来进行动态测试,在流畅性和上下文意识方面都达到了高度的满意度。