We aim to develop an AI agent that can watch video clips and have a conversation with human about the video story. Developing video understanding intelligence is a significantly challenging task, and evaluation methods for adequately measuring and analyzing the progress of AI agent are lacking as well. In this paper, we propose the Video Turing Test to provide effective and practical assessments of video understanding intelligence as well as human-likeness evaluation of AI agents. We define a general format and procedure of the Video Turing Test and present a case study to confirm the effectiveness and usefulness of the proposed test.
翻译:我们的目标是开发一个能够观看视频剪辑并与人就视频故事进行交谈的AI代理机构; 开发视频理解智能是一项艰巨的任务,也缺乏适当衡量和分析AI代理机构进展情况的评估方法; 在本文中,我们提议视频图灵测试,以便对AI代理机构视频理解智能以及人性评估进行有效和实用的评估; 我们界定视频图灵测试的一般格式和程序,并提交一份案例研究,以确认拟议测试的有效性和有用性。