Video Annotation is a crucial process in computer science and social science alike. Many video annotation tools (VAT) offer a wide range of features for making annotation possible. We conducted an extensive survey of over 59 VAT and interviewed interdisciplinary researchers to evaluate the usability of the VAT. Our findings suggest that most current VAT have overwhelming user interfaces, poor interaction techniques, and difficult-to-understand features. These often lead to longer annotation time, label inconsistencies, and user fatigue. We introduce FEVA, a video annotation tool with streamlined interaction techniques and a dynamic interface that makes labeling tasks easy and fast. FEVA focuses on speed, accuracy, and simplicity to make annotation quick, consistent, and straightforward. For example, annotators can control the speed and direction of the video and mark the onset and the offset of a label in real time with single key presses. In our user study, FEVA users, on average, require 36% less interaction than the most popular annotation tools (Advene, ANVIL, ELAN, VIA, and VIAN). The participants (N=32) rated FEVA as more intuitive and required less mental demand. The code and demo are available at http://www.snehesh.com/feva.
翻译:在计算机科学和社会科学方面,录像说明都是一个至关重要的过程。许多录像说明工具(VAT)为说明工作提供了广泛的特点。我们广泛调查了59种增值税,并采访了跨学科研究人员,以评估增值税的可使用性。我们的调查结果表明,大多数现有的增值税都具有压倒性用户界面、不良的互动技术和难以理解的特点。这往往导致说明时间延长、标签不一致和用户疲劳。我们引入了FEVA,这是一个带有简化互动技术的录像说明工具,以及一个使标签工作容易和快捷的动态界面。FEVA侧重于速度、准确性和简便性,以便迅速、一致和直截了当地作出说明。例如,说明者可以控制录像的速度和方向,用单一关键媒体标示开始和抵消的标签。在我们用户研究中,FEVA用户平均需要比最受欢迎的说明工具(Advene、ANVIL、EL、VIA和VIANA)减少36%的交互作用。参与者在http://www/ devolish/destrual 上的需求(N=32)。