Gaze tracking is a useful human-to-computer interface, which plays an increasingly important role in a range of mobile applications. Gaze calibration is an indispensable component of gaze tracking, which transforms the eye coordinates to the screen coordinates. The existing approaches of gaze tracking either have limited accuracy or require the user's cooperation in calibration and in turn hurt the quality of experience. We in this paper propose vGaze, continuous gaze tracking with implicit saliency-aware calibration on mobile devices. The design of vGaze stems from our insight on the temporal and spatial dependent relation between the visual saliency and the user's gaze. vGaze is implemented as a light-weight software that identifies video frames with "useful" saliency information, sensing the user's head movement, performs opportunistic calibration using only those "useful" frames, and leverages historical information for accelerating saliency detection. We implement vGaze on a commercial mobile device and evaluate its performance in various scenarios. The results show that vGaze can work at real time with video playback applications. The average error of gaze tracking is 1.51 cm (2.884 degree) which decreases to 0.99 cm (1.891 degree) with historical information and 0.57 cm (1.089 degree) with an indicator.
翻译:Gaze 跟踪是一种有用的人与计算机界面,它在一系列移动应用中发挥着越来越重要的作用。 Gaze 校准是凝视跟踪的一个不可或缺的组成部分,它将眼坐标转换为屏幕坐标。现有的凝视跟踪方法要么准确性有限,要么要求用户在校准方面进行合作,从而损害经验的质量。我们在本文件中建议 vGaze, 连续凝视跟踪,在移动设备上以隐含的显著性能校准方式进行。 vGaze 的设计源于我们对视觉突出度与用户的凝视之间的时间和空间依赖关系的洞察。 vGaze 是一个轻量软件,用于识别带有“有用”显著信息的视频框架,感测用户头部运动,仅使用这些“有用”框架进行机会校准,并利用历史信息加速显要度检测。我们用商业移动设备进行VGaze,并评估其在不同情况下的性能。结果显示, vGaze 可以在实时回放应用中工作。VGaze 的视觉跟踪平均误差为1.51厘米(284米) 和0.98米(0.98米) 。