Gaze-tracking is a novel way of interacting with computers which allows new scenarios, such as enabling people with motor-neuron disabilities to control their computers or doctors to interact with patient information without touching screen or keyboard. Further, there are emerging applications of gaze-tracking in interactive gaming, user experience research, human attention analysis and behavioral studies. Accurate estimation of the gaze may involve accounting for head-pose, head-position, eye rotation, distance from the object as well as operating conditions such as illumination, occlusion, background noise and various biological aspects of the user. Commercially available gaze-trackers utilize specialized sensor assemblies that usually consist of an infrared light source and camera. There are several challenges in the universal proliferation of gaze-tracking as accessibility technologies, specifically its affordability, reliability, and ease-of-use. In this paper, we try to address these challenges through the development of a hardware-agnostic gaze-tracker. We present a deep neural network architecture as an appearance-based method for constrained gaze-tracking that utilizes facial imagery captured on an ordinary RGB camera ubiquitous in all modern computing devices. Our system achieved an error of 1.8073cm on GazeCapture dataset without any calibration or device specific fine-tuning. This research shows promise that one day soon any computer, tablet, or phone will be controllable using just your eyes due to the prediction capabilities of deep neutral networks.
翻译:与计算机进行跟踪是一种新颖的互动方式,它使得新情况得以与计算机进行互动,例如,使运动中性残疾者能够控制其计算机或医生在不触摸屏幕或键盘的情况下与病人信息进行互动互动信息互动;此外,在互动游戏、用户经验研究、人类注意力分析和行为研究方面,正在出现视视跟踪应用。对视跟踪的准确估计可能涉及对头部、头部位置、眼旋转、与对象的距离以及操作条件进行核算,如用户的照明、中性隐蔽、背景噪音和各种生物方面。商业可用的视跟踪器使用专门传感器组件,通常由红外线光源和相机组成。在作为无障碍技术、特别是其可负担性、可靠性和使用便利性研究的普及方面,出现了若干挑战。在本文中,我们试图通过开发一个硬性神经跟踪跟踪器来应对这些挑战。我们提出一个深线网络架构,作为一种基于深视跟踪的外表基方法,利用普通RGB摄像网络的面图像进行专门监测,通常由红色光源源和相机组成。在普通RGB摄像机网络上进行跟踪,将利用这一固定的日常定位网络进行实时计算机定位系统,将显示所有特定的计算机定位定位的系统,将很快地显示。