Eye tracking has become an essential human-machine interaction modality for providing immersive experience in numerous virtual and augmented reality (VR/AR) applications desiring high throughput (e.g., 240 FPS), small-form, and enhanced visual privacy. However, existing eye tracking systems are still limited by their: (1) large form-factor largely due to the adopted bulky lens-based cameras; and (2) high communication cost required between the camera and backend processor, thus prohibiting their more extensive applications. To this end, we propose a lensless FlatCam-based eye tracking algorithm and accelerator co-design framework dubbed EyeCoD to enable eye tracking systems with a much reduced form-factor and boosted system efficiency without sacrificing the tracking accuracy, paving the way for next-generation eye tracking solutions. On the system level, we advocate the use of lensless FlatCams to facilitate the small form-factor need in mobile eye tracking systems. On the algorithm level, EyeCoD integrates a predict-then-focus pipeline that first predicts the region-of-interest (ROI) via segmentation and then only focuses on the ROI parts to estimate gaze directions, greatly reducing redundant computations and data movements. On the hardware level, we further develop a dedicated accelerator that (1) integrates a novel workload orchestration between the aforementioned segmentation and gaze estimation models, (2) leverages intra-channel reuse opportunities for depth-wise layers, and (3) utilizes input feature-wise partition to save activation memory size. On-silicon measurement validates that our EyeCoD consistently reduces both the communication and computation costs, leading to an overall system speedup of 10.95x, 3.21x, and 12.85x over CPUs, GPUs, and a prior-art eye tracking processor called CIS-GEP, respectively, while maintaining the tracking accuracy.
翻译:眼跟踪已成为一种至关重要的人体机器互动模式,用于在许多虚拟的、扩大的现实(VR/AR)应用程序中提供沉浸经验,这些应用程序需要高传输量(例如240个FPS)、小窗体和增强视觉隐私。然而,现有的眼跟踪系统仍然受到以下因素的限制:(1) 大型形式因素,主要由于采用大片镜头相机;(2) 相机和后端处理器之间需要高额的通信成本,从而禁止其更广泛的应用。为此,我们提议建立一个无镜可读的 FlatCam 眼跟踪算法和加速器共同设计框架,称为CEyeCoD,以便能够在不降低形式因素和增强系统效率的情况下进行眼睛跟踪系统跟踪,同时为下一代眼睛跟踪解决方案铺平道路。 在系统一级,我们主张使用无透镜的FlatCam, 从而便利移动眼跟踪系统中的小格式需求。在算法层面上,EyeCoD整合了预测的当前管道,首先预测区域利益(ROI)的深度数据,然后通过分路路段估算一个持续降低内部成本,然后为Glical deal deal dalal der 数据流进行预估测算。