In addition to the high cost and complex setup, the main reason for the limitation of the three-dimensional (3D) display is the problem of accurately estimating the user's current point-of-gaze (PoG) in a 3D space. In this paper, we present a novel noncontact technique for the PoG estimation in a stereoscopic environment, which integrates a 3D stereoscopic display system and an eye-tracking system. The 3D stereoscopic display system can provide users with a friendly and immersive high-definition viewing experience without wearing any equipment. To accurately locate the user's 3D PoG in the field of view, we build a regression-based 3D eye-tracking model with the eye movement data and stereo stimulus videos as input. Besides, to train an optimal regression model, we also design and annotate a dataset that contains 30 users' eye-tracking data corresponding to two designed stereo test scenes. Innovatively, this dataset introduces feature vectors between eye region landmarks for the gaze vector estimation and a combined feature set for the gaze depth estimation. Moreover, five traditional regression models are trained and evaluated based on this dataset. Experimental results show that the average errors of the 3D PoG are about 0.90~cm on the X-axis, 0.83~cm on the Y-axis, and 1.48~cm$/$0.12~m along the Z-axis with the scene-depth range in 75~cm$/$8~m, respectively.
翻译:除了高成本和复杂设置外,三维(3D)显示限制3维(3D)显示的主要原因在于准确估计用户在3D空间的当前加热点(PoG)的问题。在本文中,我们展示了一种新型的非接触技术,用于在立体环境中对PoG进行估计,其中包括3D立体立体显影系统和眼睛跟踪系统。3D立体立体显示系统可以为用户提供友好和隐蔽的高清晰度查看经验,而无需佩戴任何设备。为了准确确定用户在视野领域的3D PoG目前加热点(PoG)的位置,我们用眼运动数据和立体刺激视频作为输入建立一个基于回归的3D的3D眼跟踪模型。此外,为了培训最佳的回归模型,我们还设计了一套包含30个用户眼睛跟踪数据的数据集,与两个设计的立体测试场相匹配。创新地,该数据集引入了用于观测矢量矢量估计的眼区域标志和用于视觉深度估计的一组地段。1,在X-YC0的深度估计上,5个传统回归模型是经过训练和评估的。