Eye movements are known to reflect cognitive processes in reading, and psychological reading research has shown that eye gaze patterns differ between readers with and without dyslexia. In recent years, researchers have attempted to classify readers with dyslexia based on their eye movements using Support Vector Machines (SVMs). However, these approaches (i) are based on highly aggregated features averaged over all words read by a participant, thus disregarding the sequential nature of the eye movements, and (ii) do not consider the linguistic stimulus and its interaction with the reader's eye movements. In the present work, we propose two simple sequence models that process eye movements on the entire stimulus without the need of aggregating features across the sentence. Additionally, we incorporate the linguistic stimulus into the model in two ways -- contextualized word embeddings and manually extracted linguistic features. The models are evaluated on a Mandarin Chinese dataset containing eye movements from children with and without dyslexia. Our results show that (i) even for a logographic script such as Chinese, sequence models are able to classify dyslexia on eye gaze sequences, reaching state-of-the-art performance, and (ii) incorporating the linguistic stimulus does not help to improve classification performance.
翻译:已知眼睛运动反映了阅读过程中的认知过程,心理阅读研究表明,有阅读障碍的读者和没有阅读障碍的读者眼视模式不同,近年来,研究人员试图使用支持矢量机(SVMs)对阅读者眼运动进行阅读障碍分类,但是,这些方法(一) 是基于参与者阅读的所有单词的高度综合特征,从而无视眼睛运动的顺序性质,并且(二) 不考虑语言刺激及其与阅读者眼睛运动的相互作用。在目前的工作中,我们提议了两个简单的序列模型,处理整个刺激的动作,而不需要在整个句子上集成特征。此外,我们以两种方式将语言刺激纳入模型 -- -- 背景化的字嵌入和手动提取的语言特征。这些模型是在包含有和没有阅读障碍的儿童眼睛运动的中文数据集上进行评估的。我们的结果显示,(一) 即使是像中文这样的逻辑脚本,序列模型也能对眼视序列的动作进行分类,而无需综合整个句子特征。此外,我们用两种方法将语言刺激纳入模型。 (二) 将语言表现进行帮助改进语言刺激。