International large-scale assessments (ILSAs) play an important role in educational research and policy making. They collect valuable data on education quality and performance development across many education systems, giving countries the opportunity to share techniques, organisational structures and policies that have proven efficient and successful. To gain insights from ILSA data, we identify non-cognitive variables associated with students' academic performance. This problem has three analytical challenges: 1) students' academic performance is measured by cognitive items under a matrix sampling design; 2) there are often many missing values in the non-cognitive variables; and 3) multiple comparisons due to a large number of non-cognitive variables. We consider an application to data from the Programme for International Student Assessment (PISA), aiming to identify non-cognitive variables associated with students' performance in science. We formulate it as a variable selection problem under a latent variable model and further propose a knockoff method that conducts variable selection with a controlled error rate for false selections. Keywords: Model-X knockoffs, item response theory, missing data, variable selection, international large-scale assessment
翻译:国际大型评估(ILSA)在教育研究和政策制定中发挥着重要作用,收集了许多教育系统教育质量和绩效发展的宝贵数据,使各国有机会分享技术、组织结构和政策,这些都证明是有效和成功的。为了从ILSA数据中深入了解情况,我们从学生学术表现中找出与学生学术表现有关的非认知变量。这个问题有三个分析挑战:1)学生的学术表现在矩阵抽样设计下以认知项目衡量;2)非认知变量中往往有许多缺失值;和3)由于大量非认知变量而出现多重比较。我们考虑应用国际学生评估方案的数据,目的是查明与学生科学表现有关的非认知变量。我们在潜在变量模型下将其作为可变选择问题,并进一步提出一种取舍方法,以可控误差率选择错误。关键词:模型-X取错率、项目响应理论、缺失数据、变量选择、国际大规模评估。