This paper proposes a new method for assessing differential item functioning (DIF) in item response theory (IRT) models. The method does not require pre-specification of anchor items, which is its main virtue. It is developed in two main steps, first by showing how DIF can be re-formulated as a problem of outlier detection in IRT-based scaling, then tackling the latter using established methods from robust statistics. The proposal is a redescending M-estimator of IRT scaling parameters that is tuned to flag items with DIF at the desired asymptotic Type I Error rate during estimation. Theoretical results guarantee that the method performs reasonably well when fewer than one-half of the items on a test exhibit DIF. Data simulations show that the proposed method compares favorably to currently available approaches, and a real data example illustrates its application in a research context where pre-specification of anchor items is infeasible. The focus of the paper is the two-parameter logistic model in two independent groups, with extensions to other settings considered in the conclusion.
翻译:本文件提出了在项目响应理论(IRT)模型中评估不同项功能的新方法。该方法并不要求预先确定锁定项目,这是其主要优点。该方法分为两个主要步骤:首先,表明如何重新拟订DIF,将其作为在IRT基础上进行异常测量的一个问题,然后利用可靠统计数字的既定方法处理后者。该提议是重新确定IRT比例参数的M-估计器,该参数在估计期间按理想的无症状类型I错误率对带有DIF的项目进行标记。理论结果保证该方法在试验展览DIF不到一半的情况下运行得合理良好。数据模拟表明,拟议的方法比现有方法要好,而一个真实的数据实例则表明,在无法对锁定项目进行预先测定的研究环境中应用该方法。该文件的重点是两个独立组的两个参数的物流模型,在结论中考虑了其他环境的扩展。