Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many contemporary algorithms for fitting IRT models may also have massive computational demands that forbid real-world application. To address this bottleneck, we introduce a variational Bayesian inference algorithm for IRT, and show that it is fast and scalable without sacrificing accuracy. Applying this method to five large-scale item response datasets from cognitive science and education yields higher log likelihoods and higher accuracy in imputing missing data than alternative inference algorithms. Using this new inference approach we then generalize IRT with expressive Bayesian models of responses, leveraging recent advances in deep learning to capture nonlinear item characteristic curves (ICC) with neural networks. Using an eigth-grade mathematics test from TIMSS, we show our nonlinear IRT models can capture interesting asymmetric ICCs. The algorithm implementation is open-source, and easily usable.
翻译:大型现代数据集提供了获取人类行为中更多细微差别的机会,有可能改进心理模型,从而改善科学理解和公共政策。然而,虽然更大的数据集允许采取更灵活的方法,但许多适合光学模型的当代算法也可能具有巨大的计算要求,禁止真实世界应用。为了解决这一瓶颈问题,我们为光学阵引入了一种变异的贝氏推断算法,并表明该方法在不牺牲准确性的情况下是快速和可缩放的。将这种方法应用到认知科学和教育的5个大型项目响应数据集中,比替代的推断算法在估计缺失数据方面产生更高的日志概率和准确性。我们利用这种新的推论方法,然后将光学阵(IRT)与直观的贝氏反应模型普遍化,利用近期的深层次学习进展,用神经网络来捕捉非线性物品特征曲线(ICC),并显示它在不牺牲准确性的情况下是快速和可缩缩放的。我们展示了非线性光源的IRT模型可以捕捉取。