Country comparisons using standardized test scores may in some cases be misleading unless we make sure that the potential sample selection bias created by drop-outs and non-enrollment patterns does not alter the analysis. In this paper, I propose an answer to this issue which consists of identifying the counterfactual distribution of achievement (I mean the distribution of achievement if there was hypothetically no selection) from the observed distribution of achievements. International comparison measures like means, quantiles, and inequality measures have to be computed using that counterfactual distribution which is statistically closer to the observed one for a low proportion of out-of-school children. I identify the quantiles of that latent distribution by readjusting the percentile levels of the observed quantile function of achievement. Because the data on test scores is by nature truncated, I have to rely on auxiliary data to borrow identification power. I finally applied my method to compute selection corrected means using PISA 2018 and PASEC 2019 and I found that ranking/comparisons can change.
翻译:暂无翻译