2019-03-26T08:54:33Z
2019-03-26T08:54:33Z
2018-02
2019-03-26T08:54:33Z
The Programme for International Student Assessment (PISA) is an important international study of 15‐olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has focused upon the psychometric scaling model used to create the proficiency scores. The aim of this article is to therefore investigate the robustness of cross‐country comparisons of PISA scores to subtle changes to the underlying scaling model used. This includes the specification of the item‐response model, whether the difficulty and discrimination of items are allowed to vary across countries (item‐by‐country interactions) and how test questions not reached by pupils are treated. Our key finding is that these technical choices make little substantive difference to the overall country‐level results.
Article
Accepted version
English
Rendiment acadèmic; Avaluació educativa; Mètode comparatiu; Academic achievement; Educational evaluation; Comparative method
John Wiley & Sons
Versió postprint del document publicat a: https://doi.org/10.1111/emip.12211
Educational Measurement-Issues And Practice, 2018, vol. 37, num. 4, p. 28-39
https://doi.org/10.1111/emip.12211
(c) National Council on Measurement in Education, 2018
Economia [1045]