How robust are cross-country comparisons of PISA scores to the scaling model used?

Publication date

2019-03-26T08:54:33Z

2019-03-26T08:54:33Z

2018-02

2019-03-26T08:54:33Z

Abstract

The Programme for International Student Assessment (PISA) is an important international study of 15‐olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has focused upon the psychometric scaling model used to create the proficiency scores. The aim of this article is to therefore investigate the robustness of cross‐country comparisons of PISA scores to subtle changes to the underlying scaling model used. This includes the specification of the item‐response model, whether the difficulty and discrimination of items are allowed to vary across countries (item‐by‐country interactions) and how test questions not reached by pupils are treated. Our key finding is that these technical choices make little substantive difference to the overall country‐level results.

Document Type

Article


Accepted version

Language

English

Publisher

John Wiley & Sons

Related items

Versió postprint del document publicat a: https://doi.org/10.1111/emip.12211

Educational Measurement-Issues And Practice, 2018, vol. 37, num. 4, p. 28-39

https://doi.org/10.1111/emip.12211

Recommended citation

This citation was generated automatically.

Rights

(c) National Council on Measurement in Education, 2018

This item appears in the following Collection(s)