Programme for International Student Assessment
|Purpose||Comparison of education attainment across the world|
|59 government education departments|
Head of the Early Childhood and Schools Division
|PISA Governing Body (Chair – Lorna Bertrand, England)|
The Programme for International Student Assessment (PISA) is a worldwide study by the Organisation for Economic Co-operation and Development (OECD) in member and non-member nations of 15-year-old school pupils' scholastic performance on mathematics, science, and reading. It was first performed in 2000 and then repeated every three years. It is done with a view to improving education policies and outcomes. It measures problem solving and cognition in daily life.
The 2012 version of the test involved 34 OECD countries and 31 partner countries, with a total of 510,000 participating students. The 2015 version of the test will be published in December 2016.
The Trends in International Mathematics and Science Study (TIMSS) and the Progress in International Reading Literacy Study (PIRLS) by the International Association for the Evaluation of Educational Achievement are similar studies.
- 1 Framework
- 2 Implementation
- 3 Method of testing
- 4 Results
- 5 Previous years
- 6 Reception
- 7 Research on possible causes of PISA disparities in different countries
- 8 See also
- 9 References
- 10 Further reading
- 11 External links
PISA stands in a tradition of international school studies, undertaken since the late 1950s by the International Association for the Evaluation of Educational Achievement (IEA). Much of PISA's methodology follows the example of the Trends in International Mathematics and Science Study (TIMSS, started in 1995), which in turn was much influenced by the U.S. National Assessment of Educational Progress (NAEP). The reading component of PISA is inspired by the IEA's Progress in International Reading Literacy Study (PIRLS).
The PISA mathematics literacy test asks students to apply their mathematical knowledge to solve problems set in real-world contexts. To solve the problems students must activate a number of mathematical competencies as well as a broad range of mathematical content knowledge. TIMSS, on the other hand, measures more traditional classroom content such as an understanding of fractions and decimals and the relationship between them (curriculum attainment). PISA claims to measure education's application to real-life problems and lifelong learning (workforce knowledge).
In the reading test, "OECD/PISA does not measure the extent to which 15-year-old students are fluent readers or how competent they are at word recognition tasks or spelling." Instead, they should be able to "construct, extend and reflect on the meaning of what they have read across a wide range of continuous and non-continuous texts."
PISA is sponsored, governed, and coordinated by the OECD.
Method of testing
The students tested by PISA are aged between 15 years and 3 months and 16 years and 2 months at the beginning of the assessment period. The school year pupils are in is not taken into consideration. Only students at school are tested, not home-schoolers. In PISA 2006, however, several countries also used a grade-based sample of students. This made it possible to study how age and school year interact.
To fulfill OECD requirements, each country must draw a sample of at least 5,000 students. In small countries like Iceland and Luxembourg, where there are fewer than 5,000 students per year, an entire age cohort is tested. Some countries used much larger samples than required to allow comparisons between regions.
Each student takes a two-hour handwritten test. Part of the test is multiple-choice and part involves fuller answers. There are six and a half hours of assessment material, but each student is not tested on all the parts. Following the cognitive test, participating students spend nearly one more hour answering a questionnaire on their background including learning habits, motivation, and family. School directors fill in a questionnaire describing school demographics, funding, etc. In 2012 the participants were, for the first time in the history of large-scale testing and assessments, offered a new type of problem, i.e. interactive (complex) problems requiring exploration of a novel virtual device.
In selected countries, PISA started experimentation with computer adaptive testing.
Countries are allowed to combine PISA with complementary national tests.
Germany does this in a very extensive way: On the day following the international test, students take a national test called PISA-E (E=Ergänzung=complement). Test items of PISA-E are closer to TIMSS than to PISA. While only about 5,000 German students participate in the international and the national test, another 45,000 take only the latter. This large sample is needed to allow an analysis by federal states. Following a clash about the interpretation of 2006 results, the OECD warned Germany that it might withdraw the right to use the "PISA" label for national tests.
From the beginning, PISA has been designed with one particular method of data analysis in mind. Since students work on different test booklets, raw scores must be 'scaled' to allow meaningful comparisons. Scores are thus scaled so that the OECD average in each domain (mathematics, reading and science) is 500 and the standard deviation is 100. This is true only for the initial PISA cycle when the scale was first introduced, though, subsequent cycles are linked to the previous cycles through IRT scale linking methods.
This generation of proficiency estimates is done using a latent regression extension of the Rasch model, a model of item response theory (IRT), also known as conditioning model or population model. The proficiency estimates are provided in the form of so-called plausible values, which allow unbiased estimates of differences between groups. The latent regression, together with the use of a Gaussian prior probability distribution of student competencies allows estimation of the proficiency distributions of groups of participating students. The scaling and conditioning procedures are described in nearly identical terms in the Technical Reports of PISA 2000, 2003, 2006. NAEP and TIMSS use similar scaling methods.
All PISA results are tabulated by country; recent PISA cycles have separate provincial or regional results for some countries. Most public attention concentrates on just one outcome: the mean scores of countries and their rankings of countries against one another. In the official reports, however, country-by-country rankings are given not as simple league tables but as cross tables indicating for each pair of countries whether or not mean score differences are statistically significant (unlikely to be due to random fluctuations in student sampling or in item functioning). In favorable cases, a difference of 9 points is sufficient to be considered significant.
PISA never combines mathematics, science and reading domain scores into an overall score. However, commentators have sometimes combined test results from all three domains into an overall country ranking. Such meta-analysis is not endorsed by the OECD, although official summaries sometimes use scores from a testing cycle's principal domain as a proxy for overall student ability.
PISA 2012 was presented on 3 December 2013, with results for around 510,000 participating students in all 34 OECD member countries and 31 partner countries. This testing cycle had a particular focus on mathematics, where the mean score was 494. A sample of 1,688 students from Puerto Rico took the assessment, scoring 379 in math, 404 in reading and 401 in science. A subgroup of 44 countries and economies with about 85 000 students also took part in an optional computer-based assessment of problem solving.
Shanghai had the highest score in all three subjects. It was followed by Singapore, Hong Kong, Chinese Taipei and Korea in mathematics; Hong Kong, Singapore, Japan and Korea in reading and Hong Kong, Singapore, Japan and Finland in science.
They were a sample of about 28 million in the same age group in 65 countries and economies, including the OECD countries, several Chinese cities, Vietnam, Indonesia and several countries in South America.
PISA 2012 was presented on 3 December 2013, with results for around 510,000 participating students in all 34 OECD member countries and 31 partner countries. This testing cycle had a particular focus on mathematics, where the mean score was 494. The mean score in reading was 496 and in science 501.
The results show distinct groups of high-performers in mathematics: the East Asian countries, with Shanghai, scoring the best result of 613, followed closely by Hong Kong, Japan, Chinese Taipei and South Korea. Among the Europeans, Liechtenstein and Switzerland performed best, with Netherlands, Estonia, Finland, Poland, Belgium, Germany, Austria all posting mathematics scores "not significantly statistically different from" one another. The United Kingdom, Ireland, Australia and New Zealand were similarly clustered around the OECD average of 494, with the USA trailing this group at 481.
Qatar, Kazakhstan and Malaysia were the countries which showed the greatest improvement in mathematics. The USA and the United Kingdom showed no significant change. Sweden had the greatest fall in mathematics performance over the last ten years, with a similar falling trend also in the two other subjects, and leading politicians in Sweden expressed great worry over the results.
On average boys scored better than girls in mathematics, girls scored better than boys in reading and the two sexes had quite similar scores in science.
Indonesia, Albania, Peru, Thailand and Colombia were the countries where most students reported being happy at school, while students in Korea, the Czech Republic, the Slovak Republic, Estonia and Finland reported least happiness.
|Period||Focus||OECD countries||Partner countries||Participating students||Notes|
|2000||Reading||28||4 + 11||265,000||The Netherlands disqualified from data analysis. 11 additional non-OECD countries took the test in 2002.|
|2003||Mathematics||30||11||275,000||UK disqualified from data analysis. Also included test in problem solving.|
|2006||Science||30||27||400,000||Reading scores for US disqualified from analysis due to misprint in testing materials.|
|2009||Reading||34||41 + 10||470,000||10 additional non-OECD countries took the test in 2010.|
China didn't participate as a nation in the 2012 test, but Shanghai, Hong Kong and Macau participated as separate entities. Shanghai has participated for the second time, topping the rankings in all three subjects, as well as improving scores in the subjects compared to the 2009 tests. Shanghai's score of 613 in mathematics was 113 points above the average score, putting the performance of Shanghai pupils about 3 school years ahead of pupils in average countries. Educational experts debated to which degree the result reflected the quality of the general educational system in China, pointing out that Shanghai has greater wealth and better-paid teachers than the rest of China. Hong Kong placed second in reading and science and third in maths.
China is expected to participate as a country in 2018, and will participate with 4 provinces in 2015. The four provinces will be Jiangsu, Guangdong, Beijing, and Shanghai with a total population of over 230 million.
Critics of PISA counter that in Shanghai and other Chinese cities, most children of migrant workers can only attend city schools up to the ninth grade, and must return to their parents' hometowns for high school due to hukou restrictions, thus skewing the composition of the city's high school students in favor of wealthier local families. A population chart of Shanghai reproduced in The New York Times shows a steep drop off in the number of 15-year-olds residing there. According to Schleicher, 27% of Shanghai's 15-year-olds are excluded from its school system (and hence from testing). As a result, the percentage of Shanghai's 15-year-olds tested by PISA was 73%, lower than the 89% tested in the US.
Finland, which received several top positions in the first tests, fell in all three subjects, but remained the best performing country overall in Europe, achieving their best result in science with 545 points (5th) and worst in mathematics with 519 (12th) in which the country was outperformed by four other European countries. The drop in mathematics was 25 points since 2003, the last time mathematics was the focus of the tests. For the first time Finnish girls outperformed boys in the subject, but only narrowly. It was also the first time pupils in Finnish-speaking schools did not perform better than pupils in Swedish-speaking schools. Minister of Education and Science Krista Kiuru expressed concern for the overall drop, as well as the fact that the number of low-performers had increased from 7% to 12%.
India participated in the 2009 round of testing but pulled out of the 2012 PISA testing, in August 2012, with the Indian government attributing its action to the unfairness of PISA testing to Indian students. The Indian Express reported on 9/3/2012 that "The ministry (of education) has concluded that there was a socio-cultural disconnect between the questions and Indian students. The ministry will write to the OECD and drive home the need to factor in India's "socio-cultural milieu". India's participation in the next PISA cycle will hinge on this". The Indian Express also noted that "Considering that over 70 nations participate in PISA, it is uncertain whether an exception would be made for India".
In June 2013, the Indian government, still concerned with the future prospect of fairness of PISA testing relating to Indian students, again pulled India out from the 2015 round of PISA testing.
Sweden's result dropped in all three subjects in the 2012 test, which was a continuation of a trend from 2006 and 2009. In mathematics, the nation had the sharpest fall in mathematic performance over 10 years among the countries that have participated in all tests, with a drop in score from 509 in 2003 to 478 in 2012. The score in reading showed a drop from 516 in 2000 to 483 in 2012. The country performed below the OECD average in all three subjects. The leader of the opposition, Social Democrat Stefan Löfven, described the situation as a national crisis. Along with the party's spokesperson on education, Ibrahim Baylan, he pointed to the downward trend in reading as most severe.
In the 2012 test, as in 2009, the result was slightly above average for the United Kingdom, with the science ranking being highest (20). England, Wales, Scotland and Northern Ireland also participated as separated entities, showing the worst result for Wales which in mathematics was 43 of the 65 countries and economies. Minister of Education in Wales Huw Lewis expressed disappointment in the results, said that there was no "quick fixes", but hoped that several educational reform that has been implented the last years would give better results in the next round of tests. The United Kingdom had a greater gap between high- and low-scoring students than the average. There was little difference between public and private schools when adjusted for socio-economic background of students. The gender difference in favour of girls was less than in most other countries, as was the difference between natives and immigrants.
Writing in the Daily Telegraph, Ambrose Evans-Pritchard warned against putting too much emphasis on the UK's international ranking, arguing that an overfocus on scholarly performances in East Asia might have contributed to the area's low birthrate, which he argued could harm the economic performance in the future more than a good PISA score would outweigh.
In 2013, the Times Educational Supplement (TES) published an article, "Is PISA Fundamentally Flawed?" by William Stewart, detailing serious critiques of PISA's conceptual foundations and methods advanced by statisticians at major universities.
In the article, Professor Harvey Goldstein of the University of Bristol was quoted as saying that when the OECD tries to rule out questions suspected of bias, it can have the effect of "smoothing out" key differences between countries. "That is leaving out many of the important things,” he warned. "They simply don't get commented on. What you are looking at is something that happens to be common. But (is it) worth looking at? PISA results are taken at face value as providing some sort of common standard across countries. But as soon as you begin to unpick it, I think that all falls apart."
Queen's University Belfast mathematician Dr. Hugh Morrison stated that he found the statistical model underlying PISA to contain a fundamental, insoluble mathematical error that renders Pisa rankings "valueless". Goldstein remarked that Dr. Morrison's objection highlights “an important technical issue” if not a “profound conceptual error”. However, Goldstein cautioned that PISA has been "used inappropriately", contending that some of the blame for this "lies with PISA itself. I think it tends to say too much for what it can do and it tends not to publicise the negative or the weaker aspects.” Professors Morrison and Goldstein expressed dismay at the OECD's response to criticism. Morrison said that when he first published his criticisms of PISA in 2004 and also personally queried several of the OECS's "senior people" about them, his points were met with “absolute silence” and have yet to be addressed. “I was amazed at how unforthcoming they were,” he told TES. “That makes me suspicious.” “Pisa steadfastly ignored many of these issues,” he says. “I am still concerned.”
Professor Kreiner[who?] agreed: “One of the problems that everybody has with PISA is that they don’t want to discuss things with people criticising or asking questions concerning the results. They didn’t want to talk to me at all. I am sure it is because they can’t defend themselves.
The American result of 2012 was average in science and reading, but lagged behind in mathematics compared to other developed nations. There was little change from the previous test in 2009. The result was described as “a picture of educational stagnation” by Education Secretary Arne Duncan, who said the result was not compatible with the American goal of having the world's best educated workers. Randi Weingarten of the American Federation of Teachers stated that an overemphasis on standardised tests contributed to the lack of improvement in education performance. Dennis Van Roekel of the National Education Association said a failure to address poverty among students had hampered progress.
About 9% of the U.S. students scored in the top two mathematics levels compared to 13% in all countries and economies.
For the first time, three U.S. states participated in the tests as separate entities, with Massachusetts scoring well above both the American and international average, particularly in reading. An approximate corresponding OECD ranking is shown along with the United States average.
Research on possible causes of PISA disparities in different countries
Although PISA and TIMSS officials and researchers themselves generally refrain from hypothesizing about the large and stable differences in student achievement between countries, since 2000, literature on the differences in PISA and TIMSS results and their possible causes has emerged. Data from PISA have furnished several economists, notably Eric Hanushek, Ludger Woessmann, Heiner Rindermann, and Stephen J. Ceci, with material for books and articles about the relationship between student achievement and economic development, democratization, and health; as well as the roles of such single educational factors as high-stakes exams, the presence or absence of private schools, and the effects and timing of ability tracking.
- Teaching And Learning International Survey (TALIS)
- Trends in International Mathematics and Science Study (TIMSS)
- Gender gaps in mathematics and reading in PISA 2009
- Berger, Kathleen. Invitation to The Life Span (second ed.). worth. ISBN 978-1-4641-7205-2.
- PISA 2012 Results in Focus (PDF), OECD, 3 December 2013, retrieved 4 December 2013
- "Programme for International Student Assessment (PISA)". The Council of Ministers of Education, Canada. Retrieved 2016-06-05.
- Hefling, Kimberly. "Asian nations dominate international test". Yahoo!.
- "Chapter 2 of the publication 'PISA 2003 Assessment Framework'" (pdf). Pisa.oecd.org.
- Keeley B. PISA, we have a problem… OECD Insights, April 2014.
- Poddiakov A.N. Complex Problem Solving at PISA 2012 and PISA 2015: Interaction with Complex Reality. // Translated from Russian. Reference to the original Russian text: Poddiakov, A. (2012.) Reshenie kompleksnykh problem v PISA-2012 i PISA-2015: vzaimodeistvie so slozhnoi real'nost'yu. Obrazovatel'naya Politika, 6, 34-53.
- C. Füller: Pisa hat einen kleinen, fröhlichen Bruder. taz, 5.12.2007 
- Stanat, P; Artelt, C; Baumert, J; Klieme, E; Neubrand, M; Prenzel, M; Schiefele, U; Schneider, W (2002), PISA 2000: Overview of the study—Design, method and results, Berlin: Max Planck Institute for Human Development
- Mazzeo, John; von Davier, Matthias (2013), Linking Scales in International Large-Scale Assessments, chapter 10 in Rutkowski, L. von Davier, M. & Rutkowski, D. (eds.) Handbook of International Large-Scale Assessment: Background, Technical Issues, and Methods of Data Analysis., New York: Chapman and Hall/CRC.
- von Davier, Matthias; Sinharay, Sandip (2013), Analytics in International Large-Scale Assessments: Item Response Theory and Population Models, chapter 7 in Rutkowski, L. von Davier, M. & Rutkowski, D. (eds.) Handbook of International Large-Scale Assessment: Background, Technical Issues, and Methods of Data Analysis., New York: Chapman and Hall/CRC.
- CB Online Staff. "PR scores low on global report card", Caribbean Business, September 26, 2014. Retrieved on January 3, 2015.
- OECD (2014): PISA 2012 results: Creative problem solving: Students’ skills in tackling real-life problems (Volume V), http://www.oecd-ilibrary.org/education/pisa-2012-results-skills-for-life-volume-v_9789264208070-en
- PISA 2012 Results OECD. Retrieved 4 December 2013
- Sedghi, Ami; Arnett, George; Chalabi, Mona (2013-12-03), Pisa 2012 results: which country does best at reading, maths and science?, The Guardian, retrieved 2013-02-14
- Adams, Richard (2013-12-03), Swedish results fall abruptly as free school revolution falters, The Guardian, retrieved 2013-12-03
- Kärrman, Jens (2013-12-03), Löfven om Pisa: Nationell kris, Dagens Nyheter, retrieved 2013-12-03
- Baldi, Stéphane; Jin, Ying; Skemer, Melanie; Green, Patricia J; Herget, Deborah; Xie, Holly (2007-12-10), Highlights From PISA 2006: Performance of U.S. 15-Year-Old Students in Science and Mathematics Literacy in an International Context (PDF), NCES, retrieved 2013-12-14,
PISA 2006 reading literacy results are not reported for the United States because of an error in printing the test booklets. Furthermore, as a result of the printing error, the mean performance in mathematics and science may be misestimated by approximately 1 score point. The impact is below one standard error.
- PISA 2009 Results: Executive Summary (PDF), OECD, 2010-12-07
- ACER releases results of PISA 2009+ participant economies, ACER, 2011-12-16
- Walker, Maurice (2011), PISA 2009 Plus Results (PDF), OECD, retrieved 2012-06-28
- Tom Phillips (3 December 2013) OECD education report: Shanghai's formula is world-beating The Telegraph. Retrieved 8 December 2013
- Helen Gao, "Shanghai Test Scores and the Mystery of the Missing Children", New York Times, January 23, 2014. For Schleicher's initial response to these criticisms see his post, "Are the Chinese Cheating in PISA Or Are We Cheating Ourselves?" on the OECD's website blog, Education Today, December 10, 2013.
- William Stewart, "More than a quarter of Shanghai pupils missed by international Pisa rankings", Times Educational Supplement, March 6, 2014.
- PISA 2012: Proficiency of Finnish youth declining University of Jyväskylä. Retrieved 9 December 2013
- Hemali Chhapia, TNN (3 August 2012). "India backs out of global education test for 15-year-olds". The Times of India.
- "Poor PISA score: Govt blames 'disconnect' with India". The Indian Express. 3 September 2012.
- "India chickens out of international students assessment programme again". The Times of India. 1 June 2013.
- Lars Näslund (3 December 2013) Svenska skolan rasar i stor jämförelse Expressen. Retrieved 4 December 2013 (Swedish)
- Jens Kärrman (3 December 2013) Löfven om Pisa: Nationell kris Dagens Nyheter. Retrieved 8 December 2013 (Swedish)
- Adams, Richard (2013-12-03), UK students stuck in educational doldrums, OECD study finds, The Guardian, retrieved 2013-12-04
- Pisa ranks Wales' education the worst in the UK BBC. 3 December 2013. Retrieved 4 December 2013.
- Ambrose Evans-Pritchard (3 December 2013) Ambrose Evans-Pritchard Telegraph.co.uk. Retrieved 4 December 2013.
- William Stewart, "Is Pisa fundamentally flawed?" Times Educational Supplement, July 26, 2013.
- Stewart, "Is PISA fundamentally flawed?" TES (2013).
- Motoko Rich (3 December 2013) American 15-Year-Olds Lag, Mainly in Math, on International Standardized Tests New York Times. Retrieved 4 December 2013
- Simon, Stephanie (2013-12-03), PISA results show "educational stagnation" in US, Politico, retrieved 2013-12-03
- Vaznis, James (2013-12-03), Mass. students excel on global examinations, Boston Globe, retrieved 2013-12-14
- 2012 Program for International Student Assessment (PISA) Results (PDF), Massachusetts Department of Education, retrieved 2014-12-11
- Hanushek, Eric A., and Ludger Woessmann. 2011. "The economics of international differences in educational achievement." In Handbook of the Economics of Education, Vol. 3, edited by Eric A. Hanushek, Stephen Machin, and Ludger Woessmann. Amsterdam: North Holland: 89–200.
- Hanushek, Eric; Woessmann, Ludger (2008), "The role of cognitive skills in economic development" (PDF), Journal of Economic Literature, 46 (3): 607–668, doi:10.1257/jel.46.3.607
- Rindermann, Heiner; Ceci, Stephen J (2009), "Educational policy and country outcomes in international cognitive competence studies", Perspectives on Psychological Science, 4 (6): 551–577, doi:10.1111/j.1745-6924.2009.01165.x
- Bishop, John H (1997), "The effect of national standards and curriculum-based exams on achievement", American Economic Review, 87 (2): 260–264
- Hanushek, Eric; Woessmann, Ludger (2006), "Does educational tracking affect performance and inequality? Differences-in-differences evidence across countries" (PDF), Economic Journal, 116 (510): C63–C76, doi:10.1111/j.1468-0297.2006.01076.x
Official websites and reports
- OECD/PISA website