Stanford–Binet Intelligence Scales
|Stanford–Binet Intelligence scales|
The Stanford–Binet Intelligence Scales (or more commonly the Stanford-Binet) is an individually administered intelligence test that was revised from the original Binet-Simon Scale by Lewis M. Terman, a psychologist at Stanford University. The Stanford–Binet Intelligence Scale is now in its fifth edition (SB5) and was released in 2003. It is a cognitive ability and intelligence test that is used to diagnose developmental or intellectual deficiencies in young children. The test measures five weighted factors and consists of both verbal and nonverbal subtests. The five factors being tested are knowledge, quantitative reasoning, visual-spatial processing, working memory, and fluid reasoning.
The development of the Stanford–Binet initiated the modern field of intelligence testing and was one of the first examples of an adaptive test. The test originated in France, then was revised in the United States. It was initially created by the French psychologist Alfred Binet, who, following the introduction of a law mandating universal education by the French government, began developing a method of identifying "slow" children for their placement in special education programs (rather than removing them to asylums as "sick"). As Binet indicated, case studies might be more detailed and helpful, but the time required to test many people would be excessive. In 1916, at Stanford University, the psychologist Lewis Terman released a revised examination which became known as the "Stanford–Binet test".
As discussed by Fancher & Rutherford in 2012, the Stanford–Binet is a modified version of the Binet-Simon Intelligence scale. The Binet-Simon scale was created by the French psychologist Alfred Binet and his student Theodore Simon. Due to changing education laws of the time, Binet had been requested by a government commission to come up with a way to detect children with significantly below-average intelligence and mental retardation.
To create their test, Binet and Simon first created a baseline of intelligence. A wide range of children were tested on a broad spectrum of measures in an effort to discover a clear indicator of intelligence. Failing to find a single identifier of intelligence, Binet and Simon instead compared children in each category by age. The children’s highest levels of achievement were sorted by age and common levels of achievement considered the normal level for that age. Because this testing method merely compares a person's ability to the common ability level of others their age, the general practices of the test can easily be transferred to test different populations, even if the measures used are changed.
One of the first intelligence tests, the Binet-Simon test quickly gained support in the psychological community, many of whom further spread it to the public. Lewis M. Terman, a psychologist at Stanford University, was one of the first to create a version of the test for people in the United States, naming the localized version the Stanford–Binet Intelligence Scale. Terman used the test not only to help identify children with learning difficulties but also to find children and adults who had above average levels of intelligence. In creating his version, Terman also tested additional methods for his Stanford revision, publishing his first official version as The Measurement of Intelligence: An Explanation of and a Complete Guide for the Use of the Stanford Revision and Extension of the Binet-Simon Intelligence Scale (Fancher & Rutherford, 2012) (Becker, 2003).
The original tests in the 1905 form include:
- "Le Regard"
- Prehension Provoked by a Tactile Stimulus
- Prehension Provoked by a Visual Perception
- Recognition of Food
- Quest of Food Complicated by a Slight Mechanical Difficulty
- Execution of Simple Commands and Imitation of Simple Gestures
- Verbal Knowledge of Objects
- Verbal Knowledge of Pictures
- Naming of Designated Objects
- Immediate Comparison of Two Lines of Unequal Lengths
- Repetition of Three Figures
- Comparison of Two Weights
- Verbal Definition of Known Objects
- Repetition of Sentences of Fifteen Words
- Comparison of Known Objects from Memory
- Exercise of Memory on Pictures
- Drawing a Design from Memory
- Immediate Repetition of Figures
- Resemblances of Several Known Objects Given from Memory
- Comparison of Lengths
- Five Weights to be Placed in Order
- Gap in Weights
- Exercise upon Rhymes
- Verbal Gaps to be Filled
- Synthesis of Three Words in One Sentence
- Reply to an Abstract Question
- Reversal of the Hands of a Clock
- Paper Cutting
- Definitions of Abstract Terms
One hindrance to widespread understanding of the test is its use of a variety of different measures. In an effort to simplify the information gained from the Binet-Simon test into a more comprehensible and easier to understand form, German psychologist William Stern created the now well known Intelligence Quotient (IQ). By comparing the age a child scored at to their biological age, a ratio is created to show the rate of their mental progress as IQ. Terman quickly grasped the idea for his Stanford revision with the adjustment of multiplying the ratios by 100 to make them easier to read.
As also discussed by Leslie, in 2000, Terman was another of the main forces in spreading intelligence testing in the United States (Becker, 2003). Terman quickly promoted the use of the Stanford–Binet for schools across the United States where it saw a high rate of acceptance. Terman’s work also had the attention of the U.S. government, who recruited him to apply the ideas from his Stanford–Binet test for military recruitment near the start of World War I. With over 1.7 million military recruits taking a version of the test and the acceptance of the test by the government, the Stanford–Binet saw an increase in awareness and acceptance (Fancher & Rutherford, 2012).
Given the perceived importance of intelligence and with new ways to measure intelligence, many influential individuals, including Terman, began promoting controversial ideas to increase the nation's overall intelligence. These ideas included things such as discouraging individuals with low IQ from having children and granting important positions based on high IQ scores. While there was significant opposition, many institutions proceeded to adjust students' education based on their IQ scores, often with a heavy influence on future career possibilities (Leslie, 2000).
Revisions of the Stanford–Binet Intelligence Scale
Since the first publication in 1916, there have been four additional revised editions of the Stanford–Binet Intelligence Scales, the first of which was developed by Lewis Terman. Over twenty years later, Maud Merrill was accepted into Stanford’s education program shortly before Terman became the head of the psychology department. She completed both her Masters Degree and Ph.D. under Terman and quickly became a colleague of his as they started the revisions of the second edition together. There were 3,200 examinees, aged one and a half to eighteen years, ranging in different geographic regions as well as socioeconomic levels in attempts to comprise a broader normative sample (Roid & Barram, 2004). This edition incorporated more objectified scoring methods, while placing less emphasis on recall memory and including a greater range of nonverbal abilities (Roid & Barram, 2004) compared to the 1916 edition.
When Terman died in 1956, the revisions for the third edition were well underway, and Merrill was able to publish the final revision in 1960 (Roid & Barram, 2004). The use of the deviation IQ made its first appearance in this third edition by replacing the ratio IQ. While new features were added, there were no newly created items included in this revision. Instead, any items from the 1937 form that showed no substantial change in difficulty from the 1930s to the 1950s were either eliminated or adjusted (Roid & Barram, 2004).
Robert Thorndike was asked to take over after Merrill’s retirement. With the help of Elizabeth Hagen and Jerome Sattler, Thorndike produced the fourth edition of the Stanford–Binet Intelligence Scale in 1986. This edition covers the ages two through twenty-three and has some considerable changes compared to its predecessors (Graham & Naglieri, 2003). This edition was the first to use the fifteen subtests with point scales in place of using the previous age scale format. In an attempt to broaden cognitive ability, the subtests were grouped and resulted in four area scores, which improved flexibility for administration and interpretation (Youngstrom, Glutting, & Watkins, 2003). The fourth edition is known for assessing children that may be referred for gifted programs. This edition includes a broad range of abilities which provides more challenging items for those in their early adolescent years, whereas other intelligence tests of the time did not provide difficult enough items for the older children (Laurent, Swerdlik, & Ryburn, 1992).
Gale Roid published the most recent edition of the Stanford–Binet Intelligence Scale. Roid attended Harvard University where he was a research assistant to David McClelland. McClelland is well known for his studies on the need for achievement. While the fifth edition incorporates some of the classical traditions of these scales, there were several significant changes made.
- April 1905: Development of Binet-Simon Test announced at a conference in Rome
- June 1905: Binet-Simon Intelligence Test introduced
- 1908 and 1911: New Versions of Binet-Simon Intelligence Test
- 1916: Stanford–Binet First Edition by Terman
- 1937: Second Edition by Terman and Merrill
- 1973: Third Edition by Merrill
- 1986: Fourth Edition by Thorndike, Hagen, and Sattler
- 2003: Fifth Edition by Roid
Stanford–Binet Intelligence Scale: Fifth Edition
Just as it was used when Binet first developed the IQ test, the Stanford–Binet Intelligence Scale: Fifth Edition (SB5) is based in the schooling process to assess intelligence. It continuously and efficiently assesses all levels of ability in individuals with a broader range in age. It is also capable of measuring multiple dimensions of abilities (Ruf, 2003).
The SB5 can be administered to individuals as early as two years of age. There are ten subsets included in this revision including both verbal and nonverbal domains. Five factors are also incorporated in this scale, which are directly related to Cattell-Horn-Carroll (CHC) hierarchical model of cognitive abilities. These factors include fluid reasoning, knowledge, quantitative reasoning, visual-spatial processing, and working memory (Bain & Allin, 2005). Many of the familiar picture absurdities, vocabulary, memory for sentences, and verbal absurdities still remain from the previous editions (Janzen, Obrzut, & Marusiak, 2003) however with more modern artwork and item content for the revised fifth edition.
For every verbal subtest that is used there is a nonverbal counterpart across all factors. These nonverbal tasks consist of making movement responses such as pointing or assembling manipulatives (Bain & Allin, 2005). These counterparts have been included in order to address the language-reduced assessments in multicultural societies. Depending on age and ability, administration can range from fifteen minutes to an hour and fifteen minutes.
The fifth edition incorporated a new scoring system, which can provide a wide range of information such as four intelligence score composites, five factor indices, and ten subtest scores. Additional scoring information includes percentile ranks, age equivalents, and a change-sensitive score (Janzen, Obrzut, & Marusiak, 2003). Extended IQ scores and gifted composite scores are available with the SB5 in order to optimize the assessment for gifted programs (Ruf, 2003). In order to reduce errors and increase diagnostic precision, scores are obtained electronically through the use of computers now.
The standardization sample for the SB5 included 4,800 participants varying in age, sex, race/ethnicity, geographic region, and socioeconomic level (Bain & Allin, 2005).
Several reliability tests have been performed on the SB5 including split-half reliability, standard error of measurement, plotting of test information curves, test-retest stability, and inter-scorer agreement. On average, the IQ scores for this scale have been found to be quite stable across time (Janzen, Obrzut, & Marusiak, 2003). Internal consistency was tested by split-half reliability and was reported to be substantial and comparable to other cognitive batteries (Bain & Allin, 2005). The median interscorer correlation was found to be .90 on average (Janzen, Obrzut, & Marusiak, 2003). The SB5 has also been found to have great precision at advanced levels of performance meaning that the test is especially useful in testing children for giftedness (Bain & Allin, 2005). There have only been a small amount of practice effects and familiarity of testing procedures with retest reliability; however, these have proven to be insignificant. Readministration of the SB5 can occur in a six-month interval rather than one year due to the small mean differences in reliability (Bain & Allin, 2005).
Content validity has been found based on the professional judgements Roid received concerning fairness of items and item content as well as items concerning the assessment of giftedness (Bain & Allin, 2005). With an examination of age trends, construct validity was supported along with empirical justification of a more substantial g loading for the SB5 compared to previous editions. The potential for a variety of comparisons, especially for within or across factors and verbal/nonverbal domains, has been appreciated with the scores received from the SB5 (Bain & Allin, 2005).
The test publisher includes suggested score classifications in the test manual.
|IQ Range ("deviation IQ")||IQ Classification|
|145–160||Very gifted or highly advanced|
|130–144||Gifted or very advanced|
|70–79||Borderline impaired or delayed|
|55–69||Mildly impaired or delayed|
|40–54||Moderately impaired or delayed|
The classifications of scores used in the Fifth Edition differ from those used in earlier versions of the test.
Subtests and factors
|Fluid reasoning||Knowledge||Quantitative reasoning||Visual-spatial processing||Working memory|
|Early reasoning||Vocabulary||Non-verbal quantitative reasoning (non-verbal)||Form board and form patterns
|Delayed response (non-verbal)|
|Verbal absurdities||Procedural knowledge (non-verbal)||Verbal quantitative reasoning||Position and direction||Block span (non-verbal)|
|Verbal analogies||Picture absurdities (non-verbal)||Memory for sentences|
|Object series matrices (non-verbal)||Last word|
Since its inception, the Stanford–Binet has been revised several times. Currently, the test is in its fifth edition, which is called the Stanford–Binet Intelligence Scales, Fifth Edition, or SB5. According to the publisher's website, "The SB5 was normed on a stratified random sample of 4,800 individuals that matches the 2000 U.S. Census". By administering the Stanford–Binet test to large numbers of individuals selected at random from different parts of the United States, it has been found that the scores approximate a normal distribution. The revised edition of the Stanford–Binet over time has devised substantial changes in the way the tests are presented. The test has improved when looking at the introduction of a more parallel form and more demonstrative standards. For one, a non-verbal IQ component is included in the present day tests whereas in the past, there was only a verbal component. In fact, it now has equal balance of verbal and non-verbal content in the tests. It is also more animated than the other tests, providing the test-takers with more colourful artwork, toys and manipulatives. This allows the test to have a higher range in the age of the test takers. This test is purportedly useful in assessing the intellectual capabilities of people ranging from young children all the way to young adults. However, the test has come under criticism for not being able to compare people of different age categories, since each category gets a different set of tests. Furthermore, very young children tend to do poorly on the test due to the fact that they are lacking in the concentration needed to finish the test.
Current uses for the test include clinical and neuropsychological assessment, educational placement, compensation evaluations, career assessment, adult neuropsychological treatment, forensics, and research on aptitude. Various high-IQ societies also accept this test for admission into their ranks; for example, the Triple Nine Society accepts a minimum qualifying score of 151 for Form L or M, 149 for Form LM if taken in 1986 or earlier, 149 for SB-IV, and 146 for SB-V; in all cases the applicant must have been at least 16 years old at the date of the test.
- Nicolas, S., Andrieu, B., Croizet, J.-C., Sanitioso, R. B., & Burman, J. T. (2013). Sick? Or slow? On the origins of intelligence as a psychological object. Intelligence, 41(5), 699–711. doi:10.1016/j.intell.2013.08.006 (This is an open access article, made freely available by Elsevier.)
- Kaufman, Alan S. (2009). IQ Testing 101. New York: Springer Publishing. p. 112. ISBN 978-0-8261-0629-2. Sattler, Jerome M. (2008). Assessment of Children: Cognitive Foundations. La Mesa (CA): Jerome M. Sattler, Publisher. inside back cover. ISBN 978-0-9702671-4-6. Lay summary (28 July 2010).
- Chase, Danielle (2005). "Underlying Factor Structures of the Stanford-Binet Intelligence Scales – Fifth Edition". Drexel University.
- Bain, S. K., & Allin, J. D. (2005). Book review: Stanford–Binet intelligence scales, fifth edition. Journal of Psychoeducational Assessment, 23, 87–95.
- Becker, K. A. (2003). History of the Stanford–Binet intelligence scales: Content and psychometrics.
- Fancher, Raymond E., & Rutherford, Alexandra. (2012). Pioneers of psychology. New York, NY: W. W. Norton & Company, Inc.
- Graham, J. & Naglieri, J. (2003). Handbook of Psychology. Hoboken, New Jersey: John Wiley & Sons, Inc.
- Janzen, H., Obrzut, J., & Marusiak, C. (2004). Test review: Roid, G. H. (2003). Stanford–binet intelligence scales, fifth edition (sb:v). Canadian Journal of School Psychology, 19, 235–244.
- Laurent, J., Swerdlik, M., & Ryburn, M. (1992). Review of validity research on the stanford–Binet intelligence scale: Fourth edition. Psychological Assessment, 4, 102–112.
- Leslie, M. (2000). The vexing legacy of Lewis Terman. Retrieved from http://alumni.stanford.edu/get/page/magazine/article/?article_id=40678
- Roid, G. (n.d.). Stanford–Binet Intelligence Scales, Fifth Edition
- Roid, G. & Barram, R. (2004). Essentials of Stanford–Binet Intelligence Scales (SB5) Assessment. Hoboken, New Jersey: John Wiley & Sons, Inc.
- Roid, Kamphaus, Randy W., Martha D. Petoskey, and ANNA WALTERS Morgan. "A history of intelligence test interpretation." Contemporary intellectual assessment: Theories, tests, and issues (1997): 3–16.
- Ruf, D. L. (2003). Use of the SB5 in the Assessment of High Abilities. Itasca, IL: Riverside Publishing Company.
- Stanovich, K. E. (2009). What intelligence tests miss: The psychology of rational thought. Yale University Press.
- Youngstrom, E., Glutting, J., & Watkins, M. (2003). Stanford–Binet intelligence scale: Fourth edition (sb4): Evaluating the empirical bases for interpretations. Handbook of Psychological and Educational Assessment: Intelligence, Aptitude, and Achievement, 2, 217–242.
- Becker, K.A (2003). "History of the Stanford–Binet Intelligence scales: Content and psychometrics.". Stanford–Binet Intelligence Scales, Fifth Edition Assessment Service Bulletin No. 1.
- Binet, Alfred; Simon, Th. (1916). The development of intelligence in children: The Binet–Simon Scale. Publications of the Training School at Vineland New Jersey Department of Research No. 11. E. S. Kite (Trans.). Baltimore: Williams & Wilkins. Retrieved 18 July 2010.
- Brown, A. L.; French, L. A. (1979). "The Zone of Potential Development: Implications for Intelligence Testing in the Year 2000". Intelligence. 3 (3): 255–273. doi:10.1016/0160-2896(79)90021-7.
- Fancher, Raymond E. (1985). The Intelligence Men: Makers of the IQ Controversy. New York (NY): W. W. Norton. ISBN 978-0-393-95525-5.
- Freides, D. (1972). "Review of Stanford–Binet Intelligence Scale, Third Revision". In Oscar Buros. Seventh Mental Measurements Yearbook. Highland Park (NJ): Gryphon Press. pp. 772–773.
- Gould, Stephen Jay (1981). The Mismeasure of Man. New York (NY): W. W. Norton. ISBN 978-0-393-31425-0. Lay summary (10 July 2010).
- McNemar, Quinn (1942). The revision of the Stanford–Binet Scale. Boston: Houghton Mifflin.
- Pinneau, Samuel R. (1961). Changes in Intelligence Quotient Infancy to Maturity: New Insights from the Berkeley Growth Study with Implications for the Stanford–Binet Scales and Applications to Professional Practice. Boston: Houghton Mifflin.
- Terman, Lewis Madison; Merrill, Maude A. (1937). Measuring intelligence: A guide to the administration of the new revised Stanford–Binet tests of intelligence. Riverside textbooks in education. Boston (MA): Houghton Mifflin.
- Terman, Lewis Madison; Merrill, Maude A. (1960). Stanford–Binet Intelligence Scale: Manual for the Third Revision Form L–M with Revised IQ Tables by Samuel R. Pinneau. Boston (MA): Houghton Mifflin.
- Richardson, Nancy (1992). "Stanford–Binet IV, of Course!: Time Marches On! (originally published as Which Stanford–Binet for the Brightest?)". Roeper Review. 15 (1): 32–34. doi:10.1080/02783199209553453.
- Waddell, Deborah D. (1980). "The Stanford–Binet: An Evaluation of the Technical Data Available since the 1972 Restandardization". Journal of School Psychology. 18 (3): 203–209. doi:10.1016/0022-4405(80)90060-6. Retrieved 29 June 2010.