Spearman's hypothesis has two formulations. The original formulation was that the magnitudes of the black-white differences on tests of cognitive ability positively correlate with the tests' g-loading. The subsequent formulation was that the magnitude of the black-white difference on tests of cognitive ability is entirely or mainly a function of the extent to which a test measures general mental ability, or g.
Claims of validity of Spearman's hypothesis have been criticized on methodological grounds. Such claims have been used to support scientific racism.
The hypothesis, first formalized by Arthur Jensen in the 1980s based on Charles Spearman's earlier comments on the topic, stated that the magnitude of the differences is directly related to the magnitude of the g-loadings of a test. Jensen devised the method of correlated vectors (MCV) to study this hypothesis and published his research in his paper, "The nature of the black–white difference on various psychometric tests: Spearman's hypothesis", a paper which elicited 29 comments from experts in the field. Jensen went on to argue that the finding of a positive correlation between group differences and g-loadings would support the hypothesis that the group differences were largely in general mental ability—the later formulation of Spearman's hypothesis. The validity of this inference has been much debated.
Closely related to Spearman's hypothesis is the hypothesis that the magnitude of certain group differences correlates with within-group heritability estimates. Arthur Jensen and J. Phillippe Rushton, for example, reported in 2010 that the found psychometric meta-analytic correlation between g-loadings and heritability estimates was 1.
The international data, though, has challenged this finding: a study from Japan published in 2014 did find a correlation between g-loading and heritability, but only a third of that found by Jensen & Rushton; one from 2015 covering Korea that found no clear link between heritability and g-loadings of IQ subtests; and one from 2016 by Voronin, Te Nijenhuis, and Malykh found negative correlations between g-loadings and heritability in two samples of Russian twins.
Spearman's hypothesis and the arguments that invoke it have received multiple criticisms, including criticisms of Jensen's method of correlated vectors, of the inference that the first formulation of Spearman's hypothesis supports the second, and of genetic and biological arguments made on the basis of Spearman's hypothesis.
First, Jensen's MCV has been criticized with regards to the original formulation of Spearman's hypothesis. Peter Schönemann and others argue that MCV produces tautological results. However, according to Dolan and Lubke (2001), Schönemann's analysis is incorrect; using simulations, they show that the correlations are not positive by mathematical necessity. Ashton and Lee (2005) argue that MCV can produce spurious results. However, according to te Nijenhuis et al. (2007), one can use psychometric meta-analysis to negate this problem.
Second, Jensen's MCV has been criticized with regards to the claim that it supports the later formulation of Spearman's hypothesis. Dolan et al. (2004) argue that MCV lacks specificity: that is, that instances not including g differences could create a positive correlation between the magnitude of the group differences and the g-loadings. Dolan et al. (2004) note that they are specifically criticizing MCV as a way of proving that group differences largely or totally represent g differences; they don't argue against Spearman's hypothesis as originally formulated and they do not argue that the larger body of evidence does not support Spearman's hypothesis as later formulated. Hunt and Carlson summarize criticism:
The essence of these objections is that the method of correlated vectors does not consider alternative hypotheses concerning the latent traits that might give rise to the observed difference in test scores. When a more appropriate method of analysis, multigroup confirmatory factor analysis, is applied, it has been found that Spearman's hypothesis (i.e., that the difference is due to differences in general intelligence) is only one of several models that could give rise to the observed distributions in test scores (Dolan, 2000). These findings render the method of correlated vectors ambiguous—which is not the same as saying that the Jensen-Rushton position is incorrect. Our point is that the argument for the default hypothesis is an indirect one. It would be far better if a direct causal argument could be made linking racial/ethnic genetic differences to studies of the development of the brain.
However, Rushton and Jensen (2010) argued that this criticism misses the point because there was no absolute claim that g had been proven—only that the results were what would have been expected if a g difference did in fact exist.
Third, arguments based on Spearman's hypothesis have been criticized. Some have argued that culturally caused differences could produce a correlation between g-loadings and group differences. Flynn (2010) has criticized the basic assumption that confirmation of Spearman's hypothesis would support a partially genetic explanation for IQ differences. He argues that environmental causes for average group IQ differences would cause the differences to be greater for more complex tasks.
- Jensen, A. R. (1985). The nature of the black–white difference on various psychometric tests: Spearman's hypothesis. Behavioral and Brain Sciences, 8(02), 193–219.
- Hunt, E., & Carlson, J. (2007). Considerations relating to the study of group differences in intelligence. Perspectives on Psychological Science, 2(2), 194–213.
- te Nijenhuis, J. (2012). Jan te Nijenhuis. The Flynn effect, group differences, and g loadings. Personality and Individual Differences.
- Dolan, C. V., Roorda, W., & Wicherts, J. M. (2004). Two failures of Spearman's hypothesis: The GATB in Holland and the JAT in South Africa. Intelligence, 32(2), 155–173.
- Rushton, J. P., & Jensen, A. R. (2010). "The rise and fall of the Flynn Effect as a reason to expect a narrowing of the Black–White IQ gap". Intelligence, 38(2), 213–219.
- Te Nijenhuis, Jan; Kura, Kenya; Hur, Yoon-Mi (2014). "The correlation between g loadings and heritability in Japan: A meta-analysis". Intelligence. 46: 275–282. doi:10.1016/j.intell.2014.07.008.
- Choi, Yu Yong; Cho, Sun Hee; Lee, Kun Ho (August 2015). "No Clear Link between G Loadings and Heritability: A Twin Study from Korea". Psychological Reports. 117 (1): 291–297. doi:10.2466/19.04.PR0.117c10z1. ISSN 0033-2941. PMID 26126212. S2CID 24209390.
- Voronin, Ivan; Te Nijenhuis, Jan; Malykh, Sergey B. (November 2016). "THE CORRELATION BETWEEN g LOADINGS AND HERITABILITY IN RUSSIA". Journal of Biosocial Science. 48 (6): 833–843. doi:10.1017/S0021932015000395. ISSN 1469-7599. PMID 26507902. S2CID 2286258.
- Fairchild, Harlford (1991). "Scientific Racism: The Cloak of Objectivity". Journal of Social Issues. 47 (3): 101–115. doi:10.1111/j.1540-4560.1991.tb01825.x.
- Schönemann, Peter (1997). "Famous artefacts: Spearman's hypothesis" (PDF). Current Psychology of Cognition. 16 (6): 665–694.[dead link]
- Dolan, C.V. and Lubke, G.H. (2001). Viewing Spearman's hypothesis from the perspective of multigroup PCA: A comment on Schönemann's criticism. Intelligence 29, 231–245.
- Ashton, M. C., & Lee, K. (2005). Problems with the method of correlated vectors. Intelligence, 33(4), 431–444.
- Flynn, J. R. (2010). The spectacles through which I see the race and IQ debate. Intelligence, 38(4), 363–366.