Race and genetics: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Undid revision 556496802 by Aprock (talk)WP:IDIDNTHEARTHAT. It's not cherry. You've now reverted this 5 times. You should know when to stop.
Undid revision 556502702 by BlackHades (talk) rv misrepresentation of sources.
Line 83: Line 83:


While acknowledging Lewontin's observation that racial groups are genetically homogeneous, [[A. W. F. Edwards]] in his 2003 paper "[[Lewontin's Fallacy|Human Genetic Diversity: Lewontin's Fallacy]]" argued that his conclusion (that races cannot be genetically distinguished from each other) is incorrect. Edwards argued that when multiple alleles are taken into account, genetic differences cluster in geographic patterns (roughly corresponding to groups commonly defined as "races"). Most information distinguishing populations from each other is hidden in the correlation structure of allele frequency, making it possible to classify individuals using mathematical techniques. Edwards argued that even if the probability of misclassifying an individual based on a single genetic marker is as high as 30 percent (as Lewontin reported in 1972), the misclassification probability nears zero if enough genetic markers are studied simultaneously. Edwards saw Lewontin's argument as based on a political stance, denying biological differences to argue for social equality.<ref name="Edwards2003"/>
While acknowledging Lewontin's observation that racial groups are genetically homogeneous, [[A. W. F. Edwards]] in his 2003 paper "[[Lewontin's Fallacy|Human Genetic Diversity: Lewontin's Fallacy]]" argued that his conclusion (that races cannot be genetically distinguished from each other) is incorrect. Edwards argued that when multiple alleles are taken into account, genetic differences cluster in geographic patterns (roughly corresponding to groups commonly defined as "races"). Most information distinguishing populations from each other is hidden in the correlation structure of allele frequency, making it possible to classify individuals using mathematical techniques. Edwards argued that even if the probability of misclassifying an individual based on a single genetic marker is as high as 30 percent (as Lewontin reported in 1972), the misclassification probability nears zero if enough genetic markers are studied simultaneously. Edwards saw Lewontin's argument as based on a political stance, denying biological differences to argue for social equality.<ref name="Edwards2003"/>

[[Richard Dawkins]] (2005) agreed with Edwards' view, summarizing the argument against Lewontin as being, "However small the racial partition of the total variation may be, if such racial characteristics as there are highly correlate with other racial characteristics, they are by definition informative, and therefore of taxonomic significance." He explained this position by stating "suppose we took standard full-face photographs of 20 randomly chosen natives of each of the following countries: Japan, Uganda, Iceland, Sri Lanka, Papua New Guinea and Egypt. If we presented all 120 people with all 120 photographs, my guess is that every single one of them would achieve 100 percent success rates in sorting them into six different categories."<ref>{{cite book |title=The Ancestor's Tale: A Pilgrimage to the Dawn of Evolution |last=Dawkins |first=Richard |authorlink= |coauthors=Wong, Yan |year=2005 |publisher=[[Houghton Mifflin Harcourt]] |location= |isbn= 978-0-618-61916-0|page= |pages=406–407 |url=http://books.google.com/?id=rR9XPnaqvCMC&pg=PA406&dq=%22Lewontin's+Fallacy%22#v=onepage&q=%22Lewontin's%20Fallacy%22&f=false |accessdate=July 13, 2011}}</ref>


[[Neil Risch]] (2005), who was involved in research published in the article ''Genetic Structure, Self-Identified Race/Ethnicity, and Confounding in Case-Control Association Studies'',<ref name=Tang2005/> noted: "In a recent study, when we looked at the correlation between genetic structure [based on microsatellite markers] versus self-description, we found 99.9 percent concordance between the two. We actually had a higher discordance rate between self-reported sex and markers on the X chromosome! So you could argue that sex is also a problematic category".<ref name=Gitschier2005>{{cite journal |author=Risch N |title=The whole side of it--an interview with Neil Risch by Jane Gitschier |journal=PLoS Genetics |volume=1 |issue=1 |pages=e14 |year=2005 |month=July |pmid=17411332 |doi=10.1371/journal.pgen.0010014}}</ref> [[Henry Harpending]] (2002) has argued that the magnitude of human F<sub>ST</sub> values implies that "kinship between two individuals of the same human population is equivalent to kinship between grandparent and grandchild or between half siblings. The widespread assertion that this is small and insignificant should be reexamined". <ref>Harpending H. 2002. Kinship and population subdivision. Population and Environment 24(2):141-147.</ref> Sarich and Miele (2004) have argued that estimates of genetic differences between individuals in different populations fail to take into account human [[ploidy|diploidity]]: "The point is that we are diploid organisms, getting one set of chromosomes from one parent and a second from the other. To the extent that your mother and father are not especially closely related, then, those two sets of chromosomes will come close to being a random sample of the chromosomes in your population. And the sets present in some randomly chosen member of yours will also be about as different from your two sets as they are from one another. So how much of the variability will be distributed where? First is the 15 percent that is interpopulational. The other 85 percent will then split half and half (42.5 percent) between the intra- and interindividual within-population comparisons. The increase in variability in between-population comparisons is thus 15 percent against the 42.5 percent that is between-individual within-population. Thus, 15/42.5 is 32.5 percent, a much more impressive and, more important, more legitimate value than 15 percent".<ref>Sarich VM, Miele F. Race: The Reality of Human Differences. Westview Press (2004). ISBN 0-8133-4086-1</ref>
[[Neil Risch]] (2005), who was involved in research published in the article ''Genetic Structure, Self-Identified Race/Ethnicity, and Confounding in Case-Control Association Studies'',<ref name=Tang2005/> noted: "In a recent study, when we looked at the correlation between genetic structure [based on microsatellite markers] versus self-description, we found 99.9 percent concordance between the two. We actually had a higher discordance rate between self-reported sex and markers on the X chromosome! So you could argue that sex is also a problematic category".<ref name=Gitschier2005>{{cite journal |author=Risch N |title=The whole side of it--an interview with Neil Risch by Jane Gitschier |journal=PLoS Genetics |volume=1 |issue=1 |pages=e14 |year=2005 |month=July |pmid=17411332 |doi=10.1371/journal.pgen.0010014}}</ref> [[Henry Harpending]] (2002) has argued that the magnitude of human F<sub>ST</sub> values implies that "kinship between two individuals of the same human population is equivalent to kinship between grandparent and grandchild or between half siblings. The widespread assertion that this is small and insignificant should be reexamined". <ref>Harpending H. 2002. Kinship and population subdivision. Population and Environment 24(2):141-147.</ref> Sarich and Miele (2004) have argued that estimates of genetic differences between individuals in different populations fail to take into account human [[ploidy|diploidity]]: "The point is that we are diploid organisms, getting one set of chromosomes from one parent and a second from the other. To the extent that your mother and father are not especially closely related, then, those two sets of chromosomes will come close to being a random sample of the chromosomes in your population. And the sets present in some randomly chosen member of yours will also be about as different from your two sets as they are from one another. So how much of the variability will be distributed where? First is the 15 percent that is interpopulational. The other 85 percent will then split half and half (42.5 percent) between the intra- and interindividual within-population comparisons. The increase in variability in between-population comparisons is thus 15 percent against the 42.5 percent that is between-individual within-population. Thus, 15/42.5 is 32.5 percent, a much more impressive and, more important, more legitimate value than 15 percent".<ref>Sarich VM, Miele F. Race: The Reality of Human Differences. Westview Press (2004). ISBN 0-8133-4086-1</ref>

Revision as of 01:49, 24 May 2013

The relationship between race and genetics is relevant to the controversy concerning race. Genetic analysis enables us to determine the geographic ancestry of a person. Such analyses may pinpoint the migrational history of a person's ancestors with a high degree of accuracy.

Due to endogamy, allele frequencies cluster around kin groups and lineages or national, cultural or linguistic boundaries. This correlates genetic clusters and population groups when a number of alleles are evaluated.

The biological variation of a human genetic trait is clinal, with a gradual change in trait frequency between population clusters. Clines do not align around the same centers, resulting in more complex variations than those caused by large continental groups.

Research in genetics offers a means to classify humans which is more precise than broad phenotypes (such as skin color), and does not correlate with geographic ancestry. Some anthropologists, particularly forensic anthropologists, consider race a useful biological category. It is possible to determine race from physical remains; what is identified is the geographic phenotype.

Evolution

Multicolored world map
Map of early human migrations[1]
1. Homo sapiens
2. Neanderthals
3. Early Hominids

Humans and chimpanzees diverged from a common ancestor about 5–7 million years ago. The genus Homo evolved by about 2.3 to 2.4 million years ago from Australopithecines. Several species and subspecies of Homo evolved, which are now extinct; these included Homo erectus (who inhabited Asia) and Homo sapiens neanderthalensis (who inhabited western Eurasia). Archaic Homo sapiens evolved from 400,000 to 250,000 years ago.

The dominant view among scientists of the origin of anatomically modern humans is the "out of Africa" hypothesis; this believes that Homo sapiens arose in Africa and migrated from the continent 50,000 to 100,000 years ago, replacing populations of Homo erectus in Asia and Homo neanderthalensis in Europe. This theory has replaced the multiregional hypothesis, which argues that Homo sapiens evolved as geographically separate (but interbreeding) populations stemming from a worldwide migration of Homo erectus out of Africa about 2.5 million years ago. Although scientists have generally replaced Homo erectus with Homo sapiens as the common ancestor of modern humans, DNA evidence demonstrates that Neanderthal genes may have contributed about four percent of non-African heredity and the recently discovered Denisova hominin may have contributed six percent of the Melanesian genome.

Genetic variation

Genetic variation arises from mutations in genetic material, migration between populations (gene flow) and the reshuffling of genes through sexual reproduction. Evolution is caused by natural selection and genetic drift. Genetic drift may be hampered by the founder effect. Epigenetic inheritance are heritable changes in phenotype (appearance) or gene expression caused by mechanisms other than changes in the DNA sequence.

Human phenotypes are polygenic (dependent on interaction by many genes) and may be influenced by environment as well as genetics. The measure of genetic role in phenotypes is heritability.

Nucleotide diversity is based on single mutations, single nucleotide polymorphisms (SNPs). The nucleotide diversity between humans is about 0.1 percent (one difference per one thousand nucleotides between two humans chosen at random). This amounts to approximately three million SNPs (since the human genome has about three billion nucleotides). There are an estimated ten million SNPs in the human population.

Research has shown that non-SNP (structural) variation accounts for more human genetic variation than single nucleotide diversity. Structural variation includes copy-number variation and results from deletions, inversions, insertions and duplications. It is estimated that approximately 0.4 percent of the genomes of unrelated people differ, apart from copy number. When copy-number variation is included, human-to-human genetic variation is estimated to be at least 0.5 percent.

Research methods

Trait, protein and gene studies

Early classification attempts measured surface traits.

Multicolored world map
Geographic distribution of blood group A
Multicolored world map
Geographic distribution of blood group B

Before the discovery of DNA, scientists used blood proteins (the human blood group systems) to study human genetic variation. Research by Ludwik and Hanka Herschfeld during World War I found that the incidence of blood groups A and B differed by region; for example, among Europeans 15 percent were group B and 40 percent group A. Eastern Europeans and Russians had a higher incidence of group B; people from India had the greatest incidence. The Herschfelds concluded that humans comprised two "biochemical races", originating separately. It was hypothesized that these two races later mixed, resulting in the patterns of groups A and B. This was one of the first theories of racial differences to include the idea that human variation did not correlate with genetic variation. It was expected that groups with similar proportions of blood groups would be more closely related, but instead it was often found that groups separated by great distances (such as those from Madagascar and Russia), had similar incidences.[2] Researchers currently use genetic testing, which may involve hundreds (or thousands) of genetic markers or the entire genome.

Population genetics

Structure

Multi-Locus Allele Clusters


In a haploid population, when a single locus is considered (blue), with two alleles, + and - we can see a differential geographical distribution between Population I (70% +) and Population II (30% +).

When we want to assign an individual to one of these populations using this single locus we will assign any + to population I because the probability (p) of this allele belonging to Population I is p = 0.7, the probability (q) of incorrectly assigning this allele to Population I is q = 1 − p, or 0.3. This amounts to a Bernoulli trial because the answer to the question "is this the correct population?" is a simple yes or no. This makes the test binomially distributed but with a single trial.

But when three loci per individual are taken into account, each with p = 0.7 for a + allele in Population I the average number of + alleles per individual becomes kp = 2.1 (number of trials (k = 3) × probability for each allele (p = 0.7)) and 0.9 (3 × 0.3) + alleles per individual in Population II. This is sometimes referred to as the population trait value. Because alleles are discrete entities we can only assign an individual to a population based on the number of whole + alleles it contains. Therefore we will assign any individual with three or two + alleles to Population I, and any individual with one or fewer + alleles to population II.

The binomial distribution with three trials and a probability of 0.7 shows that the probability of an individual from this population having a single + allele is 0.189 and for zero + alleles it is 0.027, which gives a misclassification rate of 0.189 + 0.027 = 0.216, which is a smaller chance of misclassification than for a single allele. Misclassification becomes much smaller as we use more alleles. When more loci are taken into account, each new locus adds an extra independent test to the binomial distribution, decreasing the chance of misclassification.

Using modern computer software and the abundance of genetic data now available, it is possible not only to distinguish such correlations for hundreds or even thousands of alleles, which form clusters, it is also possible to assign individuals to given populations with very little chance of error.[citation needed] It should be noted, however, that genes tend to vary clinally, and there are likely to be intermediate populations that reside in the geographical areas between our sample populations (Population III, for example, may lie equidistantly from Population I and Population II). In this case it may well be that Population III may display characteristics of both population I and Population II and have intermediate frequencies for many of the alleles used for classification, causing this population to be more prone to misclassification.
Multi-colored world map
World map based on genetic principal component analysis of human populations from Luigi Luca Cavalli-Sforza's 1994 History and Geography of Human Genes.[3]

Several methods to examine and quantify genetic subgroups exist, including cluster and principal components analysis. Genetic markers from individuals are examined to find a population's genetic structure. While subgroups overlap when examining variants of one marker only, when a number of markers are examined different subgroups have different average genetic structure. An individual may be described as belonging to several subgroups. These subgroups may be more or less distinct, depending on how much overlap there is with other subgroups. One such mathematical method is.. The population genetic structure found is often similar.[4][5][6]

In cluster analysis, the number of clusters to search for K is determined in advance; how distinct the clusters are varies. The results obtained from cluster analyses depend on several factors:

  • A large number genetic markers studied facilitates finding distinct clusters.[7]
  • Some genetic markers vary more than others, so fewer are required to find distinct clusters.[8] Ancestry-informative markers exhibits substantially different frequencies between populations from different geographical regions. Using AIMs, scientists can determine a person's ancestral continent of origin based solely on their DNA. AIMs can also be used to determine someone's admixture proportions.[9]
  • The more individuals studied, the easier it becomes to detect distinct clusters (statistical noise is reduced).[8]
  • Low genetic variation makes it more difficult to find distinct clusters.[8] Greater geographic distance generally increases genetic variation, making identifying clusters easier.[10]
  • A similar cluster structure is seen with different genetic markers when the number of genetic markers included is sufficiently large. The clustering structure obtained with different statistical techniques is similar. A similar cluster structure is found in the original sample with a subsample of the original sample.[11]

Distance

Genetic distance is genetic divergence between species or populations of a species. It may compare the genetic similarity of related species, such as humans and chimpanzees. Within a species, genetic distance measures divergence between subgroups.

Genetic distance significantly correlates to geographic distance between populations, a phenomenon sometimes known as "isolation by distance".[12] Genetic distance may be the result of physical boundaries restricting gene flow such as islands, deserts, mountains or forests.

Genetic distance is measured by the fixation index (FST). FST is the correlation of randomly chosen alleles in a subgroup to a larger population. It is often expressed as a proportion of genetic diversity. This comparison of genetic variability within (and between) populations is used in population genetics. The values range from 0 to 1; zero indicates the two populations are freely interbreeding, and one would indicate that two populations are separate.

History and geography

Cavalli-Sforza has described two methods of ancestry analysis.[13] Current-population genetic structure does not imply that differing clusters or components indicate only one ancestral home per group; for example, a genetic cluster in the US comprises Hispanics with European, Native American and African ancestry.[7]

Geographic analyses attempt to identify places of origin, their relative importance and possible causes of genetic variation in an area. The results can be presented as maps showing genetic variation. Cavalli-Sforza and colleagues argue that if genetic variations are investigated, they often correspond to population migrations due to new sources of food, improved transportation or shifts in political power. For example, in Europe the most significant direction of genetic variation corresponds to the spread of agriculture from the Middle East to Europe between 10,000 and 6,000 years ago.[13] Such geographic analysis works best in the absence of recent large-scale, rapid migrations.

Historic analyses use differences in genetic variation (measured by genetic distance) as a molecular clock indicating the evolutionary relation of species or groups, and can be used to create evolutionary trees reconstructing population separations.[13]

Validation

Results of genetic-ancestry research are supported if they agree with research results from other fields, such as linguistics or archeology.[13] Cavalli-Sforza and colleagues have argued that there is a correspondence between language families found in linguistic research and the population tree they found in their 1994 study. There are generally shorter genetic distances between populations using languages from the same linguistic family. Exceptions to this rule are Sami, Tibetans and Ethiopians, who are genetically associated with populations speaking languages from other linguistic families. The Sami speak a Uralic language, but are genetically primarily European. This is argued to have resulted from migration (and interbreeding) with Europeans while retaining their original language. Agreement also exists between research dates in archeology and those calculated using genetic distance.[8][13]

Ancestral populations

A 1994 study by Cavalli-Sforza and colleagues evaluated genetic distances among 42 native populations based on 120 blood polymorphisms. The populations were grouped into nine clusters: African (sub-Saharan), Caucasoid (European), Caucasoid (extra-European), northern Mongoloid (excluding Arctic populations), northeast Asian Arctic, southern Mongoloid (mainland and insular Southeast Asia), Pacific islander, New Guinean and Australian, and American (Amerindian). Although the clusters demonstrate varying degrees of homogeneity, the nine-cluster model represents a majority (80 out of 120) of single-trait trees and is useful in demonstrating the historic phylogenetic relationship among these populations.[14]

The greatest genetic distance between two continents is between Africa and Oceania, at 0.2470. Based on physical appearance this is counterintuitive, since indigenous Australians and New Guineans resemble Africans (with dark skin and curly hair). This measure of genetic distance reflects the isolation of Australia and New Guinea since the end of the last glacial maximum, when the continent was isolated from mainland Asia due to rising sea levels. The next-largest genetic distance is between Africa and the Americas, at 0.2260. This is expected, since the longest geographic distance by land is between Africa and South America. The shortest genetic distance, 0.0155, is between European and extra-European Caucasoids. Africa is the most genetically divergent continent, with all other groups more related to each other than to sub-Saharan Africans. This is expected, according to the single-origin hypothesis. Europe has a general genetic variation about three times less than that of other continents; the genetic contribution of Asia and Africa to Europe is thought to be two-thirds and one-third, respectively.[13][14]

Recent studies have been published using an increasing number of genetic markers.[8][11][15][16][17][18][19][20] Research has also been done on smaller regions (for example, the genetic history of Europe, the genetic history of Italy and genetic studies on Jews).

Population structures

Definitions of "race" are rooted in taxonomic classifications first developed in 18th- and 19th-century Europe. "Race" has overlapped with a debate about species known as the species problem.

Since the 1960s scientists have understood race as a social construct imposed on phenotypes in culturally determined ways, rather than a biological concept. A 2000 study by Celera Genomics found that human DNA does not differ significantly across populations. Citizens of any village in the world, in Scotland or Tanzania, have 90 percent of the genetic variability humanity has to offer. Only .01 percent of genes account for a person's appearance.[21] Biological adaptation plays a role in bodily features and skin type. According to Luigi Luca Cavalli-Sforza, "From a scientific point of view, the concept of race has failed to obtain any consensus; none is likely, given the gradual variation in existence. It may be objected that the racial stereotypes have a consistency that allows even the layman to classify individuals. However, the major stereotypes, all based on skin color, hair color and form, and facial traits, reflect superficial differences that are not confirmed by deeper analysis with more reliable genetic traits and whose origin dates from recent evolution mostly under the effect of climate and perhaps sexual selection".[3]

Group size

Research techniques can be used to detect genetic population differences if enough genetic markers are used; the Japanese and Chinese East Asian populations have been identified.[22] Sub-Saharan Africans have greater genetic diversity than other populations.[23]

Lewontin's argument and criticism

In 1972, Richard Lewontin performed a FST statistical analysis using 17 markers (including blood-group proteins). He found that the majority of genetic differences between humans (85.4 percent) were found within a population, 8.3 percent were found between populations within a race and 6.3 percent were found to differentiate races (Caucasian, African, Mongoloid, South Asian Aborigines, Amerinds, Oceanians, and Australian Aborigines in his study). Since then, other analyses have found FST values of 6–10 percent between continental human groups, 5–15 percent between different populations on the same continent and 75–85 percent within populations.[24][25][26][27][28]

While acknowledging Lewontin's observation that racial groups are genetically homogeneous, A. W. F. Edwards in his 2003 paper "Human Genetic Diversity: Lewontin's Fallacy" argued that his conclusion (that races cannot be genetically distinguished from each other) is incorrect. Edwards argued that when multiple alleles are taken into account, genetic differences cluster in geographic patterns (roughly corresponding to groups commonly defined as "races"). Most information distinguishing populations from each other is hidden in the correlation structure of allele frequency, making it possible to classify individuals using mathematical techniques. Edwards argued that even if the probability of misclassifying an individual based on a single genetic marker is as high as 30 percent (as Lewontin reported in 1972), the misclassification probability nears zero if enough genetic markers are studied simultaneously. Edwards saw Lewontin's argument as based on a political stance, denying biological differences to argue for social equality.[5]

Neil Risch (2005), who was involved in research published in the article Genetic Structure, Self-Identified Race/Ethnicity, and Confounding in Case-Control Association Studies,[7] noted: "In a recent study, when we looked at the correlation between genetic structure [based on microsatellite markers] versus self-description, we found 99.9 percent concordance between the two. We actually had a higher discordance rate between self-reported sex and markers on the X chromosome! So you could argue that sex is also a problematic category".[29] Henry Harpending (2002) has argued that the magnitude of human FST values implies that "kinship between two individuals of the same human population is equivalent to kinship between grandparent and grandchild or between half siblings. The widespread assertion that this is small and insignificant should be reexamined". [30] Sarich and Miele (2004) have argued that estimates of genetic differences between individuals in different populations fail to take into account human diploidity: "The point is that we are diploid organisms, getting one set of chromosomes from one parent and a second from the other. To the extent that your mother and father are not especially closely related, then, those two sets of chromosomes will come close to being a random sample of the chromosomes in your population. And the sets present in some randomly chosen member of yours will also be about as different from your two sets as they are from one another. So how much of the variability will be distributed where? First is the 15 percent that is interpopulational. The other 85 percent will then split half and half (42.5 percent) between the intra- and interindividual within-population comparisons. The increase in variability in between-population comparisons is thus 15 percent against the 42.5 percent that is between-individual within-population. Thus, 15/42.5 is 32.5 percent, a much more impressive and, more important, more legitimate value than 15 percent".[31]

Anthropologists (such as C. Loring Brace)[32] and Jonathan Kaplan[33] and geneticist Joseph Graves[34] have argued that while it is possible to find biological and genetic variation roughly corresponding to race, this is true for almost all geographically distinct populations: the cluster structure of genetic data is dependent on the initial hypotheses of the researcher and the populations sampled. When one samples continental groups, the clusters become continental; with other sampling patterns, the clusters would be different. Weiss and Fullerton note that if one sampled only Icelanders, Mayans and Maoris, three distinct clusters would form; all other populations would be composed of genetic admixtures of Maori, Icelandic and Mayan material.[35] Kaplan argues that Lewontin and Edwards are correct, concluding that while racial groups are characterized by allele frequency racial classification is a taxonomy of the human species; other genetic patterns can be found in human populations which crosscut racial distinctions. In this view, races are social constructs with a biological reality (largely an artifact of the category's construction).

Self-identification

Jorde and Wooding (2004) wrote that clusters from genetic markers did not correspond to subjects' self-identified race or ethnic group. However, the studies cited were based on relatively few genetic markers and deemed insufficient. In contrast, studies based on a higher number of genetic markers have found more agreement.[22]

A 2005 study by Tang and colleagues used 326 genetic markers to determine genetic clusters. The 3,636 subjects, from the United States and Taiwan, self-identified as belonging to white, African American, East Asian or Hispanic ethnic groups. The study found "nearly perfect correspondence between genetic cluster and SIRE for major ethnic groups living in the United States, with a discrepancy rate of only 0.14 percent".[7]

Paschou et al. (2010) found "essentially perfect" agreement between 51 self-identified populations and the population's genetic structure, using 650,000 genetic markers. Selecting for informative genetic makers allowed a reduction to less than 650, while retaining near-total accuracy.[36]

Correspondence between genetic clusters in a population (such as the current US population) and self-identified race or ethnic groups does not mean that such a cluster (or group) corresponds to only one ethnic group. African Americans have an estimated 10–20-percent European genetic admixture; Hispanics have European, Native American and African ancestry.[7] In Brazil there has been extensive admixture between Europeans, Amerindians and Africans, resulting in no clear differences in skin color and relatively weak associations between self-reported race and African ancestry.[37][38]

Genetic-distance increase

Colored circles, illustrating gene-pool changes
A change in a gene pool may be abrupt or clinal.

Genetic distances generally increase continually with geographic distance, which makes a dividing line arbitrary. Any two neighboring settlements will exhibit some genetic difference from each other, which could be defined as a race. Therefore, attempts to classify races impose an artificial discontinuity on a naturally occurring phenomenon. This explains why studies on population genetic structure yield varying results, depending on methodology.[39]

Rosenberg and colleagues (2005) have argued, based on cluster analysis, that populations do not always vary continuously and a population's genetic structure is consistent if enough genetic markers (and subjects) are included. "Examination of the relationship between genetic and geographic distance supports a view in which the clusters arise not as an artifact of the sampling scheme, but from small discontinuous jumps in genetic distance for most population pairs on opposite sides of geographic barriers, in comparison with genetic distance for pairs on the same side. Thus, analysis of the 993-locus dataset corroborates our earlier results: if enough markers are used with a sufficiently large worldwide sample, individuals can be partitioned into genetic clusters that match major geographic subdivisions of the globe, with some individuals from intermediate geographic locations having mixed membership in the clusters that correspond to neighboring regions." They also wrote, regarding a model with five clusters corresponding to Africa, Eurasia (Europe, Middle East, and Central/South Asia), East Asia, Oceania, and the Americas: "For population pairs from the same cluster, as geographic distance increases, genetic distance increases in a linear manner, consistent with a clinal population structure. However, for pairs from different clusters, genetic distance is generally larger than that between intracluster pairs that have the same geographic distance. For example, genetic distances for population pairs with one population in Eurasia and the other in East Asia are greater than those for pairs at equivalent geographic distance within Eurasia or within East Asia. Loosely speaking, it is these small discontinuous jumps in genetic distance—across oceans, the Himalayas, and the Sahara—that provide the basis for the ability of STRUCTURE to identify clusters that correspond to geographic regions".[11] This applies to populations in their ancestral homes when migrations and gene flow were slow; large, rapid migrations exhibit different characteristics. Tang and colleagues (2004) wrote, "we detected only modest genetic differentiation between different current geographic locales within each race/ethnicity group. Thus, ancient geographic ancestry, which is highly correlated with self-identified race/ethnicity—as opposed to current residence—is the major determinant of genetic structure in the U.S. population".[7]

Number of clusters

Cluster analysis has been criticized because the number of clusters to search for is decided in advance, with different values possible (although with varying degrees of probability).[40] Principal component analysis does not decide in advance how many components for which to search,[4] and it has been used in an increasing number of studies.[citation needed]

Utility

It has been argued that knowledge of a person's race is limited in value, since people of the same race vary from one another.[22] Witherspoon and colleagues (2007) have argued that when individuals are assigned to population groups, two randomly chosen individuals from different populations can resemble each other more than a randomly chosen member of their own group. They found that many thousands of genetic markers had to be used for the answer to "How often is a pair of individuals from one population genetically more dissimilar than two individuals chosen from two different populations?" to be "never". This assumed three population groups, separated by large geographic distances (European, African and East Asian). The global human population is more complex, and studying a large number of groups would require an increased number of markers for the same answer. They conclude that "caution should be used when using geographic or genetic ancestry to make inferences about individual phenotypes",[41] and "The fact that, given enough genetic data, individuals can be correctly assigned to their populations of origin is compatible with the observation that most human genetic variation is found within populations, not between them. It is also compatible with our finding that, even when the most distinct populations are considered and hundreds of loci are used, individuals are frequently more similar to members of other populations than to members of their own population".[42]

This is similar to the conclusion reached by anthropologist Norman Sauer in a 1992 article on the ability of forensic anthropologists to assign "race" to a skeleton, based on craniofacial features and limb morphology. Sauer said, "the successful assignment of race to a skeletal specimen is not a vindication of the race concept, but rather a prediction that an individual, while alive was assigned to a particular socially constructed 'racial' category. A specimen may display features that point to African ancestry. In this country that person is likely to have been labeled Black regardless of whether or not such a race actually exists in nature".[43]

Race and medicine

Neil Risch states that studies have documented biological differences among the races in susceptibility to chronic disease.[29] Genes change in response to local diseases; for example, people who are Duffy-negative tend to have a higher resistance to malaria (most Africans are Duffy-negative, and most non-Africans are Duffy-positive).[44] A number of genetic diseases prevalent in malaria-endemic areas may provide genetic resistance to malaria, including sickle cell disease, thalassaemias and glucose-6-phosphate dehydrogenase. Cystic fibrosis is the most common life-limiting autosomal recessive disease among people of European ancestry; a hypothesized heterozygote advantage, providing resistance to diseases earlier common in Europe, has been challenged.[45]

Information about a person's population of origin may aid in diagnosis, and adverse drug responses may vary by group.[8] Because of the correlation between self-identified race and genetic clusters, medical treatments influenced by genetics have varying rates of success between self-defined racial groups.[46] For this reason, some physicians consider a patient's race in choosing the most effective treatment,[47] and some drugs are marketed with race-specific instructions.[48] Jorde and Wooding (2004) have argued that because of genetic variation within racial groups, when "it finally becomes feasible and available, individual genetic assessment of relevant genes will probably prove more useful than race in medical decision making". However, race continues to be a factor when examining groups (such as epidemiologic research).[22]

See also

Regional: Archaeogenetics

References

  1. ^ Literature: Göran Burenhult: Die ersten Menschen, Weltbild Verlag, 2000. ISBN 3-8289-0741-5
  2. ^ Sykes, Bryan (2001). "From Blood Groups to Genes". The seven daughters of Eve. New York: Norton. pp. 32–51. ISBN 0-393-02018-5.
  3. ^ a b Cavalli-Sforza, L. L.; Menozzi, P.; Piazza, A. (1994). The history and geography of human genes. Princeton, N.J: Princeton University Press. ISBN 9780691087504. {{cite book}}: Invalid |ref=harv (help)
  4. ^ a b Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1371/journal.pgen.0020190, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1371/journal.pgen.0020190 instead.
  5. ^ a b Edwards AW (2003). "Human genetic diversity: Lewontin's fallacy". BioEssays. 25 (8): 798–801. doi:10.1002/bies.10315. PMID 12879450. {{cite journal}}: Unknown parameter |month= ignored (help)
  6. ^ Witherspoon, D. J.; Wooding, S.; Rogers, A. R.; Marchani, E. E.; Watkins, W. S.; Batzer, M. A.; Jorde, L. B. (2007). "Genetic Similarities Within and Between Human Populations". Genetics. 176 (1): 351–359. doi:10.1534/genetics.106.067355. PMC 1893020. PMID 17339205.
  7. ^ a b c d e f Tang H, Quertermous T, Rodriguez B; et al. (2005). "Genetic Structure, Self-Identified Race/Ethnicity, and Confounding in Case-Control Association Studies". American Journal of Human Genetics. 76 (2): 268–75. doi:10.1086/427888. PMC 1196372. PMID 15625622. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)
  8. ^ a b c d e f Rosenberg NA, Pritchard JK, Weber JL; et al. (2002). "Genetic structure of human populations". Science. 298 (5602): 2381–5. Bibcode:2002Sci...298.2381R. doi:10.1126/science.1078311. PMID 12493913. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)
  9. ^ Lewontin, R.C. "Confusions About Human Races".
  10. ^ Kittles RA, Weiss KM (2003). "Race, ancestry, and genes: implications for defining disease risk". Annual Review of Genomics and Human Genetics. 4: 33–67. doi:10.1146/annurev.genom.4.070802.110356. PMID 14527296.
  11. ^ a b c Rosenberg NA, Mahajan S, Ramachandran S, Zhao C, Pritchard JK, Feldman MW (2005). "Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure". PLoS Genetics. 1 (6): e70. doi:10.1371/journal.pgen.0010070. PMC 1310579. PMID 16355252. {{cite journal}}: Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link) CS1 maint: unflagged free DOI (link)
  12. ^ Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa
  13. ^ a b c d e f Genes, peoples, and languages, Luigi Luca Cavalli-Sforza, Proceedings of the National Academy of Science, 1997, vol.94, pp.7719–7724, doi=10.1073/pnas.94.15.7719 http://www.pnas.org/cgi/content/full/94/15/7719
  14. ^ a b Cavalli-Sforza, L. L., P. Menozzi, A. Piazza. 1994. The History and Geography of Human Genes. Princeton University Press, Princeton. ISBN 0-691-02905-9
  15. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1126/science.1153717, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1126/science.1153717 instead.
  16. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1038/nature06742, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1038/nature06742 instead.
  17. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1101/gr.085589.108, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1101/gr.085589.108 instead.
  18. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1371/journal.pone.0007888, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1371/journal.pone.0007888 instead.
  19. ^ Report by Masatoshi Nei and Arun K
  20. ^ Peter J. Taylor's schematic map
  21. ^ "Do Races Differ? Not Really, DNA Shows". The New York Times. 22 August 2000.
  22. ^ a b c d Jordge, Lynn B. and Stephen P. Wooding. "Genetic Variation, classification and 'race'". Nature, Vol. 36 Num. 11, November 2004.
  23. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1002/ajpa.21011, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1002/ajpa.21011 instead.
  24. ^ Lewontin, R. C. (1972). "The Apportionment of Human Diversity" (PDF). Evolutionary Biology. 6 (38): 391–398.
  25. ^ Risch, Neil; Burchard, Esteban; Ziv, Elad; Tang, Hua (2002). "Categorization of humans in biomedical research: genes, race and disease". Genome Biology. 3 (7): comment2007.1. doi:10.1186/gb-2002-3-7-comment2007. PMC 139378. PMID 12184798.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  26. ^ Templeton, Alan R. (2003). "Human Races in the Context of Recent Human Evolution: A Molecular Genetic Perspective". In Goodman, Alan H.; Heath, Deborah; Lindee, M. Susan (eds.). Genetic nature/culture: anthropology and science beyond the two-culture divide (PDF). Berkeley: University of California Press. pp. 234–257. ISBN 0-520-23792-7.
  27. ^ Ossorio and Duster, 2005[verification needed]
  28. ^ Lewontin, R. C. (2005). Confusions About Human Races from Race and Genomics, Social Sciences Research Council. Retrieved 28 December 2006.
  29. ^ a b Risch N (2005). "The whole side of it--an interview with Neil Risch by Jane Gitschier". PLoS Genetics. 1 (1): e14. doi:10.1371/journal.pgen.0010014. PMID 17411332. {{cite journal}}: Unknown parameter |month= ignored (help)CS1 maint: unflagged free DOI (link)
  30. ^ Harpending H. 2002. Kinship and population subdivision. Population and Environment 24(2):141-147.
  31. ^ Sarich VM, Miele F. Race: The Reality of Human Differences. Westview Press (2004). ISBN 0-8133-4086-1
  32. ^ Loring Brace, C. 2005. Race is a four letter word. Oxford University Press.
  33. ^ Kaplan, Jonathan Michael (January 2011) 'Race': What Biology Can Tell Us about a Social Construct. In: Encyclopedia of Life Sciences (ELS). John Wiley & Sons, Ltd: Chichester
  34. ^ Graves, Joseph. 2001. The Emperor's New Clothes. Rutgers University Press
  35. ^ Weiss KM and Fullerton SM (2005) Racing around, getting nowhere. Evolutionary Anthropology 14: 165–169
  36. ^ Ancestry informative markers for fine-scale individual assignment to worldwide populations, Peristera Paschou, Jamey Lewis, Asif Javed, Petros Drineas, J Med Genet doi:10.1136/jmg.2010.078212
  37. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1371/journal.pone.0017063, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1371/journal.pone.0017063 instead.
  38. ^ Attention: This template ({{cite doi}}) is deprecated. To cite the publication identified by doi:10.1073/pnas.0126614100, please use {{cite journal}} (if it was published in a bona fide academic journal, otherwise {{cite report}} with |doi=10.1073/pnas.0126614100 instead.
  39. ^ Back with a Vengeance: the Reemergence of a Biological Conceptualization of Race in Research on Race/Ethnic Disparities in Health Reanne Frank
  40. ^ Bolnick, Deborah A. (2008). "Individual Ancestry Inference and the Reification of Race as a Biological Phenomenon". In Koenig, Barbara A.; Richardson, Sarah S.; Lee, Sandra Soo-Jin (eds.). Revisiting race in a genomic age. Rutgers University Press. ISBN 978-0-8135-4324-6.
  41. ^ Witherspoon DJ, Wooding S, Rogers AR; et al. (2007). "Genetic Similarities Within and Between Human Populations". Genetics. 176 (1): 351–9. doi:10.1534/genetics.106.067355. PMC 1893020. PMID 17339205. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)
  42. ^ Witherspoon DJ, Wooding S, Rogers AR; et al. (2007). "Genetic Similarities Within and Between Human Populations". Genetics. 176 (1): 358. doi:10.1534/genetics.106.067355. PMC 1893020. PMID 17339205. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)
  43. ^ Sauer 1992
  44. ^ Malaria and the Red Cell, Harvard University. 2002 url=http://sickle.bwh.harvard.edu/malaria_sickle.html
  45. ^ Högenauer C, Santa Ana CA, Porter JL; et al. (2000). "Active intestinal chloride secretion in human carriers of cystic fibrosis mutations: an evaluation of the hypothesis that heterozygotes have subnormal active intestinal chloride secretion". Am. J. Hum. Genet. 67 (6): 1422–7. doi:10.1086/316911. PMC 1287919. PMID 11055897. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)
  46. ^ Racial Differences in the Response to Drugs — Pointers to Genetic Differences. New England Journal of Medicine, Volume 344:1393-1396, May 3, 2001.
  47. ^ Bloche, Gregg M. Race-Based Therapeutics. New England Journal of Medicine, Volume 351:2035-2037, November 11, 2004.
  48. ^ Drug information for the drug Crestor. Warnings for this drug state, "People of Asian descent may absorb rosuvastatin at a higher rate than other people. Make sure your doctor knows if you are Asian. You may need a lower than normal starting dose."

Further reading

External links