|Possible time of origin||perhaps 4,000 years BP|
|Possible place of origin||perhaps Turkey|
|Ancestor||Haplogroup G2a2b (L30/PF3267/S126)|
|Defining mutations||M406 (G2a2b1), L14/S130 (G2a2b1a1a), L645 (G2a2b1a1b2a )|
In human genetics, Haplogroup G-M406 is a Y-chromosome haplogroup. G-M406 is a branch of Haplogroup G Y-DNA (M201). More specifically in descending order, G-M406 is a subbranch also of G2 (P287), G2a (P15) and finally G2a2b (L30/S126) Haplogroup G-M406 seems most common in Turkey and Greece. Secondary concentrations of G-M406 are found in the northern and eastern Mediterranean, and it is found in very small numbers in more inland areas of Europe, the Middle East, and the southern Caucasus Mountains area.
- 1 Genetic features
- 2 Age of origin of G-M406
- 3 Geographical distribution
- 4 G-M406 subgroups
- 5 Migrations of G-M406 persons
- 6 See also
- 7 References
A large number of G-M406 persons have the value of 21 at short tandem repeat (STR) marker DYS390, and all G-M406 men will have the M406 SNP mutation which characterizes this group. The 21 at DYS390 is uncommon among G persons outside the G-M406 group. In G-M406 persons, the DYS391 marker has mostly a value of 10, but sometimes 11, and DYS392 is almost always 11 except in one distinct cluster. If a sample meets the criteria indicated for these three markers, it is likely the sample is G-M406.
This M406 SNP was first reported in 2008. The M designation indicates it was first identified at Stanford University. M406 is located on the Y chromosome at position 2809995, reference SNP ID i4000120. The mutation involves a change from T to G.
The SNPs L184 and L185 (respectively 17586994 with a mutation from A to G, and 20922998 with a mutation from C to G) were found to exist in only some members of a M406+ family and are now considered private SNPs with no practical use outside this family.
Age of origin of G-M406
Research studies have not addressed the age of G-M406. Based on available 67-marker STR samples, it would seem that the mutation that defines G-M406 arose perhaps about 4,000 years ago.
The ability to describe G-M406 distribution accurately is handicapped by the rather recent identification of the defining G-M406 SNP and the resulting paucity of samples tested for M406. Some estimate of G-M406 percentages can be made, however, by noting the percentage of men with STR marker G samples which have the distinctive 21 value at DYS390 found almost exclustively among G persons within G-M406 and making use of samples tested at commercial labs and in research papers.
Turkey and Greece
Perhaps about 5% of men in Turkey are G-M406, the highest percentage of the general population in any country yet sampled. This 5% figure is based on the finding of the value of 21 at STR marker DYS390 in 21 of 57 G samples from throughout Turkey. Among G persons, the 21 value is seen overwhelmingly in G-M406 persons. But values other than 21 occur to a small extent in G-M406 persons. So the total G-M406 percentage within Turkish G is likely close to 50 percent, and the 57 SNP-confirmed G samples represent 11% of 523 Turkish samples obtained in the largest study yet conducted of Turkish population genetics.
In adjacent Greece, SNP testing determined that half of eight G samples were G-M406. The G samples represented 5% of 171 Greek samples. In contrast in nearby Crete, G-M406 was only 20% of the 21 G samples, with G samples representing 11% of 193 island samples. Though treated separately in this study, Crete is part of Greece but with a different settlement history. Farther out in the Mediterranean, in a smaller sample size from Cyprus 4 of 7 G samples have the distinctive 21 value seen overwhelmingly in G-M406 persons.
Just to the south of Turkey among the Kurds of Iraq 7 of 14 likely G STR samples in the YHRD database have the value of 21 at DYS390 suggesting half the G population there belongs to G-M406. This relatively high percentage of G-M406 is confined to the country's northern Kurdish region.
Lebanon, Jordan and Palestine also have significant G-M406 populations though small sample sizes make broad conclusions difficult. In one study, 4 of 5 Palestinian G samples have the distinctive DYS390=21 value. In Lebanon, at least 10 of 37 G samples have G-M406 features and are found among all the major religions there. In Jordan, 7 of 15 available G samples have 21 at marker DYS390. None of the G samples among the Druze peoples in these locations have STR marker values typical of G-M406 persons according to recent studies. However, recent testing on Family TreeDNA points to some families whose origins lie in Palestine/Israel, but who have moved to Lebanon or Syria prior to the 1500-1700 period who happen to be carrying around 50-60% of the Druze among the Haplogroup G population in Druze. There is initially reasonable assurance the samples are within the DYS459=8 subgroup (that is also negative for its own DYS392=12 subgroup). Overall, these sets of related subgroups make up a major part of G-M406 men and are negative for the L14 and L645 SNP-defined subgroups. The sample has a few marker values that are a little different and the Druze samples have 22 at DYS390 compared to the more typical 21 in the majority of G-M406 samples. So this is good indirect evidence that at least half of the G among the Druze is G-M406. In Syria, G-M406 seems less common than in the countries closer to the Mediterranean. Only 3 of 17 G samples there have the 21 value discussed.
The largest source of G STR samples from Iran is the YHRD database where only 1 of 340 Iranian samples has a value of 21 for DYS390 together with the other marker values consistent with G-M406 samples. It is possible G-M406 persons in Iran have different STR marker values. The countries that share the Persian Gulf and Gulf of Oman with Iran were the subject of a separate small study which provided STR samples with G patterns similar to those in Iran. But the percentage of G-M406 (based on the 21 value) would seem higher than in Iran in Dubai with half of 6 G samples with 21. None of the three G samples from Yemen and Saudi Arabia had this DYS390 value.
In the Nasidze study of the Caucasus Mountains area, his 87 likely G STR samples now in the YHRD database have only 8 samples with the distinctive DYS390 value of 21 that is the overwhelming feature of the G-M406 group. Only in Georgia where 3 of 16 likely G samples have 21 does G-M406 apparently have a significant presence.
The Haplogroup G Project newsletter is gradually summarizing the percentage of G-M406 men in other areas based on the likely or proven G-M406 samples in the project's large project G database. This source has more reliability in identifying the totality of G-M406 samples because it does not rely on the value at STR marker DYS390 as the sole criterion.
The newsletter indicates (in descending order) in Italy 20% of 156 G samples......Spain 15% of 56 G samples.....Netherlands 15% of 20 G samples......Switzerland 8% of 51 G samples.....Iran 6% of 34 G samples....Poland 4% of 75 G samples......France 4% of 46 G samples......Ireland 3% of 29 G samples.......India 3% of 18 G samples.
In addition, a research study which provided Indian and Pakistani STR marker samples did not find that any of the 20 G samples from various groups contained the DYS390=21 value so typical of G-M406. And a study of Sardinia found 7% of 51 G STR samples had the characteristic 21 value. Possibly of significance, half of these "21" samples were found in the asylum land of the ancient Sardinian population in the isolated central highlands which has no evidence of occupation by external colonial powers beginning with the Phoenician period. In the Spanish islands to the west of Sardinia, in another study none of 7 G STR samples from Ibiza are G-M406, but 2 of 4 G samples in Majorca have the DSY390 value of 21. This same study found one 21 value among 25 G Sephardic Jews whose history is intertwined with Spain. Of the 291 STR samples of all types in the YHRD database from Egypt none of the DYS390=21 samples has the same features of known G-M406 samples from other areas of the Mediterranean.
G2a2b1a1a is characterized by possessing the SNP mutation designated either as L14 or S130. G2a2b1a1a persons typically have DYF395S1 values of 16,16 and a DYS565 value of 11.
L14 and S130 are different designations for the same SNP mutation. The L14 version was developed at Family Tree DNA, and S130 was developed at Ethnoancestry. But this mutation was first identified at the University of Central Florida in 2006 as U16 and well before the associated researchers published a formal description in 2009 of its location at 21327383 on the Y chromosome, reference number rs35474563. The mutation involves a C to T change.
Another SNP of interest is S133 which was first identified in 2008 at Ethnoancestry in London, England in a G-M406 person, but it is also available as L90 elsewhere. Its location on the Y chromosome is given as 20087688, reference number rs35169834. The mutation is a change from G to A. In the several persons in which this mutation have been found it does not form a separate category from G2a2b1a1a because the tested persons also had the mutation that defines G2a2b1a1a (L14 or S130). It is possible there are persons who are positive (derived) for either L14/S130 while negative (ancestral) for L90/S133, and such a situation would necessitate creating a separate G-M406 category.
G2a2b1a1b2a is characterized by possessing the SNP mutation designated as L645. G2a2b1a1b2a persons so far all have the value of 9 at STR marker DYS578. Both European and Near Eastern members of this subgroup have been identified. L645 was first identified at Family Tree DNA in summer 2011. It was found at chromosome position 2948673 and represents a mutation from A to C.
The DYS454=12 cluster
There is a cluster of G-M406 men not belonging to the G2a2b1a1a subgroup who (1) have the unusual (for G-M406 men) value of 12 at STR marker DYS454 instead of the ancestral 11 value (2) and have a relatively close genetic distance when 67 markers are compared. The shared common ancestor may have lived about 2,000 years ago or so. This DYS454 STR marker is not as reliable as a SNP mutation for categorizing men, but the 12 value plus other uncommon features and the close genetic distance make is certain this a distinct subgroup. A few persons who belong to this cluster may have had more recent ancestors who mutated to a value other than 12, but such samples have not surfaced.
Many men in this cluster have developed one or two other mutations at STR markers which make the cluster even easier to identify. The largest number of cluster men have a mutation at DYS459a from the ancestral 9 to 8. And many have a mutation from the ancestral 11 to 12 at DYS392. The DYS392=12 value is seen almost exclusively in G1 persons, with this G-M406 cluster being a major exception.
This cluster seems overwhelmingly made up of three identifiable ethnic groups (a) Ashkenazi Jews based on distinctive surnames together with ancestral origins in northeastern Europe south of Scandinavia, (b) Conversos with Hispanic surnames and (c) Men with Italian ancestral origins. However, an available sample from Iran also likely belongs to this cluster. The Ashkenazi Jews are so far not found in the group with the ancestral 11 at DYS392. And the Hispanics are so far not found within the DYS392=12 group. (See also page covering Jews with Haplogroup G (Y-DNA)).
The DYS459b=10 cluster
The men in this cluster of G-M406 men do not belong to the G2a2b1a1a subgroup and have (1) the unusual (for G-M406 men) value of 10 at STR marker DYS459b instead of the ancestral 9 value (2) and have a relatively close genetic distance when 67 markers are compared. This is so far a relatively small cluster with French and British ancestral histories. Too few samples are available to estimate with some reliability when the common male ancestor of this cluster may have lived.
The DYS436=14 cluster
The men in this cluster of G-M406 men do not belong to the G2a2b1a1a subgroup and have the unusual (for G-M406 men) value of 14 at STR marker DYS436 instead of the ancestral 12 value This is so far a relatively small cluster with ancestral histories confined to Galicia and Basque country in Spain and to various parts of Portugal. Too few samples are available to estimate with some reliability when the common male ancestor of this cluster may have lived.
Migrations of G-M406 persons
It is logical that G-M406 spread westward along the Mediterranean from the area where it is most concentrated today (Turkey and the eastern Mediterranean countries) in conjunction with the trading, slave-selling and other migratory events originating in these lands. The first great trading empire that joined both ends of the Mediterranean was the Phoenician that originated in the Israel-Lebanon-Jordan area. After the demise of the Phoenicians, the Greeks, Romans, Byzantines and some "barbarians" could have spread G-M406 from the eastern Mediterranean to the west.
In comparisons of 67-marker G-M406 STR samples available from inland Europe with similar samples from (1) Turkey (2) Lebanon-Jordan and (3) Armenia certain deductions can be made. Most G-M406 Europeans have Armenians as their nearest relatives. Based on the number of mutations observed, some of these probably share common male ancestors as recently as the Dark Ages. Only one European showed Jordan/Lebanon samples as the nearest G relatives. Likewise none of the Europeans showed Turks alone as nearest relatives, but rather some European samples had Turks and Armenians equally related.
The sharing of common ancestors much farther back in time (perhaps 3,500 years ago) among some of these samples does not provide information so useful because the migration westward could have occurred anytime in the earlier period. The finding of likely G-M406 samples in the ancient isolated highlands of Sardinia, however, suggests the arrival of G-M406 in that island prior to the arrival of the Phoenicians. The latter began a Sardinian coastal presence about 3,000 years ago.
There is currently no information to explain the dispersion of G-M406 to the southern Indian area.
- King, R. J.; Özcan, S. S.; Carter, T.; Kalfoğlu, E.; Atasoy, S.; Triantaphyllidis, C.; Kouvatsi, A.; Lin, A. A.; et al. (2008). "Differential Y-chromosome Anatolian Influences on the Greek and Cretan Neolithic". Annals of Human Genetics. 72 (2): 205–214. doi:10.1111/j.1469-1809.2007.00414.x. PMID 18269686.
- Cinnioglu, C.; King, R; Kivisild, T; Kalfoğlu, E; Atasoy, S; Cavalleri, GL; Lillie, AS; Roseman, CC; et al. (January 2004). "Excavating Y-chromosome haplotype strata in Anatolia" (PDF). Human Genetics. 114 (2): 127–48. doi:10.1007/s00439-003-1031-4. PMID 14586639. Archived from the original (PDF) on 2006-06-19.
- El Sibai, M.; et al. (2009). "Geographical structure of the Y-chromosomal genetic landscape of the Levant: a coastal-inland contrast". Annals of Human Genetics. 73 (6): 568–81. doi:10.1111/j.1469-1809.2009.00538.x. PMC . PMID 19686289.
- Zalloua, P.; et al. (2008). "Identifying Genetic Traces of Historical Expansions: Phoenician Footprints in the Mediterranean". American Journal of Human Genetics. 83 (5): 633–42. doi:10.1016/j.ajhg.2008.10.012. PMC . PMID 18976729.
- Zalloua, P.; et al. (2008). "Y-Chromosomal Diversity in Lebanon Is Structured by Recent Historical Events". American Journal of Human Genetics. 82 (4): 873–82. doi:10.1016/j.ajhg.2008.01.020. PMC . PMID 18374297.
- Shlush, L.; et al. (2008). Gemmell, Neil John, ed. "The Druze: A Population Genetic Refugium of the Near East". PLoS ONE. 3 (5): e2105. Bibcode:2008PLoSO...3.2105S. doi:10.1371/journal.pone.0002105. PMC . PMID 18461126.
- Alshamali, F; et al. (2009). "Local population structure in Arabian Peninsula revealed by Y-STR diversity". Human Heredity. 68 (1): 45–54. doi:10.1159/000210448. PMID 19339785.
- Nasidze, I.; et al. (2003). "Testing hypotheses of language replacement in the Caucasus: evidence from the Y-chromosome" (PDF). Human Genetics. 112 (3): 255–61. doi:10.1007/s00439-002-0874-4 (inactive 2017-01-16). PMID 12596050.
- Senpugta, S.; et al. (2006). "Polarity and Temporality of High-Resolution Y-Chromosome Distributions in India Identify Both Indigenous and Exogenous Expansions and Reveal Minor Genetic Influence of Central Asian Pastoralists". American Journal of Human Genetics. 78 (2): 202–21. doi:10.1086/499411. PMC . PMID 16400607.
- Contu, D.; et al. (2008). Hawks, John, ed. "Y-Chromosome Based Evidence for Pre-Neolithic Origin of the Genetically Homogeneous but Diverse Sardinian Population: Inference for Association Scans". PLoS ONE. 3 (1): e1430. Bibcode:2008PLoSO...3.1430C. doi:10.1371/journal.pone.0001430. PMC . PMID 18183308.
- Zei, G.; et al. (2003). "From surnames to the history of Y chromosomes: the Sardinian population as a paradigm". European Journal of Human Genetics. 11 (10): 802–07. doi:10.1038/sj.ejhg.5201040. PMID 14512971.
- Adams, F.; et al. (2009). "The Genetic Legacy of Religious Diversity and Intolerance: Paternal Lineages of Christians, Jews, and Muslims in the Iberian Peninsula". American Journal of Human Genetics. 83 (6): 725–36. doi:10.1016/j.ajhg.2008.11.007. PMC . PMID 19061982.
- Sims, L.; et al. (2009). Batzer, Mark A., ed. "Improved Resolution Haplogroup G Phylogeny in the Y Chromosome, Revealed by a Set of Newly Characterized SNPs". PLoS ONE. 4 (6): e5792. Bibcode:2009PLoSO...4.5792S. doi:10.1371/journal.pone.0005792. PMC . PMID 19495413.