Haplogroup L-M20

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Haplogroup L-M20
Distribution Haplogroup L Y-DNA.svg
Possible time of origin25,000–30,000 years BP[citation needed]
Possible place of originMiddle East, West Asia, South Asia or Pamir Mountains
Defining mutationsM11, M20, M61, M185, L656, L863, L878, L879[web 1]
Highest frequenciesSyria Raqqa, Balochistan, Northern Afghanistan, South India, Kallar, Tarkhan, Jats, Kalash, Nuristanis, Burusho, Pashtuns, Lazs, Afshar village, Fascia, Veneto, Southern Tyrol

Haplogroup L-M20 is a human Y-DNA haplogroup, which is defined by SNPs M11, M20, M61 and M185. As a secondary descendant of haplogroup K and a primary branch of haplogroup LT, haplogroup L currently has the alternative phylogenetic name of K1a, and is a sibling of haplogroup T (a.k.a. K1b).

The presence of L-M20 has been observed at varying levels throughout South Asia, peaking in populations native to Balochistan (28%),[1] Northern Afghanistan (25%)[2] and Southern India (19%).[3] The clade also occurs in Tajikistan and Anatolia, as well as at lower frequencies in Iran. It has also been present for millennia at very low levels in the Caucasus, Europe and Central Asia. The subclade L2 (L-L595) has been found in Europe and Western Asia, but is extremely rare.

Phylogenetic tree[edit]

There are several confirmed and proposed phylogenetic trees available for haplogroup L-M20. The scientifically accepted one is the Y-Chromosome Consortium (YCC) one published in Karafet 2008 and subsequently updated. A draft tree that shows emerging science is provided by Thomas Krahn at the Genomic Research Center in Houston, Texas.[web 1] The International Society of Genetic Genealogy (ISOGG) also provides an amateur tree.

This is Thomas Krahn at the Genomic Research Center's Draft tree Proposed Tree for haplogroup L-M20:[web 1]

  • L-M20 M11, M20, M61, M185, L656, L863, L878, L879
    • L-M22 (L1) M22, M295, PAGES00121
      • L-M317 (L1b) M317, L655
        • L-M349 (L1b1) M349
        • L-M274 M274
        • L-L1310 L1310
      • L-L1304 L1304
        • L-M27 (L1a1) M27, M76, P329.1, L1318, L1319, L1320, L1321
        • L-M357 (L1a2) M357
          • L-PK3 PK3
          • L-L1305 L1305, L1306, L1307
    • L-L595 (L2) L595
      • L-L864 L864, L865, L866, L867, L868, L869, L870, L877


L-M20 is a descendant of Haplogroup LT,[4][5] which is a descendant of haplogroup K-M9.[6][5] According to Dr. Spencer Wells, L-M20 was hypothesized to have originated in the Middle East or India ca. 30,000 years ago.[7] Other studies have proposed a West Asian origin for L-M20 and associated its expansion in the Indus valley to neolithic farmers.[8][9][10][11][12][13] McElreavy and Quintana-Murci, writing on the Indus Valley Civilisation, state that

One Y-chromosome haplogroup (L-M20) has a high mean frequency of 14% in Pakistan and so differs from all other haplogroups in its frequency distribution. L-M20 is also observed, although at lower frequencies, in neighbouring countries, such as India, Tajikistan, Uzbekistan and Russia. Both the frequency distribution and estimated expansion time (~7,000 YBP) of this lineage suggest that its spread in the Indus Valley may be associated with the expansion of local farming groups during the Neolithic period.[14]

Sengupta et al. (2006) observed three subbranches of haplogroup L: L1-M76 (L1a1), L2-M317 (L1b) and L3-M357 (L1a2), with distinctive geographic affiliations.[15] Almost all Indian members of haplogroup L are L1 derived, with L3-M357 occurring only sporadically (0.4%).[16][17] Conversely in Pakistan, L3-M357 subclade account for 86% of L-M20 chromosomes and reaches an intermediate frequency of 6.8%, overall.[18] L1-M76 occurs at a frequency of 7.5% in India and 5.1% in Pakistan, exhibiting peak variance distribution in the Maharashtra region in coastal western India.[19]

Geographical distribution[edit]

In India, L-M20 has a higher frequency among Dravidian castes, but is somewhat rarer in Indo-Aryan castes.[15] In Pakistan, it has highest frequency in Balochistan.[20]

It has also been found at low frequencies among populations of Central Asia and South West Asia (including Arabia, Iraq, Syria, Turkey, Lebanon, Egypt, and Yemen) as well as in Southern Europe (especially areas adjoining the Mediterranean Sea).[citation needed]

Preliminary evidence gleaned from non-scientific sources, such as individuals who have had their Y-chromosomes tested by commercial labs,[web 2] suggests that most European examples of Haplogroup L-M20 might belong to the subclade L2-M317, which is, among South Asian populations, generally the rarest of the subclades of Haplogroup L.[web 2]

South Asia[edit]


It has higher frequency among Dravidian castes (ca. 17-19%) but is somewhat rarer in Indo-Aryan castes (ca. 5-6%).[15] It reaches up to 68% in some tribes and castes of Karnataka,[21] 38% in some castes in Gujarat,[21] 48% in some castes in Tamil Nadu and an overall frequency of 12% in Punjab.[15][21][22] Earlier studies (e.g. Wells 2001) report a very high frequency (approaching 80%) of Haplogroup L-M20 in Tamil Nadu appear to have been due to extrapolation from data obtained from a sample of 84 Kallars, a Tamil-speaking higher ruler caste of Tamil Nadu, among whom 40 (approx. 48%) displayed the M20 mutation that defines Haplogroup L. The presence of haplogroup L-M20 is rare among tribal groups (ca. 5,6-7%) (Cordaux 2004, Sengupta 2006, and Thamseem 2006).

L-M20 was found 68% in the Korova tribe from Karnataka, 38% in the Bharwad caste from Junagarh district in Gujarat, 21% in Charan caste from Junagarh district in Gujarat and 17% in the Kare Vokkal tribe from Uttara Kannada in Karnataka.(Shah 2011) Also found at low frequency in other populations from Junagarh district and Uttara Kannada. L-M20 is the single largest male lineage (36.8%) among the Jat people of Northern India and is found at 16.33% among the Gujar's of Jammu and Kashmir.[23][24] It also occurs at 18.6% among the Konkanastha Brahmins of the Konkan region[22] and at 15% among the Maratha's of Maharashtra.[25] L-M20 is also found at 32.35% in the Vokkaligas and at 17.82% in the Lingayats of Karnataka.[26] L-M20 is also found at 20.7% among the Ambalakarar, 16.7% among the Iyengar and 17.2% among the Iyer castes of Tamil Nadu.[25] L-M11 is found in frequencies of 8-16% among Indian Jews.[27] 2% of Siddis have also been reported with L-M11.(Shah 2011) Haplogroup L-M20 is currently present in the Indian population at an overall frequency of ca. 7-15%.[Footnote 1]


The greatest concentration of Haplogroup L-M20 is along the Indus River in Pakistan where the Indus Valley Civilization flourished during 3300–1300 BC with its mature period between 2600–1900 BCE. L-M357's highest frequency and diversity is found in the Balochistan province at 28%[20] with a moderate distribution among the general Pakistani population at 11.6% (Firasat 2007)). It is also found in Afghanistan ethnic counterparts as well, such as with the Pashtuns and Balochis. L-M357 is found frequently among Burusho (approx. 12% (Firasat 2007)) and Pashtuns (approx. 7% (Firasat 2007)),

L1a and L1c-M357 are found at 24% among Balochis, L1a and L1c are found at 8% among the Dravidian-speaking Brahui, L1c is found at 25% among Kalash, L1c is found at 15% among Burusho, L1a-M76 and L1b-M317 are found at 2% among the Makranis and L1c is found at 3.6% of Sindhis according to Julie di Cristofaro et al. 2013.[28] L-M20 is found at 17.78% among the Parsis.[29] L3a is found at 23% among the Nuristanis in both Pakistan and Afghanistan.[30]

L-PK3 is found in approximately 23% of Kalash in northwest Pakistan(Firasat 2007).

Middle East and Anatolia[edit]

L-M20 was found in 51% of Syrians from Raqqa, a northern Syrian city whose previous inhabitants were wiped out by Mongol genocides and repopulated in recent times by local Bedouin populations and Chechen war refugees from Russia (El-Sibai 2009). In a small sample of Israeli Druze haplogroup L-M20 was found in 7 out of 20 (35%). However, studies done on bigger samples showed that L-M20 averages 5% in Israeli Druze,[Footnote 2] 8% in Lebanese Druze,[Footnote 3] and it was not found in a sample of 59 Syrian Druze. Haplogroup L-M20 has been found in 2.0% (1/50) (Wells 2001) to 5.25% (48/914) of Lebanese (Zalloua 2008).

Populations Distribution Source
Turkey 57% in Afshar village, 12% (10/83) in Black Sea Region, 6.6% (7/106) of Turks in Turkey, 4.2% (1/523 L-M349 and 21/523 L-M11(xM27, M349)) Cinnioğlu 2004, Gokcumen 2008 and Karafet 2016
Iran 54.9% (42/71) L in Priest Zoroastrian Parsis
22.2% L1b and L1c in South Iran (2/9)
8% to 16% L2-L595, L1a, L1b and L1c of Kurds in Kordestan (2-4/25)
9.1% L-M20 (7/77) of Persians in Eastern Iran
3.4% L-M76 (4/117) and 2.6% L-M317 (3/117)
for a total of 6.0% (7/117) haplogroup L-M20 in Southern Iran
3.0% (1/33) L-M357 in Northern Iran
4.2% L1c-M357 of Azeris in East Azeris (1/21)
4.8% L1a and L1b of Persians in Esfahan (2/42)
Regueiro 2006, Cristofaro 2013, Malyarchuk 2013 and Lopez 2017
Syria 51.0% (33/65) of Syrians in Raqqa, 31.0% of Eastern Syrians El-Sibai 2009
Laz 41.7% (15/36) L1b-M317 O. Balanovsky 2017
Saudi Arabians 15.6% ( 4/32 of L-M76 and 1/32 of L-317 ) 1.91% (2/157=1.27% L-M76 and 1/157=0.64% L-M357) Karafet 2016 and AbuAmero 2009
Kurds 3.2% of Kurds in Southeast Turkey Flores 2005
Iraq 3.1% (2/64) L-M22 Sanchez 2005
Armenians 1.63% (12/734) to 4.3% (2/47) Weale 2001 and Wells 2001
Omanis 1% L-M11 Luis 2004
Qataris 2.8% (2/72 L-M76) Cadenas 2008
UAE Arabs 3.0% (4/164 L-M76 and 1/164 L-M357) Cadenas 2008

Central Asia[edit]

A study on the Pashtun male lineages in Afghanistan, found that Haplogroup L-M20, with an overall frequency of 9.5%, is the second most abundant male lineage among them.[31] It exhibits substantial disparity in its distribution on either side of the Hindu Kush range, with 25% of the northern Afghan Pashtuns belonging to this lineage, compared with only 4.8% of males from the south.[31] Specifically, paragroup L3*-M357 accounts for the majority of the L-M20 chromosomes among Afghan Pashtuns in both the north (20.5%) and south (4.1%).[31] An earlier study involving a lesser number of samples had reported that L1c comprises 12.24% of the Afghan Pashtun male lineages.[32] L1c-M357 occurs significantly in the Burusho and Kalash(15% and 25%), as well.[33] L1c is also found at 7.69% among the Balochs of Afghanistan.[32] However L1a-M76 occurs in a much more higher frequency among the Balochs (20[33] to 61.54%),[33] and is found at lower levels in Kyrgyz, Tajik, Uzbek and Turkmen populations.[33]

Populations Distribution Source
Tajiks 22.5% (9/40), 11.1% (6/54) L1a and L1c in Balkh Province, 9.0% (7/78), 6.3% (1/16) L1c in Samangan Province, 5.4% (2/37) L1c in Badakhshan Province Malyarchuk 2013Wells 2001
Uzbeks 20% (1/5) L1c in Balkh Province, 14.3% (4/28) L1a and L1c in Sar-e Pol Province, 7.5% (7/94)L1a, L1b and L1c in Jawzjan Province, 3.0% (11/366) to 3.7% (2/54) Wells 2001, Karafet 2001 and Cristofaro 2013
Uyghurs 16.7% (1/6) L1c-M357 in Kyrgyzstan Cristofaro 2013
Pamiris 16% (7/44) of Shugnanis, 12% 3/25 of Ishkashimis, 0/30 Bartangis Wells 2001
Hazaras 12.5% (1/8) L1a in Balkh Province, 1.9% (2/69) L1a in Bamiyan Province Cristofaro 2013
Yagnobis 9.7% (3/31) Wells 2001
Bukharan Arabs 9.5% (4/42) Wells 2001
Pashtuns 9.4% (5/53) L1a and L1b in Kunduz Province, 2.9% (1/34) L1c in Baghlan Province Cristofaro 2013
Dungans 8.3% (1/12) L1c in Kyrgyzstan Cristofaro 2013
Uyghurs (Lopliks) 7.8% (5/64) L-M357 in Qarchugha Village, Lopnur County, Xinjiang[34] Liu 2018
Karakalpaks 4.5% (2/44) Wells 2001
Uyghurs 4.4% (3/68) Karafet 2001 and Hammer 2005[Footnote 4]
Turkmens 4.1% (3/74) L1a in Jawzjan Province Cristofaro 2013
Chelkans 4.0% (1/25) Dulik 2012 and Dulik 2012
Kyrgyzes 2.7% (1/37) L1c in Northwest Kyrgyzstan and 2.5% (1/40) L1a in Central Kyrgyzstan Cristofaro 2013
Kazan Tatars 2.6% (1/38) Wells 2001
Hui 1.9% (1/54) Karafet 2001
Bashkirs 0.64% (3/471) Lobov 2009

East Asia[edit]

Researchers studying samples of Y-DNA from populations of East Asia have rarely tested their samples for any of the mutations that define Haplogroup L. However, mutations for Haplogroup L have been tested and detected in samples of Balinese (13/641 = 2.0% L-M20), Han Chinese (1/57 = 1.8%),[35] Dolgans from Sakha and Taymyr (1/67 = 1.5% L-M20) and Koreans (3/506 = 0.6% L-M20).[36][37][38]


An article by O. Semino et al. published in the journal Science (Volume 290, 10 November 2000) reported the detection of the M11-G mutation, which is one of the mutations that defines Haplogroup L, in approximately 1% to 3% of samples from Georgia, Greece, Hungary, Calabria (Italy), and Andalusia (Spain). The sizes of the samples analyzed in this study were generally quite small, so it is possible that the actual frequency of Haplogroup L-M20 among Mediterranean European populations may be slightly lower or higher than that reported by Semino et al., but there seems to be no study to date that has described more precisely the distribution of Haplogroup L-M20 in Southwest Asia and Europe.

Populations Distribution Source
Fascia, Italy 19.2% of Fascians L-M20 Valentina Coia 2013
Nonstal. Italy 10% of Nonesi L-M20 F. di Giacomo 2003
Samnium, Italy 10% of Aquilanis L-M20 Alessio Boattini 2013
Vicenza, Italy 10% of Venetians L-M20 Alessio Boattini 2013
South Tyrol, Italy 8.9% of Ladin speakers from Val Badia, 8.3% of Val Badia, 2.9% of Puster Valley, 2.2% of German speakers from Val Badia, 2% of German speakers from Upper Vinschgau, 1.9% of German speakers from Lower Vinschgau and 1.7% of Italian speakers from Bolzano Pichler 2006 and Thomas 2007.
Georgians 20% (2/10) of Georgians in Gali, 14.3% (2/14) of Georgians in Chokhatauri, 12.5% (2/16) of Georgians in Martvili, 11.8% (2/17) of Georgians in Abasha, 11.1% (2/18) of Georgians in Baghdati, 10% (1/10) of Georgians in Gardabani, 9.1% (1/11) of Georgians in Adigeni, 6.9% (2/29) of Georgians in Omalo, 5.9% (1/17) of Georgians in Gurjaani, 5.9% (1/17) of Georgians in Lentekhi and 1.5% (1/66) L-M357(xPK3) to 1.6% (1/63) L-M11 Battaglia 2008, Semino 2000 and Tarkhnishvili 2014
Daghestan, Russia 10% of Chechens in Daghestan, 9.5% (4/42) of Avars, 8.3% (2/24) of Tats, 3.7% (1/27) of Chamalins Yunusbaev 2006, Caciagli 2009 and Karafet 2016
Arkhangelsk Oblast, Russia 5.9% of Russians L1c-M357 Hongyang Xu 2014
Estonia L2-L595 and L1-M22 are found in 5.3%, 3.5%, 1.4% and 0.8% of Estonians Scozzari 2001 and Lappalainen 2007
Balkarians, Russia 5.3% (2/38) L-M317 Battaglia 2008
Portugal 5.0% of Coimbra Beleza 2006
Bulgaria 3.9% of Bulgarians Karafet 2016
Flanders L1a*: 3.17% of Mechelen 2.4% of Turnhout and 1.3% of Kempen. L1b*: 0.74% of West Flanders and East Flanders Larmuseau 2010 and Larmuseau 2011
Antsiferovo, Novgorod 2.3% of Russians Balanovska 2017
East Tyrol, Austria L-M20 is found in 1.9% of Tyroleans in Region B (Isel, Lower Drau, Defereggen, Virgen, and Kals valley) H.Niederstätter 2012
Gipuzkoa, Basque Country L1b is found in 1.7% of Gipuzkoans Young 2011
North Tyrol, Austria L-M20 is found in 0.8% of Tyroleans in Reutte D.Erhart 2012

Subclade distribution[edit]

L1 (M295)[edit]

L-M295 is found from Western Europe to South Asia.[Footnote 5]

The L1 subclade is also found at low frequencies on the Comoros Islands.[39]

L1a1 (M27)[edit]

L-M27 is found in 14.5% of Indians and 15% of Sri Lankans, with a moderate distribution in other populations of Pakistan, southern Iran and Europe, but slightly higher Middle East Arab populations[citation needed] (Karafet 2016). There is a very minor presence among Siddi's (2%),[40] as well.

L1a2 (M357)[edit]

L-M357 is found frequently among Burushos, Kalashas, and Pashtuns, with a moderate distribution among other populations in Pakistan, Georgia,[41] Chechens,[42] Ingushes,[42] northern Iran, India, the UAE, and Saudi Arabia.[citation needed]

A Chinese study published in 2018 found L-M357/L1307 in 7.8% (5/64) of a sample of Loplik Uyghurs from Qarchugha Village, Lopnur County, Xinjiang.[34]


L-PK3, which is downstream of L-M357,[43] is found frequently among Kalash.[citation needed]

L1b (M317)[edit]

L-M317 is found at low frequency in Central Asia, Southwest Asia, and Europe.[citation needed]

In Europe, L-M317 has been found in Northeast Italians (3/67 = 4.5%)[41] and Greeks (1/92 = 1.1%).[41]

In Caucasia, L-M317 has been found in Mountain Jews (2/10 = 20%[44]), Avars (4/42 = 9.5%,[44] 3%[42]), Balkarians (2/38 = 5.3%),[41] Abkhaz (8/162 = 4.9%,[44] 2/58 = 3.4%[42]), Chamalals (1/27 = 3.7%[44]), Abazins (2/88 = 2.3%[44]), Adyghes (3/154 = 1.9%[44]), Chechens (3/165 = 1.8%[44]), Armenians (1/57 = 1.8%[44]), Lezgins (1/81 = 1.2%[42]), and Ossetes (1/132 = 0.76% North Ossetians,[44] 2/230 = 0.9% Iron[42]).

L-M317 has been found in Makranis (2/20 = 10%) in Pakistan, Iranians (3/186 = 1.6%), Pashtuns in Afghanistan (1/87 = 1.1%), and Uzbeks in Afghanistan (1/127 = 0.79%).[45]

L1b1 (M349)[edit]

L-M349 is principally found in Europe.[citation needed]

L2 (L595)[edit]

L2-L595 is extremely rare, and has been identified by private testing in individuals from Europe and Western Asia.

Two confirmed L2-L595 individuals from Iran were reported in a 2020 study supplementary.[46] Possible but unconfirmed cases of L2 include 4% (1/25) L-M11(xM76, M27, M317, M357) in a sample of Iranians in Kordestan[45] and 2% (2/100) L-M20(xM27, M317, M357) in a sample of Shapsugs,[42] among other rare reported cases of L which don't fall into the common branches.

L2 in modern populations
Region Population n/Sample size Percentage Source
West Asia Azerbaijan 2/204 1 [47]
Central Europe Germany 1/8641 0.0000115 [48]
Southern Europe Greece 1/753 0.1 [49]
West Asia Iran 2/800 0.25 [50]
Southern Europe taly 3/913 0.3 [51]

Ancient DNA[edit]

  • Three individuals from Maykop culture c. 5200 BCE were found to belong to haplogroup L2-L595.[52]
  • Three individuals who lived in the Chalcolithic era (c. 5700–6250 years BP), found in the Areni-1 ("Bird's Eye") cave in the South Caucasus mountains (present-day Vayots Dzor Province, Armenia), were also identified as belonging to haplogroup L1a. One individual's genome indicated that he had red hair and blue eyes.
  • Haplogroup L1a-M27 in the remains from a 5th century burial site in Pannonia, Hungary, attributed to an elite member of the Huns.[53]
  • Narasimhan et al. (2018) analyzed skeletons from the BMAC sites in Uzbekistan and identified 2 individuals as belonging to haplogroup L1a. One of these specimens was found in Bustan and the other in Sappali Tepe; both ascertained to be Bronze Age sites.[54]
  • Skourtanioti et al. (2020) analyzed skeletons from the Alalakh sites in Syria and identified one individual (ALA084) c. 2006-1777 BC as belonging to haplogroup L-L595.[55]
  • An ancient Viking individual that lived in Öland, Sweden circa 847 ± 65 CE was determined to belong to L-L595.[56]

Elite Hun grave[edit]

Remains found in grave in Pannonia (attributed to an elite member of the Huns)
Population Huns
Language unknown; possibly Hunnic
Date (YBP) 1540–1500 ybp
Burial / Location Hungary
Members / Sample Size 1/1
mtDNA D4j12
Isotope Sr
Eye color (HIrisPlex System)
Hair color (HIrisPlex System)
Skin pigmentation
ABO Blood Group
Diet (d13C%0 / d15N%0)
FADS activity
Lactase Persistence
Oase-1 Shared DNA
Ostuni1 Shared DNA
Neanderthal Vi33.26 Shared DNA
Neanderthal Vi33.25 Shared DNA
Neanderthal Vi33.16 Shared DNA
Ancestral Component (AC)
puntDNAL K12 Ancient
Dodecad [dv3]
Eurogenes [K=36]
Dodecad [Globe13]
Genetic Distance
Parental Consanguinity
Age at Death
Death Position
Read Pairs
Source Laboratory of population genetics of Kazakhstan

Chalcolithic South Caucasus[edit]

Areni-1 Cave
Property Areni-I Areni-II Areni-III
ID AR1/44 I1634 AR1/46 I1632 ARE12 I1407
Y DNA L1a L1a1-M27 L1a
Population Chalcolithic (Horizon III) Chalcolithic (Horizon III) Chalcolithic (Horizon II)
Culture Late Chalcolithic Late Chalcolithic Late Chalcolithic
Date (YBP) 6161 ± 89 6086 ± 72 6025 ± 325
Burial / Location Burial 2 / Areni-1 Cave Burial 3 / Areni-1 Cave Trench 2A, Unit 7, Square S33/T33, Locus 9, Spit 23 / Areni-1 Cave
Members / Sample Size 1/3 1/3 1/3
Percentage 33.3% 33.3% 33.3%
mtDNA H2a1 K1a8 H*
Isotope Sr
Eye color (HIrisPlex System) Likely Blue
Hair color (HIrisPlex System) Likely Red
Skin pigmentation Likely light
ABO Blood Group Likely O or B
Diet (d13C%0 / d15N%0)
FADS activity
Lactase Persistence Likely lactose-intolerant
Oase-1 Shared DNA
Ostuni1 Shared DNA
Neanderthal Vi33.26 Shared DNA
Neanderthal Vi33.25 Shared DNA
Neanderthal Vi33.16 Shared DNA
Ancestral Component (AC)
puntDNAL K12 Ancient
Dodecad [dv3]
Eurogenes [K=36]
Dodecad [Globe13]
Genetic Distance
Parental Consanguinity
Age at Death 11 ± 2.5 15 ± 2.5
Death Position
Read Pairs
Source [57]
Notes World’s earliest evidence of footwear and wine making


Prior to 2002, there were in academic literature at least seven naming systems for the Y-Chromosome Phylogenetic tree. This led to considerable confusion. In 2002, the major research groups came together and formed the Y-Chromosome Consortium (YCC). They published a joint paper that created a single new tree that all agreed to use. Later, a group of citizen scientists with an interest in population genetics and genetic genealogy formed a working group to create an amateur tree aiming at being above all timely. The table below brings together all of these works at the point of the landmark 2002 YCC Tree. This allows a researcher reviewing older published literature to quickly move between nomenclatures.

YCC 2002/2008 (Shorthand) (α) (β) (γ) (δ) (ε) (ζ) (η) YCC 2002 (Longhand) YCC 2005 (Longhand) YCC 2008 (Longhand) YCC 2010r (Longhand) ISOGG 2006 ISOGG 2007 ISOGG 2008 ISOGG 2009 ISOGG 2010 ISOGG 2011 ISOGG 2012
L-M20 28 VIII 1U 27 Eu17 H5 F L* L L L - - - - - - -
L-M27 28 VIII 1U 27 Eu17 H5 F L1 L1 L1 L1 - - - - - - -
The Y-Chromosome Consortium tree

This is the official scientific tree produced by the Y-Chromosome Consortium (YCC). The last major update was in 2008 (Karafet 2008). Subsequent updates have been quarterly and biannual. The current version is a revision of the 2010 update.[58]

Original research publications

The following research teams per their publications were represented in the creation of the YCC Tree.

See also[edit]

Phylogenetic tree of human Y-chromosome DNA haplogroups [χ 1][χ 2]
"Y-chromosomal Adam"
A00 A0-T [χ 3]
A0 A1 [χ 4]
A1a A1b
A1b1 BT
F1  F2  F3  GHIJK
I   J     LT [χ 5]       K2 [χ 6]
L     T    K2a [χ 7]        K2b [χ 8]     K2c     K2d K2e [χ 9]  
K-M2313 [χ 10]     K2b1 [χ 11] P [χ 12]
NO   S [χ 13]  M [χ 14]    P1     P2


  1. ^ see Basu 2003, Cordaux 2004, Sengupta 2006, and Thamseem 2006.
  2. ^ 12/222 Shlush et al. 2008
  3. ^ 1/25 Shlush et al. 2008
  4. ^ In Hammer 2005, see the Supplementary Material.
  5. ^ FTDNA lab results, May 2011


  1. ^ Mahal, David G.; Matsoukas, Ianis G. (23 January 2018). "The Geographic Origins of Ethnic Groups in the Indian Subcontinent: Exploring Ancient Footprints with Y-DNA Haplogroups". Frontiers in Genetics. 9: 4. doi:10.3389/fgene.2018.00004. ISSN 1664-8021. PMC 5787057. PMID 29410676.
  2. ^ Lacau, Harlette; Gayden, Tenzin; Regueiro, Maria; Chennakrishnaiah, Shilpa; Bukhari, Areej; Underhill, Peter A; Garcia-Bertrand, Ralph L; Herrera, Rene J (18 April 2012). "Afghanistan from a Y-chromosome perspective". European Journal of Human Genetics. 20 (10): 1063–1070. doi:10.1038/ejhg.2012.59. ISSN 1018-4813. PMC 3449065. PMID 22510847.
  3. ^ Sengupta, Sanghamitra; Zhivotovsky, Lev A.; King, Roy; Mehdi, S. Q.; Edmonds, Christopher A.; Chow, Cheryl-Emiliane T.; Lin, Alice A.; Mitra, Mitashree; Sil, Samir K.; Ramesh, A.; Usha Rani, M. V.; Thakur, Chitra M.; Cavalli-Sforza, L. Luca; Majumder, Partha P.; Underhill, Peter A. (2006). "Polarity and Temporality of High-Resolution Y-Chromosome Distributions in India Identify Both Indigenous and Exogenous Expansions and Reveal Minor Genetic Influence of Central Asian Pastoralists". American Journal of Human Genetics. 78 (2): 202–221. doi:10.1086/499411. ISSN 0002-9297. PMC 1380230. PMID 16400607.
  4. ^ International Society of Genetic Genealogy, 2015, Y-DNA Haplogroup Tree 2015 (30 May 2015).
  5. ^ a b Chiaroni, J.; Underhill, P. A.; Cavalli-Sforza, L. L. (December 2009). "Y chromosome diversity, human expansion, drift, and cultural evolution". Proc. Natl. Acad. Sci. U.S.A. 106 (48): 20174–49. Bibcode:2009PNAS..10620174C. doi:10.1073/pnas.0910803106. JSTOR 25593348. PMC 2787129. PMID 19920170.
  6. ^ International Society of Genetic Genealogy, 2015 Y-DNA Haplogroup K and its Subclades – 2015 (5 April 2015).
  7. ^ Wells, Spencer (2007). Deep ancestry : inside the Genographic project. Washington, D.C.: National Geographic. p. 103. ISBN 978-1426201189.
  8. ^ Qamar, Raheel; Ayub, Qasim; Mohyuddin, Aisha; Helgason, Agnar; Mazhar, Kehkashan; Mansoor, Atika; Zerjal, Tatiana; Tyler-Smith, Chris; Mehdi, S. Qasim (2002). "Y-Chromosomal DNA Variation in Pakistan". American Journal of Human Genetics. 70 (5): 1107–1124. doi:10.1086/339929. ISSN 0002-9297. PMC 447589. PMID 11898125.
  9. ^ Zhao, Zhongming; Khan, Faisal; Borkar, Minal; Herrera, Rene; Agrawal, Suraksha (2009). "Presence of three different paternal lineages among North Indians: A study of 560 Y chromosomes". Annals of Human Biology. 36 (1): 46–59. doi:10.1080/03014460802558522. ISSN 0301-4460. PMC 2755252. PMID 19058044.
  10. ^ Thanseem, Ismail; Thangaraj, Kumarasamy; Chaubey, Gyaneshwer; Singh, Vijay Kumar; Bhaskar, Lakkakula VKS; Reddy, B Mohan; Reddy, Alla G; Singh, Lalji (7 August 2006). "Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA". BMC Genetics. 7: 42. doi:10.1186/1471-2156-7-42. ISSN 1471-2156. PMC 1569435. PMID 16893451.
  11. ^ Cordaux, Richard; Aunger, Robert; Bentley, Gillian; Nasidze, Ivane; Sirajuddin, S. M.; Stoneking, Mark (3 February 2004). "Independent origins of Indian caste and tribal paternal lineages". Current Biology. 14 (3): 231–235. doi:10.1016/j.cub.2004.01.024. ISSN 0960-9822. PMID 14761656. S2CID 5721248.
  12. ^ McElreavey, K.; Quintana-Murci, L. (2005). "A population genetics perspective of the Indus Valley through uniparentally-inherited markers". Annals of Human Biology. 32 (2): 154–162. doi:10.1080/03014460500076223. ISSN 0301-4460. PMID 16096211. S2CID 109014.
  13. ^ Thangaraj, Kumarasamy; Naidu, B. Prathap; Crivellaro, Federica; Tamang, Rakesh; Upadhyay, Shashank; Sharma, Varun Kumar; Reddy, Alla G.; Walimbe, S. R.; Chaubey, Gyaneshwer; Kivisild, Toomas; Singh, Lalji (20 December 2010). "The Influence of Natural Barriers in Shaping the Genetic Structure of Maharashtra Populations". PLOS ONE. 5 (12): e15283. Bibcode:2010PLoSO...515283T. doi:10.1371/journal.pone.0015283. ISSN 1932-6203. PMC 3004917. PMID 21187967.
  14. ^ K. McElreavy and L. Quintana-Murci (2005), A population genetics perspective of the Indus Valley through uniparentally-inherited markers Archived 2017-01-16 at the Wayback Machine
  15. ^ a b c d Sengupta 2006.
  16. ^ Sengupta 2006, p. 218.
  17. ^ Kivisild, T.; Rootsi, S.; Metspalu, M.; Mastana, S.; Kaldma, K.; Parik, J.; Metspalu, E.; Adojaan, M.; Tolk, H.-V.; Stepanov, V.; Gölge, M.; Usanga, E.; Papiha, S. S.; Cinnioğlu, C.; King, R.; Cavalli-Sforza, L.; Underhill, P. A.; Villems, R. (2003). "The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations". American Journal of Human Genetics. 72 (2): 313–332. doi:10.1086/346068. ISSN 0002-9297. PMC 379225. PMID 12536373.
  18. ^ Sengupta 2006, p. 219.
  19. ^ Sengupta 2006, p. 220.
  20. ^ a b Qamar 2002.
  21. ^ a b c Shah 2011.
  22. ^ a b Kivisild, T; Rootsi, S; Metspalu, M; et al. (February 2003). "The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations". Am. J. Hum. Genet. 72 (2): 313–32. doi:10.1086/346068. PMC 379225. PMID 12536373.
  23. ^ Mahal, David G.; Matsoukas, Ianis G. (20 September 2017). "Y-STR Haplogroup Diversity in the Jat Population Reveals Several Different Ancient Origins". Frontiers in Genetics. 8: 121. doi:10.3389/fgene.2017.00121. ISSN 1664-8021. PMC 5611447. PMID 28979290.
  24. ^ Sharma, S; Rai, E; Sharma, P; et al. (January 2009). "The Indian origin of paternal haplogroup R1a1* substantiates the autochthonous origin of Brahmins and the caste system". Journal of Human Genetics. 54 (1): 47–55. doi:10.1038/jhg.2008.2. PMID 19158816.
  25. ^ a b Sengupta, S; Zhivotovsky, LA; King, R; et al. (February 2006). "Polarity and Temporality of High-Resolution Y-Chromosome Distributions in India Identify Both Indigenous and Exogenous Expansions and Reveal Minor Genetic Influence of Central Asian Pastoralists". Am. J. Hum. Genet. 78 (2): 202–21. doi:10.1086/499411. PMC 1380230. PMID 16400607.
  26. ^ "Analysis of Y-chromosome Diversity in Lingayat and Vokkaliga Populations of Southern India". 2011. CiteSeerX Cite journal requires |journal= (help)
  27. ^ Chaubey, Gyaneshwer (2016). "Genetic affinities of the Jewish populations of India". Scientific Reports. 6: 19166. Bibcode:2016NatSR...619166C. doi:10.1038/srep19166. PMC 4725824. PMID 26759184.
  28. ^ Di Cristofaro, Julie; Pennarun, Erwan; Mazières, Stéphane; Myres, Natalie M.; Lin, Alice A.; Temori, Shah Aga; Metspalu, Mait; Metspalu, Ene; Witzel, Michael; King, Roy J.; Underhill, Peter A.; Villems, Richard; Chiaroni, Jacques (2013). "Afghan Hindu Kush: Where Eurasian Sub-Continent Gene Flows Converge". PLOS ONE. 8 (10): e76748. Bibcode:2013PLoSO...876748D. doi:10.1371/journal.pone.0076748. PMC 3799995. PMID 24204668.
  29. ^ Qamar, R; Ayub, Q; Mohyuddin, A; et al. (May 2002). "Y-Chromosomal DNA Variation in Pakistan". Am. J. Hum. Genet. 70 (5): 1107–24. doi:10.1086/339929. PMC 447589. PMID 11898125.
  30. ^ Firasat, S; Khaliq, S; Mohyuddin, A; et al. (January 2007). "Y-chromosomal evidence for a limited Greek contribution to the Pathan population of Pakistan". Eur. J. Hum. Genet. 15 (1): 121–26. doi:10.1038/sj.ejhg.5201726. PMC 2588664. PMID 17047675.
  31. ^ a b c Lacau, H; Gayden, T; Regueiro, M; Chennakrishnaiah, S; Bukhari, A; Underhill, PA; Garcia-Bertrand, RL; Herrera, RJ (Oct 2012). "Afghanistan from a Y-chromosome perspective". European Journal of Human Genetics. 20 (10): 1063–70. doi:10.1038/ejhg.2012.59. PMC 3449065. PMID 22510847.
  32. ^ a b Haber, M; Platt, DE; Ashrafian Bonab, M; et al. (2012). "Afghanistan's Ethnic Groups Share a Y-Chromosomal Heritage Structured by Historical Events". PLOS ONE. 7 (3): e34288. Bibcode:2012PLoSO...734288H. doi:10.1371/journal.pone.0034288. PMC 3314501. PMID 22470552.
  33. ^ a b c d Di Cristofaro, J; Pennarun, E; Mazières, S; Myres, NM; Lin, AA; Temori, SA; Metspalu, M; Metspalu, E; Witzel, M; King, RJ; Underhill, PA; Villems, R; Chiaroni, J (2013). "Afghan Hindu Kush: Where Eurasian Sub-Continent Gene Flows Converge". PLOS ONE. 8 (10): e76748. Bibcode:2013PLoSO...876748D. doi:10.1371/journal.pone.0076748. PMC 3799995. PMID 24204668.
  34. ^ a b Liu SH, N, Yilihamu, R Bake, et al. (2018), "A study of genetic diversity of three isolated populations in Xinjiang using Y-SNP." Acta Anthropologica Sinica, 2018, 37(1): 146-156.
  35. ^ Zhong 2010.
  36. ^ Fedorova 2013.
  37. ^ Karafet 2010.
  38. ^ Kim 2011.
  39. ^ Msaidie, Said; et al. (2011). "Genetic diversity on the Comoros Islands shows early seafaring as major determinant of human biocultural evolution in the Western Indian Ocean" (PDF). European Journal of Human Genetics. 19 (1): 89–94. doi:10.1038/ejhg.2010.128. PMC 3039498. PMID 20700146.
  40. ^ Shah, AM; Tamang, R; Moorjani, P; Rani, DS; Govindaraj, P; Kulkarni, G; Bhattacharya, T; Mustak, MS; Bhaskar, LV; Reddy, AG; Gadhvi, D; Gai, PB; Chaubey, G; Patterson, N; Reich, D; Tyler-Smith, C; Singh, L; Thangaraj, K (2011). "Indian Siddis: African Descendants with Indian Admixture". Am. J. Hum. Genet. 89 (1): 154–61. doi:10.1016/j.ajhg.2011.05.030. PMC 3135801. PMID 21741027.
  41. ^ a b c d Vincenza Battaglia, Simona Fornarino, Nadia Al-Zahery, et al. (2009), "Y-chromosomal evidence of the cultural diffusion of agriculture in southeast Europe." European Journal of Human Genetics (2009) 17, 820–830; doi:10.1038/ejhg.2008.249; published online 24 December 2008.
  42. ^ a b c d e f g Balanovsky, Oleg; Dibirova, Khadizhat; Dybo, Anna; et al. (October 2011). "Parallel Evolution of Genes and Languages in the Caucasus Region". Molecular Biology and Evolution. 28 (10): 2905–2920. doi:10.1093/molbev/msr126. PMC 3355373. PMID 21571925.
  43. ^ ISOGG 2016.
  44. ^ a b c d e f g h i Yunusbayev, Bayazit; Metspalu, Mait; Järve, Mari; et al. (2012). "The Caucasus as an Asymmetric Semipermeable Barrier to Ancient Human Migrations". Molecular Biology and Evolution. 29 (1): 359–365. doi:10.1093/molbev/msr221. PMID 21917723.
  45. ^ a b Di Cristofaro, J; Pennarun, E; Mazières, S; Myres, NM; Lin, AA; et al. (2013). "Afghan Hindu Kush: Where Eurasian Sub-Continent Gene Flows Converge". PLOS ONE. 8 (10): e76748. Bibcode:2013PLoSO...876748D. doi:10.1371/journal.pone.0076748. PMC 3799995. PMID 24204668.
  46. ^ Platt, D.E; Artinian, H.; Mouzaya, F. (2020). "Autosomal genetics and Y-chromosome haplogroup L1b-M317 reveal Mount Lebanon Maronites as a persistently non-emigrating population". Eur J Hum Genet. 29 (4): 581–592. doi:10.1038/s41431-020-00765-x. PMID 33273712.
  47. ^ "Azerbaijan DNA". FamilyTreeDNA. Gene by Gene, Ltd.
  48. ^ "Germany- YDNA". FamilyTreeDNA. Gene by Gene, Ltd.
  49. ^ "Greek DNA Project". FamilyTreeDNA. Gene by Gene, Ltd.
  50. ^ Platt, D.E; Artinian, H.; Mouzaya, F. (2020). "Autosomal genetics and Y-chromosome haplogroup L1b-M317 reveal Mount Lebanon Maronites as a persistently non-emigrating population". Eur J Hum Genet. 29 (4): 581–592. doi:10.1038/s41431-020-00765-x. PMID 33273712.
  51. ^ "L - Y Haplogroup L". FamilyTreeDNA. Gene by Gene, Ltd.
  52. ^ Chuan-Chao Wang, Sabine Reinhold, Alexey Kalmykov, Antje Wissgott, Guido Brandt, Choongwon Jeong, Olivia Cheronet, Matthew Ferry, Eadaoin Harney, Denise Keating, Swapan Mallick, Nadin Rohland, Kristin Stewardson, Anatoly R. Kantorovich, Vladimir E. Maslov, Vladimira G. Petrenko, Vladimir R. Erlikh, Biaslan Ch. Atabiev, Rabadan G. Magomedov, Philipp L. Kohl, Kurt W. Alt, Sandra L. Pichler, Claudia Gerling, Harald Meller, Benik Vardanyan, Larisa Yeganyan, Alexey D. Rezepkin, Dirk Mariaschk, Natalia Berezina, Julia Gresky, Katharina Fuchs, Corina Knipper, Stephan Schiffels, Elena Balanovska, Oleg Balanovsky, Iain Mathieson, Thomas Higham, Yakov B. Berezin, Alexandra Buzhilova, Viktor Trifonov, Ron Pinhasi, Andrej B. Belinskiy, David Reich, Svend Hansen, Johannes Krause, Wolfgang Haak bioRxiv 322347; doi: https://doi.org/10.1101/322347 Now published in Nature Communications doi: 10.1038/s41467-018-08220-8
  53. ^ Uali, Murat (25 November 2016). "Казахстанский ДНК-проект" [Kazakhstan DNA Project]. Np.kz (in Russian). Archived from the original on 26 November 2016. Retrieved 10 January 2016.
  54. ^ "The Genomic Formation of South and Central Asia". bioRxiv: 292581. 31 March 2018. doi:10.1101/292581.
  55. ^ Skourtanioti, Eirini; Erdal, Yilmaz S.; Frangipane, Marcella; Balossi Restelli, Francesca; Yener, K. Aslıhan; Pinnock, Frances; Matthiae, Paolo; Özbal, Rana; Schoop, Ulf-Dietrich; Guliyev, Farhad; Akhundov, Tufan; Lyonnet, Bertille; Hammer, Emily L.; Nugent, Selin E.; Burri, Marta; Neumann, Gunnar U.; Penske, Sandra; Ingman, Tara; Akar, Murat; Shafiq, Rula; Palumbi, Giulio; Eisenmann, Stefanie; d'Andrea, Marta; Rohrlach, Adam B.; Warinner, Christina; Jeong, Choongwon; Stockhammer, Philipp W.; Haak, Wolfgang; Krause, Johannes (2020). "Genomic History of Neolithic to Bronze Age Anatolia, Northern Levant, and Southern Caucasus". Cell. 181 (5): 1158–1175.e28. doi:10.1016/j.cell.2020.04.044. PMID 32470401. S2CID 219105572.
  56. ^ Margaryan, Ashot (2020). "Population genomics of the Viking world". Nature. 585 (7825): 390–396. doi:10.1038/s41586-020-2688-8. PMID 32939067.
  57. ^ Lazaridis, Iosif; et al. (25 July 2016). "Genomic insights into the origin of farming in the ancient Near East". Nature. 536 (7617): 419–24. Bibcode:2016Natur.536..419L. bioRxiv 10.1101/059311. doi:10.1038/nature19310. PMC 5003663. PMID 27459054.
  58. ^ "Y-DNA Haplotree". Family Tree DNA uses the Y-Chromosome Consortium tree and posts it on their website.




  1. ^ a b c Krahn, T.; FTDNA. "FTDNA Draft Y-DNA Tree (AKA YTree)". Family Tree DNA. Archived from the original on 2015-08-15. Retrieved 2013-01-01.
  2. ^ a b Henson, G.; Hrechdakian, P.; FTDNA (2013). "L – The Y-Haplogroup L Project". Retrieved 2013-01-01.

External links[edit]