TY - JOUR
T1 - Deep phylogenetic analysis of haplogroup G1 provides estimates of SNP and STR mutation rates on the human Y-chromosome and reveals migrations of iranic speakers
AU - Balanovsky, Oleg
AU - Zhabagin, Maxat
AU - Agdzhoyan, Anastasiya
AU - Chukhryaeva, Marina
AU - Zaporozhchenko, Valery
AU - Utevska, Olga
AU - Highnam, Gareth
AU - Sabitov, Zhaxylyk
AU - Greenspan, Elliott
AU - Dibirova, Khadizhat
AU - Skhalyakho, Roza
AU - Kuznetsova, Marina
AU - Koshel, Sergey
AU - Yusupov, Yuldash
AU - Nymadawa, Pagbajabyn
AU - Zhumadilov, Zhaxybay
AU - Pocheshkhova, Elvira
AU - Haber, Marc
AU - Zalloua, Pierre A.
AU - Yepiskoposyan, Levon
AU - Dybo, Anna
AU - Tyler-Smith, Chris
AU - Balanovska, Elena
PY - 2015/4/7
Y1 - 2015/4/7
N2 - Y-chromosomal haplogroup G1 is a minor component of the overall gene pool of South-West and Central Asia but reaches up to 80% frequency in some populations scattered within this area. We have genotyped the G1-defining marker M285 in 27 Eurasian populations (n= 5,346), analyzed 367 M285-positive samples using 17 Y-STRs, and sequenced ∼11 Mb of the Y-chromosome in 20 of these samples to an average coverage of 67X. This allowed detailed phylogenetic reconstruction. We identified five branches, all with high geographical specificity: G1-L1323 in Kazakhs, the closely related G1-GG1 in Mongols, G1-GG265 in Armenians and its distant brother clade G1-GG162 in Bashkirs, and G1-GG362 in West Indians. The haplotype diversity, which decreased from West Iran to Central Asia, allows us to hypothesize that this rare haplogroup could have been carried by the expansion of Iranic speakers northwards to the Eurasian steppe and via founder effects became a predominant genetic component of some populations, including the Argyn tribe of the Kazakhs. The remarkable agreement between genetic and genealogical trees of Argyns allowed us to calibrate the molecular clock using a historical date (1405 AD) of the most recent common genealogical ancestor. The mutation rate for Y-chromosomal sequence data obtained was 0.78×10-9 per bp per year, falling within the range of published rates. The mutation rate for Y-chromosomal STRs was 0.0022 per locus per generation, very close to the so-called genealogical rate. The "clan-based" approach to estimating the mutation rate provides a third, middle way between direct farther-to-son comparisons and using archeologically known migrations, whose dates are subject to revision and of uncertain relationship to genetic events.
AB - Y-chromosomal haplogroup G1 is a minor component of the overall gene pool of South-West and Central Asia but reaches up to 80% frequency in some populations scattered within this area. We have genotyped the G1-defining marker M285 in 27 Eurasian populations (n= 5,346), analyzed 367 M285-positive samples using 17 Y-STRs, and sequenced ∼11 Mb of the Y-chromosome in 20 of these samples to an average coverage of 67X. This allowed detailed phylogenetic reconstruction. We identified five branches, all with high geographical specificity: G1-L1323 in Kazakhs, the closely related G1-GG1 in Mongols, G1-GG265 in Armenians and its distant brother clade G1-GG162 in Bashkirs, and G1-GG362 in West Indians. The haplotype diversity, which decreased from West Iran to Central Asia, allows us to hypothesize that this rare haplogroup could have been carried by the expansion of Iranic speakers northwards to the Eurasian steppe and via founder effects became a predominant genetic component of some populations, including the Argyn tribe of the Kazakhs. The remarkable agreement between genetic and genealogical trees of Argyns allowed us to calibrate the molecular clock using a historical date (1405 AD) of the most recent common genealogical ancestor. The mutation rate for Y-chromosomal sequence data obtained was 0.78×10-9 per bp per year, falling within the range of published rates. The mutation rate for Y-chromosomal STRs was 0.0022 per locus per generation, very close to the so-called genealogical rate. The "clan-based" approach to estimating the mutation rate provides a third, middle way between direct farther-to-son comparisons and using archeologically known migrations, whose dates are subject to revision and of uncertain relationship to genetic events.
UR - http://www.scopus.com/inward/record.url?scp=84927537818&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0122968
DO - 10.1371/journal.pone.0122968
M3 - Article
C2 - 25849548
AN - SCOPUS:84927537818
SN - 1932-6203
VL - 10
JO - PLoS ONE
JF - PLoS ONE
IS - 4
M1 - e0122968
ER -