The phylogenetic relationships of numerous branches within the core Y-chromosome haplogroup R-M207 support a West Asian origin of haplogroup R1b, its initial differentiation there followed by a rapid ...spread of one of its sub-clades carrying the M269 mutation to Europe. Here, we present phylogeographically resolved data for 2043 M269-derived Y-chromosomes from 118 West Asian and European populations assessed for the M412 SNP that largely separates the majority of Central and West European R1b lineages from those observed in Eastern Europe, the Circum-Uralic region, the Near East, the Caucasus and Pakistan. Within the M412 dichotomy, the major S116 sub-clade shows a frequency peak in the upper Danube basin and Paris area with declining frequency toward Italy, Iberia, Southern France and British Isles. Although this frequency pattern closely approximates the spread of the Linearbandkeramik (LBK), Neolithic culture, an advent leading to a number of pre-historic cultural developments during the past ≤10 thousand years, more complex pre-Neolithic scenarios remain possible for the L23(xM412) components in Southeast Europe and elsewhere.
Despite strides in characterizing human history from genetic polymorphism data, progress in identifying genetic signatures of recent demography has been limited. Here we identify very recent ...fine-scale population structure in North America from a network of over 500 million genetic (identity-by-descent, IBD) connections among 770,000 genotyped individuals of US origin. We detect densely connected clusters within the network and annotate these clusters using a database of over 20 million genealogical records. Recent population patterns captured by IBD clustering include immigrants such as Scandinavians and French Canadians; groups with continental admixture such as Puerto Ricans; settlers such as the Amish and Appalachians who experienced geographic or cultural isolation; and broad historical trends, including reduced north-south gene flow. Our results yield a detailed historical portrait of North America after European settlement and support substantial genetic heterogeneity in the United States beyond that uncovered by previous studies.
Despite being located at the crossroads of Asia, genetics of the Afghanistan populations have been largely overlooked. It is currently inhabited by five major ethnic populations: Pashtun, Tajik, ...Hazara, Uzbek and Turkmen. Here we present autosomal from a subset of our samples, mitochondrial and Y- chromosome data from over 500 Afghan samples among these 5 ethnic groups. This Afghan data was supplemented with the same Y-chromosome analyses of samples from Iran, Kyrgyzstan, Mongolia and updated Pakistani samples (HGDP-CEPH). The data presented here was integrated into existing knowledge of pan-Eurasian genetic diversity. The pattern of genetic variation, revealed by structure-like and Principal Component analyses and Analysis of Molecular Variance indicates that the people of Afghanistan are made up of a mosaic of components representing various geographic regions of Eurasian ancestry. The absence of a major Central Asian-specific component indicates that the Hindu Kush, like the gene pool of Central Asian populations in general, is a confluence of gene flows rather than a source of distinctly autochthonous populations that have arisen in situ: a conclusion that is reinforced by the phylogeography of both haploid loci.
Sequence diversity and the ages of the deepest nodes of the MSY phylogeny remain largely unexplored due to the severely biased collection of SNPs available for study. We characterized 68 worldwide Y ...chromosomes by high-coverage next-generation sequencing, including 18 deep-rooting ones, and identified 2386 SNPs, 80% of which were novel. Many aspects of this pool of variants resembled the pattern observed among genome-wide de novo events, suggesting that in the MSY, a large proportion of newly arisen alleles has survived in the phylogeny. Some degree of purifying selection emerged in the form of an excess of private missense variants. Our tree recapitulated the previously known topology, but the relative lengths of major branches were drastically modified and the associated node ages were remarkably older. We found significantly different branch lengths when comparing the rare deep-rooted A1b African lineage with the rest of the tree. Our dating results and phylogeography led to the following main conclusions: (1) Patrilineal lineages with ages approaching those of early AMH fossils survive today only in central-western Africa; (2) only a few evolutionarily successful MSY lineages survived between 160 and 115 kya; and (3) an early exit out of Africa (before 70 kya), which fits recent western Asian archaeological evidence, should be considered. Our experimental design produced an unbiased resource of new MSY markers informative for the initial formation of the anatomically modern human gene pool, i.e., a period of our evolution that had been previously considered to be poorly accessible with paternally inherited markers.
It is widely accepted that the ancestors of Native Americans arrived in the New World via Beringia approximately 10 to 30 thousand years ago (kya). However, the arrival time(s), number of expansion ...events, and migration routes into the Western Hemisphere remain controversial because linguistic, archaeological, and genetic evidence have not yet provided coherent answers. Notably, most of the genetic evidence has been acquired from the analysis of the common pan-American mitochondrial DNA (mtDNA) haplogroups. In this study, we have instead identified and analyzed mtDNAs belonging to two rare Native American haplogroups named D4h3 and X2a.
Phylogeographic analyses at the highest level of molecular resolution (69 entire mitochondrial genomes) reveal that two almost concomitant paths of migration from Beringia led to the Paleo-Indian dispersal approximately 15–17 kya. Haplogroup D4h3 spread into the Americas along the Pacific coast, whereas X2a entered through the ice-free corridor between the Laurentide and Cordilleran ice sheets. The examination of an additional 276 entire mtDNA sequences provides similar entry times for all common Native American haplogroups, thus indicating at least a dual origin for Paleo-Indians.
A dual origin for the first Americans is a striking novelty from the genetic point of view, and it makes plausible a scenario positing that within a rather short period of time, there may have been several entries into the Americas from a dynamically changing Beringian source. Moreover, this implies that most probably more than one language family was carried along with the Paleo-Indians.
Pan-American mitochondrial DNA (mtDNA) haplogroup C1 has been recently subdivided into three branches, two of which (C1b and C1c) are characterized by ages and geographical distributions that are ...indicative of an early arrival from Beringia with Paleo-Indians. In contrast, the estimated ages of C1d--the third subset of C1--looked too young to fit the above scenario. To define the origin of this enigmatic C1 branch, we completely sequenced 63 C1d mitochondrial genomes from a wide range of geographically diverse, mixed, and indigenous American populations. The revised phylogeny not only brings the age of C1d within the range of that of its two sister clades, but reveals that there were two C1d founder genomes for Paleo-Indians. Thus, the recognized maternal founding lineages of Native Americans are at least 15, indicating that the overall number of Beringian or Asian founder mitochondrial genomes will probably increase extensively when all Native American haplogroups reach the same level of phylogenetic and genomic resolution as obtained here for C1d.
Factors affecting the rate and pattern of the mutational process are being identified for human autosomes, but the same relationships for the male specific portion of the Y chromosome (MSY) are not ...established. We considered 3,390 mutations occurring in 19 sequence bins identified by sequencing 1.5 Mb of the MSY from each of 104 present-day chromosomes. The occurrence of mutations was not proportional to the amount of sequenced bases in each bin, with a 2-fold variation. The regression of the number of mutations per unit sequence against a number of indicators of the genomic features of each bin, revealed the same fundamental patterns as in the autosomes. By considering the sequences of the same region from two precisely dated ancient specimens, we obtained a calibrated region-specific substitution rate of 0.716 × 10-9/site/year. Despite its lack of recombination and other peculiar features, the MSY then resembles the autosomes in displaying a marked regional heterogeneity of the mutation rate. An immediate implication is that a given figure for the substitution rate only makes sense if bound to a specific DNA region. By strictly applying this principle we obtained an unbiased estimate of the antiquity of lineages relevant to the genetic history of the human Y chromosome. In particular, the two deepest nodes of the tree highlight the survival, in Central-Western Africa, of lineages whose coalescence (291 ky, 95% C.I. 253-343) predates the emergence of anatomically modern features in the fossil record.
R1a-M420 is one of the most widely spread Y-chromosome haplogroups; however, its substructure within Europe and Asia has remained poorly characterized. Using a panel of 16 244 male subjects from 126 ...populations sampled across Eurasia, we identified 2923 R1a-M420 Y-chromosomes and analyzed them to a highly granular phylogeographic resolution. Whole Y-chromosome sequence analysis of eight R1a and five R1b individuals suggests a divergence time of ∼25,000 (95% CI: 21,300-29,000) years ago and a coalescence time within R1a-M417 of ∼5800 (95% CI: 4800-6800) years. The spatial frequency distributions of R1a sub-haplogroups conclusively indicate two major groups, one found primarily in Europe and the other confined to Central and South Asia. Beyond the major European versus Asian dichotomy, we describe several younger sub-haplogroups. Based on spatial distributions and diversity patterns within the R1a-M420 clade, particularly rare basal branches detected primarily within Iran and eastern Turkey, we conclude that the initial episodes of haplogroup R1a diversification likely occurred in the vicinity of present-day Iran.
One hundred and forty-six previously detected mutations were more precisely positioned in the human Y chromosome phylogeny by the analysis of 51 representative Y chromosome haplogroups and the use of ...59 mutations from literature. Twenty-two new mutations were also described and incorporated in the revised phylogeny. This analysis made it possible to identify new haplogroups and to resolve a deep trifurcation within haplogroup B2. Our data provide a highly resolved branching in the African-specific portion of the Y tree and support the hypothesis of an origin in the north-western quadrant of the African continent for the human MSY diversity.
The process of Greek colonization of the central and western Mediterranean during the Archaic and Classical Eras has been understudied from the perspective of population genetics. To investigate the ...Y chromosomal demography of Greek colonization in the western Mediterranean, Y-chromosome data consisting of 29 YSNPs and 37 YSTRs were compared from 51 subjects from Provence, 58 subjects from Smyrna and 31 subjects whose paternal ancestry derives from Asia Minor Phokaia, the ancestral embarkation port to the 6th century BCE Greek colonies of Massalia (Marseilles) and Alalie (Aleria, Corsica).
19% of the Phokaian and 12% of the Smyrnian representatives were derived for haplogroup E-V13, characteristic of the Greek and Balkan mainland, while 4% of the Provencal, 4.6% of East Corsican and 1.6% of West Corsican samples were derived for E-V13. An admixture analysis estimated that 17% of the Y-chromosomes of Provence may be attributed to Greek colonization. Using the following putative Neolithic Anatolian lineages: J2a-DYS445 = 6, G2a-M406 and J2a1b1-M92, the data predict a 0% Neolithic contribution to Provence from Anatolia. Estimates of colonial Greek vs. indigenous Celto-Ligurian demography predict a maximum of a 10% Greek contribution, suggesting a Greek male elite-dominant input into the Iron Age Provence population.
Given the origin of viniculture in Provence is ascribed to Massalia, these results suggest that E-V13 may trace the demographic and socio-cultural impact of Greek colonization in Mediterranean Europe, a contribution that appears to be considerably larger than that of a Neolithic pioneer colonization.