Human Y chromosome haplogroup J1-M267 is a common male lineage in West Asia. One high-frequency region-encompassing the Arabian Peninsula, southern Mesopotamia, and the southern Levant-resides ~ ...2000 km away from the other one found in the Caucasus. The region between them, although has a lower frequency, nevertheless demonstrates high genetic diversity. Studies associate this haplogroup with the spread of farming from the Fertile Crescent to Europe, the spread of mobile pastoralism in the desert regions of the Arabian Peninsula, the history of the Jews, and the spread of Islam. Here, we study past human male demography in West Asia with 172 high-coverage whole Y chromosome sequences and 889 genotyped samples of haplogroup J1-M267. We show that this haplogroup evolved ~ 20,000 years ago somewhere in northwestern Iran, the Caucasus, the Armenian Highland, and northern Mesopotamia. The major branch-J1a1a1-P58-evolved during the early Holocene ~ 9500 years ago somewhere in the Arabian Peninsula, the Levant, and southern Mesopotamia. Haplogroup J1-M267 expanded during the Chalcolithic, the Bronze Age, and the Iron Age. Most probably, the spread of Afro-Asiatic languages, the spread of mobile pastoralism in the arid zones, or both of these events together explain the distribution of haplogroup J1-M267 we see today in the southern regions of West Asia.
The genetic origins of Uralic speakers from across a vast territory in the temperate zone of North Eurasia have remained elusive. Previous studies have shown contrasting proportions of Eastern and ...Western Eurasian ancestry in their mitochondrial and Y chromosomal gene pools. While the maternal lineages reflect by and large the geographic background of a given Uralic-speaking population, the frequency of Y chromosomes of Eastern Eurasian origin is distinctively high among European Uralic speakers. The autosomal variation of Uralic speakers, however, has not yet been studied comprehensively.
Here, we present a genome-wide analysis of 15 Uralic-speaking populations which cover all main groups of the linguistic family. We show that contemporary Uralic speakers are genetically very similar to their local geographical neighbours. However, when studying relationships among geographically distant populations, we find that most of the Uralic speakers and some of their neighbours share a genetic component of possibly Siberian origin. Additionally, we show that most Uralic speakers share significantly more genomic segments identity-by-descent with each other than with geographically equidistant speakers of other languages. We find that correlated genome-wide genetic and lexical distances among Uralic speakers suggest co-dispersion of genes and languages. Yet, we do not find long-range genetic ties between Estonians and Hungarians with their linguistic sisters that would distinguish them from their non-Uralic-speaking neighbours.
We show that most Uralic speakers share a distinct ancestry component of likely Siberian origin, which suggests that the spread of Uralic languages involved at least some demic component.
The phylogenetic analysis of Y chromosomal haplogroup O2a-M95 was crucial to determine the nested structure of South Asian branches within the larger tree, predominantly present in East and Southeast ...Asia. However, it had previously been unclear that how many founders brought the haplogroup O2a-M95 to South Asia. On the basis of the updated Y chromosomal tree for haplogroup O2a-M95, we analysed 1437 male samples from South Asia for various novel downstream markers, carefully selected from the extant phylogenetic tree. With this increased resolution of genetic markers, we were able to identify at least three founders downstream to haplogroup O2a-M95, who are likely to have been associated with the dispersal of Austroasiatic languages to South Asia. The fourth founder was exclusively present amongst Tibeto-Burman speakers of Manipur and Bangladesh. In sum, our new results suggest the arrival of Austroasiatic languages in South Asia during last 5000 years.
The most frequent Y-chromosomal (chrY) haplogroups in northern and eastern Europe (NEE) are well-known and thoroughly characterised. Yet a considerable number of men in every population carry rare ...paternal lineages with estimated frequencies around 5%. So far, limited sample-sizes and insufficient resolution of genotyping have obstructed a truly comprehensive look into the variety of rare paternal lineages segregating within populations and potential signals of population history that such lineages might convey. Here we harness the power of massive re-sequencing of human Y chromosomes to identify previously unknown population-specific clusters among rare paternal lineages in NEE. We construct dated phylogenies for haplogroups E2-M215, J2-M172, G-M201 and Q-M242 on the basis of 421 (of them 282 novel) high-coverage chrY sequences collected from large-scale databases focusing on populations of NEE. Within these otherwise rare haplogroups we disclose lineages that began to radiate ~1-3 thousand years ago in Estonia and Sweden and reveal male phylogenetic patterns testifying of comparatively recent local demographic expansions. Conversely, haplogroup Q lineages bear evidence of ancient Siberian influence lingering in the modern paternal gene pool of northern Europe. We assess the possible direction of influx of ancestral carriers for some of these male lineages. In addition, we demonstrate the congruency of paternal haplogroup composition of our dataset with two independent population-based cohorts from Estonia and Sweden.
The Parsis are one of the smallest religious communities in the world. To understand the population structure and demographic history of this group in detail, we analyzed Indian and Pakistani Parsi ...populations using high-resolution genetic variation data on autosomal and uniparental loci (Y-chromosomal and mitochondrial DNA). Additionally, we also assayed mitochondrial DNA polymorphisms among ancient Parsi DNA samples excavated from Sanjan, in present day Gujarat, the place of their original settlement in India.
Among present-day populations, the Parsis are genetically closest to Iranian and the Caucasus populations rather than their South Asian neighbors. They also share the highest number of haplotypes with present-day Iranians and we estimate that the admixture of the Parsis with Indian populations occurred ~1,200 years ago. Enriched homozygosity in the Parsi reflects their recent isolation and inbreeding. We also observed 48% South-Asian-specific mitochondrial lineages among the ancient samples, which might have resulted from the assimilation of local females during the initial settlement. Finally, we show that Parsis are genetically closer to Neolithic Iranians than to modern Iranians, who have witnessed a more recent wave of admixture from the Near East.
Our results are consistent with the historically-recorded migration of the Parsi populations to South Asia in the 7th century and in agreement with their assimilation into the Indian sub-continent's population and cultural milieu "like sugar in milk". Moreover, in a wider context our results support a major demographic transition in West Asia due to the Islamic conquest.
Hungarians who live in Central Europe today are one of the westernmost Uralic speakers. Despite of the proposed Volga-Ural/West Siberian roots of the Hungarian language, the present-day Hungarian ...gene pool is highly similar to that of the surrounding Indo-European speaking populations. However, a limited portion of specific Y-chromosomal lineages from haplogroup N, sometimes associated with the spread of Uralic languages, link modern Hungarians with populations living close to the Ural Mountain range on the border of Europe and Asia. Here we investigate the paternal genetic connection between these spatially separated populations. We reconstruct the phylogeny of N3a4-Z1936 clade by using 33 high-coverage Y-chromosomal sequences and estimate the coalescent times of its sub-clades. We genotype close to 5000 samples from 46 Eurasian populations to show the presence of N3a4-B539 lineages among Hungarians and in the populations from Ural Mountain region, including Ob-Ugric-speakers from West Siberia who are geographically distant but linguistically closest to Hungarians. This sub-clade splits from its sister-branch N3a4-B535, frequent today among Northeast European Uralic speakers, 4000-5000 ya, which is in the time-frame of the proposed divergence of Ugric languages.
Kalmyks, the only Mongolic-speaking population in Europe, live in the southeast of the European Plain, in Russia. They adhere to Buddhism and speak a dialect of the Mongolian language. Historical and ...linguistic evidence, as well a shared clan names, suggests a common origin with Oirats of western Mongolia; yet, only a limited number of genetic studies have focused on this topic. Here we compare the paternal genetic relationship of Kalmyk clans with ethnographically related groups from Mongolia, Kyrgyzstan and China, within the context of their neighbouring populations. A phylogeny of 37 high-coverage Y-chromosome sequences, together with further genotyping of larger sample sets, reveals that all the Oirat-speaking populations studied here, including Kalmyks, share, as a dominant paternal lineage, Y-chromosomal haplogroup C3c1-M77, which is also present in several geographically distant native Siberian populations. We identify a subset of this clade, C3c1b-F6379, specifically enriched in Kalmyks as well as in Oirat-speaking clans in Inner Asia. This sub-clade coalesces at around 1500 years before present, before the Genghis Khan era, and significantly earlier than the split between Kalmyks and other Oirat speakers about 400 years ago. We also show that split between the dominant hg C variant among Buryats-C3-M407-and that of C3-F6379, took place in the Early Upper Palaeolithic, suggesting an extremely long duration for the dissipation of hg C3-M217 carriers across northern Eurasia, which cuts through today's major linguistic phyla.
The debate concerning the origin of the Polynesian speaking peoples has been recently reinvigorated by genetic evidence for secondary migrations to western Polynesia from the New Guinea region during ...the 2nd millennium BP. Using genome-wide autosomal data from the Leeward Society Islands, the ancient cultural hub of eastern Polynesia, we find that the inhabitants' genomes also demonstrate evidence of this episode of admixture, dating to 1,700-1,200 BP. This supports a late settlement chronology for eastern Polynesia, commencing ~1,000 BP, after the internal differentiation of Polynesian society. More than 70% of the autosomal ancestry of Leeward Society Islanders derives from Island Southeast Asia with the lowland populations of the Philippines as the single largest potential source. These long-distance migrants into Polynesia experienced additional admixture with northern Melanesians prior to the secondary migrations of the 2nd millennium BP. Moreover, the genetic diversity of mtDNA and Y chromosome lineages in the Leeward Society Islands is consistent with linguistic evidence for settlement of eastern Polynesia proceeding from the central northern Polynesian outliers in the Solomon Islands. These results stress the complex demographic history of the Leeward Society Islands and challenge phylogenetic models of cultural evolution predicated on eastern Polynesia being settled from Samoa.
Uniparental genetic systems are unique sex indicators and complement the study of autosomal diversity by providing landmarks of human migrations that repeatedly shaped the structure of extant ...populations. Our knowledge of the variation of the male-specific region of the Y chromosome in Native Americans is still rather scarce and scattered, but by merging sequence information from modern and ancient individuals, we here provide a comprehensive and updated phylogeny of the distinctive Native American branches of haplogroups C and Q. Our analyses confirm C-MPB373, C-P39, Q-Z780, Q-M848, and Q-Y4276 as the main founding haplogroups and identify traces of unsuccessful (pre-Q-F1096) or extinct (C-L1373*, Q-YP4010*) Y-chromosome lineages, indicating that haplogroup diversity of the founder populations that first entered the Americas was greater than that observed in the Indigenous component of modern populations. In addition, through a diachronic and phylogeographic dissection of newly identified Q-M848 branches, we provide the first Y-chromosome insights into the early peopling of the South American hinterland (Q-BY104773 and Q-BY15730) and on overlying inland migrations (Q-BY139813).
Linguistic and genetic studies on Roma populations inhabited in Europe have unequivocally traced these populations to the Indian subcontinent. However, the exact parental population group and time of ...the out-of-India dispersal have remained disputed. In the absence of archaeological records and with only scanty historical documentation of the Roma, comparative linguistic studies were the first to identify their Indian origin. Recently, molecular studies on the basis of disease-causing mutations and haploid DNA markers (i.e. mtDNA and Y-chromosome) supported the linguistic view. The presence of Indian-specific Y-chromosome haplogroup H1a1a-M82 and mtDNA haplogroups M5a1, M18 and M35b among Roma has corroborated that their South Asian origins and later admixture with Near Eastern and European populations. However, previous studies have left unanswered questions about the exact parental population groups in South Asia. Here we present a detailed phylogeographical study of Y-chromosomal haplogroup H1a1a-M82 in a data set of more than 10,000 global samples to discern a more precise ancestral source of European Romani populations. The phylogeographical patterns and diversity estimates indicate an early origin of this haplogroup in the Indian subcontinent and its further expansion to other regions. Tellingly, the short tandem repeat (STR) based network of H1a1a-M82 lineages displayed the closest connection of Romani haplotypes with the traditional scheduled caste and scheduled tribe population groups of northwestern India.