We report genome-wide ancient DNA from 44 ancient Near Easterners ranging in time between ~12,000 and 1,400 bc, from Natufian hunter-gatherers to Bronze Age farmers. We show that the earliest ...populations of the Near East derived around half their ancestry from a 'Basal Eurasian' lineage that had little if any Neanderthal admixture and that separated from other non-African lineages before their separation from each other. The first farmers of the southern Levant (Israel and Jordan) and Zagros Mountains (Iran) were strongly genetically differentiated, and each descended from local hunter-gatherers. By the time of the Bronze Age, these two populations and Anatolian-related farmers had mixed with each other and with the hunter-gatherers of Europe to greatly reduce genetic differentiation. The impact of the Near Eastern farmers extended beyond the Near East: farmers related to those of Anatolia spread westward into Europe; farmers related to those of the Levant spread southward into East Africa; farmers related to those of Iran spread northward into the Eurasian steppe; and people related to both the early farmers of Iran and to the pastoralists of the Eurasian steppe spread eastward into South Asia.
North African populations are distinct from sub-Saharan Africans based on cultural, linguistic, and phenotypic attributes; however, the time and the extent of genetic divergence between populations ...north and south of the Sahara remain poorly understood. Here, we interrogate the multilayered history of North Africa by characterizing the effect of hypothesized migrations from the Near East, Europe, and sub-Saharan Africa on current genetic diversity. We present dense, genome-wide SNP genotyping array data (730,000 sites) from seven North African populations, spanning from Egypt to Morocco, and one Spanish population. We identify a gradient of likely autochthonous Maghrebi ancestry that increases from east to west across northern Africa; this ancestry is likely derived from "back-to-Africa" gene flow more than 12,000 years ago (ya), prior to the Holocene. The indigenous North African ancestry is more frequent in populations with historical Berber ethnicity. In most North African populations we also see substantial shared ancestry with the Near East, and to a lesser extent sub-Saharan Africa and Europe. To estimate the time of migration from sub-Saharan populations into North Africa, we implement a maximum likelihood dating method based on the distribution of migrant tracts. In order to first identify migrant tracts, we assign local ancestry to haplotypes using a novel, principal component-based analysis of three ancestral populations. We estimate that a migration of western African origin into Morocco began about 40 generations ago (approximately 1,200 ya); a migration of individuals with Nilotic ancestry into Egypt occurred about 25 generations ago (approximately 750 ya). Our genomic data reveal an extraordinarily complex history of migrations, involving at least five ancestral populations, into North Africa.
The Roma population is the largest transnational ethnic minority in Europe, characterized by a linguistic, cultural and historical heterogeneity. Comparative linguistics and genetic studies have ...placed the origin of European Roma in the Northwest of India. After their migration across Persia, they entered into the Balkan Peninsula, from where they spread into Europe, arriving in the Iberian Peninsula in the 15th century. Their particular demographic history has genetic implications linked to rare and common diseases. However, the South Asian source of the proto-Roma remains still untargeted and the West Eurasian Roma component has not been yet deeply characterized. Here, in order to describe both the South Asian and West Eurasian ancestries, we analyze previously published genome-wide data of 152 European Roma and 34 new Iberian Roma samples at a fine-scale and haplotype-based level, with special focus on the Iberian Roma genetic substructure. Our results suggest that the putative origin of the proto-Roma involves a Punjabi group with low levels of West Eurasian ancestry. In addition, we have identified a complex West Eurasian component (around 65%) in the Roma, as a result of the admixture events occurred with non-proto-Roma populations between 1270-1580. Particularly, we have detected the Balkan genetic footprint in all European Roma, and the Baltic and Iberian components in the Northern and Western Roma groups, respectively. Finally, our results show genetic substructure within the Iberian Roma, with different levels of West Eurasian admixture, as a result of the complex historical events occurred in the Peninsula.
Population demography and gene flow among African groups, as well as the putative archaic introgression of ancient hominins, have been poorly explored at the genome level.
Here, we examine 15 African ...populations covering all major continental linguistic groups, ecosystems, and lifestyles within Africa through analysis of whole-genome sequence data of 21 individuals sequenced at deep coverage. We observe a remarkable correlation among genetic diversity and geographic distance, with the hunter-gatherer groups being more genetically differentiated and having larger effective population sizes throughout most modern-human history. Admixture signals are found between neighbor populations from both hunter-gatherer and agriculturalists groups, whereas North African individuals are closely related to Eurasian populations. Regarding archaic gene flow, we test six complex demographic models that consider recent admixture as well as archaic introgression. We identify the fingerprint of an archaic introgression event in the sub-Saharan populations included in the models (~ 4.0% in Khoisan, ~ 4.3% in Mbuti Pygmies, and ~ 5.8% in Mandenka) from an early divergent and currently extinct ghost modern human lineage.
The present study represents an in-depth genomic analysis of a Pan African set of individuals, which emphasizes their complex relationships and demographic history at population level.
To shed light on the peopling of South Asia and the origins of the morphological adaptations found there, we analyzed whole-genome sequences from 10 Andamanese individuals and compared them with ...sequences for 60 individuals from mainland Indian populations with different ethnic histories and with publicly available data from other populations. We show that all Asian and Pacific populations share a single origin and expansion out of Africa, contradicting an earlier proposal of two independent waves of migration. We also show that populations from South and Southeast Asia harbor a small proportion of ancestry from an unknown extinct hominin, and this ancestry is absent from Europeans and East Asians. The footprints of adaptive selection in the genomes of the Andamanese show that the characteristic distinctive phenotypes of this population (including very short stature) do not reflect an ancient African origin but instead result from strong natural selection on genes related to human body size.
Tandem repeats (TRs) are stretches of DNA that are highly variable in length and mutate rapidly. They are thus an important source of genetic variation. This variation is highly informative for ...population and conservation genetics. It has also been associated with several pathological conditions and with gene expression regulation. However, genome-wide surveys of TR variation in humans and closely related species have been scarce due to technical difficulties derived from short-read technology. Here we explored the genome-wide diversity of TRs in a panel of 83 human and nonhuman great ape genomes, in a total of six different species, and studied their impact on gene expression evolution. We found that population diversity patterns can be efficiently captured with short TRs (repeat unit length, 1-5 bp). We examined the potential evolutionary role of TRs in gene expression differences between humans and primates by using 30,275 larger TRs (repeat unit length, 2-50 bp). Genes that contained TRs in the promoters, in their 3' untranslated region, in introns, and in exons had higher expression divergence than genes without repeats in the regions. Polymorphic small repeats (1-5 bp) had also higher expression divergence compared with genes with fixed or no TRs in the gene promoters. Our findings highlight the potential contribution of TRs to human evolution through gene regulation.
The demographic history of human populations in North Africa has been characterized by complex processes of admixture and isolation that have modeled its current gene pool. Diverse genetic ancestral ...components with different origins (autochthonous, European, Middle Eastern, and sub-Saharan) and genetic heterogeneity in the region have been described. In this complex genetic landscape, Algeria, the largest country in Africa, has been poorly covered, with most of the studies using a single Algerian sample. In order to evaluate the genetic heterogeneity of Algeria, Y-chromosome, mtDNA and autosomal genome-wide makers have been analyzed in several Berber- and Arab-speaking groups. Our results show that the genetic heterogeneity found in Algeria is not correlated with geography or linguistics, challenging the idea of Berber groups being genetically isolated and Arab groups open to gene flow. In addition, we have found that external sources of gene flow into North Africa have been carried more often by females than males, while the North African autochthonous component is more frequent in paternally transmitted genome regions. Our results highlight the different demographic history revealed by different markers and urge to be cautious when deriving general conclusions from partial genomic information or from single samples as representatives of the total population of a region.
Abstract Background North African human populations present a complex demographic scenario due to the presence of an autochthonous genetic component and population substructure, plus extensive gene ...flow from the Middle East, Europe, and sub-Saharan Africa. Results We conducted a comprehensive analysis of 364 genomes to construct detailed demographic models for the North African region, encompassing its two primary ethnic groups, the Arab and Amazigh populations. This was achieved through an Approximate Bayesian Computation with Deep Learning (ABC-DL) framework and a novel algorithm called Genetic Programming for Population Genetics (GP4PG). This innovative approach enabled us to effectively model intricate demographic scenarios, utilizing a subset of 16 whole genomes at > 30X coverage. The demographic model suggested by GP4PG exhibited a closer alignment with the observed data compared to the ABC-DL model. Both point to a back-to-Africa origin of North African individuals and a close relationship with Eurasian populations. Results support different origins for Amazigh and Arab populations, with Amazigh populations originating back in Epipaleolithic times, while GP4PG supports Arabization as the main source of Middle Eastern ancestry. The GP4PG model includes population substructure in surrounding populations (sub-Saharan Africa and Middle East) with continuous decaying gene flow after population split. Contrary to ABC-DL, the best GP4PG model does not require pulses of admixture from surrounding populations into North Africa pointing to soft splits as drivers of divergence in North Africa. Conclusions We have built a demographic model on North Africa that points to a back-to-Africa expansion and a differential origin between Arab and Amazigh populations.
North Africa is characterized by its diverse cultural and linguistic groups and its genetic heterogeneity. Genomic data has shown an amalgam of components mixed since pre-Holocean times. Though no ...differences have been found in uniparental and classical markers between Berbers and Arabs, the two main ethnic groups in the region, the scanty genomic data available have highlighted the singularity of Berbers. We characterize the genetic heterogeneity of North African groups, focusing on the putative differences of Berbers and Arabs, and estimate migration dates. We analyze genome-wide autosomal data in five Berber and six Arab groups, and compare them to Middle Easterns, sub-Saharans, and Europeans. Haplotype-based methods show a lack of correlation between geographical and genetic populations, and a high degree of genetic heterogeneity, without strong differences between Berbers and Arabs. Berbers enclose genetically diverse groups, from isolated endogamous groups with high autochthonous component frequencies, large homozygosity runs and low effective population sizes, to admixed groups with high frequencies of sub-Saharan and Middle Eastern components. Admixture time estimates show a complex pattern of recent historical migrations, with a peak around the 7th century C.E. coincident with the Arabization of the region; sub-Saharan migrations since the 1st century B.C. in agreement with Roman slave trade; and a strong migration in the 17th century C.E., coincident with a huge impact of the trans-Atlantic and trans-Saharan trade of sub-Saharan slaves in the Modern Era. The genetic complexity found should be taken into account when selecting reference groups in population genetics and biomedical studies.
Despite being enclosed between the Mediterranean Sea and the Sahara Desert, North Africa has been the scenario of multiple human migrations that have shaped the genetic structure of its present-day ...populations. Despite its richness, North Africa remains underrepresented in genomic studies. To overcome this, we have sequenced and analyzed 264 mitogenomes from the Algerian Chaoui-speaking Imazighen (a.k.a. Berbers) living in the Aurès region. The maternal genetic composition of the Aurès is similar to Arab populations in the region, dominated by West Eurasian lineages with a moderate presence of M1/U6 North African and L sub-Saharan lineages. When focusing on the time and geographic origin of the North African specific clades within the non-autochthonous haplogroups, different geographical neighboring regions contributed to the North African maternal gene pool during time periods that could be attributed to previously suggested admixture events in the region, since Paleolithic times to recent historical movements such as the Arabization. We have also observed the role of North Africa as a source of geneflow mainly in Southern European regions since Neolithic times. Finally, the present work constitutes an effort to increase the representation of North African populations in genetic databases, which is key to understand their history.