Africa is the origin of modern humans within the past 300 thousand years. To infer the complex demographic history of African populations and adaptation to diverse environments, we sequenced the ...genomes of 92 individuals from 44 indigenous African populations.
Genetic structure analyses indicate that among Africans, genetic ancestry is largely partitioned by geography and language, though we observe mixed ancestry in many individuals, consistent with both short- and long-range migration events followed by admixture. Phylogenetic analysis indicates that the San genetic lineage is basal to all modern human lineages. The San and Niger-Congo, Afroasiatic, and Nilo-Saharan lineages were substantially diverged by 160 kya (thousand years ago). In contrast, the San and Central African rainforest hunter-gatherer (CRHG), Hadza hunter-gatherer, and Sandawe hunter-gatherer lineages were diverged by ~ 120-100 kya. Niger-Congo, Nilo-Saharan, and Afroasiatic lineages diverged more recently by ~ 54-16 kya. Eastern and western CRHG lineages diverged by ~ 50-31 kya, and the western CRHG lineages diverged by ~ 18-12 kya. The San and CRHG populations maintained the largest effective population size compared to other populations prior to 60 kya. Further, we observed signatures of positive selection at genes involved in muscle development, bone synthesis, reproduction, immune function, energy metabolism, and cell signaling, which may contribute to local adaptation of African populations.
We observe high levels of genomic variation between ethnically diverse Africans which is largely correlated with geography and language. Our study indicates ancient population substructure and local adaptation of Africans.
We report a method called ContamLD for estimating autosomal ancient DNA (aDNA) contamination by measuring the breakdown of linkage disequilibrium in a sequenced individual due to the introduction of ...contaminant DNA. ContamLD leverages the idea that contaminants should have haplotypes uncorrelated to those of the studied individual. Using simulated data, we confirm that ContamLD accurately infers contamination rates with low standard errors: for example, less than 1.5% standard error in cases with less than 10% contamination and 500,000 sequences covering SNPs. This method is optimized for application to aDNA, taking advantage of characteristic aDNA damage patterns to provide calibrated contamination estimates, and is available at https://github.com/nathan-nakatsuka/ContamLD .
While the series of events that shaped the transition between foraging societies and food producers are well described for Central and Southern Europe, genetic evidence from Northern Europe ...surrounding the Baltic Sea is still sparse. Here, we report genome-wide DNA data from 38 ancient North Europeans ranging from ~9500 to 2200 years before present. Our analysis provides genetic evidence that hunter-gatherers settled Scandinavia via two routes. We reveal that the first Scandinavian farmers derive their ancestry from Anatolia 1000 years earlier than previously demonstrated. The range of Mesolithic Western hunter-gatherers extended to the east of the Baltic Sea, where these populations persisted without gene-flow from Central European farmers during the Early and Middle Neolithic. The arrival of steppe pastoralists in the Late Neolithic introduced a major shift in economy and mediated the spread of a new ancestry associated with the Corded Ware Complex in Northern Europe.
The history of southern Africa involved interactions between indigenous hunter–gatherers and a range of populations that moved into the region. Here we use genome-wide genetic data to show that there ...are at least two admixture events in the history of Khoisan populations (southern African hunter–gatherers and pastoralists who speak non-Bantu languages with click consonants). One involved populations related to Niger–Congo-speaking African populations, and the other introduced ancestry most closely related to west Eurasian (European or Middle Eastern) populations. We date this latter admixture event to ∼900–1,800 y ago and show that it had the largest demographic impact in Khoisan populations that speak Khoe–Kwadi languages. A similar signal of west Eurasian ancestry is present throughout eastern Africa. In particular, we also find evidence for two admixture events in the history of Kenyan, Tanzanian, and Ethiopian populations, the earlier of which involved populations related to west Eurasians and which we date to ∼2,700–3,300 y ago. We reconstruct the allele frequencies of the putative west Eurasian population in eastern Africa and show that this population is a good proxy for the west Eurasian ancestry in southern Africa. The most parsimonious explanation for these findings is that west Eurasian ancestry entered southern Africa indirectly through eastern Africa.
For societies with writing systems, hereditary leadership is documented as one of the hallmarks of early political complexity and governance. In contrast, it is unknown whether hereditary succession ...played a role in the early formation of prehistoric complex societies that lacked writing. Here we use an archaeogenomic approach to identify an elite matriline that persisted between 800 and 1130 CE in Chaco Canyon, the centre of an expansive prehistoric complex society in the Southwestern United States. We show that nine individuals buried in an elite crypt at Pueblo Bonito, the largest structure in the canyon, have identical mitochondrial genomes. Analyses of nuclear genome data from six samples with the highest DNA preservation demonstrate mother-daughter and grandmother-grandson relationships, evidence for a multigenerational matrilineal descent group. Together, these results demonstrate the persistence of an elite matriline in Chaco for ∼330 years.
The establishment of agrarian economy in Eneolithic East Europe is associated with the Pre-Cucuteni-Cucuteni-Trypillia complex (PCCTC). PCCTC farmers interacted with Eneolithic forager-pastoralist ...groups of the North Pontic steppe as PCCTC extended from the Carpathian foothills to the Dnipro Valley beginning in the late 5th millennium BCE. While the cultural interaction between the two groups is evident through the Cucuteni C pottery style that carries steppe influence, the extent of biological interactions between Trypillian farmers and the steppe remains unclear. Here we report the analysis of artefacts from the late 5th millennium Trypillian settlement at the Kolomiytsiv Yar Tract (KYT) archaeological complex in central Ukraine, focusing on a human bone fragment found in the Trypillian context at KYT. Diet stable isotope ratios obtained from the bone fragment suggest the diet of the KYT individual to be within the range of forager-pastoralists of the North Pontic area. Strontium isotope ratios of the KYT individual are consistent with having originated from contexts of the Serednii Stih (Sredny Stog) culture sites of the Middle Dnipro Valley. Genetic analysis of the KYT individual indicates ancestry derived from a proto-Yamna population such as Serednii Stih. Overall, the KYT archaeological site presents evidence of interactions between Trypillians and Eneolithic Pontic steppe inhabitants of the Serednii Stih horizon and suggests a potential for gene flow between the two groups as early as the beginning of the 4th millennium BCE.
Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with ...the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this
tool within a software package,
, which is a reimplementation of the
software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature.