Next-Generation Sequencing has revolutionized our approach to ancient DNA (aDNA) research, by providing complete genomic sequences of ancient individuals and extinct species. However, the recovery of ...genetic material from long-dead organisms is still complicated by a number of issues, including post-mortem DNA damage and high levels of environmental contamination. Together with error profiles specific to the type of sequencing platforms used, these specificities could limit our ability to map sequencing reads against modern reference genomes and therefore limit our ability to identify endogenous ancient reads, reducing the efficiency of shotgun sequencing aDNA.
In this study, we compare different computational methods for improving the accuracy and sensitivity of aDNA sequence identification, based on shotgun sequencing reads recovered from Pleistocene horse extracts using Illumina GAIIx and Helicos Heliscope platforms. We show that the performance of the Burrows Wheeler Aligner (BWA), that has been developed for mapping of undamaged sequencing reads using platforms with low rates of indel-types of sequencing errors, can be employed at acceptable run-times by modifying default parameters in a platform-specific manner. We also examine if trimming likely damaged positions at read ends can increase the recovery of genuine aDNA fragments and if accurate identification of human contamination can be achieved using a strategy previously suggested based on best hit filtering. We show that combining our different mapping and filtering approaches can increase the number of high-quality endogenous hits recovered by up to 33%.
We have shown that Illumina and Helicos sequences recovered from aDNA extracts could not be aligned to modern reference genomes with the same efficiency unless mapping parameters are optimized for the specific types of errors generated by these platforms and by post-mortem DNA damage. Our findings have important implications for future aDNA research, as we define mapping guidelines that improve our ability to identify genuine aDNA sequences, which in turn could improve the genotyping accuracy of ancient specimens. Our framework provides a significant improvement to the standard procedures used for characterizing ancient genomes, which is challenged by contamination and often low amounts of DNA material.
Large-scale genomic analyses of ancient human populations have become feasible partly due to refined sampling methods. The inner part of petrous bones and the cementum layer in teeth roots are ...currently recognized as the best substrates for such research. We present a comparative analysis of DNA preservation in these two substrates obtained from the same human skulls, across a range of different ages and preservation environments. Both substrates display significantly higher endogenous DNA content (average of 16.4% and 40.0% for teeth and petrous bones, respectively) than parietal skull bone (average of 2.2%). Despite sample-to-sample variation, petrous bone overall performs better than tooth cementum (p = 0.001). This difference, however, is driven largely by a cluster of viking skeletons from one particular locality, showing relatively poor molecular tooth preservation (<10% endogenous DNA). In the remaining skeletons there is no systematic difference between the two substrates. A crude preservation (good/bad) applied to each sample prior to DNA-extraction predicted the above/below 10% endogenous DNA threshold in 80% of the cases. Interestingly, we observe signficantly higher levels of cytosine to thymine deamination damage and lower proportions of mitochondrial/nuclear DNA in petrous bone compared to tooth cementum. Lastly, we show that petrous bones from ancient cremated individuals contain no measurable levels of authentic human DNA. Based on these findings we discuss the pros and cons of sampling the different elements.
The farming way of life originated in the Near East some 11,000 years ago and had reached most of the European continent 5000 years later. However, the impact of the agricultural revolution on ...demography and patterns of genomic variation in Europe remains unknown. We obtained 249 million base pairs of genomic DNA from ~5000-year-old remains of three hunter-gatherers and one farmer excavated in Scandinavia and find that the farmer is genetically most similar to extant southern Europeans, contrasting sharply to the hunter-gatherers, whose distinct genetic signature is most similar to that of extant northern Europeans. Our results suggest that migration from southern Europe catalyzed the spread of agriculture and that admixture in the wake of this expansion eventually shaped the genomic landscape of modern-day Europe.
Present-day hunter-gatherers (HGs) live in multilevel social groups essential to sustain a population structure characterized by limited levels of within-band relatedness and inbreeding. When these ...wider social networks evolved among HGs is unknown. To investigate whether the contemporary HG strategy was already present in the Upper Paleolithic, we used complete genome sequences from Sunghir, a site dated to ~34,000 years before the present, containing multiple anatomically modern human individuals. We show that individuals at Sunghir derive from a population of small effective size, with limited kinship and levels of inbreeding similar to HG populations. Our findings suggest that Upper Paleolithic social organization was similar to that of living HGs, with limited relatedness within residential groups embedded in a larger mating network.
Prehistoric population structure associated with the transition to an agricultural lifestyle in Europe remains a contentious idea. Population-genomic data from 11 Scandinavian Stone Age human remains ...suggest that hunter-gatherers had lower genetic diversity than that of farmers. Despite their close geographical proximity, the genetic differentiation between the two Stone Age groups was greater than that observed among extant European populations. Additionally, the Scandinavian Neolithic farmers exhibited a greater degree of hunter-gatherer–related admixture than that of the Tyrolean Iceman, who also originated from a farming context. In contrast, Scandinavian hunter-gatherers displayed no significant evidence of introgression from farmers. Our findings suggest that Stone Age foraging groups were historically in low numbers, likely owing to oscillating living conditions or restricted carrying capacity, and that they were partially incorporated into expanding farming groups.
During the Last Glacial Maximum, continental ice sheets isolated Beringia (northeast Siberia and northwest North America) from unglaciated North America. By around 15 to 14 thousand calibrated ...radiocarbon years before present (cal. kyr bp), glacial retreat opened an approximately 1,500-km-long corridor between the ice sheets. It remains unclear when plants and animals colonized this corridor and it became biologically viable for human migration. We obtained radiocarbon dates, pollen, macrofossils and metagenomic DNA from lake sediment cores in a bottleneck portion of the corridor. We find evidence of steppe vegetation, bison and mammoth by approximately 12.6 cal. kyr bp, followed by open forest, with evidence of moose and elk at about 11.5 cal. kyr bp, and boreal forest approximately 10 cal. kyr bp. Our findings reveal that the first Americans, whether Clovis or earlier groups in unglaciated North America before 12.6 cal. kyr bp, are unlikely to have travelled by this route into the Americas. However, later groups may have used this north-south passageway.
Ancient DNA extracts consist of a mixture of contaminant DNA molecules, most often originating from environmental microbes, and endogenous fragments exhibiting substantial levels of DNA damage. The ...latter introduce specific nucleotide misincorporations and DNA fragmentation signatures in sequencing reads that could be advantageously used to argue for sequence validity. mapDamage is a Perl script that computes nucleotide misincorporation and fragmentation patterns using next-generation sequencing reads mapped against a reference genome. The Perl script outputs are further automatically processed in embedded R script in order to detect typical patterns of genuine ancient DNA sequences.
Availability and implementation: The Perl script mapDamage is freely available with documentation and example files at http://geogenetics.ku.dk/all_literature/mapdamage/. The script requires prior installation of the SAMtools suite and R environment and has been validated on both GNU/Linux and MacOSX operating systems.
Contact:
aginolhac@snm.ku.dk
Supplementary information:
Supplementary data available at Bioinformatics online
The origin of contemporary Europeans remains contentious. We obtained a genome sequence from Kostenki 14 in European Russia dating from 38,700 to 36,200 years ago, one of the oldest fossils of ...anatomically modern humans from Europe. We find that Kostenki 14 shares a close ancestry with the 24,000-year-old Mal'ta boy from central Siberia, European Mesolithic hunter-gatherers, some contemporary western Siberians, and many Europeans, but not eastern Asians. Additionally, the Kostenki 14 genome shows evidence of shared ancestry with a population basal to all Eurasians that also relates to later European Neolithic farmers. We find that Kostenki 14 contains more Neandertal DNA that is contained in longer tracts than present Europeans. Our findings reveal the timing of divergence of western Eurasians and East Asians to be more than 36,200 years ago and that European genomic structure today dates back to the Upper Paleolithic and derives from a metapopulation that at times stretched from Europe to central Asia.
Schistosomiasis is a water-based, infectious disease with high morbidity and significant economic burdens affecting >250 million people globally. Disease control has, with notable success, for ...decades focused on drug treatment of infected human populations, but a recent paradigm shift now entails moving from control to elimination. To achieve this ambitious goal, more sensitive diagnostic tools are needed to monitor progress toward transmission interruption in the environment, especially in low-intensity infection areas. We report on the development of an environmental DNA (eDNA)-based tool to efficiently detect DNA traces of the parasite Schistosoma mansoni directly in the aquatic environment, where the nonhuman part of the parasite life cycle occurs. This is a report of the successful detection of S. mansoni in freshwater samples by using aquatic eDNA. True eDNA was detected in as few as 10 cercariae per liter of water in laboratory experiments. The field applicability of the method was tested at known transmission sites in Kenya, where comparison of schistosome detection by conventional snail surveys (snail collection and cercariae shedding) with eDNA (water samples) showed 71% agreement between the methods. The eDNA method furthermore detected schistosome presence at two additional sites where snail shedding failed, demonstrating a higher sensitivity of eDNA sampling. We conclude that eDNA provides a promising tool to substantially improve the environmental surveillance of S. mansoni. Given the proper method and guideline development, eDNA could become an essential future component of the schistosomiasis control tool box needed to achieve the goal of elimination.
A major factor for the population decline of Native Americans after European contact has been attributed to infectious disease susceptibility. To investigate whether a pre-existing genetic component ...contributed to this phenomenon, here we analyse 50 exomes of a continuous population from the Northwest Coast of North America, dating from before and after European contact. We model the population collapse after European contact, inferring a 57% reduction in effective population size. We also identify signatures of positive selection on immune-related genes in the ancient but not the modern group, with the strongest signal deriving from the human leucocyte antigen (HLA) gene HLA-DQA1. The modern individuals show a marked frequency decrease in the same alleles, likely due to the environmental change associated with European colonization, whereby negative selection may have acted on the same gene after contact. The evident shift in selection pressures correlates to the regional European-borne epidemics of the 1800s.