Sample preparation for high-throughput sequencing (HTS) includes treatment with various laboratory components, potentially carrying viral nucleic acids, the extent of which has not been thoroughly ...investigated. Our aim was to systematically examine a diverse repertoire of laboratory components used to prepare samples for HTS in order to identify contaminating viral sequences.
A total of 322 samples of mainly human origin were analysed using eight protocols, applying a wide variety of laboratory components. Several samples (60% of human specimens) were processed using different protocols. In total, 712 sequencing libraries were investigated for viral sequence contamination.
Among sequences showing similarity to viruses, 493 were significantly associated with the use of laboratory components. Each of these viral sequences had sporadic appearance, only being identified in a subset of the samples treated with the linked laboratory component, and some were not identified in the non-template control samples. Remarkably, more than 65% of all viral sequences identified were within viral clusters linked to the use of laboratory components.
We show that high prevalence of contaminating viral sequences can be expected in HTS-based virome data and provide an extensive list of novel contaminating viral sequences that can be used for evaluation of viral findings in future virome and metagenome studies. Moreover, we show that detection can be problematic due to stochastic appearance and limited non-template controls. Although the exact origin of these viral sequences requires further research, our results support laboratory-component-linked viral sequence contamination of both biological and synthetic origin.
Display omitted
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Ecosystems across the globe are threatened by climate change and human activities. New rapid survey approaches for monitoring biodiversity would greatly advance assessment and understanding of these ...threats. Taking advantage of next‐generation DNA sequencing, we tested an approach we call metabarcoding: high‐throughput and simultaneous taxa identification based on a very short (usually <100 base pairs) but informative DNA fragment. Short DNA fragments allow the use of degraded DNA from environmental samples. All analyses included amplification using plant‐specific versatile primers, sequencing and estimation of taxonomic diversity. We tested in three steps whether degraded DNA from dead material in soil has the potential of efficiently assessing biodiversity in different biomes. First, soil DNA from eight boreal plant communities located in two different vegetation types (meadow and heath) was amplified. Plant diversity detected from boreal soil was highly consistent with plant taxonomic and growth form diversity estimated from conventional above‐ground surveys. Second, we assessed DNA persistence using samples from formerly cultivated soils in temperate environments. We found that the number of crop DNA sequences retrieved strongly varied with years since last cultivation, and crop sequences were absent from nearby, uncultivated plots. Third, we assessed the universal applicability of DNA metabarcoding using soil samples from tropical environments: a large proportion of species and families from the study site were efficiently recovered. The results open unprecedented opportunities for large‐scale DNA‐based biodiversity studies across a range of taxonomic groups using standardized metabarcoding approaches.
See also the Perspective by Chariton
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
DNA barcoding should provide rapid, accurate and automatable species identifications by using a standardized DNA region as a tag. Based on sequences available in GenBank and sequences produced for ...this study, we evaluated the resolution power of the whole chloroplast trnL (UAA) intron (254-767 bp) and of a shorter fragment of this intron (the P6 loop, 10-143 bp) amplified with highly conserved primers. The main limitation of the whole trnL intron for DNA barcoding remains its relatively low resolution (67.3% of the species from GenBank unambiguously identified). The resolution of the P6 loop is lower (19.5% identified) but remains higher than those of existing alternative systems. The resolution is much higher in specific contexts such as species originating from a single ecosystem, or commonly eaten plants. Despite the relatively low resolution, the whole trnL intron and its P6 loop have many advantages: the primers are highly conserved, and the amplification system is very robust. The P6 loop can even be amplified when using highly degraded DNA from processed food or from permafrost samples, and has the potential to be extensively used in food industry, in forensic science, in diet analyses based on feces and in ancient DNA studies.
Palaeoenvironments and former climates are typically inferred from pollen and macrofossil records. This approach is time-consuming and suffers from low taxonomic resolution and biased taxon sampling. ...Here, we test an alternative DNA-based approach utilizing the P6 loop in the chloroplast trnL (UAA) intron; a short (13-158 bp) and variable region with highly conserved flanking sequences. For taxonomic reference, a whole trnL intron sequence database was constructed from recently collected material of 842 species, representing all widespread and/or ecologically important taxa of the species-poor arctic flora. The P6 loop alone allowed identification of all families, most genera (>75%) and one-third of the species, thus providing much higher taxonomic resolution than pollen records. The suitability of the P6 loop for analysis of samples containing degraded ancient DNA from a mixture of species is demonstrated by high-throughput parallel pyrosequencing of permafrost-preserved DNA and reconstruction of two plant communities from the last glacial period. Our approach opens new possibilities for DNA-based assessment of ancient as well as modern biodiversity of many groups of organisms using environmental samples.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
The ratite moa (Aves: Dinornithiformes) were a speciose group of massive graviportal avian herbivores that dominated the New Zealand (NZ) ecosystem until their extinction almost equal to600 years ...ago. The phylogeny and evolutionary history of this morphologically diverse order has remained controversial since their initial description in 1839. We synthesize mitochondrial phylogenetic information from 263 subfossil moa specimens from across NZ with morphological, ecological, and new geological data to create the first comprehensive phylogeny, taxonomy, and evolutionary timeframe for all of the species of an extinct order. We also present an important new geological/paleogeographical model of late Cenozoic NZ, which suggests that terrestrial biota on the North and South Island landmasses were isolated for most of the past 20-30 Ma. The data reveal that the patterns of genetic diversity within and between different moa clades reflect a complex history following a major marine transgression in the Oligocene, affected by marine barriers, tectonic activity, and glacial cycles. Surprisingly, the remarkable morphological radiation of moa appears to have occurred much more recently than previous early Miocene (ca. 15 Ma) estimates, and was coincident with the accelerated uplift of the Southern Alps just ca. 5-8.5 Ma. Together with recent fossil evidence, these data suggest that the recent evolutionary history of nearly all of the iconic NZ terrestrial biota occurred principally on just the South Island.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
High Arctic environments are particularly sensitive to climate changes, but retrieval of paleoecological data is challenging due to low productivity and biomass. At the same time, Arctic soils and ...sediments have proven exceptional for long-term DNA preservation due to their constantly low temperatures. Lake sediments contain DNA paleorecords of the surrounding ecosystems and can be used to retrieve a variety of organismal groups from a single sample. In this study, we analyzed vascular plant, bryophyte, algal (in particular diatom) and copepod DNA retrieved from a sediment core spanning the Holocene, taken from Bliss Lake on the northernmost coast of Greenland. A previous multi-proxy study including microscopic diatom analyses showed that this lake experienced changes between marine and lacustrine conditions. We inferred the same environmental changes from algal DNA preserved in the sediment core. Our DNA record was stratigraphically coherent, with no indication of leaching between layers, and our cross-taxon comparisons were in accordance with previously inferred local ecosystem changes. Authentic ancient plant DNA was retrieved from nearly all layers, both from the marine and the limnic phases, and distinct temporal changes in plant presence were recovered. The plant DNA was mostly in agreement with expected vegetation history, but very early occurrences of vascular plants, including the woody Empetrum nigrum, document terrestrial vegetation very shortly after glacial retreat. Our study shows that multi-taxon metabarcoding of sedimentary ancient DNA from lake cores is a valuable tool both for terrestrial and aquatic paleoecology, even in low-productivity ecosystems such as the High Arctic.
•We retrieved DNA of multiple organisms in a sediment core from North Greenland.•This low-productivity ecosystem provided DNA from throughout the Holocene.•Plants and algae indicate relatively high summer temperatures in the early Holocene.•Phases as a marine bay and as a freshwater lake were reflected in the DNA record.•The DNA record was consistent with sedimentological and microscopical data.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK
House mice (Mus musculus) are commensals of humans and therefore their phylogeography can reflect human colonization and settlement patterns. Previous studies have linked the distribution of house ...mouse mitochondrial (mt) DNA clades to areas formerly occupied by the Norwegian Vikings in Norway and the British Isles. Norwegian Viking activity also extended further westwards in the North Atlantic with the settlement of Iceland, short-lived colonies in Greenland and a fleeting colony in Newfoundland in 1000 AD. Here we investigate whether house mouse mtDNA sequences reflect human history in these other regions as well.
House mice samples from Iceland, whether from archaeological Viking Age material or from modern-day specimens, had an identical mtDNA haplotype to the clade previously linked with Norwegian Vikings. From mtDNA and microsatellite data, the modern-day Icelandic mice also share the low genetic diversity shown by their human hosts on Iceland. Viking Age mice from Greenland had an mtDNA haplotype deriving from the Icelandic haplotype, but the modern-day Greenlandic mice belong to an entirely different mtDNA clade. We found no genetic association between modern Newfoundland mice and the Icelandic/ancient Greenlandic mice (no ancient Newfoundland mice were available). The modern day Icelandic and Newfoundland mice belong to the subspecies M. m. domesticus, the Greenlandic mice to M. m. musculus.
In the North Atlantic region, human settlement history over a thousand years is reflected remarkably by the mtDNA phylogeny of house mice. In Iceland, the mtDNA data show the arrival and continuity of the house mouse population to the present day, while in Greenland the data suggest the arrival, subsequent extinction and recolonization of house mice--in both places mirroring the history of the European human host populations. If house mice arrived in Newfoundland with the Viking settlers at all, then, like the humans, their presence was also fleeting and left no genetic trace. The continuity of mtDNA haplotype in Iceland over 1000 years illustrates that mtDNA can retain the signature of the ancestral house mouse founders. We also show that, in terms of genetic variability, house mouse populations may also track their host human populations.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Although ancient DNA (aDNA) miscoding lesions have been studied since the earliest days of the field, their nature remains a source of debate. A variety of conflicting hypotheses exist about which ...miscoding lesions constitute true aDNA damage as opposed to PCR polymerase amplification error. Furthermore, considerable disagreement and speculation exists on which specific damage events underlie observed miscoding lesions. The root of the problem is that it has previously been difficult to assemble sufficient data to test the hypotheses, and near-impossible to accurately determine the specific strand of origin of observed damage events. With the advent of emulsion-based clonal amplification (emPCR) and the sequencing-by-synthesis technology this has changed. In this paper we demonstrate how data produced on the Roche GS20 genome sequencer can determine miscoding lesion strands of origin, and subsequently be interpreted to enable characterization of the aDNA damage behind the observed phenotypes. Through comparative analyses on 390 965 bp of modern chloroplast and 131 474 bp of ancient woolly mammoth GS20 sequence data we conclusively demonstrate that in this sample at least, a permafrost preserved specimen, Type 2 (cytosine→thymine/guanine→adenine) miscoding lesions represent the overwhelming majority of damage-derived miscoding lesions. Additionally, we show that an as yet unidentified guanine→adenine analogue modification, not the conventionally argued cytosine→uracil deamination, underpins a significant proportion of Type 2 damage. How widespread these implications are for aDNA will become apparent as future studies analyse data recovered from a wider range of substrates.
The field of ancient DNA (aDNA) is casting new light on many evolutionary questions. However, problems associated with the postmortem instability of DNA may complicate the interpretation of aDNA ...data. For example, in population genetic studies, the inclusion of damaged DNA may inflate estimates of diversity. In this paper, we examine the effect of DNA damage on population genetic estimates of ancestral population size. We simulate data using standard coalescent simulations that include postmortem damage and show that estimates of effective population sizes are inflated around, or right after, the sampling time of the ancestral DNA sequences. This bias leads to estimates of increasing, and then decreasing, population sizes, as observed in several recently published studies. We reanalyze a recently published data set of DNA sequences from the Bison (Bison bison/Bison priscus) and show that the signal for a change in effective population size in this data set vanishes once the effects of putative damage are removed. Our results suggest that population genetic analyses of aDNA sequences, which do not accurately account for damage, should be interpreted with great caution.