Genomic GC content varies widely among microbes for reasons unknown. While mutation bias partially explains this variation, prokaryotes near-universally have a higher GC content than predicted solely ...by this bias. Debate surrounds the relative importance of the remaining explanations of selection versus biased gene conversion favoring GC alleles. Some environments (e.g. soils) are associated with a high genomic GC content of their inhabitants, which implies that either high GC content is a selective adaptation to particular habitats, or that certain habitats favor increased rates of gene conversion. Here, we report a novel association between the presence of the non-homologous end joining DNA double-strand break repair pathway and GC content; this observation suggests that DNA damage may be a fundamental driver of GC content, leading in part to the many environmental patterns observed to-date. We discuss potential mechanisms accounting for the observed association, and provide preliminary evidence that sites experiencing higher rates of double-strand breaks are under selection for increased GC content relative to the genomic background.
Ancient DNA (aDNA) molecules in fossilized bones and teeth, coprolites, sediments, mummified specimens and museum collections represent fantastic sources of information for evolutionary biologists, ...revealing the agents of past epidemics and the dynamics of past populations. However, the analysis of aDNA generally faces two major issues. Firstly, sequences consist of a mixture of endogenous and various exogenous backgrounds, mostly microbial. Secondly, high nucleotide misincorporation rates can be observed as a result of severe post-mortem DNA damage. Such misincorporation patterns are instrumental to authenticate ancient sequences versus modern contaminants. We recently developed the user-friendly mapDamage package that identifies such patterns from next-generation sequencing (NGS) sequence datasets. The absence of formal statistical modeling of the DNA damage process, however, precluded rigorous quantitative comparisons across samples.
Here, we describe mapDamage 2.0 that extends the original features of mapDamage by incorporating a statistical model of DNA damage. Assuming that damage events depend only on sequencing position and post-mortem deamination, our Bayesian statistical framework provides estimates of four key features of aDNA molecules: the average length of overhangs (λ), nick frequency (ν) and cytosine deamination rates in both double-stranded regions ( ) and overhangs ( ). Our model enables rescaling base quality scores according to their probability of being damaged. mapDamage 2.0 handles NGS datasets with ease and is compatible with a wide range of DNA library protocols.
mapDamage 2.0 is available at ginolhac.github.io/mapDamage/ as a Python package and documentation is maintained at the Centre for GeoGenetics Web site (geogenetics.ku.dk/publications/mapdamage2.0/).
Supplementary data are available at Bioinformatics online.
Recent analyses of de novo DNA mutations in modern humans have suggested a nuclear substitution rate that is approximately half that of previous estimates based on fossil calibration. This result has ...led to suggestions that major events in human evolution occurred far earlier than previously thought.
Here, we use mitochondrial genome sequences from ten securely dated ancient modern humans spanning 40,000 years as calibration points for the mitochondrial clock, thus yielding a direct estimate of the mitochondrial substitution rate. Our clock yields mitochondrial divergence times that are in agreement with earlier estimates based on calibration points derived from either fossils or archaeological material. In particular, our results imply a separation of non-Africans from the most closely related sub-Saharan African mitochondrial DNAs (haplogroup L3) that occurred less than 62–95 kya.
Though single loci like mitochondrial DNA (mtDNA) can only provide biased estimates of population divergence times, they can provide valid upper bounds. Our results exclude most of the older dates for African and non-African population divergences recently suggested by de novo mutation rate estimates in the nuclear genome.
► Human mutation rates are directly calculated using securely dated ancient human mtDNAs ► The study provides improved molecular estimates for human evolutionary events ► The last major gene flow event between Africans and non-Africans was calculated to 95 kya
It is now possible to perform whole-genome shotgun sequencing as well as capture of specific genomic regions for extinct organisms. However, targeted resequencing of large parts of nuclear genomes ...has yet to be demonstrated for ancient DNA. Here we show that hybridization capture on microarrays can successfully recover more than a megabase of target regions from Neandertal DNA even in the presence of approximately 99.8% microbial DNA. Using this approach, we have sequenced approximately 14,000 protein-coding positions inferred to have changed on the human lineage since the last common ancestor shared with chimpanzees. By generating the sequence of one Neandertal and 50 present-day humans at these positions, we have identified 88 amino acid substitutions that have become fixed in humans since our divergence from the Neandertals.
Using DNA extracted from a finger bone found in Denisova Cave in southern Siberia, we have sequenced the genome of an archaic hominin to about 1.9-fold coverage. This individual is from a group that ...shares a common origin with Neanderthals. This population was not involved in the putative gene flow from Neanderthals into Eurasians; however, the data suggest that it contributed 4-6% of its genetic material to the genomes of present-day Melanesians. We designate this hominin population 'Denisovans' and suggest that it may have been widespread in Asia during the Late Pleistocene epoch. A tooth found in Denisova Cave carries a mitochondrial genome highly similar to that of the finger bone. This tooth shares no derived morphological features with Neanderthals or modern humans, further indicating that Denisovans have an evolutionary history distinct from Neanderthals and modern humans.
Understanding how immunological memory lasts a lifetime requires quantifying changes in the number of memory cells as well as how their division and death rates change over time. We address these ...questions by using a statistically powerful mixed-effects differential equations framework to analyze data from two human studies that follow CD8 T cell responses to the yellow fever vaccine (YFV-17D). Models were first fit to the frequency of YFV-specific memory CD8 T cells and deuterium enrichment in those cells 42 days to 1 year post-vaccination. A different dataset, on the loss of YFV-specific CD8 T cells over three decades, was used to assess out of sample predictions of our models. The commonly used exponential and bi-exponential decline models performed relatively poorly. Models with the cell loss following a power law (exactly or approximately) were most predictive. Notably, using only the first year of data, these models accurately predicted T cell frequencies up to 30 years post-vaccination. Our analyses suggest that division rates of these cells drop and plateau at a low level (0.1% per day, ∼ double the estimated values for naive T cells) within one year following vaccination, whereas death rates continue to decline for much longer. Our results show that power laws can be predictive for T cell memory, a finding that may be useful for vaccine evaluation and epidemiological modeling. Moreover, since power laws asymptotically decline more slowly than any exponential decline, our results help explain the longevity of immune memory phenomenologically.
High-throughput direct sequencing techniques have recently opened the possibility to sequence genomes from Pleistocene organisms. Here we analyze DNA sequences determined from a Neandertal, a ...mammoth, and a cave bear. We show that purines are overrepresented at positions adjacent to the breaks in the ancient DNA, suggesting that depurination has contributed to its degradation. We furthermore show that substitutions resulting from miscoding cytosine residues are vastly overrepresented in the DNA sequences and drastically clustered in the ends of the molecules, whereas other substitutions are rare. We present a model where the observed substitution patterns are used to estimate the rate of deamination of cytosine residues in single- and double-stranded portions of the DNA, the length of single-stranded ends, and the frequency of nicks. The results suggest that reliable genome sequences can be obtained from Pleistocene organisms.
Significance Current vaccine development against persistent infections such as HIV and tuberculosis focuses on eliciting CD8 T cell immunity through the use of replication-incompetent or single-cycle ...vectors. Although inherently safe, these vectors deliver limited amounts of antigen. We investigate how antigen load affects the CD8 response by analyzing the viral load and the magnitude of the specific CD8 response after immunization with the live attenuated yellow fever vaccine (YFV-17D). Our results show that the magnitude of the CD8 response is proportional to the amount of antigen when virus load is below a threshold value and saturates above. This finding highlights the requirement that T cell-based vaccines deliver sufficient antigen to elicit a large CD8 response that may be needed for protection.
CD8 T cells are a potent tool for eliminating intracellular pathogens and tumor cells. Thus, eliciting robust CD8 T-cell immunity is the basis for many vaccines under development. However, the relationship between antigen load and the magnitude of the CD8 T-cell response is not well-described in a human immune response. Here we address this issue by quantifying viral load and the CD8 T-cell response in a cohort of 80 individuals immunized with the live attenuated yellow fever vaccine (YFV-17D) by sampling peripheral blood at days 0, 1, 2, 3, 5, 7, 9, 11, 14, 30, and 90. When the virus load was below a threshold (peak virus load < 225 genomes per mL, or integrated virus load < 400 genome days per mL), the magnitude of the CD8 T-cell response correlated strongly with the virus load ( R ² ∼ 0.63). As the virus load increased above this threshold, the magnitude of the CD8 T-cell responses saturated. Recent advances in CD8 T-cell–based vaccines have focused on replication-incompetent or single-cycle vectors. However, these approaches deliver relatively limited amounts of antigen after immunization. Our results highlight the requirement that T-cell–based vaccines should deliver sufficient antigen during the initial period of the immune response to elicit a large number of CD8 T cells that may be needed for protection.
Draft Sequence of the Neandertal Genome Green, Richard E; Krause, Johannes; Briggs, Adrian W ...
Science (American Association for the Advancement of Science),
05/2010, Volume:
328, Issue:
5979
Journal Article
Peer reviewed
Open access
Neandertals, the closest evolutionary relatives of present-day humans, lived in large parts of Europe and western Asia before disappearing 30,000 years ago. We present a draft sequence of the ...Neandertal genome composed of more than 4 billion nucleotides from three individuals. Comparisons of the Neandertal genome to the genomes of five present-day humans from different parts of the world identify a number of genomic regions that may have been affected by positive selection in ancestral modern humans, including genes involved in metabolism and in cognitive and skeletal development. We show that Neandertals shared more genetic variants with present-day humans in Eurasia than with present-day humans in sub-Saharan Africa, suggesting that gene flow from Neandertals into the ancestors of non-Africans occurred before the divergence of Eurasian groups from each other.
Abstract
Background
Even when microbial communities vary wildly in their taxonomic composition, their functional composition is often surprisingly stable. This suggests that a functional perspective ...could provide much deeper insight into the principles governing microbiome assembly. Much work to date analyzing the functional composition of microbial communities, however, relies heavily on inference from genomic features. Unfortunately, output from these methods can be hard to interpret and often suffers from relatively high error rates.
Results
We built and analyzed a domain-specific microbial trait database from known microbe-trait pairs recorded in the literature to better understand the functional composition of the human microbiome. Using a combination of phylogentically conscious machine learning tools and a network science approach, we were able to link particular traits to areas of the human body, discover traits that determine the range of body areas a microbe can inhabit, and uncover drivers of metabolic breadth.
Conclusions
Domain-specific trait databases are an effective compromise between noisy methods to infer complex traits from genomic data and exhaustive, expensive attempts at database curation from the literature that do not focus on any one subset of taxa. They provide an accurate account of microbial traits and, by limiting the number of taxa considered, are feasible to build within a reasonable time-frame. We present a database specific for the human microbiome, in the hopes that this will prove useful for research into the functional composition of human-associated microbial communities.