Overwhelming evidence supports the endosymbiosis theory that mitochondria originated once from the Alphaproteobacteria. However, its exact position in the tree of life remains highly debated. This is ...because systematic errors, including biased taxonomic sampling, high evolutionary rates and sequence composition bias have long plagued the mitochondrial phylogenetics. In this study, we address this issue by 1) increasing the taxonomic representation of alphaproteobacterial genomes by sequencing 18 phylogenetically novel species. They include 5 Rickettsiales and 4 Rhodospirillales, two orders that have shown close affiliations with mitochondria previously, 2) using a set of 29 slowly evolving mitochondria-derived nuclear genes that are less biased than mitochondria-encoded genes as the alternative "well behaved" markers for phylogenetic analysis, 3) applying site heterogeneous mixture models that account for the sequence composition bias. With the integrated phylogenomic approach, we are able to for the first time place mitochondria unequivocally within the Rickettsiales order, as a sister clade to the Rickettsiaceae and Anaplasmataceae families, all subtended by the Holosporaceae family. Our results suggest that mitochondria most likely originated from a Rickettsiales endosymbiont already residing in the host, but not from the distantly related free-living Pelagibacter and Rhodospirillales.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
The explosive growth of genomic data provides an opportunity to make increased use of protein markers for phylogenetic inference. We have developed an automated pipeline for phylogenomic analysis ...(AMPHORA) that overcomes the existing bottlenecks limiting large-scale protein phylogenetic inference. We demonstrated its high throughput capabilities and high quality results by constructing a genome tree of 578 bacterial species and by assigning phylotypes to 18,607 protein markers identified in metagenomic data collected from the Sargasso Sea.
Molecular evolutionary rate varies significantly among species and a strict global molecular clock has been rejected across the tree of life. Generation time is one primary life-history trait that ...influences the molecular evolutionary rate. Theory predicts that organisms with shorter generation times evolve faster because of the accumulation of more DNA replication errors per unit time. Although the generation-time effect has been demonstrated consistently in plants and animals, the evidence of its existence in bacteria is lacking. The bacterial phylum Firmicutes offers an excellent system for testing generation-time effect because some of its members can enter a dormant, nonreproductive endospore state in response to harsh environmental conditions. It follows that spore-forming bacteria would—with their longer generation times—evolve more slowly than their nonspore-forming relatives. It is therefore surprising that a previous study found no generation-time effect in Firmicutes. Using a phylogenetic comparative approach and leveraging on a large number of Firmicutes genomes, we found sporulation significantly reduces the genome-wide spontaneous DNA mutation rate and protein evolutionary rate. Contrary to the previous study, our results provide strong evidence that the evolutionary rates of bacteria, like those of plants and animals, are influenced by generation time.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NMLJ, NUK, OILJ, PNG, SAZU, SBCE, SBMB, UL, UM, UPUK
The lack of a consensus bacterial species concept greatly hampers our ability to understand and organize bacterial diversity. Operational taxonomic units (OTUs), which are clustered on the basis of ...DNA sequence identity alone, are the most commonly used microbial diversity unit. Although it is understood that OTUs can be phylogenetically incoherent, the degree and the extent of the phylogenetic inconsistency have not been explicitly studied. Here, we tested the phylogenetic signal of OTUs in a broad range of bacterial genera from various phyla. Strikingly, we found that very few OTUs were monophyletic, and many showed evidence of multiple independent origins. Using previously established bacterial habitats as benchmarks, we showed that OTUs frequently spanned multiple ecological habitats. We demonstrated that ecological heterogeneity within OTUs is caused by their phylogenetic inconsistency, and not merely due to 'lumping' of taxa resulting from using relaxed identity cut-offs. We argue that ecotypes, as described by the Stable Ecotype Model, are phylogenetically and ecologically more consistent than OTUs and therefore could serve as an alternative unit for bacterial diversity studies. In addition, we introduce QuickES, a new wrapper program for the Ecotype Simulation algorithm, which is capable of demarcating ecotypes in data sets with tens of thousands of sequences.
Uncertainty in multiple sequence alignments has a large impact on phylogenetic analyses. Little has been done to evaluate the quality of individual positions in protein sequence alignments, which ...directly impact the accuracy of phylogenetic trees. Here we describe ZORRO, a probabilistic masking program that accounts for alignment uncertainty by assigning confidence scores to each alignment position. Using the BALIBASE database and in simulation studies, we demonstrate that masking by ZORRO significantly reduces the alignment uncertainty and improves the tree accuracy.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The abundance of different SSU rRNA ("16S") gene sequences in environmental samples is widely used in studies of microbial ecology as a measure of microbial community structure and diversity. ...However, the genomic copy number of the 16S gene varies greatly - from one in many species to up to 15 in some bacteria and to hundreds in some microbial eukaryotes. As a result of this variation the relative abundance of 16S genes in environmental samples can be attributed both to variation in the relative abundance of different organisms, and to variation in genomic 16S copy number among those organisms. Despite this fact, many studies assume that the abundance of 16S gene sequences is a surrogate measure of the relative abundance of the organisms containing those sequences. Here we present a method that uses data on sequences and genomic copy number of 16S genes along with phylogenetic placement and ancestral state estimation to estimate organismal abundances from environmental DNA sequence data. We use theory and simulations to demonstrate that 16S genomic copy number can be accurately estimated from the short reads typically obtained from high-throughput environmental sequencing of the 16S gene, and that organismal abundances in microbial communities are more strongly correlated with estimated abundances obtained from our method than with gene abundances. We re-analyze several published empirical data sets and demonstrate that the use of gene abundance versus estimated organismal abundance can lead to different inferences about community diversity and structure and the identity of the dominant taxa in microbial communities. Our approach will allow microbial ecologists to make more accurate inferences about microbial diversity and abundance based on 16S sequence data.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Genome size and complexity vary tremendously among eukaryotic species and their organelles. Comparisons across deeply divergent eukaryotic lineages have suggested that variation in mutation rates may ...explain this diversity, with increased mutational burdens favoring reduced genome size and complexity. The discovery that mitochondrial mutation rates can differ by orders of magnitude among closely related angiosperm species presents a unique opportunity to test this hypothesis. We sequenced the mitochondrial genomes from two species in the angiosperm genus Silene with recent and dramatic accelerations in their mitochondrial mutation rates. Contrary to theoretical predictions, these genomes have experienced a massive proliferation of noncoding content. At 6.7 and 11.3 Mb, they are by far the largest known mitochondrial genomes, larger than most bacterial genomes and even some nuclear genomes. In contrast, two slowly evolving Silene mitochondrial genomes are smaller than average for angiosperms. Consequently, this genus captures approximately 98% of known variation in organelle genome size. The expanded genomes reveal several architectural changes, including the evolution of complex multichromosomal structures (with 59 and 128 circular-mapping chromosomes, ranging in size from 44 to 192 kb). They also exhibit a substantial reduction in recombination and gene conversion activity as measured by the relative frequency of alternative genome conformations and the level of sequence divergence between repeat copies. The evolution of mutation rate, genome size, and chromosome structure can therefore be extremely rapid and interrelated in ways not predicted by current evolutionary theories. Our results raise the hypothesis that changes in recombinational processes, including gene conversion, may be a central force driving the evolution of both mutation rate and genome structure.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Reconstruction of mitochondrial ancestor has great impact on our understanding of the origin of mitochondria. Previous studies have largely focused on reconstructing the last common ancestor of all ...contemporary mitochondria (proto-mitochondria), but not on the more informative pre-mitochondria (the last common ancestor of mitochondria and their alphaproteobacterial sister clade). Using a phylogenomic approach and leveraging on the increased taxonomic sampling of alphaproteobacterial and eukaryotic genomes, we reconstructed the metabolisms of both proto-mitochondria and pre-mitochondria. Our reconstruction depicts a more streamlined proto-mitochondrion than these predicted by previous studies, and revealed several novel insights into the mitochondria-derived eukaryotic metabolisms including the lipid metabolism. Most strikingly, pre-mitochondrion was predicted to possess a plastid/parasite type of ATP/ADP translocase that imports ATP from the host, which posits pre-mitochondrion as an energy parasite that directly contrasts with the current role of mitochondria as the cell's energy producer. In addition, pre-mitochondrion was predicted to encode a large number of flagellar genes and several cytochrome oxidases functioning under low oxygen level, strongly supporting the previous finding that the mitochondrial ancestor was likely motile and capable of oxidative phosphorylation under microoxic condition.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Adaptive radiations provide unique opportunities to test whether and how recent ecological and evolutionary diversification of host species structures the composition of entire bacterial communities. ...We used 16S rRNA gene sequencing of faecal samples to test for differences in the gut microbiota of six species of Puerto Rican Anolis lizards characterized by the evolution of distinct ‘ecomorphs’ related to differences in habitat use. We found substantial variation in the composition of the microbiota within each species and ecomorph (trunk‐crown, trunk‐ground, grass‐bush), but no differences in bacterial alpha diversity among species or ecomorphs. Beta diversity analyses revealed subtle but significant differences in bacterial composition related to host phylogeny and species, but these differences were not consistently associated with Anolis ecomorph. Comparison of a trunk‐ground species from this clade (A. cristatellus) with a distantly related member of the same ecomorph class (A. sagrei) where the two species have been introduced and are now sympatric in Florida revealed pronounced differences in the alpha diversity and beta diversity of their microbiota despite their ecological similarity. Comparisons of these populations with allopatric conspecifics also revealed geographic differences in bacterial alpha diversity and beta diversity within each species. Finally, we observed high intraindividual variation over time and strong effects of a simplified laboratory diet on the microbiota of A. sagrei. Collectively, our results indicate that bacterial communities are only weakly shaped by the diversification of their lizard hosts due to the strikingly high levels of bacterial diversity and variation observed within Anolis species.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK