Genome size is a biodiversity trait that shows staggering diversity across eukaryotes, varying over 64,000-fold. Of all major taxonomic groups, land plants stand out due to their staggering genome ...size diversity, ranging ca. 2400-fold. As our understanding of the implications and significance of this remarkable genome size diversity in land plants grows, it is becoming increasingly evident that this trait plays not only an important role in shaping the evolution of plant genomes, but also in influencing plant community assemblages at the ecosystem level. Recent advances and improvements in novel sequencing technologies, as well as analytical tools, make it possible to gain critical insights into the genomic and epigenetic mechanisms underpinning genome size changes. In this review we provide an overview of our current understanding of genome size diversity across the different land plant groups, its implications on the biology of the genome and what future directions need to be addressed to fill key knowledge gaps.
Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of ...developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5–15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
We investigate the timing of diversification in allopolyploids of Nicotiana (Solanaceae) utilising sequence data of maternal and paternal origin to look for evidence of a lag phase during which ...diploidisation took place. Bayesian relaxed clock phylogenetic methods show recent allopolyploids are a result of several unique polyploidisation events, and older allopolyploid sections have undergone subsequent speciation at the polyploid level (i.e. a number of these polyploid species share a singular origin). The independently formed recent polyploid species in the genus all have mean age estimates below 1 million years ago (Ma). Nicotiana section Polydicliae (two species) evolved 1.5 Ma, N. section Repandae (four species) formed 4 Ma, and N. section Suaveolentes (~35 species) is about 6 million years old. A general trend of higher speciation rates in older polyploids is evident, but diversification dramatically increases at approximately 6 Ma (in section Suaveolentes). Nicotiana sect. Suaveolentes has spectacularly radiated to form 35 species in Australia and some Pacific islands following a lag phase of almost 6 million years. Species have filled new ecological niches and undergone extensive diploidisation (e.g. chromosome fusions bringing the ancestral allotetraploid number, n = 24, down to n = 15 and ribosomal loci numbers back to diploid condition). Considering the progenitors of Suaveolentes inhabit South America, this represents the colonisation of Australia by polyploids that have subsequently undergone a recent radiation into new environments. To our knowledge, this study is the first report of a substantial lag phase being investigated below the family level.
Recent phylogenomic analyses based on the maternally inherited plastid organelle have enlightened evolutionary relationships between the subfamilies of Orchidaceae and most of the tribes. However, ...uncertainty remains within several subtribes and genera for which phylogenetic relationships have not ever been tested in a phylogenomic context. To address these knowledge-gaps, we here provide the most extensively sampled analysis of the orchid family to date, based on 78 plastid coding genes representing 264 species, 117 genera, 18 tribes and 28 subtribes. Divergence times are also provided as inferred from strict and relaxed molecular clocks and birth-death tree models. Our taxon sampling includes 51 newly sequenced plastid genomes produced by a genome skimming approach. We focus our sampling efforts on previously unplaced clades within tribes Cymbidieae and Epidendreae. Our results confirmed phylogenetic relationships in Orchidaceae as recovered in previous studies, most of which were recovered with maximum support (209 of the 262 tree branches). We provide for the first time a clear phylogenetic placement for Codonorchideae within subfamily Orchidoideae, and Podochilieae and Collabieae within subfamily Epidendroideae. We also identify relationships that have been persistently problematic across multiple studies, regardless of the different details of sampling and genomic datasets used for phylogenetic reconstructions. Our study provides an expanded, robust temporal phylogenomic framework of the Orchidaceae that paves the way for biogeographical and macroevolutionary studies.
Mitochondrial genomes are readily sequenced with recent technology and thus evolutionary lineages can be densely sampled. This permits better phylogenetic estimates and assessment of potential biases ...resulting from heterogeneity in nucleotide composition and rate of change. We gathered 245 mitochondrial sequences for the Coleoptera representing all 4 suborders, 15 superfamilies of Polyphaga, and altogether 97 families, including 159 newly sequenced full or partial mitogenomes. Compositional heterogeneity greatly affected 3rd codon positions, and to a lesser extent the 1st and 2nd positions, even after RY coding. Heterogeneity also affected the encoded protein sequence, in particular in the nad2, nad4, nad5, and nad6 genes. Credible tree topologies were obtained with the nhPhyML ("nonhomogeneous") algorithm implementing a model for branch-specific equilibrium frequencies. Likelihood searches using RAxML were improved by data partitioning by gene and codon position. Finally, the PhyloBayes software, which allows different substitution processes for amino acid replacement at various sites, produced a tree that best matched known higher level taxa and defined basal relationships in Coleoptera. After rooting with Neuropterida outgroups, suborder relationships were resolved as (Polyphaga (Myxophaga (Archostemata + Adephaga))). The infraorder relationships in Polyphaga were (Scirtiformia (Elateriformia ((Staphyliniformia + Scarabaeiformia) (Bostrichiformia (Cucujiformia))). Polyphagan superfamilies were recovered as monophyla except Staphylinoidea (paraphyletic for Scarabaeiformia) and Cucujoidea, which can no longer be considered a valid taxon. The study shows that, although compositional heterogeneity is not universal, it cannot be eliminated for some mitochondrial genes, but dense taxon sampling and the use of appropriate Bayesian analyses can still produce robust phylogenetic trees.
The world’s herbaria collectively house millions of diverse plant specimens, including endangered or extinct species and type specimens. Unlocking genetic data from the typically highly degraded DNA ...obtained from herbarium specimens was difficult until the arrival of high-throughput sequencing approaches, which can be applied to low quantities of severely fragmented DNA. Target enrichment involves using short molecular probes that hybridise and capture genomic regions of interest for high-throughput sequencing. In this study on herbariomics, we used this targeted sequencing approach and the Angiosperms353 universal probe set to recover up to 351 nuclear genes from 435 herbarium specimens that are up to 204 years old and span the breadth of angiosperm diversity. We show that on average 207 genes were successfully retrieved from herbarium specimens, although the mean number of genes retrieved and target enrichment efficiency is significantly higher for silica gel-dried specimens. Forty-seven target nuclear genes were recovered from a herbarium specimen of the critically endangered St Helena boxwood,
Mellissia begoniifolia
, collected in 1815. Herbarium specimens yield significantly less high-molecular-weight DNA than silica gel-dried specimens, and genomic DNA quality declines with sample age, which is negatively correlated with target enrichment efficiency. Climate, taxon-specific traits, and collection strategies additionally impact target sequence recovery. We also detected taxonomic bias in targeted sequencing outcomes for the 10 most numerous angiosperm families that were investigated in depth. We recommend that (1) for species distributed in wet tropical climates, silica gel-dried specimens should be used preferentially; (2) for species distributed in seasonally dry tropical climates, herbarium and silica gel-dried specimens yield similar results, and either collection can be used; (3) taxon-specific traits should be explored and established for effective optimisation of taxon-specific studies using herbarium specimens; (4) all herbarium sheets should, in future, be annotated with details of the preservation method used; (5) long-term storage of herbarium specimens should be in stable, low-humidity, and low-temperature environments; and (6) targeted sequencing with universal probes, such as Angiosperms353, should be investigated closely as a new approach for DNA barcoding that will ensure better exploitation of herbarium specimens than traditional Sanger sequencing approaches.
Extremely high levels of plant diversity in the American tropics are derived from multiple interactions between biotic and abiotic factors. Previous studies have focused on macro-evolutionary ...dynamics of the Tropical Andes, Amazonia, and Brazil's Cerrado and Atlantic forests during the last decade. Yet, other equally important Neotropical biodiversity hotspots have been severely neglected. This is particularly true for the Chocó region on the north-western coast of South and Central America. This geologically complex region is Earth's ninth most biodiverse hotspot, hosting approximately 3% of all known plant species. Here, we test Gentry's 1982a,b hypothesis of a northern Andean-Central American Pleistocene origin of the Chocoan flora using phylogenetic reconstructions of representative plant lineages in the American tropics. We show that plant diversity in the Chocó is derived mostly from Andean immigrants. Contributions from more distant biogeographical areas also exist but are fewer. We also identify a strong floristic connection between the Chocó and Central America, revealed by multiple migrations into the Chocó during the last 5 Ma. The dated phylogenetic reconstructions suggest a Plio-Pleistocene onset of the extant Chocó flora. Taken together, these results support to a limited extend Gentry's hypothesis of a Pleistocene origin and of a compound assembly of the Chocoan biodiversity hotspot. Strong Central American-Chocoan floristic affinity may be partly explained by the accretion of a land mass derived from the Caribbean plate to north-western South America. Additional densely sampled phylogenies of Chocoan lineages also well represented across the Neotropics could enlighten the role of land mass movements through time in the assembly of floras in Neotropical biodiversity hotspots.
At the shoot apex of plants is a small region known as the shoot apical meristem (SAM) that maintains a population of undifferentiated (stem) cells whilst providing cells for developing lateral ...organs and the stem. All aerial structures of the plant develop from the SAM post-embryogenesis, enabling plants to grow in a characteristic modular fashion with great phenotypic and developmental plasticity throughout their lifetime. The maintenance of the stem cell population is intimately balanced with cell recruitment into differentiating tissues through intercellular communication involving a complex signalling network. Recent studies have shown that diverse regulators function in SAM maintenance, many of which converge on the WUSCHEL (WUS) gene. In this review the diverse regulatory modules that function in SAM maintenance are discussed: transcriptional and epigenetic control, hormonal regulation, and the balance with organogenesis. The central role of WUS as an integrator of multiple signals is highlighted; in addition, accessory feedback loops emerge as a feature enabling dynamic regulation of the stem cell niche.
In petaloid monocots expansion of B-gene expression into whorl 1 of the flower results in two whorls of petaloid organs (tepals), as opposed to sepals in whorl 1 of typical eudicot flowers. Recently, ...new gene-silencing technologies have provided the first functional data to support this, in the genus Tricyrtis (Liliaceae).