Abstract
Background and Aims
With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic ...information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data.
Methods
Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree.
Key Results
All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58–0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66–0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree.
Conclusions
Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
Copepods are important ecologically and represent a large amount of aquatic biomass in both freshwater and marine systems. Despite this, the taxonomy of copepods and other meiofauna is not well ...understood, hampered by tiny sizes, cryptic taxa, intraspecific polymorphisms and total specimen destruction where DNA methods are employed. In this article we highlight these issues and propose a more up-to-date approach for dealing with them. Namely, we recommend non-destructive DNA extraction methods, coupled with high-throughput sequencing (HTS). Whilst DNA yields may be low, they should still be sufficient for HTS library preparation and DNA sequencing. At the same time morphological specimens can be preserved and the crucial link between morphology and DNA sequence is maintained. This is critical for an integrative taxonomy and a fuller understanding of biodiversity patterns as well as evolutionary processes in meiofauna.
Premise
Evolutionary studies require solid phylogenetic frameworks, but increased volumes of phylogenomic data have revealed incongruent topologies among gene trees in many organisms both between and ...within genomes. Some of these incongruences indicate polytomies that may remain impossible to resolve. Here we investigate the degree of gene‐tree discordance in Solanum, one of the largest flowering plant genera that includes the cultivated potato, tomato, and eggplant, as well as 24 minor crop plants.
Methods
A densely sampled species‐level phylogeny of Solanum is built using unpublished and publicly available Sanger sequences comprising 60% of all accepted species (742 spp.) and nine regions (ITS, waxy, and seven plastid markers). The robustness of this topology is tested by examining a full plastome dataset with 140 species and a nuclear target‐capture dataset with 39 species of Solanum (Angiosperms353 probe set).
Results
While the taxonomic framework of Solanum remained stable, gene tree conflicts and discordance between phylogenetic trees generated from the target‐capture and plastome datasets were observed. The latter correspond to regions with short internodal branches, and network analysis and polytomy tests suggest the backbone is composed of three polytomies found at different evolutionary depths. The strongest area of discordance, near the crown node of Solanum, could potentially represent a hard polytomy.
Conclusions
We argue that incomplete lineage sorting due to rapid diversification is the most likely cause for these polytomies, and that embracing the uncertainty that underlies them is crucial to understand the evolution of large and rapidly radiating lineages.
PREMISE
The inference of evolutionary relationships in the species‐rich family Orchidaceae has hitherto relied heavily on plastid DNA sequences and limited taxon sampling. Previous studies have ...provided a robust plastid phylogenetic framework, which was used to classify orchids and investigate the drivers of orchid diversification. However, the extent to which phylogenetic inference based on the plastid genome is congruent with the nuclear genome has been only poorly assessed.
METHODS
We inferred higher‐level phylogenetic relationships of orchids based on likelihood and ASTRAL analyses of 294 low‐copy nuclear genes sequenced using the Angiosperms353 universal probe set for 75 species (representing 69 genera, 16 tribes, 24 subtribes) and a concatenated analysis of 78 plastid genes for 264 species (117 genera, 18 tribes, 28 subtribes). We compared phylogenetic informativeness and support for the nuclear and plastid phylogenetic hypotheses.
RESULTS
Phylogenetic inference using nuclear data sets provides well‐supported orchid relationships that are highly congruent between analyses. Comparisons of nuclear gene trees and a plastid supermatrix tree showed that the trees are mostly congruent, but revealed instances of strongly supported phylogenetic incongruence in both shallow and deep time. The phylogenetic informativeness of individual Angiosperms353 genes is in general better than that of most plastid genes.
CONCLUSIONS
Our study provides the first robust nuclear phylogenomic framework for Orchidaceae and an assessment of intragenomic nuclear discordance, plastid‐nuclear tree incongruence, and phylogenetic informativeness across the family. Our results also demonstrate what has long been known but rarely thoroughly documented: nuclear and plastid phylogenetic trees can contain strongly supported discordances, and this incongruence must be reconciled prior to interpretation in evolutionary studies, such as taxonomy, biogeography, and character evolution.
Premise
To further advance the understanding of the species‐rich, economically and ecologically important angiosperm order Myrtales in the rosid clade, comprising nine families, approximately 400 ...genera and almost 14,000 species occurring on all continents (except Antarctica), we tested the Angiosperms353 probe kit.
Methods
We combined high‐throughput sequencing and target enrichment with the Angiosperms353 probe kit to evaluate a sample of 485 species across 305 genera (76% of all genera in the order).
Results
Results provide the most comprehensive phylogenetic hypothesis for the order to date. Relationships at all ranks, such as the relationship of the early‐diverging families, often reflect previous studies, but gene conflict is evident, and relationships previously found to be uncertain often remain so. Technical considerations for processing HTS data are also discussed.
Conclusions
High‐throughput sequencing and the Angiosperms353 probe kit are powerful tools for phylogenomic analysis, but better understanding of the genetic data available is required to identify genes and gene trees that account for likely incomplete lineage sorting and/or hybridization events.
Display omitted
•The genus Urera (Urticaceae) is polyphyletic, with 3 other genera nested within it.•Sanger & High-throughput sequencing resolved relationships among 6 ingroup clades.•Morphological & ...Geographic data aligned with 6 clades, enabling new delimitations.•Poikilospermum &Obetia unchanged; Touchardia gains Hawaiian endemic U. glabra.•Scepocarpus is resurrected for Afrotropical taxa; Urera s.s. only Neotropical.•Minimises taxonomic disruption, while reflecting evolutionary relationships.
Urera Gaudich, s.l. is a pantropical genus comprising c. 35 species of trees, shrubs, and vines. It has a long history of taxonomic uncertainty, and is repeatedly recovered as polyphyletic within a poorly resolved complex of genera in the Urticeae tribe of the nettle family (Urticaceae). To provide generic delimitations concordant with evolutionary history, we use increased taxonomic and genomic sampling to investigate phylogenetic relationships among Urera and associated genera. A cost-effective two-tier genome-sampling approach provides good phylogenetic resolution by using (i) a taxon-dense sample of Sanger sequence data from two barcoding regions to recover clades of putative generic rank, and (ii) a genome-dense sample of target-enrichment data for a subset of representative species from each well-supported clade to resolve relationships among them. The results confirm the polyphyly of Urera s.l. with respect to the morphologically distinct genera Obetia, Poikilospermum and Touchardia. Afrotropic members of Urera s.l. are recovered in a clade sister to the xerophytic African shrubs Obetia; and Hawaiian ones with Touchardia, also from Hawaii. Combined with distinctive morphological differences between Neotropical and African members of Urera s.l., these results lead us to resurrect the previously synonymised name Scepocarpus Wedd. for the latter. The new species epiphet Touchardia oahuensis T.Wells & A.K. Monro is offered as a replacement name for Touchardia glabra non H.St.John, and subgenera are created within Urera s.s. to account for the two morphologically distinct Neotropical clades. This new classification minimises taxonomic and nomenclatural disruption, while more accurately reflecting evolutionary relationships within the group.
The extent to which genome size and chromosome numbers evolve in concert is little understood, particularly after polyploidy (whole-genome duplication), when a genome returns to a diploid-like ...condition (diploidization). We study this phenomenon in 46 species of allotetraploid Nicotiana section Suaveolentes (Solanaceae), which formed <6 million years ago and radiated in the arid centre of Australia.
We analysed newly assessed genome sizes and chromosome numbers within the context of a restriction site-associated nuclear DNA (RADseq) phylogenetic framework.
RADseq generated a well-supported phylogenetic tree, in which multiple accessions from each species formed unique genetic clusters. Chromosome numbers and genome sizes vary from n = 2x = 15 to 24 and 2.7 to 5.8 pg/1C nucleus, respectively. Decreases in both genome size and chromosome number occur, although neither consistently nor in parallel. Species with the lowest chromosome numbers (n = 15-18) do not possess the smallest genome sizes and, although N. heterantha has retained the ancestral chromosome complement, n = 2x = 24, it nonetheless has the smallest genome size, even smaller than that of the modern representatives of ancestral diploids.
The results indicate that decreases in genome size and chromosome number occur in parallel down to a chromosome number threshold, n = 20, below which genome size increases, a phenomenon potentially explained by decreasing rates of recombination over fewer chromosomes. We hypothesize that, more generally in plants, major decreases in genome size post-polyploidization take place while chromosome numbers are still high because in these stages elimination of retrotransposons and other repetitive elements is more efficient. Once such major genome size change has been accomplished, then dysploid chromosome reductions take place to reorganize these smaller genomes, producing species with small genomes and low chromosome numbers such as those observed in many annual angiosperms, including Arabidopsis.
Given the 2,400-fold range of genome sizes (0.06-148.9 Gbp (gigabase pair)) of seed plants (angiosperms and gymnosperms) with a broadly similar gene content (amounting to approximately 0.03 Gbp), the ...repeat-sequence content of the genome might be expected to increase with genome size, resulting in the largest genomes consisting almost entirely of repetitive sequences. Here we test this prediction, using the same bioinformatic approach for 101 species to ensure consistency in what constitutes a repeat. We reveal a fundamental change in repeat turnover in genomes above around 10 Gbp, such that species with the largest genomes are only about 55% repetitive. Given that genome size influences many plant traits, habits and life strategies, this fundamental shift in repeat dynamics is likely to affect the evolutionary trajectory of species lineages.