Plastid phylogenomic analysis of green plants Gitzendanner, Matthew A.; Soltis, Pamela S.; Wong, Gane K.-S. ...
American journal of botany,
03/2018, Volume:
105, Issue:
3
Journal Article
Peer reviewed
Open access
Premise of the Study
For the past one billion years, green plants (Viridiplantae) have dominated global ecosystems, yet many key branches in their evolutionary history remain poorly resolved. Using ...the largest analysis of Viridiplantae based on plastid genome sequences to date, we examined the phylogeny and implications for morphological evolution at key nodes.
Methods
We analyzed amino acid sequences from protein‐coding genes from complete (or nearly complete) plastomes for 1879 taxa, including representatives across all major clades of Viridiplantae. Much of the data used was derived from transcriptomes from the One Thousand Plants Project (1KP); other data were taken from GenBank.
Key Results
Our results largely agree with previous plastid‐based analyses. Noteworthy results include (1) the position of Zygnematophyceae as sister to land plants (Embryophyta), (2) a bryophyte clade (hornworts, mosses + liverworts), (3) Equisetum + Psilotaceae as sister to Marattiales + leptosporangiate ferns, (4) cycads + Ginkgo as sister to the remaining extant gymnosperms, within which Gnetophyta are placed within conifers as sister to non‐Pinaceae (Gne‐Cup hypothesis), and (5) Amborella, followed by water lilies (Nymphaeales), as successive sisters to all other extant angiosperms. Within angiosperms, there is support for Mesangiospermae, a clade that comprises magnoliids, Chloranthales, monocots, Ceratophyllum, and eudicots. The placements of Ceratophyllum and Dilleniaceae remain problematic. Within Pentapetalae, two major clades (superasterids and superrosids) are recovered.
Conclusions
This plastid data set provides an important resource for elucidating morphological evolution, dating divergence times in Viridiplantae, comparisons with emerging nuclear phylogenies, and analyses of molecular evolutionary patterns and dynamics of the plastid genome.
Full text
Available for:
FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK
Next-generation sequencing has provided a wealth of plastid genome sequence data from an increasingly diverse set of green plants (Viridiplantae). Although these data have helped resolve the ...phylogeny of numerous clades (e.g., green algae, angiosperms, and gymnosperms), their utility for inferring relationships across all green plants is uncertain. Viridiplantae originated 700-1500 million years ago and may comprise as many as 500,000 species. This clade represents a major source of photosynthetic carbon and contains an immense diversity of life forms, including some of the smallest and largest eukaryotes. Here we explore the limits and challenges of inferring a comprehensive green plant phylogeny from available complete or nearly complete plastid genome sequence data.
We assembled protein-coding sequence data for 78 genes from 360 diverse green plant taxa with complete or nearly complete plastid genome sequences available from GenBank. Phylogenetic analyses of the plastid data recovered well-supported backbone relationships and strong support for relationships that were not observed in previous analyses of major subclades within Viridiplantae. However, there also is evidence of systematic error in some analyses. In several instances we obtained strongly supported but conflicting topologies from analyses of nucleotides versus amino acid characters, and the considerable variation in GC content among lineages and within single genomes affected the phylogenetic placement of several taxa.
Analyses of the plastid sequence data recovered a strongly supported framework of relationships for green plants. This framework includes: i) the placement of Zygnematophyceace as sister to land plants (Embryophyta), ii) a clade of extant gymnosperms (Acrogymnospermae) with cycads + Ginkgo sister to remaining extant gymnosperms and with gnetophytes (Gnetophyta) sister to non-Pinaceae conifers (Gnecup trees), and iii) within the monilophyte clade (Monilophyta), Equisetales + Psilotales are sister to Marattiales + leptosporangiate ferns. Our analyses also highlight the challenges of using plastid genome sequences in deep-level phylogenomic analyses, and we provide suggestions for future analyses that will likely incorporate plastid genome sequence data for thousands of species. We particularly emphasize the importance of exploring the effects of different partitioning and character coding strategies.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Conflicting relationships have been found between diversification rate and temperature across disparate clades of life. Here, we use a supermatrix comprising nearly 20,000 species of rosids-a clade ...of ~25% of all angiosperm species-to understand global patterns of diversification and its climatic association. Our approach incorporates historical global temperature, assessment of species' temperature niche, and two broad-scale characterizations of tropical versus non-tropical niche occupancy. We find the diversification rates of most subclades dramatically increased over the last 15 million years (Myr) during cooling associated with global expansion of temperate habitats. Climatic niche is negatively associated with diversification rates, with tropical rosids forming older communities and experiencing speciation rates ~2-fold below rosids in cooler climates. Our results suggest long-term cooling had a disproportionate effect on non-tropical diversification rates, leading to dynamic young communities outside of the tropics, while relative stability in tropical climes led to older, slower-evolving but still species-rich communities.
Angiosperms are by far the most species-rich clade of land plants, but their origin and early evolutionary history remain poorly understood. We reconstructed angiosperm phylogeny based on 80 genes ...from 2,881 plastid genomes representing 85% of extant families and all orders. With a well-resolved plastid tree and 62 fossil calibrations, we dated the origin of the crown angiosperms to the Upper Triassic, with major angiosperm radiations occurring in the Jurassic and Lower Cretaceous. This estimated crown age is substantially earlier than that of unequivocal angiosperm fossils, and the difference is here termed the 'Jurassic angiosperm gap'. Our time-calibrated plastid phylogenomic tree provides a highly relevant framework for future comparative studies of flowering plant evolution.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Flowering plants (angiosperms) are dominant components of global terrestrial ecosystems, but phylogenetic relationships at the familial level and above remain only partially resolved, greatly ...impeding our full understanding of their evolution and early diversification. The plastome, typically mapped as a circular genome, has been the most important molecular data source for plant phylogeny reconstruction for decades.
Here, we assembled by far the largest plastid dataset of angiosperms, composed of 80 genes from 4792 plastomes of 4660 species in 2024 genera representing all currently recognized families. Our phylogenetic tree (PPA II) is essentially congruent with those of previous plastid phylogenomic analyses but generally provides greater clade support. In the PPA II tree, 75% of nodes at or above the ordinal level and 78% at or above the familial level were resolved with high bootstrap support (BP ≥ 90). We obtained strong support for many interordinal and interfamilial relationships that were poorly resolved previously within the core eudicots, such as Dilleniales, Saxifragales, and Vitales being resolved as successive sisters to the remaining rosids, and Santalales, Berberidopsidales, and Caryophyllales as successive sisters to the asterids. However, the placement of magnoliids, although resolved as sister to all other Mesangiospermae, is not well supported and disagrees with topologies inferred from nuclear data. Relationships among the five major clades of Mesangiospermae remain intractable despite increased sampling, probably due to an ancient rapid radiation.
We provide the most comprehensive dataset of plastomes to date and a well-resolved phylogenetic tree, which together provide a strong foundation for future evolutionary studies of flowering plants.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Summary
Free‐living cyanobacteria were entrapped by eukaryotic cells ~2 billion years ago, ultimately giving rise to chloroplasts. After a century of debate, the presence of chloroplast DNA was ...demonstrated in the 1960s. The first chloroplast genomes were sequenced in the 1980s, followed by ~100 vegetable, fruit, cereal, beverage, oil and starch/sugar crop chloroplast genomes in the past three decades. Foreign genes were expressed in isolated chloroplasts or intact plant cells in the late 1980s and stably integrated into chloroplast genomes, with typically maternal inheritance shown in the 1990s. Since then, chloroplast genomes conferred the highest reported levels of tolerance or resistance to biotic or abiotic stress. Although launching products with agronomic traits in important crops using this concept has been elusive, commercial products developed include enzymes used in everyday life from processing fruit juice, to enhancing water absorption of cotton fibre or removal of stains as laundry detergents and in dye removal in the textile industry. Plastid genome sequences have revealed the framework of green plant phylogeny as well as the intricate history of plastid genome transfer events to other eukaryotes. Discordant historical signals among plastid genes suggest possible variable constraints across the plastome and further understanding and mitigation of these constraints may yield new opportunities for bioengineering. In this review, we trace the evolutionary history of chloroplasts, status of autonomy and recent advances in products developed for everyday use or those advanced to the clinic, including treatment of COVID‐19 patients and SARS‐CoV‐2 vaccine.
Full text
Available for:
BFBNIB, DOBA, FZAB, GIS, IJS, IZUM, KILJ, NLZOH, NUK, OILJ, PILJ, PNG, SAZU, SBCE, SBMB, UILJ, UKNU, UL, UM, UPUK
Premise
Discordance between nuclear and organellar phylogenies (cytonuclear discordance) is a well‐documented phenomenon at shallow evolutionary levels but has been poorly investigated at deep levels ...of plant phylogeny. Determining the extent of cytonuclear discordance across major plant lineages is essential not only for elucidating evolutionary processes, but also for evaluating the currently used framework of plant phylogeny, which is largely based on the plastid genome.
Methods
We present a phylogenomic examination of a major angiosperm clade (Asteridae) based on sequence data from the nuclear, plastid, and mitochondrial genomes as a means of evaluating currently accepted relationships inferred from the plastome and exploring potential sources of genomic conflict in this group.
Results
We recovered at least five instances of well‐supported cytonuclear discordance concerning the placements of major asterid lineages (i.e., Ericales, Oncothecaceae, Aquifoliales, Cassinopsis, and Icacinaceae). We attribute this conflict to a combination of incomplete lineage sorting and hybridization, the latter supported in part by previously inferred whole‐genome duplications.
Conclusions
Our results challenge several long‐standing hypotheses of asterid relationships and have implications for morphological character evolution and for the importance of ancient whole‐genome duplications in early asterid evolution. These findings also highlight the value of reevaluating broad‐scale angiosperm and green‐plant phylogeny with nuclear genomic data.
Full text
Available for:
FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK
The 1,000 plants (1KP) project is an international multi-disciplinary consortium that has generated transcriptome data from over 1,000 plant species, with exemplars for all of the major lineages ...across the Viridiplantae (green plants) clade. Here, we describe how to access the data used in a phylogenomics analysis of the first 85 species, and how to visualize our gene and species trees. Users can develop computational pipelines to analyse these data, in conjunction with data of their own that they can upload. Computationally estimated protein-protein interactions and biochemical pathways can be visualized at another site. Finally, we comment on our future plans and how they fit within this scalable system for the dissemination, visualization, and analysis of large multi-species data sets.
Significance Early branching events in the diversification of land plants and closely related algal lineages remain fundamental and unresolved questions in plant evolutionary biology. Accurate ...reconstructions of these relationships are critical for testing hypotheses of character evolution: for example, the origins of the embryo, vascular tissue, seeds, and flowers. We investigated relationships among streptophyte algae and land plants using the largest set of nuclear genes that has been applied to this problem to date. Hypothesized relationships were rigorously tested through a series of analyses to assess systematic errors in phylogenetic inference caused by sampling artifacts and model misspecification. Results support some generally accepted phylogenetic hypotheses, while rejecting others. This work provides a new framework for studies of land plant evolution.
Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK