Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of ...developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5–15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Herbarium collections are potentially an enormous resource for DNA studies, but the use of herbarium specimens in molecular studies has thus far been slowed down by difficulty in obtaining ...amplifiable DNA. Here we compare a set of commercially available DNA extraction protocols and their performance in terms of DNA purity and yield, and PCR amplification success as measured by using three differentially sized markers, the rbcL barcoding marker (cpDNA), the LEAFY exon 3 (nrDNA), and the trnL((UAA)) P6 loop (cpDNA). Results reveal large differences between extraction methods, where DNA purity rather than yield is shown to be strongly correlated with PCR success. Amplicon size shows similarly strong correlation with PCR success, with the shortest fragment showing the highest success rate (78%, P6 loop, 10-143 base pairs (bp)) and the largest fragment the lowest success (10%, rbcL, 670 bp). The effect of specimen preparation method on PCR success was also tested. Results show that drying method strongly affects PCR success, especially the availability of fragments longer than 250 bp, where longer fragments are more available for PCR amplification in air dried material compared to alcohol dried specimens. Results from our study indicate that projects relying on poor-quality starting material such as herbarium or scat samples should focus on extracting pure DNA and aim to amplify short target regions (<200-300 bp) in order to maximise outcomes. Development of shorter barcoding regions, or mini-barcodes within existing ones should be of high importance as only a few options are currently available; this is particularly important if we hope to incorporate the millions of herbarium samples available into barcoding initiatives and other molecular studies.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
An international consortium of major natural history museums, herbaria and other organizations has launched an ambitious project, the 'Barcode of Life Initiative', to promote a process enabling the ...rapid and inexpensive identification of the estimated 10 million species on Earth. DNA barcoding is a diagnostic technique in which short DNA sequence(s) can be used for species identification. The first international scientific conference on Barcoding of Life was held at the Natural History Museum in London in February 2005, and here we review the scientific challenges discussed during this conference and in previous publications. Although still controversial, the scientific benefits of DNA barcoding include: (i) enabling species identification, including any life stage or fragment, (ii) facilitating species discoveries based on cluster analyses of gene sequences (e.g. cox1=CO1, in animals), (iii) promoting development of handheld DNA sequencing technology that can be applied in the field for biodiversity inventories and (iv) providing insight into the diversity of life.
Abstract
The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are ...fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96$\%$) and 2333 genera (17$\%$). A “first pass” angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world’s natural history collections. Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The world’s herbaria collectively house millions of diverse plant specimens, including endangered or extinct species and type specimens. Unlocking genetic data from the typically highly degraded DNA ...obtained from herbarium specimens was difficult until the arrival of high-throughput sequencing approaches, which can be applied to low quantities of severely fragmented DNA. Target enrichment involves using short molecular probes that hybridise and capture genomic regions of interest for high-throughput sequencing. In this study on herbariomics, we used this targeted sequencing approach and the Angiosperms353 universal probe set to recover up to 351 nuclear genes from 435 herbarium specimens that are up to 204 years old and span the breadth of angiosperm diversity. We show that on average 207 genes were successfully retrieved from herbarium specimens, although the mean number of genes retrieved and target enrichment efficiency is significantly higher for silica gel-dried specimens. Forty-seven target nuclear genes were recovered from a herbarium specimen of the critically endangered St Helena boxwood,
Mellissia begoniifolia
, collected in 1815. Herbarium specimens yield significantly less high-molecular-weight DNA than silica gel-dried specimens, and genomic DNA quality declines with sample age, which is negatively correlated with target enrichment efficiency. Climate, taxon-specific traits, and collection strategies additionally impact target sequence recovery. We also detected taxonomic bias in targeted sequencing outcomes for the 10 most numerous angiosperm families that were investigated in depth. We recommend that (1) for species distributed in wet tropical climates, silica gel-dried specimens should be used preferentially; (2) for species distributed in seasonally dry tropical climates, herbarium and silica gel-dried specimens yield similar results, and either collection can be used; (3) taxon-specific traits should be explored and established for effective optimisation of taxon-specific studies using herbarium specimens; (4) all herbarium sheets should, in future, be annotated with details of the preservation method used; (5) long-term storage of herbarium specimens should be in stable, low-humidity, and low-temperature environments; and (6) targeted sequencing with universal probes, such as Angiosperms353, should be investigated closely as a new approach for DNA barcoding that will ensure better exploitation of herbarium specimens than traditional Sanger sequencing approaches.
Display omitted
•Our results highlight the importance of the number of locules in Myrcia classification.•The widespread and taxonomically complex Myrcia guianensis is non-monophyletic.•Myrcia sect. ...Aguava appears to have originated in the Cerrado.•Transitions from Cerrado to humid forests occurred in the evolution of M. sect. Aguava.•Combination of NGS and Sanger data is possible and generates robust phylogenetic trees.
Myrcia is one of the largest exclusively Neotropical angiosperm genera, including ca. 800 species divided into nine sections. Myrcia sect. Aguava is one of most complex sections of Myrcia due to high morphological variation and wide distribution range of some species, including M. guianensis, with distribution throughout South America and a complex taxonomic history. We used complete plastid DNA sequences data generated using next-generation sequencing of 45 terminals, mostly from Myrcia sect. Aguava. These data were combined with five target DNA regions (ITS, psbA-trnH, trnL-trnF, trnQ-rps16, ndhF) of additional terminals to increase taxonomic coverage. Phylogenetic analyses were conducted using a maximum likelihood approach, and divergence times and ancestral range distributions were estimated. Myrcia sect. Aguava is monophyletic and exclusively comprises species with trilocular ovaries but has no relationship with other groups within Myrcia that possess trilocular ovaries. Three main lineages that correspond to geographical distribution are recognized within Myrcia sect. Aguava. Multiple accessions reveal a non-monophyletic Myrcia guianensis and stress the biogeographical structure inside the group. Myrcia sect. Aguava had a probable mid-Miocene origin in the Cerrado, but lineages that persisted there diversified only more recently, when the present-day vegetation started to stabilize. Posterior migrations to Atlantic Forest, Amazon and Caribbean occurred at the end of Miocene, evidencing transitions from open and dry to forested and more humid areas that are less frequent in the Neotropics. Overall, it is observed that related lineages remained in ecologically similar environments. Future perspectives on Myrcia and Myrteae in the phylogenomic era are also discussed.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a historic pandemic of respiratory disease (coronavirus disease 2019 COVID-19), and current evidence suggests that severe ...disease is associated with dysregulated immunity within the respiratory tract. However, the innate immune mechanisms that mediate protection during COVID-19 are not well defined. Here, we characterize a mouse model of SARS-CoV-2 infection and find that early CCR2 signaling restricts the viral burden in the lung. We find that a recently developed mouse-adapted SARS-CoV-2 (MA-SARS-CoV-2) strain as well as the emerging B.1.351 variant trigger an inflammatory response in the lung characterized by the expression of proinflammatory cytokines and interferon-stimulated genes. Using intravital antibody labeling, we demonstrate that MA-SARS-CoV-2 infection leads to increases in circulating monocytes and an influx of CD45
cells into the lung parenchyma that is dominated by monocyte-derived cells. Single-cell RNA sequencing (scRNA-Seq) analysis of lung homogenates identified a hyperinflammatory monocyte profile. We utilize this model to demonstrate that mechanistically, CCR2 signaling promotes the infiltration of classical monocytes into the lung and the expansion of monocyte-derived cells. Parenchymal monocyte-derived cells appear to play a protective role against MA-SARS-CoV-2, as mice lacking CCR2 showed higher viral loads in the lungs, increased lung viral dissemination, and elevated inflammatory cytokine responses. These studies have identified a potential CCR2-monocyte axis that is critical for promoting viral control and restricting inflammation within the respiratory tract during SARS-CoV-2 infection.
SARS-CoV-2 has caused a historic pandemic of respiratory disease (COVID-19), and current evidence suggests that severe disease is associated with dysregulated immunity within the respiratory tract. However, the innate immune mechanisms that mediate protection during COVID-19 are not well defined. Here, we characterize a mouse model of SARS-CoV-2 infection and find that early CCR2-dependent infiltration of monocytes restricts the viral burden in the lung. We find that SARS-CoV-2 triggers an inflammatory response in the lung characterized by the expression of proinflammatory cytokines and interferon-stimulated genes. Using RNA sequencing and flow cytometry approaches, we demonstrate that SARS-CoV-2 infection leads to increases in circulating monocytes and an influx of CD45
cells into the lung parenchyma that is dominated by monocyte-derived cells. Mechanistically, CCR2 signaling promoted the infiltration of classical monocytes into the lung and the expansion of monocyte-derived cells. Parenchymal monocyte-derived cells appear to play a protective role against MA-SARS-CoV-2, as mice lacking CCR2 showed higher viral loads in the lungs, increased lung viral dissemination, and elevated inflammatory cytokine responses. These studies have identified that the CCR2 pathway is critical for promoting viral control and restricting inflammation within the respiratory tract during SARS-CoV-2 infection.
Premise
To further advance the understanding of the species‐rich, economically and ecologically important angiosperm order Myrtales in the rosid clade, comprising nine families, approximately 400 ...genera and almost 14,000 species occurring on all continents (except Antarctica), we tested the Angiosperms353 probe kit.
Methods
We combined high‐throughput sequencing and target enrichment with the Angiosperms353 probe kit to evaluate a sample of 485 species across 305 genera (76% of all genera in the order).
Results
Results provide the most comprehensive phylogenetic hypothesis for the order to date. Relationships at all ranks, such as the relationship of the early‐diverging families, often reflect previous studies, but gene conflict is evident, and relationships previously found to be uncertain often remain so. Technical considerations for processing HTS data are also discussed.
Conclusions
High‐throughput sequencing and the Angiosperms353 probe kit are powerful tools for phylogenomic analysis, but better understanding of the genetic data available is required to identify genes and gene trees that account for likely incomplete lineage sorting and/or hybridization events.
Abstract
The infrageneric classification of the genus Clerodendrum (Lamiaceae) has been highly problematic due to different concepts and subdivisions being applied to the treatment of Asian, ...Australian, and African species. Earlier molecular phylogenetic studies based on Sanger sequencing and limited sampling have indicated that previous morphology-based infrageneric classifications are not congruent with the phylogeny due to morphological convergence in many lineages. Advances in high-throughput DNA sequencing provide more information allowing more robust phylogenetic reconstruction at larger scale. We present the first comprehensive phylogenetic study of Clerodendrum that includes representatives of all previously recognized infrageneric taxa and using targeted sequencing data obtained from the Angiosperms353 enrichment to resolve the phylogenetic relationships. In agreement with previous phylogenetic work, our phylogeny shows that Asian and Australian species form a sister clade to an African Clade. Clerodendrum is monophyletic with the exclusion of some tropical coastal species, including some Australian species, which fall within a clade containing Volkameria and New World genera. We recognize two subgenera: subgen. Clerodendrum for Asian and Australian species and the new subgen. Afroclerodendrum for African and Malagasy species. Our findings support an infrageneric classification of Clerodendrum with a total of 13 sections based on molecular phylogenetic evidence and morphology that clearly accommodate the Asian, remaining Australian, and African species. Of these, we propose three new sections: sect. Albiflora, sect. Fortunata and sect. Megaflora for clades presenting unique morphological characters. The sectional classification and taxonomic consequences are discussed.
Premise
The genetic structure of hybrid zones provides insight into the potential for gene flow to occur between plant taxa. Four closely related European orchid species (Orchis anthropophora, O. ...militaris, O. purpurea, and O. simia) hybridize when they co‐occur. We aimed to characterize patterns of hybridization in O. militaris–O. purpurea, O. purpurea–O. simia, and O. anthropophora–O. simia hybrid zones using molecular and morphological data.
Methods
We used 11 newly isolated nuclear microsatellites to genotype 695 individuals collected from seven hybrid zones and six allopatric parental populations in France. Geometric morphometric analysis was conducted using 15 labellum landmarks to capture the main aspects of petal shape.
Results
Backcrossing was asymmetric toward O. militaris in multiple O. militaris–O. purpurea hybrid zones. Hybrids in O. purpurea–O. simia and O. anthropophora–O. simia hybrid zones were largely limited to F1 and F2 generations, but further admixture had occurred. These patterns were reflected in labellum geometric morphometric data, which correlated strongly with nuclear microsatellite data in all three species combinations.
Conclusions
The coexistence of parental and admixed individuals in these Orchis hybrid zones implies they are likely to be tension zones being maintained by a balance between gene flow into the hybrid zone and selection acting against admixed individuals. The pattern of admixture in the three species combinations suggests intrinsic selection acting on the hybrids is weaker in more closely related taxa.