Compensatory base changes (CBCs) in internal transcribed spacer 2 (ITS2) rDNA secondary structures correlate with Ernst Mayr's biological species concept. This hypothesis also referred to as the CBC ...species concept recently was subjected to large-scale testing, indicating two distinct probabilities. (1) If there is a CBC then there are two different species with a probability of ∼0.93. (2) If there is no CBC then there is the same species with a probability of ∼0.76. In ITS2 research, however, the main problem is the multicopy nature of ITS2 sequences. Most recently, 454 pyrosequencing data have been used to characterize more than 5000 intragenomic variations of ITS2 regions from 178 plant species, demonstrating that mutation of ITS2 is frequent, with a mean of 35 variants per species, respectively per individual organism. In this study, using those 454 data, the CBC criterion is reconsidered in the light of intragenomic variability, a proof of concept, a necessary criterion, expecting no intragenomic CBCs in variant ITS2 copies. In accordance with the CBC species concept, we could demonstrate that the probability that there is no intragenomic CBC is ∼0.99.
As coronavirus disease 2019 (COVID-19) pandemic poses a substantial global public health threat, traditional Chinese medicine (TCM) was used in 91.50% of the COVID-19 cases in China, showing ...encouraging results in improving symptom management and reducing the deterioration, mortality, and recurrence rates. A total of 166 modified herbal formulae consisting of 179 single herbal medicines were collected for treating COVID-19 in China. Glycyrrhizae Radix et Rhizome, Scutellariae Radix, and Armeniacae Semen Amarum are the most frequently utilized in clinics, most of which are antipyretic (47, 26.26%), expectorant and cough-suppressing (22, 12.29%), and dampness-resolving (21, 11.73%) from traditional descriptions. A total of 1212 chemical components containing β-sitosterol, stigmasterol, and quercetin were primarily selected. Additionally, using complex system entropy and unsupervised hierarchical clustering, 8 core herbal combinations and 10 new formulae emerged as potentially useful candidates for COVID-19. Finally, following scaffold analysis, self-organizing mapping (SOM) and cluster analysis, 12 clusters of molecules yielded 8 pharmacophore families of structures that were further screened as pharmacological targets in human metabolic pathways for inhibiting coronavirus. This article aims to make more easily accessible and share historical herbal knowledge used in contemporary treatments in a modern manner to assist researchers contain the global spread of COVID-19.
Herbal formulae were collected from 26 protocols for treating COVID-19. Using complex system entropy and unsupervised hierarchical clustering, 8 core combinations and 10 formulae emerged as potential candidates. Following scaffold analysis and self organizing mapping, 12 clusters of molecules yielded 8 pharmacophore structures screened as pharmacological targets for inhibiting corona virus. Display omitted
The internal transcribed spacer 2 (ITS2) region of nuclear ribosomal DNA is regarded as one of the candidate DNA barcodes because it possesses a number of valuable characteristics, such as the ...availability of conserved regions for designing universal primers, the ease of its amplification, and sufficient variability to distinguish even closely related species. However, a general analysis of its ability to discriminate species in a comprehensive sample set is lacking.
In the current study, 50,790 plant and 12,221 animal ITS2 sequences downloaded from GenBank were evaluated according to sequence length, GC content, intra- and inter-specific divergence, and efficiency of identification. The results show that the inter-specific divergence of congeneric species in plants and animals was greater than its corresponding intra-specific variations. The success rates for using the ITS2 region to identify dicotyledons, monocotyledons, gymnosperms, ferns, mosses, and animals were 76.1%, 74.2%, 67.1%, 88.1%, 77.4%, and 91.7% at the species level, respectively. The ITS2 region unveiled a different ability to identify closely related species within different families and genera. The secondary structure of the ITS2 region could provide useful information for species identification and could be considered as a molecular morphological characteristic.
As one of the most popular phylogenetic markers for eukaryota, we propose that the ITS2 locus should be used as a universal DNA barcode for identifying plant species and as a complementary locus for CO1 to identify animal species. We have also developed a web application to facilitate ITS2-based cross-kingdom species identification (http://its2-plantidit.dnsalias.org).
The complete chloroplast (cp) genome of
, a common ornamental and medicinal plant in North America and East Asia, was sequenced and analyzed. The length of the
cp genome is 155,078 bp, contains a ...pair of inverted repeat regions (IRa and IRb), of 23,774 bp each, as well as large (LSC, 88,858 bp) and small (SSC, 18,672 bp) single-copy regions. A total of 129 genes were identified in the cp genome, 16 of which were duplicated within the IR regions. Relative to other plant cp genomes, the
cp genome had a unique rearrangement between trnI-CAU and trnN-GUU. In
cpDNA,
,
, and
move to the LSC region, from the IR region. The
pesudogene in the IR region is lost, and only one copy locates in the SSC region. Comparative cp DNA sequence analyses of
with other cp genomes reveal that the gene order, and the gene and intron contents, are slightly different. The introns in
and
genes are found for the first time. Four genes (
,
,
, and
) lost introns. However, its genome structure, GC content, and codon usage were similar to those of typical angiosperm cp genomes. All preferred synonymous codons were found to use codons ending with A/T. The AT-rich sequences were less abundant in the coding regions than in the non-coding ones. A phylogenetic analysis based on 71 protein-coding genes supported the idea that
is a sister of the Araliaceae species. This study identified unique characteristics of the
cp genome that contribute to our understanding of the cpDNA evolution. It offers valuable information for the phylogenetic and specific barcoding of this medicinal plant.
Salvia miltiorrhiza Bunge, which contains tanshinones and phenolic acids as major classes of bioactive components, is one of the most widely used herbs in traditional Chinese medicine. Production of ...tanshinones and phenolic acids is enhanced by methyl jasmonate (MeJA). Transcription factor MYC2 is the switch of jasmontes signaling in plants. Here, we focused on two novel JA-inducible genes in S. miltiorrhiza, designated as SmMYC2a and SmMYC2b, which were localized in the nucleus. SmMYC2a and SmMYC2b were also discovered to interact with SmJAZ1 and SmJAZ2, implying that the two MYC2s might function as direct targets of JAZ proteins. Ectopic RNA interference (RNAi)-mediated knockdown experiments suggested that SmMYC2a/b affected multiple genes in tanshinone and phenolic acid biosynthetic pathway. Besides, the accumulation of tanshinones and phenolic acids was impaired by the loss of function in SmMYC2a/b. Meanwhile, SmMYC2a could bind with an E-box motif within SmHCT6 and SmCYP98A14 promoters, while SmMYC2b bound with an E-box motif within SmCYP98A14 promoter, through which the regulation of phenolic acid biosynthetic pathway might achieve. Together, these results suggest that SmMYC2a and SmMYC2b are JAZ-interacting transcription factors that positively regulate the biosynthesis of tanshinones and Sal B with similar but irreplaceable effects.
Terpenoids are the largest class of plant secondary metabolites and have attracted widespread interest. Salvia miltiorrhiza, belonging to the largest and most widely distributed genus in the mint ...family, is a model medicinal plant with great economic and medicinal value. Diterpenoid tanshinones are the major lipophilic bioactive components in S. miltiorrhiza. Systematic analysis of genes involved in terpenoid biosynthesis has not been reported to date. Searching the recently available working draft of the S. miltiorrhiza genome, 40 terpenoid biosynthesis-related genes were identified, of which 27 are novel. These genes are members of 19 families, which encode all of the enzymes involved in the biosynthesis of the universal isoprene precursor isopentenyl diphosphate and its isomer dimethylallyl diphosphate, and two enzymes associated with the biosynthesis of labdane-related diterpenoids. Through a systematic analysis, it was found that 20 of the 40 genes could be involved in tanshinone biosynthesis. Using a comprehensive approach, the intron/exon structures and expression patterns of all identified genes and their responses to methyl jasmonate treatment were analysed. The conserved domains and phylogenetic relationships among the deduced S. miltiorrhiza proteins and their homologues isolated from other plant species were revealed. It was discovered that some of the key enzymes, such as 1-deoxy-D-xylulose 5-phosphate synthase, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase, hydroxymethylglutaryl-CoA reductase, and geranylgeranyl diphosphate synthase, are encoded by multiple gene members with different expression patterns and subcellular localizations, and both homomeric and heteromeric geranyl diphosphate synthases exist in S. miltiorrhiza. The results suggest the complexity of terpenoid biosynthesis and the existence of metabolic channels for diverse terpenoids in S. miltiorrhiza and provide useful information for improving tanshinone production through genetic engineering.
Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information ...regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species are largely unknown.
Using the 454 pyrosequencing technology, a one-quarter GS FLX titanium run resulted in 188,185 reads with an average length of 410 bases for P. notoginseng root. These reads were processed and assembled by 454 GS De Novo Assembler software into 30,852 unique sequences. A total of 70.2% of unique sequences were annotated by Basic Local Alignment Search Tool (BLAST) similarity searches against public sequence databases. The Kyoto Encyclopedia of Genes and Genomes (KEGG) assignment discovered 41 unique sequences representing 11 genes involved in triterpene saponin backbone biosynthesis in the 454-EST dataset. In particular, the transcript encoding dammarenediol synthase (DS), which is the first committed enzyme in the biosynthetic pathway of major triterpene saponins, is highly expressed in the root of four-year-old P. notoginseng. It is worth emphasizing that the candidate cytochrome P450 (Pn02132 and Pn00158) and UDP-glycosyltransferase (Pn00082) gene most likely to be involved in hydroxylation or glycosylation of aglycones for triterpene saponin biosynthesis were discovered from 174 cytochrome P450s and 242 glycosyltransferases by phylogenetic analysis, respectively. Putative transcription factors were detected in 906 unique sequences, including Myb, homeobox, WRKY, basic helix-loop-helix (bHLH), and other family proteins. Additionally, a total of 2,772 simple sequence repeat (SSR) were identified from 2,361 unique sequences, of which, di-nucleotide motifs were the most abundant motif.
This study is the first to present a large-scale EST dataset for P. notoginseng root acquired by next-generation sequencing (NGS) technology. The candidate genes involved in triterpene saponin biosynthesis, including the putative CYP450s and UGTs, were obtained in this study. Additionally, the identification of SSRs provided plenty of genetic makers for molecular breeding and genetics applications in this species. These data will provide information on gene discovery, transcriptional regulation and marker-assisted selection for P. notoginseng. The dataset establishes an important foundation for the study with the purpose of ensuring adequate drug resources for this species.
Members of
Polygonatum
are perennial herbs that have been widely used in traditional Chinese medicine to invigorate Qi, moisten the lung, and benefit the kidney and spleen among patients. However, ...the phylogenetic relationships and intrageneric taxonomy within
Polygonatum
have long been controversial because of the complexity of their morphological variations and lack of high-resolution molecular markers. The chloroplast (cp) genome is an optimal model for deciphering phylogenetic relationships in related families. In the present study, the complete cp genome of 26 species of Trib. Polygonateae were
de novo
assembled and characterized; all species exhibited a conserved quadripartite structure, that is, two inverted repeats (IR) containing most of the ribosomal RNA genes, and two unique regions, large single sequence (LSC) and small single sequence (SSC). A total of 8 highly variable regions (
rps16-trnQ-UUG, trnS-GCU-trnG-UCC
,
rpl32-trnL-UAG
,
matK-rps16
,
petA-psbJ, trnT-UGU-trnL-UAA
,
accD-psaI
, and
trnC-GCA-petN
) that might be useful as potential molecular markers for identifying
Polygonatum
species were identified. The molecular clock analysis results showed that the divergence time of
Polygonatum
might occur at ∼14.71 Ma, and the verticillate leaf might be the ancestral state of this genus. Moreover, phylogenetic analysis based on 88 cp genomes strongly supported the monophyly of
Polygonatum
. The phylogenetic analysis also suggested that
Heteropolygonatum
may be the sister group of the
Polygonatum
, but the
Disporopsis, Maianthemum
, and
Disporum
may have diverged earlier. This study provides valuable information for further species identification, evolution, and phylogenetic research of
Polygonatum
.
Chinese goldthread (Coptis chinensis Franch.), a member of the Ranunculales, represents an important early-diverging eudicot lineage with diverse medicinal applications. Here, we present a ...high-quality chromosome-scale genome assembly and annotation of C. chinensis. Phylogenetic and comparative genomic analyses reveal the phylogenetic placement of this species and identify a single round of ancient whole-genome duplication (WGD) shared by the Ranunculaceae. We characterize genes involved in the biosynthesis of protoberberine-type alkaloids in C. chinensis. In particular, local genomic tandem duplications contribute to member amplification of a Ranunculales clade-specific gene family of the cytochrome P450 (CYP) 719. The functional versatility of a key CYP719 gene that encodes the (S)-canadine synthase enzyme involved in the berberine biosynthesis pathway may play critical roles in the diversification of other berberine-related alkaloids in C. chinensis. Our study provides insights into the genomic landscape of early-diverging eudicots and provides a valuable model genome for genetic and applied studies of Ranunculales.
Plants have evolved a panoply of specialized metabolites that increase their environmental fitness. Two examples are caffeine, a purine psychotropic alkaloid, and crocins, a group of glycosylated ...apocarotenoid pigments. Both classes of compounds are found in a handful of distantly related plant genera (Coffea, Camellia, Paullinia, and Ilex for caffeine; Crocus, Buddleja, and Gardenia for crocins) wherein they presumably evolved through convergent evolution. The closely related Coffea and Gardenia genera belong to the Rubiaceae family and synthesize, respectively, caffeine and crocins in their fruits. Here, we report a chromosomal-level genome assembly of Gardenia jasminoides, a crocin-producing species, obtained using Oxford Nanopore sequencing and Hi-C technology. Through genomic and functional assays, we completely deciphered for the first time in any plant the dedicated pathway of crocin biosynthesis. Through comparative analyses with Coffea canephora and other eudicot genomes, we show that Coffea caffeine synthases and the first dedicated gene in the Gardenia crocin pathway, GjCCD4a, evolved through recent tandem gene duplications in the two different genera, respectively. In contrast, genes encoding later steps of the Gardenia crocin pathway, ALDH and UGT, evolved through more ancient gene duplications and were presumably recruited into the crocin biosynthetic pathway only after the evolution of the GjCCD4a gene. This study shows duplication-based divergent evolution within the coffee family (Rubiaceae) of two characteristic secondary metabolic pathways, caffeine and crocin biosynthesis, from a common ancestor that possessed neither complete pathway. These findings provide significant insights on the role of tandem duplications in the evolution of plant specialized metabolism.