(Hope) (Coleoptera: Cerambycidae) is an important forest pest in China that mainly infests timber and economic forests. This pest primarily causes plant tissue to necrotize, rot, and eventually die ...by feeding on the woody parts of tree trunks. To gain a deeper understanding of the genetic mechanism of
, this study employed single-molecule real-time sequencing (SMRT) and Illumina RNA-seq technologies to conduct full-length transcriptome sequencing of the insect. Total RNA extracted from male and female adults was mixed and subjected to SMRT sequencing, generating a complete transcriptome. Transcriptome analysis, prediction of long non-coding RNA (lncRNA), coding sequences (CDs), analysis of simple sequence repeats (SSR), prediction of transcription factors, and functional annotation of transcripts were performed in this study. The collective 20,356,793 subreads (38.26 G, clean reads) were generated, including 432,091 circular consensus sequences and 395,851 full-length non-chimera reads. The full-length non-chimera reads (FLNC) were clustered and redundancies were removed, resulting in 39,912 consensus reads. SSR and ANGEL software v3.0 were used for predicting SSR and CDs. In addition, four tools were used for annotating 6058 lncRNAs, identifying 636 transcription factors. Furthermore, a total of 84,650 transcripts were functionally annotated in seven different databases. This is the first time that the full-length transcriptome of
has been obtained using SMRT sequencing. This provides an important foundation for investigating the gene regulation underlying the interaction between
and its host plants through gene editing in the future and provides a scientific basis for the prevention and control of
.
Nitraria tangutorum Bobrov is a halophyte that is resistant to salt and alkali and is widely distributed in northwestern China. However, its genome has not been sequenced, thereby limiting studies on ...this particular species. For species without a reference genome, the full-length transcriptome is a convenient and rapid way to obtain reference gene information. To better study N. tangutorum, we used PacBio single-molecule real-time technology to perform full-length transcriptome analysis of this halophyte. In this study, a total of 21.83 Gb of data were obtained, and 198,300 transcripts, 51,875 SSRs (simple sequence repeats), 55,574 CDS (coding sequence), and 74,913 lncRNAs (long non-coding RNA) were identified. In addition, using this full-length transcriptome, we identified the key Na+/H+ antiporter (NHX) genes that maintain ion balance in plants and found that these are induced to express under salt stress. The results indicate that the full-length transcriptome of N. tangutorum can be used as a database and be utilized in elucidating the salt tolerance mechanism of N. tangutorum.
Simultaneous expression of highly homologous RLN1 and RLN2 genes in prostate impairs their accurate delineation. We used PacBio SMRT sequencing and RNA-Seq in LNCaP cells in order to dissect the ...expression of RLN1 and RLN2 variants. We identified a novel fusion transcript comprising the RLN1 and RLN2 genes and found evidence of its expression in the normal and prostate cancer tissues. The RLN1-RLN2 fusion putatively encodes RLN2 isoform with the deleted secretory signal peptide. The identification of the fusion transcript provided information to determine unique RLN1-RLN2 fusion and RLN1 regions. The RLN1-RLN2 fusion was co-expressed with RLN1 in LNCaP cells, but the two gene products were inversely regulated by androgens. We showed that RLN1 is underrepresented in common PCa cell lines in comparison to normal and PCa tissue. The current study brings a highly relevant update to the relaxin field, and will encourage further studies of RLN1 and RLN2 in PCa and broader.
•A novel RLN1-RLN2 fusion was observed in LNCaP cells and normal and prostate cancer tissues.•The fusion putatively encodes a RLN2 isoform with a deleted secretory signal peptide.•RLN1 is underrepresented in PCa cell lines.•Androgens inversely regulate RLN1 and the RLN1-RLN2 fusion transcript.
The highly versatile group of Herpesviruses cause disease in a wide range of hosts. In invertebrates, only two herpesviruses are known: the malacoherpesviruses HaHV-1 and OsHV-1 infecting gastropods ...and bivalves, respectively. To understand viral transcript architecture and diversity we first reconstructed full-length viral genomes of HaHV-1 infecting
and OsHV-1 infecting
by DNA-seq. We then used RNA-seq over the time-course of experimental infections to establish viral transcriptional dynamics, followed by PacBio long-read sequencing of full-length transcripts to untangle viral transcript architectures at two selected time points. Despite similarities in genome structure, in the number of genes and in the diverse transcriptomic architectures, we measured a ten-fold higher transcript variability in HaHV-1, with more extended antisense gene transcription. Transcriptional dynamics also appeared different, both in timing and expression trends. Both viruses were heavily affected by post-transcriptional modifications performed by ADAR1 affecting sense-antisense gene pairs forming dsRNAs. However, OsHV-1 concentrated these modifications in a few genomic hotspots, whereas HaHV-1 diluted ADAR1 impact by elongated and polycistronic transcripts distributed over its whole genome. These transcriptional strategies might thus provide alternative potential roles for sense-antisense transcription in viral transcriptomes to evade the host's immune response in different virus-host combinations.
•First complete genome sequence of a Bacillus glycinifermentans strain established with SMRT sequencing.•Annotation based on well-established reference genome sequences.•In silico identification of ...secondary metabolite cluster.•First RAPD-PCR profile of a Bacillus glycinifermentans strain.•Experimental data showing growth on bile containing media facilitating the survival of this strain across the gut.
The first complete genome sequence of Bacillus glycinifermentans B-27 was determined by SMRT sequencing generating a genome sequence with a total length of 4,607,442 bases. Based on this sequence 4738 protein-coding sequences were predicted and used to identify gene clusters that are related to the production of secondary metabolites such as Lichenysin, Bacillibactin and Bacitracin. This genomic potential combined with the ability of B. glycinifermentans B-27 to grown in bile containing media might contribute to a future application of this strain as probiotic in productive livestock potentially inhibiting competing and pathogenic organisms.
Proteomic analyses facilitate the interpretation of molecular biomarker probes which are very helpful in diagnosing schizophrenia (SZ). In the current study, we attempt to test whether potential ...differences in plasma protein expressions in SZ and bipolar disorder (BD) are associated with cognitive deficits and their underlying brain structures. Forty-two plasma proteins of 29 SZ patients, 25 BD patients and 93 non-clinical controls were quantified and analysed using multiple reaction monitoring-based triple quadrupole mass spectrometry approach. We also computed group comparisons of protein expressions between patients and controls, and between SZ and BD patients, as well. Potential associations of protein levels with cognitive functioning (psychomotor speed, executive functioning, crystallised intelligence) as well as underlying brain volume in the hippocampus were explored, using bivariate correlation analyses. The main finding of this study was that apolipoprotein expression differed between patients and controls and that these alterations in both disease groups were putatively related to cognitive impairments as well as to hippocampus volumes. However, none of the protein level differences were related to clinical symptom severity. In summary, altered apolipoprotein expression in BD and SZ was linked to cognitive decline and underlying morphological changes in both disorders. Our results suggest that the detection of molecular patterns in association with cognitive performance and its underlying brain morphology is of great importance for understanding of the pathological mechanisms of SZ and BD, as well as for supporting the diagnosis and treatment of both disorders.
Members of the genus Streptococcus inhabit a variety of sites in humans and other animals and some species are prolific producers of proteinaceous antibiotics (bacteriocins). As little is known about ...(i) streptococci inhabiting domestic pets, and (ii) whether novel bacteriocin-producing streptococci can be isolated from domestic pets, the aim of this study is to address these gaps in the research literature. In this study, Streptococcus equinus MDC1, isolated from a healthy dog, was found to exhibit potent antibacterial activity against Micrococcus luteus in a simultaneous antagonism assay, suggesting that strain MDC1 produces a lantibiotic bacteriocin. The inhibitory activity spectrum of S. equinus MDC1, determined using agar-based deferred antagonism assays against >70 indicator strains, was found to be similar to that of nisin U (a lantibiotic produced by Streptococcus uberis). However, the spectra of the two bacteriocins differed by 23 strains, mainly with the MDC1 bacteriocin having no inhibitory activity towards certain streptococci of human origin (e.g., Streptococcus gordonii, Streptococcus anginosus, Streptococcus salivarius). The genome of S. equinus MDC1, which was sequenced completely using single-molecule real-time (SMRT) next-generation DNA sequencing technology, comprises a single 1,936,555-basepair chromosome containing seven copies of the ribosomal RNA operon, 69 tRNA genes and nearly 1900 putative coding sequences. Analysis of the MDC1 genome sequence using the bacteriocin detection algorithms BAGEL4 and antiSMASH revealed the location of a 13,164-basepair 11-gene locus, designated nmd, which encoded a mature nisin E peptide that differed from nisin U by only two amino acids (Ile15→Ala and Leu21→Ile) and an extra C-terminal asparagine residue, and the proteins required for post-translational modification of the bacteriocin, processing, export, and producer immunity. Despite the high homology (90.6% identity, 93.8% similarity) between nisin E and nisin U, there was considerably less homology (47.4–76.3% identity, 68.4–88.8% similarity) between the other proteins encoded by their respective biosynthetic loci. This new natural variant of nisin, called nisin E, represents the first nisin variant to be reported for S. equinus; additionally, its differences with nisin U may provide some insight into the amino acids that influence bacteriocin potency and killing spectrum.
In the last two decades, the field of metagenomics has greatly expanded due to improvement in sequencing technologies allowing for a more comprehensive characterization of microbial communities. The ...use of these technologies has led to an unprecedented understanding of human, animal, and environmental microbiomes and have shown that the gut microbiota are comparable to an organ that is intrinsically linked with a variety of diseases. Characterization of microbial communities using next-generation sequencing-by-synthesis approaches have revealed important shifts in microbiota associated with debilitating diseases such as Clostridium difficile infection. But due to limitations in sequence read length, primer biases, and the quality of databases, genus- and species-level classification have been difficult. Third-generation technologies, such as Pacific Biosciences' single molecule, real-time (SMRT) approach, allow for unbiased, more specific identification of species that are likely clinically relevant. Comparison of Illumina next-generation characterization and SMRT sequencing of samples from patients treated for C. difficile infection revealed similarities in community composition at the phylum and family levels, but SMRT sequencing further allowed for species-level characterization - permitting a better understanding of the microbial ecology of this disease. Thus, as sequencing technologies continue to advance, new species-level insights can be gained in the study of complex and clinically-relevant microbial communities.
Abstract
Background
Papilio bianor Cramer, 1777 (commonly known as the Chinese peacock butterfly) (Insecta, Lepidoptera, Papilionidae) is a widely distributed swallowtail butterfly with a wide number ...of geographic populations ranging from the southeast of Russia to China, Japan, India, Vietnam, Myanmar, and Thailand. Its wing color consists of both pigmentary colored scales (black, reddish) and structural colored scales (iridescent blue or green dust). A high-quality reference genome of P. bianor is an important foundation for investigating iridescent color evolution, phylogeography, and the evolution of swallowtail butterflies.
Findings
We obtained a chromosome-level de novo genome assembly of the highly heterozygous P. bianor using long Pacific Biosciences sequencing reads and high-throughput chromosome conformation capture technology. The final assembly is 421.52 Mb on 30 chromosomes (29 autosomes and 1 Z sex chromosome) with 13.12 Mb scaffold N50. In total, 15,375 protein-coding genes and 233.09 Mb of repetitive sequences were identified. Phylogenetic analyses indicated that P. bianor separated from a common ancestor of swallowtails ∼23.69–36.04 million years ago. Demographic history suggested that the population expansion of this species from the last interglacial period to the last glacial maximum possibly resulted from its decreased natural enemies and its adaptation to climate change during the glacial period.
Conclusions
We present a high-quality chromosome-level reference genome of P. bianor using long-read single-molecule sequencing and Hi-C–based chromatin interaction maps. Our results lay the foundation for exploring the genetic basis of special biological features of P. bianor and also provide a useful data source for comparative genomics and phylogenomics among butterflies and moths.