Cells release RNA-carrying vesicles and membrane-free RNA/protein complexes into the extracellular milieu. Horizontal vesicle-mediated transfer of such shuttle RNA between cells allows dissemination ...of genetically encoded messages, which may modify the function of target cells. Other studies used array analysis to establish the presence of microRNAs and mRNA in cell-derived vesicles from many sources. Here, we used an unbiased approach by deep sequencing of small RNA released by immune cells. We found a large variety of small non-coding RNA species representing pervasive transcripts or RNA cleavage products overlapping with protein coding regions, repeat sequences or structural RNAs. Many of these RNAs were enriched relative to cellular RNA, indicating that cells destine specific RNAs for extracellular release. Among the most abundant small RNAs in shuttle RNA were sequences derived from vault RNA, Y-RNA and specific tRNAs. Many of the highly abundant small non-coding transcripts in shuttle RNA are evolutionary well-conserved and have previously been associated to gene regulatory functions. These findings allude to a wider range of biological effects that could be mediated by shuttle RNA than previously expected. Moreover, the data present leads for unraveling how cells modify the function of other cells via transfer of specific non-coding RNA species.
Highlights • RNA sequencing uncovers mechanisms regulating gene expression. • Use of alternative TSSs, PASs, and exons is the rule. • Alternative translation initiation at 5′-UTRs and downstream ...codons is widespread. • Transcription, RNA processing, and translation are often interdependent processes.
The release and uptake of nano-sized extracellular vesicles (EV) is a highly conserved means of intercellular communication. The molecular composition of EV, and thereby their signaling function to ...target cells, is regulated by cellular activation and differentiation stimuli. EV are regarded as snapshots of cells and are, therefore, in the limelight as biomarkers for disease. Although research on EV-associated RNA has predominantly focused on microRNAs, the transcriptome of EV consists of multiple classes of small non-coding RNAs with potential gene-regulatory functions. It is not known whether environmental cues imposed on cells induce specific changes in a broad range of EV-associated RNA classes. Here, we investigated whether immune-activating or -suppressing stimuli imposed on primary dendritic cells affected the release of various small non-coding RNAs via EV. The small RNA transcriptomes of highly pure EV populations free from ribonucleoprotein particles were analyzed by RNA sequencing and RT-qPCR. Immune stimulus-specific changes were found in the miRNA, snoRNA, and Y-RNA content of EV from dendritic cells, whereas tRNA and snRNA levels were much less affected. Only part of the changes in EV-RNA content reflected changes in cellular RNA, which urges caution in interpreting EV as snapshots of cells. By comprehensive analysis of RNA obtained from highly purified EV, we demonstrate that multiple RNA classes contribute to genetic messages conveyed via EV. The identification of multiple RNA classes that display cell stimulation-dependent association with EV is the prelude to unraveling the function and biomarker potential of these EV-RNAs.
The hippocampal expression profiles of wild-type mice and mice transgenic for δC-doublecortin-like kinase were compared with Solexa/Illumina deep sequencing technology and five different microarray ...platforms. With Illumina's digital gene expression assay, we obtained ∼2.4 million sequence tags per sample, their abundance spanning four orders of magnitude. Results were highly reproducible, even across laboratories. With a dedicated Bayesian model, we found differential expression of 3179 transcripts with an estimated false-discovery rate of 8.5%. This is a much higher figure than found for microarrays. The overlap in differentially expressed transcripts found with deep sequencing and microarrays was most significant for Affymetrix. The changes in expression observed by deep sequencing were larger than observed by microarrays or quantitative PCR. Relevant processes such as calmodulin-dependent protein kinase activity and vesicle transport along microtubules were found affected by deep sequencing but not by microarrays. While undetectable by microarrays, antisense transcription was found for 51% of all genes and alternative polyadenylation for 47%. We conclude that deep sequencing provides a major advance in robustness, comparability and richness of expression profiling data and is expected to boost collaborative, comparative and integrative genomics studies.
The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of ...transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing.
In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells.
Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.
Hereditary disorders are frequently caused by genetic variants that affect pre‐messenger RNA splicing. Though genetic variants in the canonical splice motifs are almost always disrupting splicing, ...the pathogenicity of variants in the noncanonical splice sites (NCSS) and deep intronic (DI) regions are difficult to predict. Multiple splice prediction tools have been developed for this purpose, with the latest tools employing deep learning algorithms. We benchmarked established and deep learning splice prediction tools on published gold standard sets of 71 NCSS and 81 DI variants in the ABCA4 gene and 61 NCSS variants in the MYBPC3 gene with functional assessment in midigene and minigene splice assays. The selection of splice prediction tools included CADD, DSSP, GeneSplicer, MaxEntScan, MMSplice, NNSPLICE, SPIDEX, SpliceAI, SpliceRover, and SpliceSiteFinder‐like. The best‐performing splice prediction tool for the different variants was SpliceRover for ABCA4 NCSS variants, SpliceAI for ABCA4 DI variants, and the Alamut 3/4 consensus approach (GeneSplicer, MaxEntScacn, NNSPLICE and SpliceSiteFinder‐like) for NCSS variants in MYBPC3 based on the area under the receiver operator curve. Overall, the performance in a real‐time clinical setting is much more modest than reported by the developers of the tools.
The choice of splice prediction tool depends on the dataset. However, deep learning tools show promising results for variants tested in midigene splice assays.
Gene coexpression network analysis is a powerful "data-driven" approach essential for understanding cancer biology and mechanisms of tumor development. Yet, despite the completion of thousands of ...studies on cancer gene expression, there have been few attempts to normalize and integrate co-expression data from scattered sources in a concise "meta-analysis" framework. We generated such a resource by exploring gene coexpression networks in 82 microarray datasets from 9 major human cancer types. The analysis was conducted using an elaborate weighted gene coexpression network (WGCNA) methodology and identified over 3,000 robust gene coexpression modules. The modules covered a range of known tumor features, such as proliferation, extracellular matrix remodeling, hypoxia, inflammation, angiogenesis, tumor differentiation programs, specific signaling pathways, genomic alterations, and biomarkers of individual tumor subtypes. To prioritize genes with respect to those tumor features, we ranked genes within each module by connectivity, leading to identification of module-specific functionally prominent hub genes. To showcase the utility of this network information, we positioned known cancer drug targets within the coexpression networks and predicted that Anakinra, an anti-rheumatoid therapeutic agent, may be promising for development in colorectal cancer. We offer a comprehensive, normalized and well documented collection of >3000 gene coexpression modules in a variety of cancers as a rich data resource to facilitate further progress in cancer research.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
MicroRNAs are small non-coding RNA transcripts that regulate post-transcriptional gene expression. The millions of short sequence reads generated by next generation sequencing technologies make this ...technique explicitly suitable for profiling of known and novel microRNAs. A modification to the small-RNA expression kit (SREK, Ambion) library preparation method for the SOLiD sequencing platform is described to generate microRNA sequencing libraries that are compatible with the Illumina Genome Analyzer.
High quality sequencing libraries can successfully be prepared from as little as 100 ng small RNA enriched RNA. An easy to use perl-based analysis pipeline called E-miR was developed to handle the sequencing data in several automated steps including data format conversion, 3' adapter removal, genome alignment and annotation to non-coding RNA transcripts. The sample preparation and E-miR pipeline were used to identify 37 cardiac enriched microRNAs in stage 16 chicken embryos. Isomir expression profiles between the heart and embryo were highly correlated for all miRNAs suggesting that tissue or cell specific miRNA modifications do not occur.
In conclusion, our alternative sample preparation method can successfully be applied to generate high quality miRNA sequencing libraries for the Illumina genome analyzer.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Tumor cell migration is a key process for cancer cell dissemination and metastasis that is controlled by signal-mediated cytoskeletal and cell matrix adhesion remodeling. Using a phagokinetic track ...assay with migratory H1299 cells, we performed an siRNA screen of almost 1,500 genes encoding kinases/phosphatases and adhesome- and migration-related proteins to identify genes that affect tumor cell migration speed and persistence. Thirty candidate genes that altered cell migration were validated in live tumor cell migration assays. Eight were associated with metastasis-free survival in breast cancer patients, with integrin β3-binding protein (ITGB3BP), MAP3K8, NIMA-related kinase (NEK2), and SHC-transforming protein 1 (SHC1) being the most predictive. Examination of genes that modulate migration indicated that SRPK1, encoding the splicing factor kinase SRSF protein kinase 1, is relevant to breast cancer outcomes, as it was highly expressed in basal breast cancer. Furthermore, high SRPK1 expression correlated with poor breast cancer disease outcome and preferential metastasis to the lungs and brain. In 2 independent murine models of breast tumor metastasis, stable shRNA-based SRPK1 knockdown suppressed metastasis to distant organs, including lung, liver, and spleen, and inhibited focal adhesion reorganization. Our study provides comprehensive information on the molecular determinants of tumor cell migration and suggests that SRPK1 has potential as a drug target for limiting breast cancer metastasis.
The FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample ...information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas.