Mutations in the splicing factor SF3B1 are found in several cancer types and have been associated with various splicing defects. Using transcriptome sequencing data from chronic lymphocytic leukemia, ...breast cancer and uveal melanoma tumor samples, we show that hundreds of cryptic 3' splice sites (3'SSs) are used in cancers with SF3B1 mutations. We define the necessary sequence context for the observed cryptic 3' SSs and propose that cryptic 3'SS selection is a result of SF3B1 mutations causing a shift in the sterically protected region downstream of the branch point. While most cryptic 3'SSs are present at low frequency (<10%) relative to nearby canonical 3'SSs, we identified ten genes that preferred out-of-frame cryptic 3'SSs. We show that cancers with mutations in the SF3B1 HEAT 5-9 repeats use cryptic 3'SSs downstream of the branch point and provide both a mechanistic model consistent with published experimental data and affected targets that will guide further research into the oncogenic effects of SF3B1 mutation.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Bacterial genomes are organized by structural and functional elements, including promoters, transcription start and termination sites, open reading frames, regulatory noncoding regions, untranslated ...regions and transcription units. Here, we iteratively integrate high-throughput, genome-wide measurements of RNA polymerase binding locations and mRNA transcript abundance, 5' sequences and translation into proteins to determine the organizational structure of the Escherichia coli K-12 MG1655 genome. Integration of the organizational elements provides an experimentally annotated transcription unit architecture, including alternative transcription start sites, 5' untranslated region, boundaries and open reading frames of each transcription unit. A total of 4,661 transcription units were identified, representing an increase of >530% over current knowledge. This comprehensive transcription unit architecture allows for the elucidation of condition-specific uses of alternative sigma factors at the genome scale. Furthermore, the transcription unit architecture provides a foundation on which to construct genome-scale transcriptional and translational regulatory networks.
Adaptive laboratory evolution (ALE) under controlled conditions has become a valuable approach for the study of the genetic and biochemical basis for microbial adaptation under a given selection ...pressure. Conventionally, the timescale in ALE experiments has been set in terms of number of generations. As mutations are believed to occur primarily during cell division in growing cultures, the cumulative number of cell divisions (CCD) would be an alternative way to set the timescale for ALE. Here we show that in short-term ALE (up to 40-50 days), Escherichia coli, under growth rate selection pressure, was found to undergo approximately 10(11.2) total cumulative cell divisions in the population to produce a new stable growth phenotype that results from 2 to 8 mutations. Continuous exposure to a low level of the mutagen N-methyl-N'-nitro-N-nitrosoguanidine was found to accelerate this timescale and led to a superior growth rate phenotype with a much larger number of mutations as determined with whole-genome sequencing. These results would be useful for the fundamental kinetics of the ALE process in designing ALE experiments and provide a basis for its quantitative description.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Significance Identifying molecules that are specific to tumors for use in early detection, diagnosis, prognosis, and therapy is both a primary goal and a key discovery challenge across diverse areas ...of oncology. To discover ovarian tumor-specific molecules, we developed custom bioinformatics algorithms to analyze transcriptome sequence data of 296 ovarian cancer and 1,839 normal tissues and validated putative tumor-specific mRNA isoforms by RT–quantitative PCR. The results revealed multiple candidate diagnostic and therapeutic targets with unique sequences that were expressed in most of the cancers examined but not in normal tissues. The process we developed can be readily applied to identify diagnostic and therapeutic targets for any of the 30 or more tumor types for which large amounts of transcriptome data now exist.
Tumor-specific molecules are needed across diverse areas of oncology for use in early detection, diagnosis, prognosis and therapy. Large and growing public databases of transcriptome sequencing data (RNA-seq) derived from tumors and normal tissues hold the potential of yielding tumor-specific molecules, but because the data are new they have not been fully explored for this purpose. We have developed custom bioinformatic algorithms and used them with 296 high-grade serous ovarian (HGS-OvCa) tumor and 1,839 normal RNA-seq datasets to identify mRNA isoforms with tumor-specific expression. We rank prioritized isoforms by likelihood of being expressed in HGS-OvCa tumors and not in normal tissues and analyzed 671 top-ranked isoforms by high-throughput RT-qPCR. Six of these isoforms were expressed in a majority of the 12 tumors examined but not in 18 normal tissues. An additional 11 were expressed in most tumors and only one normal tissue, which in most cases was fallopian or colon. Of the 671 isoforms, the topmost 5% ( n = 33) ranked based on having tumor-specific or highly restricted normal tissue expression by RT-qPCR analysis are enriched for oncogenic, stem cell/cancer stem cell, and early development loci—including ETV4, FOXM1, LSR, CD9, RAB11FIP4, and FGFRL1. Many of the 33 isoforms are predicted to encode proteins with unique amino acid sequences, which would allow them to be specifically targeted for one or more therapeutic strategies—including monoclonal antibodies and T-cell–based vaccines. The systematic process described herein is readily and rapidly applicable to the more than 30 additional tumor types for which sufficient amounts of RNA-seq already exist.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Leukemia stem cells (LSCs) play a pivotal role in the resistance of chronic myeloid leukemia (CML) to tyrosine kinase inhibitors (TKIs) and its progression to blast crisis (BC), in part, through the ...alternative splicing of self-renewal and survival genes. To elucidate splice-isoform regulators of human BC LSC maintenance, we performed whole-transcriptome RNA sequencing, splice-isoform-specific quantitative RT-PCR (qRT-PCR), nanoproteomics, stromal coculture, and BC LSC xenotransplantation analyses. Cumulatively, these studies show that the alternative splicing of multiple prosurvival BCL2 family genes promotes malignant transformation of myeloid progenitors into BC LSCS that are quiescent in the marrow niche and that contribute to therapeutic resistance. Notably, sabutoclax, a pan-BCL2 inhibitor, renders marrow-niche-resident BC LSCs sensitive to TKIs at doses that spare normal progenitors. These findings underscore the importance of alternative BCL2 family splice-isoform expression in BC LSC maintenance and suggest that the combinatorial inhibition of prosurvival BCL2 family proteins and BCR-ABL may eliminate dormant LSCs and obviate resistance.
Display omitted
► Splice-isoform switching favors prosurvival BCL2 family expression in human BC ► BC LSCs are quiescent and TKI-resistant in the marrow niche ► Sabutoclax, a pan-BCL2 inhibitor, enhances TKI sensitivity of bone marrow BC LSCs
BCL2 isoform switching in human leukemia stem cells contributes to therapeutic resistance but can be overcome by combining tyrosine kinase inhibition with a pan-BCL2 inhibitor.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The molecular etiology of human progenitor reprogramming into self-renewing leukemia stem cells (LSC) has remained elusive. Although DNA sequencing has uncovered spliceosome gene mutations that ...promote alternative splicing and portend leukemic transformation, isoform diversity also may be generated by RNA editing mediated by adenosine deaminase acting on RNA (ADAR) enzymes that regulate stem cell maintenance. In this study, whole-transcriptome sequencing of normal, chronic phase, and serially transplantable blast crisis chronic myeloid leukemia (CML) progenitors revealed increased IFN-γ pathway gene expression in concert with BCR-ABL amplification, enhanced expression of the IFN-responsive ADAR1 p150 isoform, and a propensity for increased adenosine-to-inosine RNA editing during CML progression. Lentiviral overexpression experiments demonstrate that ADAR1 p150 promotes expression of the myeloid transcription factor PU.1 and induces malignant reprogramming of myeloid progenitors. Moreover, enforced ADAR1 p150 expression was associated with production of a misspliced form of GSK3β implicated in LSC self-renewal. Finally, functional serial transplantation and shRNA studies demonstrate that ADAR1 knockdown impaired in vivo self-renewal capacity of blast crisis CML progenitors. Together these data provide a compelling rationale for developing ADAR1-based LSC detection and eradication strategies.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Short-term laboratory evolution of bacteria followed by genomic sequencing provides insight into the mechanism of adaptive evolution, such as the number of mutations needed for adaptation, ...genotype-phenotype relationships, and the reproducibility of adaptive outcomes.
In the present study, we describe the genome sequencing of 11 endpoints of Escherichia coli that underwent 60-day laboratory adaptive evolution under growth rate selection pressure in lactate minimal media. Two to eight mutations were identified per endpoint. Generally, each endpoint acquired mutations to different genes. The most notable exception was an 82 base-pair deletion in the rph-pyrE operon that appeared in 7 of the 11 adapted strains. This mutation conferred an approximately 15% increase to the growth rate when experimentally introduced to the wild-type background and resulted in an approximately 30% increase to growth rate when introduced to a background already harboring two adaptive mutations. Additionally, most endpoints had a mutation in a regulatory gene (crp or relA, for example) or the RNA polymerase.
The 82 base-pair deletion found in the rph-pyrE operon of many endpoints may function to relieve a pyrimidine biosynthesis defect present in MG1655. In contrast, a variety of regulators acquire mutations in the different endpoints, suggesting flexibility in overcoming regulatory challenges in the adaptation.
Broad-acting transcription factors (TFs) in bacteria form regulons. Here, we present a 4-step method to fully reconstruct the leucine-responsive protein (Lrp) regulon in Escherichia coli K-12 MG 1655 ...that regulates nitrogen metabolism. Step 1 is composed of obtaining high-resolution ChIP-chip data for Lrp, the RNA polymerase and expression profiles under multiple environmental conditions. We identified 138 unique and reproducible Lrp-binding regions and classified their binding state under different conditions. In the second step, the analysis of these data revealed 6 distinct regulatory modes for individual ORFs. In the third step, we used the functional assignment of the regulated ORFs to reconstruct 4 types of regulatory network motifs around the metabolites that are affected by the corresponding gene products. In the fourth step, we determined how leucine, as a signaling molecule, shifts the regulatory motifs for particular metabolites. The physiological structure that emerges shows the regulatory motifs for different amino acid fall into the traditional classification of amino acid families, thus elucidating the structure and physiological functions of the Lrp-regulon. The same procedure can be applied to other broad-acting TFs, opening the way to full bottom-up reconstruction of the transcriptional regulatory network in bacterial cells.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
We determined the genome-wide distribution of the nucleoid-associated protein Fis in Escherichia coli using chromatin immunoprecipitation coupled with high-resolution whole genome-tiling microarrays. ...We identified 894 Fis-associated regions across the E. coli genome. A significant number of these binding sites were found within open reading frames (33%) and between divergently transcribed transcripts (5%). Analysis indicates that A-tracts and AT-tracts are an important signal for preferred Fis-binding sites, and that A(6)-tracts in particular constitute a high-affinity signal that dictates Fis phasing in stretches of DNA containing multiple and variably spaced A-tracts and AT-tracts. Furthermore, we find evidence for an average of two Fis-binding regions per supercoiling domain in the chromosome of exponentially growing cells. Transcriptome analysis shows that approximately 21% of genes are affected by the deletion of fis; however, the changes in magnitude are small. To address the differential Fis bindings under growth environment perturbation, ChIP-chip analysis was performed using cells grown under aerobic and anaerobic growth conditions. Interestingly, the Fis-binding regions are almost identical in aerobic and anaerobic growth conditions-indicating that the E. coli genome topology mediated by Fis is superficially identical in the two conditions. These novel results provide new insight into how Fis modulates DNA topology at a genome scale and thus advance our understanding of the architectural bases of the E. coli nucleoid.
Formative research suggests that a human embryonic stem cell-specific alternative splicing gene regulatory network, which is repressed by Muscleblind-like (MBNL) RNA binding proteins, is involved in ...cell reprogramming. In this study, RNA sequencing, splice isoform-specific quantitative RT-PCR, lentiviral transduction, and in vivo humanized mouse model studies demonstrated that malignant reprogramming of progenitors into self-renewing blast crisis chronic myeloid leukemia stem cells (BC LSCs) was partially driven by decreased MBNL3. Lentiviral knockdown of MBNL3 resulted in reversion to an embryonic alternative splice isoform program typified by overexpression of CD44 transcript variant 3, containing variant exons 8–10, and BC LSC proliferation. Although isoform-specific lentiviral CD44v3 overexpression enhanced chronic phase chronic myeloid leukemia (CML) progenitor replating capacity, lentiviral shRNA knockdown abrogated these effects. Combined treatment with a humanized pan-CD44 monoclonal antibody and a breakpoint cluster region - ABL proto-oncogene 1, nonreceptor tyrosine kinase (BCR-ABL1) antagonist inhibited LSC maintenance in a niche-dependent manner. In summary, MBNL3 down-regulation–related reversion to an embryonic alternative splicing program, typified by CD44v3 overexpression, represents a previously unidentified mechanism governing malignant progenitor reprogramming in malignant microenvironments and provides a pivotal opportunity for selective BC LSC detection and therapeutic elimination.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK