Abstract
With the rapid increase of sequenced metazoan mitochondrial genomes, a detailed manual annotation is becoming more and more infeasible. While it is easy to identify the approximate location ...of protein-coding genes within mitogenomes, the peculiar processing of mitochondrial transcripts, however, makes the determination of precise gene boundaries a surprisingly difficult problem. We have analyzed the properties of annotated start and stop codon positions in detail, and use the inferred patterns to devise a new method for predicting gene boundaries in de novo annotations. Our method benefits from empirically observed prevalances of start/stop codons and gene lengths, and considers the dependence of these features on variations of genetic codes. Albeit not being perfect, our new approach yields a drastic improvement in the accuracy of gene boundaries and upgrades the mitochondrial genome annotation server MITOS to an even more sophisticated tool for fully automatic annotation of metazoan mitochondrial genomes.
The detection of differentially methylated regions (DMRs) is a necessary prerequisite for characterizing different epigenetic states. We present a novel program, metilene, to identify DMRs within ...whole-genome and targeted data with unrivaled specificity and sensitivity. A binary segmentation algorithm combined with a two-dimensional statistical test allows the detection of DMRs in large methylation experiments with multiple groups of samples in minutes rather than days using off-the-shelf hardware. metilene outperforms other state-of-the-art tools for low coverage data and can estimate missing data. Hence, metilene is a versatile tool to study the effect of epigenetic modifications in differentiation/development, tumorigenesis, and systems biology on a global, genome-wide level. Whether in the framework of international consortia with dozens of samples per group, or even without biological replicates, it produces highly significant and reliable results.
ViennaRNA Package 2.0 Lorenz, Ronny; Bernhart, Stephan H; Höner Zu Siederdissen, Christian ...
Algorithms for molecular biology,
11/2011, Letnik:
6, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Secondary structure forms an important intermediate level of description of nucleic acids that encapsulates the dominating part of the folding energy, is often well conserved in evolution, and is ...routinely used as a basis to explain experimental findings. Based on carefully measured thermodynamic parameters, exact dynamic programming algorithms can be used to compute ground states, base pairing probabilities, as well as thermodynamic properties.
The ViennaRNA Package has been a widely used compilation of RNA secondary structure related computer programs for nearly two decades. Major changes in the structure of the standard energy model, the Turner 2004 parameters, the pervasive use of multi-core CPUs, and an increasing number of algorithmic variants prompted a major technical overhaul of both the underlying RNAlib and the interactive user programs. New features include an expanded repertoire of tools to assess RNA-RNA interactions and restricted ensembles of structures, additional output information such as centroid structures and maximum expected accuracy structures derived from base pairing probabilities, or z-scores for locally stable secondary structures, and support for input in fasta format. Updates were implemented without compromising the computational efficiency of the core algorithms and ensuring compatibility with earlier versions.
The ViennaRNA Package 2.0, supporting concurrent computations via OpenMP, can be downloaded from http://www.tbi.univie.ac.at/RNA.
The prediction of a consensus structure for a set of related RNAs is an important first step for subsequent analyses. RNAalifold, which computes the minimum energy structure that is simultaneously ...formed by a set of aligned sequences, is one of the oldest and most widely used tools for this task. In recent years, several alternative approaches have been advocated, pointing to several shortcomings of the original RNAalifold approach.
We show that the accuracy of RNAalifold predictions can be improved substantially by introducing a different, more rational handling of alignment gaps, and by replacing the rather simplistic model of covariance scoring with more sophisticated RIBOSUM-like scoring matrices. These improvements are achieved without compromising the computational efficiency of the algorithm. We show here that the new version of RNAalifold not only outperforms the old one, but also several other tools recently developed, on different datasets.
The new version of RNAalifold not only can replace the old one for almost any application but it is also competitive with other approaches including those based on SCFGs, maximum expected accuracy, or hierarchical nearest neighbor classifiers.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract
Despite its prominence, the mechanisms through which the tumor suppressor p53 regulates most genes remain unclear. Recently, the regulatory factor X 7 (RFX7) emerged as a suppressor of ...lymphoid neoplasms, but its regulation and target genes mediating tumor suppression remain unknown. Here, we identify a novel p53-RFX7 signaling axis. Integrative analysis of the RFX7 DNA binding landscape and the RFX7-regulated transcriptome in three distinct cell systems reveals that RFX7 directly controls multiple established tumor suppressors, including PDCD4, PIK3IP1, MXD4, and PNRC1, across cell types and is the missing link for their activation in response to p53 and stress. RFX7 target gene expression correlates with cell differentiation and better prognosis in numerous cancer types. Interestingly, we find that RFX7 sensitizes cells to Doxorubicin by promoting apoptosis. Together, our work establishes RFX7’s role as a ubiquitous regulator of cell growth and fate determination and a key node in the p53 transcriptional program.
Graphical Abstract
Graphical Abstract
p53 induction by MDM2-inhibition, DNA-damage, or ribosomal stress activates the transcription factor RFX7. In turn, RFX7 up-regulates multiple tumor suppressor genes.
Vienna RNA Websuite Gruber, Andreas R; Lorenz, Ronny; Bernhart, Stephan H ...
Nucleic acids research,
07/2008, Letnik:
36, Številka:
suppl-2
Journal Article
Recenzirano
Odprti dostop
The Vienna RNA Websuite is a comprehensive collection of tools for folding, design and analysis of RNA sequences. It provides a web interface to the most commonly used programs of the Vienna RNA ...package. Among them, we find folding of single and aligned sequences, prediction of RNA-RNA interactions, and design of sequences with a given structure. Additionally, we provide analysis of folding landscapes using the barriers program and structural RNA alignments using LocARNA. The web server together with software packages for download is freely accessible at http://rna.tbi.univie.ac.at/.
Motivation: Many classes of functional RNA molecules are characterized by highly conserved secondary structures but little detectable sequence similarity. Reliable multiple alignments can therefore ...be constructed only when the shared structural features are taken into account. Since multiple alignments are used as input for many subsequent methods of data analysis, structure-based alignments are an indispensable necessity in RNA bioinformatics. Results: We present here a method to compute pairwise and progressive multiple alignments from the direct comparison of base pairing probability matrices. Instead of attempting to solve the folding and the alignment problem simultaneously as in the classical Sankoff's algorithm, we use McCaskill's approach to compute base pairing probability matrices which effectively incorporate the information on the energetics of each sequences. A novel, simplified variant of Sankoff's algorithms can then be employed to extract the maximum-weight common secondary structure and an associated alignment. Availability: The programs pmcomp and pmmulti described in this contribution are implemented in Perl and can be downloaded together with the example datasets from http://www.tbi.univie.ac.at/RNA/PMcomp/. A web server is available at http://rna.tbi.univie.ac.at/cgi-bin/pmcgi.pl
MicroRNAs (miRNAs) are small non coding RNAs responsible for posttranscriptional regulation of gene expression. Even though almost 2000 precursors have been described so far, additional miRNAs are ...still being discovered in normal as well as malignant cells. Alike protein coding genes, miRNAs may acquire oncogenic properties in consequence of altered expression or presence of gain or loss of function mutations. In this study we mined datasets from miRNA expression profiling (miRNA-seq) of 7 classic Hodgkin Lymphoma (cHL) cell lines, 10 non-Hodgkin lymphoma (NHL) cell lines and 56 samples of germinal center derived B-cell lymphomas. Our aim was to discover potential novel cHL oncomiRs not reported in miRBase (release 22.1) and expressed in cHL cell lines but no other B-cell lymphomas. We identified six such miRNA candidates in cHL cell lines and verified the expression of two of them encoded at chr2:212678788-212678849 and chr5:168090507-168090561 (GRCh38). Interestingly, we showed that one of the validated miRNAs (located in an intron of the TENM2 gene) is expressed together with its host gene. TENM2 is characterized by hypomethylation and open chromatin around its TSS in cHL cell lines in contrast to NHL cell lines and germinal centre B-cells respectively. It indicates an epigenetic mechanism responsible for aberrant expression of both, the TENM2 gene and the novel miRNA in cHL cell lines. Despite the GO analysis performed with the input of the in silico predicted novel miRNA target genes did not reveal ontologies typically associated with cHL pathogenesis, it pointed to several interesting candidates involved in i.e. lymphopoiesis. These include the lymphoma related BCL11A gene, the IKZF2 gene involved in lymphocyte development or the transcription initiator GTF2H1.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Bivalent (poised or paused) chromatin comprises activating and repressing histone modifications at the same location. This combination of epigenetic marks at promoter or enhancer regions keeps genes ...expressed at low levels but poised for rapid activation. Typically, DNA at bivalent promoters is only lowly methylated in normal cells, but frequently shows elevated methylation levels in cancer samples. Here, we developed a universal classifier built from chromatin data that can identify cancer samples solely from hypermethylation of bivalent chromatin. Tested on over 7,000 DNA methylation data sets from several cancer types, it reaches an AUC of 0.92. Although higher levels of DNA methylation are often associated with transcriptional silencing, counter-intuitive positive statistical dependencies between DNA methylation and expression levels have been recently reported for two cancer types. Here, we re-analyze combined expression and DNA methylation data sets, comprising over 5,000 samples, and demonstrate that the conjunction of hypermethylation of bivalent chromatin and up-regulation of the corresponding genes is a general phenomenon in cancer. This up-regulation affects many developmental genes and transcription factors, including dozens of homeobox genes and other genes implicated in cancer. Thus, we reason that the disturbance of bivalent chromatin may be intimately linked to tumorigenesis.
Abstract
The human
SBF1
(SET binding factor 1) gene, alternatively known as
MTMR5
, is predominantly expressed in the brain, and its epigenetic dysregulation is linked to late-onset neurocognitive ...disorders (NCDs), such as Alzheimer’s disease. This gene contains a (GCC)-repeat at the interval between + 1 and + 60 of the transcription start site (SBF1-202 ENST00000380817.8). We sequenced the
SBF1
(GCC)-repeat in a sample of 542 Iranian individuals, consisting of late-onset NCDs (N = 260) and controls (N = 282). While multiple alleles were detected at this locus, the 8 and 9 repeats were predominantly abundant, forming > 95% of the allele pool across the two groups. Among a number of anomalies, the allele distribution was significantly different in the NCD group versus controls (Fisher’s exact
p
= 0.006), primarily as a result of enrichment of the 8-repeat in the former. The genotype distribution departed from the Hardy–Weinberg principle in both groups (
p
< 0.001), and was significantly different between the two groups (Fisher’s exact
p
= 0.001). We detected significantly low frequency of the 8/9 genotype in both groups, higher frequency of this genotype in the NCD group, and reverse order of 8/8 versus 9/9 genotypes in the NCD group versus controls. Biased heterozygous/heterozygous ratios were also detected for the 6/8 versus 6/9 genotypes (in favor of 6/8) across the human samples studied (Fisher’s exact
p
= 0.0001). Bioinformatics studies revealed that the number of (GCC)-repeats may change the RNA secondary structure and interaction sites at least across human exon 1. This STR was specifically expanded beyond 2-repeats in primates. In conclusion, we report indication of a novel biological phenomenon, in which there is selection against certain heterozygous genotypes at a STR locus in human. We also report different allele and genotype distribution at this STR locus in late-onset NCD versus controls. In view of the location of this STR in the 5′ untranslated region, RNA/RNA or RNA/DNA heterodimer formation of the involved genotypes and alternative RNA processing and/or translation should be considered.