Knowledge of the various interactions between molecules in the cell is crucial for understanding cellular processes in health and disease. Currently available interaction databases, being largely ...complementary to each other, must be integrated to obtain a comprehensive global map of the different types of interactions. We have previously reported the development of an integrative interaction database called ConsensusPathDB (http://ConsensusPathDB.org) that aims to fulfill this task. In this update article, we report its significant progress in terms of interaction content and web interface tools. ConsensusPathDB has grown mainly due to the integration of 12 further databases; it now contains 215 541 unique interactions and 4601 pathways from overall 30 databases. Binary protein interactions are scored with our confidence assessment tool, IntScore. The ConsensusPathDB web interface allows users to take advantage of these integrated interaction and pathway data in different contexts. Recent developments include pathway analysis of metabolite lists, visualization of functional gene/metabolite sets as overlap graphs, gene set analysis based on protein complexes and induced network modules analysis that connects a list of genes through various interaction types. To facilitate the interactive, visual interpretation of interaction and pathway data, we have re-implemented the graph visualization feature of ConsensusPathDB using the Cytoscape.js library.
ConsensusPathDB is a database system for the integration of human functional interactions. Current knowledge of these interactions is dispersed in more than 200 databases, each having a specific ...focus and data format. ConsensusPathDB currently integrates the content of 12 different interaction databases with heterogeneous foci comprising a total of 26 133 distinct physical entities and 74 289 distinct functional interactions (protein-protein interactions, biochemical reactions, gene regulatory interactions), and covering 1738 pathways. We describe the database schema and the methods used for data integration. Furthermore, we describe the functionality of the ConsensusPathDB web interface, where users can search and visualize interaction networks, upload, modify and expand networks in BioPAX, SBML or PSI-MI format, or carry out over-representation analysis with uploaded identifier lists with respect to substructures derived from the integrated interaction network. The ConsensusPathDB database is available at: http://cpdb.molgen.mpg.de
The ability of stem cells to propagate indefinitely is believed to occur via the fine modulation of pathways commonly involved in cellular senescence, including the telomerase, the p53, and the ...mitochondrial/oxidative stress pathways. Induced pluripotent stem cells (iPSCs) are a novel stem cell population obtained from somatic cells through forced expression of a set of genes normally expressed in embryonic stem cells (ESCs). These reprogrammed cells acquire self‐renewal properties and appear almost undistinguishable from ESCs in terms of morphology, gene expression, and differentiation potential. Accordingly, iPSCs exhibit alterations of the senescence‐related telomerase and p53 signaling pathways. However, although treatments with antioxidants have been recently shown to enhance cellular reprogramming, detailed information regarding the state of the mitochondrial/oxidative stress pathway in iPSCs is still lacking. Mitochondria undergo specific changes during organismal development and aging. Thus, addressing whether somatic mitochondria within iPSCs acquire ESC‐like features or retain the phenotype of the parental cell is an unanswered but relevant question. Herein, we demonstrate that somatic mitochondria within human iPSCs revert to an immature ESC‐like state with respect to organelle morphology and distribution, expression of nuclear factors involved in mitochondrial biogenesis, content of mitochondrial DNA, intracellular ATP level, oxidative damage, and lactate generation. Upon differentiation, mitochondria within iPSCs and ESCs exhibited analogous maturation and anaerobic‐to‐aerobic metabolic modifications. Overall, the data highlight that human iPSCs and ESCs, although not identical, share similar mitochondrial properties and suggest that cellular reprogramming can modulate the mitochondrial/oxidative stress pathway, thus inducing a rejuvenated state capable of escaping cellular senescence. STEM CELLS 2010;28:721–733
ConsensusPathDB is a meta-database that integrates different types of functional interactions from heterogeneous interaction data resources. Physical protein interactions, metabolic and signaling ...reactions and gene regulatory interactions are integrated in a seamless functional association network that simultaneously describes multiple functional aspects of genes, proteins, complexes, metabolites, etc. With 155 432 human, 194 480 yeast and 13 648 mouse complex functional interactions (originating from 18 databases on human and eight databases on yeast and mouse interactions each), ConsensusPathDB currently constitutes the most comprehensive publicly available interaction repository for these species. The Web interface at http://cpdb.molgen.mpg.de offers different ways of utilizing these integrated interaction data, in particular with tools for visualization, analysis and interpretation of high-throughput expression data in the light of functional interactions and biological pathways.
High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is ...difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online.
Colorectal carcinoma represents a heterogeneous entity, with only a fraction of the tumours responding to available therapies, requiring a better molecular understanding of the disease in precision ...oncology. To address this challenge, the OncoTrack consortium recruited 106 CRC patients (stages I-IV) and developed a pre-clinical platform generating a compendium of drug sensitivity data totalling >4,000 assays testing 16 clinical drugs on patient-derived in vivo and in vitro models. This large biobank of 106 tumours, 35 organoids and 59 xenografts, with extensive omics data comparing donor tumours and derived models provides a resource for advancing our understanding of CRC. Models recapitulate many of the genetic and transcriptomic features of the donors, but defined less complex molecular sub-groups because of the loss of human stroma. Linking molecular profiles with drug sensitivity patterns identifies novel biomarkers, including a signature outperforming RAS/RAF mutations in predicting sensitivity to the EGFR inhibitor cetuximab.
Epigenetic alterations, that is, disruption of DNA methylation and chromatin architecture, are now acknowledged as a universal feature of tumorigenesis. Medulloblastoma, a clinically challenging, ...malignant childhood brain tumour, is no exception. Despite much progress from recent genomics studies, with recurrent changes identified in each of the four distinct tumour subgroups (WNT-pathway-activated, SHH-pathway-activated, and the less-well-characterized Group 3 and Group 4), many cases still lack an obvious genetic driver. Here we present whole-genome bisulphite-sequencing data from thirty-four human and five murine tumours plus eight human and three murine normal controls, augmented with matched whole-genome, RNA and chromatin immunoprecipitation sequencing data. This comprehensive data set allowed us to decipher several features underlying the interplay between the genome, epigenome and transcriptome, and its effects on medulloblastoma pathophysiology. Most notable were highly prevalent regions of hypomethylation correlating with increased gene expression, extending tens of kilobases downstream of transcription start sites. Focal regions of low methylation linked to transcription-factor-binding sites shed light on differential transcriptional networks between subgroups, whereas increased methylation due to re-normalization of repressed chromatin in DNA methylation valleys was positively correlated with gene expression. Large, partially methylated domains affecting up to one-third of the genome showed increased mutation rates and gene silencing in a subgroup-specific fashion. Epigenetic alterations also affected novel medulloblastoma candidate genes (for example, LIN28B), resulting in alternative promoter usage and/or differential messenger RNA/microRNA expression. Analysis of mouse medulloblastoma and precursor-cell methylation demonstrated a somatic origin for many alterations. Our data provide insights into the epigenetic regulation of transcription and genome organization in medulloblastoma pathogenesis, which are probably also of importance in a wider developmental and disease context.
Sugar beet (Beta vulgaris ssp. vulgaris) is an important crop of temperate climates which provides nearly 30% of the world's annual sugar production and is a source for bioethanol and animal feed. ...The species belongs to the order of Caryophylalles, is diploid with 2n = 18 chromosomes, has an estimated genome size of 714-758 megabases and shares an ancient genome triplication with other eudicot plants. Leafy beets have been cultivated since Roman times, but sugar beet is one of the most recently domesticated crops. It arose in the late eighteenth century when lines accumulating sugar in the storage root were selected from crosses made with chard and fodder beet. Here we present a reference genome sequence for sugar beet as the first non-rosid, non-asterid eudicot genome, advancing comparative genomics and phylogenetic reconstructions. The genome sequence comprises 567 megabases, of which 85% could be assigned to chromosomes. The assembly covers a large proportion of the repetitive sequence content that was estimated to be 63%. We predicted 27,421 protein-coding genes supported by transcript data and annotated them on the basis of sequence homology. Phylogenetic analyses provided evidence for the separation of Caryophyllales before the split of asterids and rosids, and revealed lineage-specific gene family expansions and losses. We sequenced spinach (Spinacia oleracea), another Caryophyllales species, and validated features that separate this clade from rosids and asterids. Intraspecific genomic variation was analysed based on the genome sequences of sea beet (Beta vulgaris ssp. maritima; progenitor of all beet crops) and four additional sugar beet accessions. We identified seven million variant positions in the reference genome, and also large regions of low variability, indicating artificial selection. The sugar beet genome sequence enables the identification of genes affecting agronomically relevant traits, supports molecular breeding and maximizes the plant's potential in energy biotechnology.
Our lives (and deaths) have by now been dominated for two years by COVID-19, a pandemic that has caused hundreds of millions of disease cases, millions of deaths, trillions in economic costs, and ...major restrictions on our freedom. Here we suggest a novel tool for controlling the COVID-19 pandemic. The key element is a method for a population-scale PCR-based testing, applied on a systematic and repeated basis. For this we have developed a low cost, highly sensitive virus-genome-based test. Using Germany as an example, we demonstrate by using a mathematical model, how useful this strategy could have been in controlling the pandemic. We show using real-world examples how this might be implemented on a mass scale and discuss the feasibility of this approach.