Massively parallel RNA sequencing (RNA-seq) has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to ...fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki.
Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but ...highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info . Both are offered free of charge to the research community.
The Drug-Gene Interaction Database (DGIdb, www.dgidb.org) is a web resource that consolidates disparate data sources describing drug-gene interactions and gene druggability. It provides an intuitive ...graphical user interface and a documented application programming interface (API) for querying these data. DGIdb was assembled through an extensive manual curation effort, reflecting the combined information of twenty-seven sources. For DGIdb 2.0, substantial updates have been made to increase content and improve its usefulness as a resource for mining clinically actionable drug targets. Specifically, nine new sources of drug-gene interactions have been added, including seven resources specifically focused on interactions linked to clinical trials. These additions have more than doubled the overall count of drug-gene interactions. The total number of druggable gene claims has also increased by 30%. Importantly, a majority of the unrestricted, publicly-accessible sources used in DGIdb are now automatically updated on a weekly basis, providing the most current information for these sources. Finally, a new web view and API have been developed to allow searching for interactions by drug identifiers to complement existing gene-based search functionality. With these updates, DGIdb represents a comprehensive and user friendly tool for mining the druggable genome for precision medicine hypothesis generation.
Cerebrospinal fluid (CSF) 42 amino acid species of amyloid beta (Aβ42) and tau levels are strongly correlated with the presence of Alzheimer's disease (AD) neuropathology including amyloid plaques ...and neurodegeneration and have been successfully used as endophenotypes for genetic studies of AD. Additional CSF analytes may also serve as useful endophenotypes that capture other aspects of AD pathophysiology. Here we have conducted a genome-wide association study of CSF levels of 59 AD-related analytes. All analytes were measured using the Rules Based Medicine Human DiscoveryMAP Panel, which includes analytes relevant to several disease-related processes. Data from two independently collected and measured datasets, the Knight Alzheimer's Disease Research Center (ADRC) and Alzheimer's Disease Neuroimaging Initiative (ADNI), were analyzed separately, and combined results were obtained using meta-analysis. We identified genetic associations with CSF levels of 5 proteins (Angiotensin-converting enzyme (ACE), Chemokine (C-C motif) ligand 2 (CCL2), Chemokine (C-C motif) ligand 4 (CCL4), Interleukin 6 receptor (IL6R) and Matrix metalloproteinase-3 (MMP3)) with study-wide significant p-values (p<1.46×10-10) and significant, consistent evidence for association in both the Knight ADRC and the ADNI samples. These proteins are involved in amyloid processing and pro-inflammatory signaling. SNPs associated with ACE, IL6R and MMP3 protein levels are located within the coding regions of the corresponding structural gene. The SNPs associated with CSF levels of CCL4 and CCL2 are located in known chemokine binding proteins. The genetic associations reported here are novel and suggest mechanisms for genetic control of CSF and plasma levels of these disease-related proteins. Significant SNPs in ACE and MMP3 also showed association with AD risk. Our findings suggest that these proteins/pathways may be valuable therapeutic targets for AD. Robust associations in cognitively normal individuals suggest that these SNPs also influence regulation of these proteins more generally and may therefore be relevant to other diseases.
Broad and deep tumour genome sequencing has shed new light on tumour heterogeneity and provided important insights into the evolution of metastases arising from different clones. There is an ...additional layer of complexity, in that tumour evolution may be influenced by selective pressure provided by therapy, in a similar fashion to that occurring in infectious diseases. Here we studied tumour genomic evolution in a patient (index patient) with metastatic breast cancer bearing an activating PIK3CA (phosphatidylinositol-4,5-bisphosphate 3-kinase, catalytic subunit alpha, PI(3)Kα) mutation. The patient was treated with the PI(3)Kα inhibitor BYL719, which achieved a lasting clinical response, but the patient eventually became resistant to this drug (emergence of lung metastases) and died shortly thereafter. A rapid autopsy was performed and material from a total of 14 metastatic sites was collected and sequenced. All metastatic lesions, when compared to the pre-treatment tumour, had a copy loss of PTEN (phosphatase and tensin homolog) and those lesions that became refractory to BYL719 had additional and different PTEN genetic alterations, resulting in the loss of PTEN expression. To put these results in context, we examined six other patients also treated with BYL719. Acquired bi-allelic loss of PTEN was found in one of these patients, whereas in two others PIK3CA mutations present in the primary tumour were no longer detected at the time of progression. To characterize our findings functionally, we examined the effects of PTEN knockdown in several preclinical models (both in cell lines intrinsically sensitive to BYL719 and in PTEN-null xenografts derived from our index patient), which we found resulted in resistance to BYL719, whereas simultaneous PI(3)K p110β blockade reverted this resistance phenotype. We conclude that parallel genetic evolution of separate metastatic sites with different PTEN genomic alterations leads to a convergent PTEN-null phenotype resistant to PI(3)Kα inhibition.
Interpretation of the clinical significance of genomic alterations remains the most severe bottleneck preventing the realization of personalized medicine in cancer. We propose a knowledge commons to ...facilitate collaborative contributions and open discussion of clinical decision-making based on genomic events in cancer.
Background Identification of the physiologic changes that occur during the early stages of Alzheimer’s disease (AD) may provide critical insights for the diagnosis, prognosis, and treatment of ...disease. Cerebrospinal fluid (CSF) biomarkers are a rich source of information that reflect the brain proteome. Methods A novel approach was applied to screen a panel of ~190 CSF analytes quantified by multiplex immunoassay, and common associations were detected in the Knight Alzheimer’s Disease Research Center ( N = 311) and the Alzheimer’s Disease Neuroimaging Initiative ( N = 293) cohorts. Rather than case-control status, the ratio of CSF levels of tau phosphorylated at threonine 181 (ptau181 ) and Aβ42 was used as a continuous trait in these analyses. Results The ptau181 -Aβ42 ratio has more statistical power than traditional modeling approaches, and the levels of CSF heart-type fatty acid binding protein (FABP) and 12 other correlated analytes increase as AD progresses. These results were validated using the traditional case-control status model. Stratification of the dataset demonstrated that increases in these analytes occur very early in the disease course and were apparent even in nondemented individuals with AD pathology (low ptau181 , low Aβ42 ) compared with elderly control subjects with no pathology (low ptau181 , high Aβ42 ). The FABP-Aβ42 ratio demonstrates a similar hazard ratio for disease conversion to ptau181 -Aβ42 even though the overlap in classification is incomplete suggesting that FABP contributes independent information as a predictor of AD. Conclusions Our results indicate that the approach presented here can be used to identify novel biomarkers for AD correctly and that CSF heart FABP levels start to increase at very early stages of AD.
Here we report targeted sequencing of 83 genes using DNA from primary breast cancer samples from 625 postmenopausal (UBC-TAM series) and 328 premenopausal (MA12 trial) hormone receptor-positive (HR+) ...patients to determine interactions between somatic mutation and prognosis. Independent validation of prognostic interactions was achieved using data from the METABRIC study. Previously established associations between MAP3K1 and PIK3CA mutations with luminal A status/favorable prognosis and TP53 mutations with Luminal B/non-luminal tumors/poor prognosis were observed, validating the methodological approach. In UBC-TAM, NF1 frame-shift nonsense (FS/NS) mutations were also a poor outcome driver that was validated in METABRIC. For MA12, poor outcome associated with PIK3R1 mutation was also reproducible. DDR1 mutations were strongly associated with poor prognosis in UBC-TAM despite stringent false discovery correction (q = 0.0003). In conclusion, uncommon recurrent somatic mutations should be further explored to create a more complete explanation of the highly variable outcomes that typifies ER+ breast cancer.