Endophenotypes can include phenomena at diverse biological scales; some examples include protein catalytic rate or melting temperature (stability), cell growth rate, and blood pressure. Classifiers ...that are gene and endophenotype specific--such as those produced by the ePOSE algorithm--will benefit from learning which variants (or genes) contribute to specific components of disease: as an example, CFTR variants that effect processing versus chloride conductance, or complex heart disease phenotypes that can result from varying combinations of LDLR-specific cholesterol plaques 27 or LPA-specific calcium plaques 56.
Cooperative dysregulation of gene sequence and expression may contribute to cancer formation and progression. The Cancer Genome Atlas (TCGA) Network recently catalogued gene sequence and expression ...data for a collection of glioblastoma multiforme (GBM) tumors. We developed an automated, model-free method to rapidly and exhaustively examine the correlation among somatic mutation and gene expression and interrogated 149 GBM tumor samples from the TCGA. The method identified 41 genes whose mutation status is highly correlated with drastic changes in the expression (z-score ± 2.0), across tumor samples, of other genes. Some of the 41 genes have been previously implicated in GBM pathogenesis (e.g., NF1, TP53, RB1, and IDH1) and others, while implicated in cancer, had not previously been highlighted in studies using TCGA data (e.g., SYNE1, KLF6, FGFR4, and EPHB4). The method also predicted that known oncogenes and tumor suppressors participate in GBM via drastic over- and underexpression, respectively. In addition, the method identified a known synthetic lethal interaction between TP53 and PLK1, other potential synthetic lethal interactions with TP53, and correlations between IDH1 mutation status and the overexpression of known GBM survival genes.
Allelic heterogeneity in disease-causing genes presents a substantial challenge to the translation of genomic variation into clinical practice. Few of the almost 2,000 variants in the cystic fibrosis ...transmembrane conductance regulator gene CFTR have empirical evidence that they cause cystic fibrosis. To address this gap, we collected both genotype and phenotype data for 39,696 individuals with cystic fibrosis in registries and clinics in North America and Europe. In these individuals, 159 CFTR variants had an allele frequency of ł0.01%. These variants were evaluated for both clinical severity and functional consequence, with 127 (80%) meeting both clinical and functional criteria consistent with disease. Assessment of disease penetrance in 2,188 fathers of individuals with cystic fibrosis enabled assignment of 12 of the remaining 32 variants as neutral, whereas the other 20 variants remained of indeterminate effect. This study illustrates that sourcing data directly from well-phenotyped subjects can address the gap in our ability to interpret clinically relevant genomic variation.
Cancer sequencing studies are increasingly comprehensive and well powered, returning long lists of somatic mutations that can be difficult to sort and interpret. Diligent analysis and quality control ...can require multiple computational tools of distinct utility and producing disparate output, creating additional challenges for the investigator. The Cancer-Related Analysis of Variants Toolkit (CRAVAT) is an evolving suite of informatics tools for mutation interpretation that includes mutation mapping and quality control, impact prediction and extensive annotation, gene- and mutation-level interpretation, including joint prioritization of all nonsilent mutation consequence types, and structural and mechanistic visualization. Results from CRAVAT submissions are explored in an interactive, user-friendly web environment with dynamic filtering and sorting designed to highlight the most informative mutations, even in the context of very large studies. CRAVAT can be run on a public web portal, in the cloud, or downloaded for local use, and is easily integrated with other methods for cancer omics analysis.
.
Many variants of uncertain significance (VUS) have been identified in BRCA2 through clinical genetic testing. VUS pose a significant clinical challenge because the contribution of these variants to ...cancer risk has not been determined. We conducted a comprehensive assessment of VUS in the BRCA2 C-terminal DNA binding domain (DBD) by using a validated functional assay of BRCA2 homologous recombination (HR) DNA-repair activity and defined a classifier of variant pathogenicity. Among 139 variants evaluated, 54 had ≥99% probability of pathogenicity, and 73 had ≥95% probability of neutrality. Functional assay results were compared with predictions of variant pathogenicity from the Align-GVGD protein-sequence-based prediction algorithm, which has been used for variant classification. Relative to the HR assay, Align-GVGD significantly (p < 0.05) over-predicted pathogenic variants. We subsequently combined functional and Align-GVGD prediction results in a Bayesian hierarchical model (VarCall) to estimate the overall probability of pathogenicity for each VUS. In addition, to predict the effects of all other BRCA2 DBD variants and to prioritize variants for functional studies, we used the endoPhenotype-Optimized Sequence Ensemble (ePOSE) algorithm to train classifiers for BRCA2 variants by using data from the HR functional assay. Together, the results show that systematic functional assays in combination with in silico predictors of pathogenicity provide robust tools for clinical annotation of BRCA2 VUS.
There are conflicting data on the effects of antipsychotic medications on delirium in patients in the intensive care unit (ICU).
In a randomized, double-blind, placebo-controlled trial, we assigned ...patients with acute respiratory failure or shock and hypoactive or hyperactive delirium to receive intravenous boluses of haloperidol (maximum dose, 20 mg daily), ziprasidone (maximum dose, 40 mg daily), or placebo. The volume and dose of a trial drug or placebo was halved or doubled at 12-hour intervals on the basis of the presence or absence of delirium, as detected with the use of the Confusion Assessment Method for the ICU, and of side effects of the intervention. The primary end point was the number of days alive without delirium or coma during the 14-day intervention period. Secondary end points included 30-day and 90-day survival, time to freedom from mechanical ventilation, and time to ICU and hospital discharge. Safety end points included extrapyramidal symptoms and excessive sedation.
Written informed consent was obtained from 1183 patients or their authorized representatives. Delirium developed in 566 patients (48%), of whom 89% had hypoactive delirium and 11% had hyperactive delirium. Of the 566 patients, 184 were randomly assigned to receive placebo, 192 to receive haloperidol, and 190 to receive ziprasidone. The median duration of exposure to a trial drug or placebo was 4 days (interquartile range, 3 to 7). The median number of days alive without delirium or coma was 8.5 (95% confidence interval CI, 5.6 to 9.9) in the placebo group, 7.9 (95% CI, 4.4 to 9.6) in the haloperidol group, and 8.7 (95% CI, 5.9 to 10.0) in the ziprasidone group (P=0.26 for overall effect across trial groups). The use of haloperidol or ziprasidone, as compared with placebo, had no significant effect on the primary end point (odds ratios, 0.88 95% CI, 0.64 to 1.21 and 1.04 95% CI, 0.73 to 1.48, respectively). There were no significant between-group differences with respect to the secondary end points or the frequency of extrapyramidal symptoms.
The use of haloperidol or ziprasidone, as compared with placebo, in patients with acute respiratory failure or shock and hypoactive or hyperactive delirium in the ICU did not significantly alter the duration of delirium. (Funded by the National Institutes of Health and the VA Geriatric Research Education and Clinical Center; MIND-USA ClinicalTrials.gov number, NCT01211522 .).
Computational analysis of cancer pharmacogenomics data has resulted in biomarkers predictive of drug response, but the majority of response is not captured by current methods. Methods typically ...select single biomarkers or groups of related biomarkers but do not account for response that is strictly dependent on many simultaneous genetic alterations. This shortcoming reflects the combinatorics and multiple-testing problem associated with many-body biologic interactions. We developed a novel approach, Multivariate Organization of Combinatorial Alterations (MOCA), to partially address these challenges. Extending on previous work that accounts for pairwise interactions, the approach rapidly combines many genomic alterations into biomarkers of drug response, using Boolean set operations coupled with optimization; in this framework, the union, intersection, and difference Boolean set operations are proxies of molecular redundancy, synergy, and resistance, respectively. The algorithm is fast, broadly applicable to cancer genomics data, is of immediate use for prioritizing cancer pharmacogenomics experiments, and recovers known clinical findings without bias. Furthermore, the results presented here connect many important, previously isolated observations.
ABSTRACT
Insertion/deletion variants (indels) alter protein sequence and length, yet are highly prevalent in healthy populations, presenting a challenge to bioinformatics classifiers. Commonly used ...features—DNA and protein sequence conservation, indel length, and occurrence in repeat regions—are useful for inference of protein damage. However, these features can cause false positives when predicting the impact of indels on disease. Existing methods for indel classification suffer from low specificities, severely limiting clinical utility. Here, we further develop our variant effect scoring tool (VEST) to include the classification of in‐frame and frameshift indels (VEST‐indel) as pathogenic or benign. We apply 24 features, including a new “PubMed” feature, to estimate a gene's importance in human disease. When compared with four existing indel classifiers, our method achieves a drastically reduced false‐positive rate, improving specificity by as much as 90%. This approach of estimating gene importance might be generally applicable to missense and other bioinformatics pathogenicity predictors, which often fail to achieve high specificity. Finally, we tested all possible meta‐predictors that can be obtained from combining the four different indel classifiers using Boolean conjunctions and disjunctions, and derived a meta‐predictor with improved performance over any individual method.
We further developed our Variant Effect Scoring Tool (VEST) to include classification of in‐frame and frameshift indels (VEST‐indel) as pathogenic or benign. VEST‐indel includes a new feature for estimating the importance of a gene in human disease, and achieves improved specificity and balanced accuracy relative to existing methods. VEST is available as a standalone application, and as part of the CRAVAT webserver (cravat.us).