The COVID-19 pandemic demands assimilation of all biomedical knowledge to decode mechanisms of pathogenesis. Despite the recent renaissance in neural networks, a platform for the real-time synthesis ...of the exponentially growing biomedical literature and deep omics insights is unavailable. Here, we present the nferX platform for dynamic inference from over 45 quadrillion possible conceptual associations from unstructured text, and triangulation with insights from single-cell RNA-sequencing, bulk RNA-seq and proteomics from diverse tissue types. A hypothesis-free profiling of ACE2 suggests tongue keratinocytes, olfactory epithelial cells, airway club cells and respiratory ciliated cells as potential reservoirs of the SARS-CoV-2 receptor. We find the gut as the putative hotspot of COVID-19, where a maturation correlated transcriptional signature is shared in small intestine enterocytes among coronavirus receptors (ACE2, DPP4, ANPEP). A holistic data science platform triangulating insights from structured and unstructured data holds potential for accelerating the generation of impactful biological insights and hypotheses.
Rare monogenic disorders often share molecular etiologies involved in the pathogenesis of common diseases. Congenital disorders of glycosylation (CDG) and deglycosylation (CDDG) are rare pediatric ...disorders with symptoms that range from mild to life threatening. A biological mechanism shared among CDG and CDDG as well as more common neurodegenerative diseases such as Alzheimer's disease and amyotrophic lateral sclerosis, is endoplasmic reticulum (ER) stress. We developed isogenic human cellular models of two types of CDG and the only known CDDG to discover drugs that can alleviate ER stress. Systematic phenotyping confirmed ER stress and identified elevated autophagy among other phenotypes in each model. We screened 1049 compounds and scored their ability to correct aberrant morphology in each model using an agnostic cell-painting assay based on >300 cellular features. This primary screen identified multiple compounds able to correct morphological phenotypes. Independent validation shows they also correct cellular phenotypes and alleviate each of the ER stress markers identified in each model. Many of the active compounds are associated with microtubule dynamics, which points to new therapeutic opportunities for both rare and more common disorders presenting with ER stress, such as Alzheimer's disease and amyotrophic lateral sclerosis.
The COVID-19 pandemic demands assimilation of all available biomedical knowledge to decode its mechanisms of pathogenicity and transmission. Despite the recent renaissance in unsupervised neural ...networks for decoding unstructured natural languages, a platform for the real-time synthesis of the exponentially growing biomedical literature and its comprehensive triangulation with deep omic insights is not available. Here, we present the nferX platform for dynamic inference from over 45 quadrillion possible conceptual associations extracted from unstructured biomedical text, and their triangulation with Single Cell RNA-sequencing based insights from over 25 tissues. Using this platform, we identify intersections between the pathologic manifestations of COVID-19 and the comprehensive expression profile of the SARS-CoV-2 receptor ACE2. We find that tongue keratinocytes and olfactory epithelial cells are likely under-appreciated targets of SARS-CoV-2 infection, correlating with reported loss of sense of taste and smell as early indicators of COVID-19 infection, including in otherwise asymptomatic patients. Airway club cells, ciliated cells and type II pneumocytes in the lung, and enterocytes of the gut also express ACE2. This study demonstrates how a holistic data science platform can leverage unprecedented quantities of structured and unstructured publicly available data to accelerate the generation of impactful biological insights and hypotheses.