Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript ...collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.
Transdifferentiation, the process of converting from one cell type to another without going through a pluripotent state, has great promise for regenerative medicine. The identification of key ...transcription factors for reprogramming is currently limited by the cost of exhaustive experimental testing of plausible sets of factors, an approach that is inefficient and unscalable. Here we present a predictive system (Mogrify) that combines gene expression data with regulatory network information to predict the reprogramming factors necessary to induce cell conversion. We have applied Mogrify to 173 human cell types and 134 tissues, defining an atlas of cellular reprogramming. Mogrify correctly predicts the transcription factors used in known transdifferentiations. Furthermore, we validated two new transdifferentiations predicted by Mogrify. We provide a practical and efficient mechanism for systematically implementing novel cell conversions, facilitating the generalization of reprogramming of human cells. Predictions are made available to help rapidly further the field of cell conversion.
Mito-SEPs are small open reading frame-encoded peptides that localize to the mitochondria to regulate metabolism. Motivated by an intriguing negative association between mito-SEPs and inflammation, ...here we screen for mito-SEPs that modify inflammatory outcomes and report a mito-SEP named "Modulator of cytochrome C oxidase during Inflammation" (MOCCI) that is upregulated during inflammation and infection to promote host-protective resolution. MOCCI, a paralog of the NDUFA4 subunit of cytochrome C oxidase (Complex IV), replaces NDUFA4 in Complex IV during inflammation to lower mitochondrial membrane potential and reduce ROS production, leading to cyto-protection and dampened immune response. The MOCCI transcript also generates miR-147b, which targets the NDUFA4 mRNA with similar immune dampening effects as MOCCI, but simultaneously enhances RIG-I/MDA-5-mediated viral immunity. Our work uncovers a dual-component pleiotropic regulation of host inflammation and immunity by MOCCI (C15ORF48) for safeguarding the host during infection and inflammation.
Human pluripotent and trophoblast stem cells have been essential alternatives to blastocysts for understanding early human development
. However, these simple culture systems lack the complexity to ...adequately model the spatiotemporal cellular and molecular dynamics that occur during early embryonic development. Here we describe the reprogramming of fibroblasts into in vitro three-dimensional models of the human blastocyst, termed iBlastoids. Characterization of iBlastoids shows that they model the overall architecture of blastocysts, presenting an inner cell mass-like structure, with epiblast- and primitive endoderm-like cells, a blastocoel-like cavity and a trophectoderm-like outer layer of cells. Single-cell transcriptomics further confirmed the presence of epiblast-, primitive endoderm-, and trophectoderm-like cells. Moreover, iBlastoids can give rise to pluripotent and trophoblast stem cells and are capable of modelling, in vitro, several aspects of the early stage of implantation. In summary, we have developed a scalable and tractable system to model human blastocyst biology; we envision that this will facilitate the study of early human development and the effects of gene mutations and toxins during early embryogenesis, as well as aiding in the development of new therapies associated with in vitro fertilization.
Titin-truncating variants (TTNtv) commonly cause dilated cardiomyopathy (DCM). TTNtv are also encountered in ∼1% of the general population, where they may be silent, perhaps reflecting allelic ...factors. To better understand TTNtv, we integrated TTN allelic series, cardiac imaging and genomic data in humans and studied rat models with disparate TTNtv. In patients with DCM, TTNtv throughout titin were significantly associated with DCM. Ribosomal profiling in rat showed the translational footprint of premature stop codons in Ttn, TTNtv-position-independent nonsense-mediated degradation of the mutant allele and a signature of perturbed cardiac metabolism. Heart physiology in rats with TTNtv was unremarkable at baseline but became impaired during cardiac stress. In healthy humans, machine-learning-based analysis of high-resolution cardiac imaging showed TTNtv to be associated with eccentric cardiac remodeling. These data show that TTNtv have molecular and physiological effects on the heart across species, with a continuum of expressivity in health and disease.
Abstract The role of microglia cells in Alzheimer’s disease (AD) is well recognized, however their molecular and functional diversity remain unclear. Here, we isolated amyloid plaque-containing ...(using labelling with methoxy-XO4, XO4 + ) and non-containing (XO4 − ) microglia from an AD mouse model. Transcriptomics analysis identified different transcriptional trajectories in ageing and AD mice. XO4 + microglial transcriptomes demonstrated dysregulated expression of genes associated with late onset AD. We further showed that the transcriptional program associated with XO4 + microglia from mice is present in a subset of human microglia isolated from brains of individuals with AD. XO4 − microglia displayed transcriptional signatures associated with accelerated ageing and contained more intracellular post-synaptic material than XO4 + microglia, despite reduced active synaptosome phagocytosis. We identified HIF1α as potentially regulating synaptosome phagocytosis in vitro using primary human microglia, and BV2 mouse microglial cells. Together, these findings provide insight into molecular mechanisms underpinning the functional diversity of microglia in AD.
Profiling tumors at single-cell resolution provides an opportunity to understand complexities underpinning lymph-node metastases in head and neck squamous-cell carcinoma. Single-cell RNAseq ...(scRNAseq) analysis of cancer-cell trajectories identifies a subpopulation of pre-metastatic cells, driven by actionable pathways including AXL and AURK. Blocking these two proteins blunts tumor invasion in patient-derived cultures. Furthermore, scRNAseq analyses of tumor-infiltrating CD8 + T-lymphocytes show two distinct trajectories to T-cell dysfunction, corroborated by their clonal architecture based on single-cell T-cell receptor sequencing. By determining key modulators of these trajectories, followed by validation using external datasets and functional experiments, we uncover a role for SOX4 in mediating T-cell exhaustion. Finally, interactome analyses between pre-metastatic tumor cells and CD8 + T-lymphocytes uncover a putative role for the Midkine pathway in immune-modulation and this is confirmed by scRNAseq of tumors from humanized mice. Aside from specific findings, this study demonstrates the importance of tumor heterogeneity analyses in identifying key vulnerabilities during early metastasis.
The SUPERFAMILY resource provides protein domain assignments at the structural classification of protein (SCOP) superfamily level for over 1400 completely sequenced genomes, over 120 metagenomes and ...other gene collections such as UniProt. All models and assignments are available to browse and download at http://supfam.org. A new hidden Markov model library based on SCOP 1.75 has been created and a previously ignored class of SCOP, coiled coils, is now included. Our scoring component now uses HMMER3, which is in orders of magnitude faster and produces superior results. A cloud-based pipeline was implemented and is publicly available at Amazon web services elastic computer cloud. The SUPERFAMILY reference tree of life has been improved allowing the user to highlight a chosen superfamily, family or domain architecture on the tree of life. The most significant advance in SUPERFAMILY is that now it contains a domain-based gene ontology (GO) at the superfamily and family levels. A new methodology was developed to ensure a high quality GO annotation. The new methodology is general purpose and has been used to produce domain-based phenotypic ontologies in addition to GO.
Untranslated regions (UTRs) are important mediators of post-transcriptional regulation. The length of UTRs and the composition of regulatory elements within them are known to vary substantially ...across genes, but little is known about the reasons for this variation in humans. Here, we set out to determine whether this variation, specifically in 5'UTRs, correlates with gene dosage sensitivity.
We investigate 5'UTR length, the number of alternative transcription start sites, the potential for alternative splicing, the number and type of upstream open reading frames (uORFs) and the propensity of 5'UTRs to form secondary structures. We explore how these elements vary by gene tolerance to loss-of-function (LoF; using the LOEUF metric), and in genes where changes in dosage are known to cause disease. We show that LOEUF correlates with 5'UTR length and complexity. Genes that are most intolerant to LoF have longer 5'UTRs, greater TSS diversity, and more upstream regulatory elements than their LoF tolerant counterparts. We show that these differences are evident in disease gene-sets, but not in recessive developmental disorder genes where LoF of a single allele is tolerated.
Our results confirm the importance of post-transcriptional regulation through 5'UTRs in tight regulation of mRNA and protein levels, particularly for genes where changes in dosage are deleterious and lead to disease. Finally, to support gene-based investigation we release a web-based browser tool, VuTR, that supports exploration of the composition of individual 5'UTRs and the impact of genetic variation within them.
Disruptions in the ubiquitin protein ligase E3A (
) gene cause Angelman syndrome (AS). Whereas AS model mice have associated synaptic dysfunction and altered plasticity with abnormal behavior, ...whether similar or other mechanisms contribute to network hyperactivity and epilepsy susceptibility in AS patients remains unclear. Using human neurons and brain organoids, we demonstrate that UBE3A suppresses neuronal hyperexcitability via ubiquitin-mediated degradation of calcium- and voltage-dependent big potassium (BK) channels. We provide evidence that augmented BK channel activity manifests as increased intrinsic excitability in individual neurons and subsequent network synchronization. BK antagonists normalized neuronal excitability in both human and mouse neurons and ameliorated seizure susceptibility in an AS mouse model. Our findings suggest that BK channelopathy underlies epilepsy in AS and support the use of human cells to model human developmental diseases.