In response to various stimuli, vascular smooth muscle cells (SMCs) can de-differentiate, proliferate and migrate in a process known as phenotypic modulation. However, the phenotype of modulated SMCs ...in vivo during atherosclerosis and the influence of this process on coronary artery disease (CAD) risk have not been clearly established. Using single-cell RNA sequencing, we comprehensively characterized the transcriptomic phenotype of modulated SMCs in vivo in atherosclerotic lesions of both mouse and human arteries and found that these cells transform into unique fibroblast-like cells, termed 'fibromyocytes', rather than into a classical macrophage phenotype. SMC-specific knockout of TCF21-a causal CAD gene-markedly inhibited SMC phenotypic modulation in mice, leading to the presence of fewer fibromyocytes within lesions as well as within the protective fibrous cap of the lesions. Moreover, TCF21 expression was strongly associated with SMC phenotypic modulation in diseased human coronary arteries, and higher levels of TCF21 expression were associated with decreased CAD risk in human CAD-relevant tissues. These results establish a protective role for both TCF21 and SMC phenotypic modulation in this disease.
Embryonic gene expression intricately reflects anatomical context, developmental stage, and cell type. To address whether the precise spatial origins of cardiac cells can be deduced solely from their ...transcriptional profiles, we established a genome-wide expression database from 118, 949, and 1,166 single murine heart cells at embryonic day 8.5 (e8.5), e9.5, and e10.5, respectively. We segregated these cells by type using unsupervised bioinformatics analysis and identified chamber-specific genes. Using a random forest algorithm, we reconstructed the spatial origin of single e9.5 and e10.5 cardiomyocytes with 92.0% ± 3.2% and 91.2% ± 2.8% accuracy, respectively (99.4% ± 1.0% and 99.1% ± 1.1% if a ±1 zone margin is permitted) and predicted the second heart field distribution of Isl-1-lineage descendants. When applied to Nkx2-5−/− cardiomyocytes from murine e9.5 hearts, we showed their transcriptional alteration and lack of ventricular phenotype. Our database and zone classification algorithm will enable the discovery of novel mechanisms in early cardiac development and disease.
Display omitted
•Single-cell RNA-seq uncovers chamber-specific genes in the embryonic mouse heart•Machine learning can infer anatomical context from single-cell transcriptional data•Nkx2-5−/− embryonic mouse cardiomyocytes lack a ventricular transcriptional profile•Embryonic ventricular myocardium display trabecular-compact expression gradients
Cardiogenesis is orchestrated by cell-type- and chamber-specific transcription. Li et al. collected 2,233 single-cell RNA-seq samples from embryonic mouse hearts. This data resource uncovers anatomical patterns of gene expression that enable the deduction of a single-cell sample's anatomical origin, providing insight into developmental perturbations in congenital heart defect models.
The role of long noncoding RNA (lncRNA) in adult hearts is unknown; also unclear is how lncRNA modulates nucleosome remodelling. An estimated 70% of mouse genes undergo antisense transcription, ...including myosin heavy chain 7 (Myh7), which encodes molecular motor proteins for heart contraction. Here we identify a cluster of lncRNA transcripts from Myh7 loci and demonstrate a new lncRNA-chromatin mechanism for heart failure. In mice, these transcripts, which we named myosin heavy-chain-associated RNA transcripts (Myheart, or Mhrt), are cardiac-specific and abundant in adult hearts. Pathological stress activates the Brg1-Hdac-Parp chromatin repressor complex to inhibit Mhrt transcription in the heart. Such stress-induced Mhrt repression is essential for cardiomyopathy to develop: restoring Mhrt to the pre-stress level protects the heart from hypertrophy and failure. Mhrt antagonizes the function of Brg1, a chromatin-remodelling factor that is activated by stress to trigger aberrant gene expression and cardiac myopathy. Mhrt prevents Brg1 from recognizing its genomic DNA targets, thus inhibiting chromatin targeting and gene regulation by Brg1. It does so by binding to the helicase domain of Brg1, a domain that is crucial for tethering Brg1 to chromatinized DNA targets. Brg1 helicase has dual nucleic-acid-binding specificities: it is capable of binding lncRNA (Mhrt) and chromatinized--but not naked--DNA. This dual-binding feature of helicase enables a competitive inhibition mechanism by which Mhrt sequesters Brg1 from its genomic DNA targets to prevent chromatin remodelling. A Mhrt-Brg1 feedback circuit is thus crucial for heart function. Human MHRT also originates from MYH7 loci and is repressed in various types of myopathic hearts, suggesting a conserved lncRNA mechanism in human cardiomyopathy. Our studies identify a cardioprotective lncRNA, define a new targeting mechanism for ATP-dependent chromatin-remodelling factors, and establish a new paradigm for lncRNA-chromatin interaction.
The challenge of linking intergenic mutations to target genes has limited molecular understanding of human diseases. Here we show that H3K27ac HiChIP generates high-resolution contact maps of active ...enhancers and target genes in rare primary human T cell subtypes and coronary artery smooth muscle cells. Differentiation of naive T cells into T helper 17 cells or regulatory T cells creates subtype-specific enhancer-promoter interactions, specifically at regions of shared DNA accessibility. These data provide a principled means of assigning molecular functions to autoimmune and cardiovascular disease risk variants, linking hundreds of noncoding variants to putative gene targets. Target genes identified with HiChIP are further supported by CRISPR interference and activation at linked enhancers, by the presence of expression quantitative trait loci, and by allele-specific enhancer loops in patient-derived primary cells. The majority of disease-associated enhancers contact genes beyond the nearest gene in the linear genome, leading to a fourfold increase in the number of potential target genes for autoimmune and cardiovascular diseases.
Variability in induced pluripotent stem cell (iPSC) lines remains a concern for disease modeling and regenerative medicine. We have used RNA-sequencing analysis and linear mixed models to examine the ...sources of gene expression variability in 317 human iPSC lines from 101 individuals. We found that ∼50% of genome-wide expression variability is explained by variation across individuals and identified a set of expression quantitative trait loci that contribute to this variation. These analyses coupled with allele-specific expression show that iPSCs retain a donor-specific gene expression pattern. Network, pathway, and key driver analyses showed that Polycomb targets contribute significantly to the non-genetic variability seen within and across individuals, highlighting this chromatin regulator as a likely source of reprogramming-based variability. Our findings therefore shed light on variation between iPSC lines and illustrate the potential for our dataset and other similar large-scale analyses to identify underlying drivers relevant to iPSC applications.
Display omitted
•Gene expression analysis characterizes 317 human iPSC lines from 101 individuals•eQTLs contribute significantly to a cross individual variation in iPSC lines•Polycomb target genes are a significant source of non-genetic variation•Predictive networks highlight candidate key drivers of differentiation efficiency
Using large-scale analyses of over 300 iPSC lines, Chang, Quertermous, Lemischka, and colleagues of the NHLBI NextGen consortium examine sources of gene expression variation between lines and illustrate how this approach can identify genetic and non-genetic drivers relevant to line variation with implications for iPSC characterization and disease modeling.
Interstitial fibrosis plays a key role in the development and progression of heart failure. Here, we show that an enzyme that crosslinks collagen-Lysyl oxidase-like 2 (Loxl2)-is essential for ...interstitial fibrosis and mechanical dysfunction of pathologically stressed hearts. In mice, cardiac stress activates fibroblasts to express and secrete Loxl2 into the interstitium, triggering fibrosis, systolic and diastolic dysfunction of stressed hearts. Antibody-mediated inhibition or genetic disruption of Loxl2 greatly reduces stress-induced cardiac fibrosis and chamber dilatation, improving systolic and diastolic functions. Loxl2 stimulates cardiac fibroblasts through PI3K/AKT to produce TGF-β2, promoting fibroblast-to-myofibroblast transformation; Loxl2 also acts downstream of TGF-β2 to stimulate myofibroblast migration. In diseased human hearts, LOXL2 is upregulated in cardiac interstitium; its levels correlate with collagen crosslinking and cardiac dysfunction. LOXL2 is also elevated in the serum of heart failure (HF) patients, correlating with other HF biomarkers, suggesting a conserved LOXL2-mediated mechanism of human HF.
Human-induced pluripotent stem cell-derived endothelial cells (iPSC-ECs) have risen as a useful tool in cardiovascular research, offering a wide gamut of translational and clinical applications. ...However, inefficiency of the currently available iPSC-EC differentiation protocol and underlying heterogeneity of derived iPSC-ECs remain as major limitations of iPSC-EC technology.
Here, we performed droplet-based single-cell RNA sequencing (scRNA-seq) of the human iPSCs after iPSC-EC differentiation. Droplet-based scRNA-seq enables analysis of thousands of cells in parallel, allowing comprehensive analysis of transcriptional heterogeneity.
Bona fide iPSC-EC cluster was identified by scRNA-seq, which expressed high levels of endothelial-specific genes. iPSC-ECs, sorted by CD144 antibody-conjugated magnetic sorting, exhibited standard endothelial morphology and function including tube formation, response to inflammatory signals, and production of NO. Nonendothelial cell populations resulting from the differentiation protocol were identified, which included immature cardiomyocytes, hepatic-like cells, and vascular smooth muscle cells. Furthermore, scRNA-seq analysis of purified iPSC-ECs revealed transcriptional heterogeneity with 4 major subpopulations, marked by robust enrichment of CLDN5, APLNR, GJA5, and ESM1 genes, respectively.
Massively parallel, droplet-based scRNA-seq allowed meticulous analysis of thousands of human iPSCs subjected to iPSC-EC differentiation. Results showed inefficiency of the differentiation technique, which can be improved with further studies based on identification of molecular signatures that inhibit expansion of nonendothelial cell types. Subtypes of bona fide human iPSC-ECs were also identified, allowing us to sort for iPSC-ECs with specific biological function and identity.
Human induced pluripotent stem cells (hiPSCs) generated by de-differentiation of adult somatic cells offer potential solutions for the ethical issues surrounding human embryonic stem cells (hESCs), ...as well as their immunologic rejection after cellular transplantation. However, although hiPSCs have been described as "embryonic stem cell-like", these cells have a distinct gene expression pattern compared to hESCs, making incomplete reprogramming a potential pitfall. It is unclear to what degree the difference in tissue of origin may contribute to these gene expression differences. To answer these important questions, a careful transcriptional profiling analysis is necessary to investigate the exact reprogramming state of hiPSCs, as well as analysis of the impression, if any, of the tissue of origin on the resulting hiPSCs. In this study, we compare the gene profiles of hiPSCs derived from fetal fibroblasts, neonatal fibroblasts, adipose stem cells, and keratinocytes to their corresponding donor cells and hESCs. Our analysis elucidates the overall degree of reprogramming within each hiPSC line, as well as the "distance" between each hiPSC line and its donor cell. We further identify genes that have a similar mode of regulation in hiPSCs and their corresponding donor cells compared to hESCs, allowing us to specify core sets of donor genes that continue to be expressed in each hiPSC line. We report that residual gene expression of the donor cell type contributes significantly to the differences among hiPSCs and hESCs, and adds to the incompleteness in reprogramming. Specifically, our analysis reveals that fetal fibroblast-derived hiPSCs are closer to hESCs, followed by adipose, neonatal fibroblast, and keratinocyte-derived hiPSCs.
Whole-genome sequencing (WGS) is increasingly applied in clinical medicine and is expected to uncover clinically significant findings regardless of sequencing indication.
To examine coverage and ...concordance of clinically relevant genetic variation provided by WGS technologies; to quantitate inherited disease risk and pharmacogenomic findings in WGS data and resources required for their discovery and interpretation; and to evaluate clinical action prompted by WGS findings.
An exploratory study of 12 adult participants recruited at Stanford University Medical Center who underwent WGS between November 2011 and March 2012. A multidisciplinary team reviewed all potentially reportable genetic findings. Five physicians proposed initial clinical follow-up based on the genetic findings.
Genome coverage and sequencing platform concordance in different categories of genetic disease risk, person-hours spent curating candidate disease-risk variants, interpretation agreement between trained curators and disease genetics databases, burden of inherited disease risk and pharmacogenomic findings, and burden and interrater agreement of proposed clinical follow-up.
Depending on sequencing platform, 10% to 19% of inherited disease genes were not covered to accepted standards for single nucleotide variant discovery. Genotype concordance was high for previously described single nucleotide genetic variants (99%-100%) but low for small insertion/deletion variants (53%-59%). Curation of 90 to 127 genetic variants in each participant required a median of 54 minutes (range, 5-223 minutes) per genetic variant, resulted in moderate classification agreement between professionals (Gross κ, 0.52; 95% CI, 0.40-0.64), and reclassified 69% of genetic variants cataloged as disease causing in mutation databases to variants of uncertain or lesser significance. Two to 6 personal disease-risk findings were discovered in each participant, including 1 frameshift deletion in the BRCA1 gene implicated in hereditary breast and ovarian cancer. Physician review of sequencing findings prompted consideration of a median of 1 to 3 initial diagnostic tests and referrals per participant, with fair interrater agreement about the suitability of WGS findings for clinical follow-up (Fleiss κ, 0.24; P < 001).
In this exploratory study of 12 volunteer adults, the use of WGS was associated with incomplete coverage of inherited disease genes, low reproducibility of detection of genetic variation with the highest potential clinical effects, and uncertainty about clinically reportable findings. In certain cases, WGS will identify clinically actionable genetic variants warranting early medical intervention. These issues should be considered when determining the role of WGS in clinical medicine.