The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving ...diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)
. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown
. The age-related acquisition of somatic mutations that ...lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer
and coronary heart disease
-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)
. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.
Analyses of data from genome-wide association studies on unrelated individuals have shown that, for human traits and diseases, approximately one-third to two-thirds of heritability is captured by ...common SNPs. However, it is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular whether the causal variants are rare, or whether it is overestimated due to bias in inference from pedigree data. Here we estimated heritability for height and body mass index (BMI) from whole-genome sequence data on 25,465 unrelated individuals of European ancestry. The estimated heritability was 0.68 (standard error 0.10) for height and 0.30 (standard error 0.10) for body mass index. Low minor allele frequency variants in low linkage disequilibrium (LD) with neighboring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection. Our results imply that rare variants, in particular those in regions of low linkage disequilibrium, are a major source of the still missing heritability of complex traits and disease.
Angina pectoris is associated with adverse cardiovascular events. In this study, a Bi-directional Long Short-Term Memory (Bi-LSTM) prediction model with the Attention layer was established to explore ...the predictive value of the resting-state RR interval time series on the occurrence of angina pectoris. The data of this cohort study were from the Sleep Heart Health Study database, 2,977 people were included with the follow-up of 15 years. We used the RR interval time series of electrocardiogram signals in the resting state. The outcome variables were any angina events during the follow-up. We randomly divided 2,977 participants into training (
n
= 2680) and testing sets (
n
= 297) with a partition ratio of 9:1. The prediction model of Bi-LSTM with Attention layer was developed and the predictive performance was assessed. 1,236 had angina pectoris and 1,741 patients did not have angina pectoris during the follow-up period. The predictive performance of the Bi-LSTM model was great with the value of accuracy = 0.913, area under the curve (AUC) = 0.922, precision = 0.913 in the testing set. RR intervals may be the potential predictors of angina events. It is more and more convenient to obtain heart rate with the development of wearable devices; the Bi-LSTM prediction model established in this study is expected to provide support for the intelligent prediction of angina pectoris events.
Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry
. Here, in cross-ancestry ...GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis
, and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach
, we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry
. Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.
Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury
. These substances are used ...across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries
. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.
Autozygosity, the proportion of the genome that is homozygous by descent, has been associated with variation in multiple health-related traits impacting evolutionary fitness. Autozygosity (FROH) is ...typically measured from runs of homozygosity (ROHs) that arise when identical-by-descent (IBD) haplotypes are inherited from each parent. Population isolates with a small set of common founders have elevated autozygosity relative to outbred populations.
In this study, we examined whether degree of autozygosity was associated with variation in 96 cardiometabolic traits among 7221 Old Order Amish individuals residing in Lancaster County, PA. We estimated the average length of an ROH segment to be 6350 KB, with each individual having on average 17.2 segments 1.5 KB or larger. Measurements of genome-wide and regional FROH were used as the primary predictors of trait variation in association analysis.
In genome-wide FROH analysis, we did not identify any associations that withstood Bonferroni-correction (p = 0.0005). However, on regional FROH analysis, we identified associations exceeding genome-wide thresholds for two traits: serum bilirubin levels, which were significantly associated with a region on chromosome 2 localized to a region surrounding UGT1A10 (p = 1 × 10- 43), and HbA1c levels, which were significantly associated with a region on chromosome 8 localized near CHRNB3 (p = 8 × 10- 10).
These analyses highlight the potential value of autozygosity mapping in founder populations.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Fine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a ...large (n = 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population-specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide-binding groove, explaining 12.9% of trait variance.
MicroRNAs (miRNAs) regulate gene expression and have a critical role in many biologic and pathologic processes. We hypothesized that miRNA expression profiles in injured brain (hippocampus) would ...show common as well as unique profiles when compared with those of blood. Adult, untouched, control rats were compared with rats with sham surgeries, ischemic strokes, brain hemorrhage (lysed blood, fresh blood, or thrombin), and kainate-induced seizures. Brain and whole-blood miRNA expression profiles were assessed 24 h later using TaqMan rodent miRNA arrays. MicroRNA response profiles were different for each condition. Many miRNAs changed more than 1.5-fold in brain and blood after each experimental manipulation, and several miRNAs were upregulated or downregulated in both brain and blood after a given injury. A few miRNAs (e.g., miR-298, miR-155, and miR-362-3p) were upregulated or downregulated more than twofold in both brain and blood after several different injuries. The results show the possible use of blood miRNAs as biomarkers for brain injury; that selected blood miRNAs may correlate with miRNA changes in the brain; and that many of the mRNAs, previously shown to be regulated in brain and blood after brain injury, are likely accounted for by changes in miRNA expression.
RATIONALE:Cardiac progenitor cells are an attractive cell type for tissue regeneration but their mechanism for myocardial remodeling is still unclear.
OBJECTIVE:This investigation determines how ...chronological age influences the phenotypic characteristics and the secretome of human cardiac progenitor cells (CPCs), as well as their potential to recover injured myocardium.
METHODS AND RESULTS:Adult (aCPCs) and neonatal (nCPCs) cells were derived from patients more than 40 years or less than one month of age, respectively, and their functional potential was determined in a rodent myocardial infarction (MI) model. A more robust in vitro proliferative capacity of nCPCs, compared to aCPCs, correlated with significantly greater myocardial recovery mediated by nCPCs in vivo. Strikingly, a single injection of nCPC-derived total conditioned media (nTCM) was significantly more effective than nCPCs, aCPC-derived TCM (aTCM), or nCPC-derived exosomes in recovering cardiac function, stimulating neovascularization, and promoting myocardial remodeling. High resolution accurate mass spectrometry (HRAMS) with reverse phase liquid chromatography fractionation and mass spectrometry (LC-MS/MS) was employed to identify proteins in the secretome of aCPCs and nCPCs, and literature-based networking software identified specific pathways affected by the secretome of CPCs in the setting of MI. Examining the TCM, we quantified changes in the expression pattern of 804 proteins in nTCM and 513 proteins in aTCM. Literature-based proteomic network analysis identified that 46 and 6 canonical signaling pathways were significantly targeted by nTCM and aTCM, respectively. One leading candidate pathway is heat shock factor-1 (HSF-1), potentially affecting 8 identified pathways for nTCM but none for aTCM. To validate this prediction, we demonstrated that modulation of HSF-1 by knockdown in nCPCs or overexpression in aCPCs significantly altered the quality of their secretome.
CONCLUSIONS:In conclusion, a deep proteomic analysis revealed both detailed and global mechanisms underlying the chronological age-based differences in the ability of CPCs to promote myocardial recovery via the components of their secretome.