Copy Number Variation Macé, Aurélien; Kutalik, Zoltán; Valsesia, Armand
Methods in molecular biology (Clifton, N.J.),
01/2018, Letnik:
1793
Journal Article
Differences between genomes can be due to single nucleotide variants (SNPs), translocations, inversions and copy number variants (CNVs, gain or loss of DNA). The latter can range from sub-microscopic ...events to complete chromosomal aneuploidies. Small CNVs are often benign but those larger than 250 kb are strongly associated with morbid consequences such as developmental disorders and cancer. Detecting CNVs within and between populations is essential to better understand the plasticity of our genome and to elucidate its possible contribution to disease or phenotypic traits.While the link between SNPs and disease susceptibility has been well studied, to date there are still very few published CNV genome-wide association studies; probably owing to the fact that CNV analysis remains a slightly more complex task than SNP analysis (both in term of bioinformatics workflow and uncertainty in the CNV calling leading to high false positive rates and unknown false negative rates). This chapter aims at explaining computational methods for the analysis of CNVs, ranging from study design, data processing and quality control, up to genome-wide association study with clinical traits.
We established a robust capillary-flow data-independent acquisition MS platform capable of measuring 31 plasma proteomes per day without the need of repeated acquisition of the same sample. We ...acquired 1508 samples of the DiOGenes study (multicentered, Europa-wide caloric restriction weight loss and maintenance study of overweight and obese, non-diabetic participants). This was achieved using a single analytical column. Comprehensive biological reactions to weight loss and maintenance were observed.
Display omitted
Highlights
•Robust capillary-flow DIA was established at 31 clinical samples per day.•1508 samples of dietary intervention study DiOGenes were measured on a single column.•Comprehensive biological reactions to weight loss and maintenance were observed.•Comparison to independent studies shows high reproducibility of potential biomarkers.
Comprehensive, high throughput analysis of the plasma proteome has the potential to enable holistic analysis of the health state of an individual. Based on our own experience and the evaluation of recent large-scale plasma mass spectrometry (MS) based proteomic studies, we identified two outstanding challenges: slow and delicate nano-flow liquid chromatography (LC) and irreproducibility of identification of data-dependent acquisition (DDA). We determined an optimal solution reducing these limitations with robust capillary-flow data-independent acquisition (DIA) MS. This platform can measure 31 plasma proteomes per day. Using this setup, we acquired a large-scale plasma study of the diet, obesity and genes dietary (DiOGenes) comprising 1508 samples. Proving the robustness, the complete acquisition was achieved on a single analytical column. Totally, 565 proteins (459 identified with two or more peptide sequences) were profiled with 74% data set completeness. On average 408 proteins (5246 peptides) were identified per acquisition (319 proteins in 90% of all acquisitions). The workflow reproducibility was assessed using 34 quality control pools acquired at regular intervals, resulting in 92% data set completeness with CVs for protein measurements of 10.9%.
The profiles of 20 apolipoproteins could be profiled revealing distinct changes. The weight loss and weight maintenance resulted in sustained effects on low-grade inflammation, as well as steroid hormone and lipid metabolism, indicating beneficial effects. Comparison to other large-scale plasma weight loss studies demonstrated high robustness and quality of biomarker candidates identified. Tracking of nonenzymatic glycation indicated a delayed, slight reduction of glycation in the weight maintenance phase. Using stable-isotope-references, we could directly and absolutely quantify 60 proteins in the DIA.
In conclusion, we present herein the first large-scale plasma DIA study and one of the largest clinical research proteomic studies to date. Application of this fast and robust workflow has great potential to advance biomarker discovery in plasma.
We performed exome sequencing to detect somatic mutations in protein-coding regions in seven melanoma cell lines and donor-matched germline cells. All melanoma samples had high numbers of somatic ...mutations, which showed the hallmark of UV-induced DNA repair. Such a hallmark was absent in tumor sample-specific mutations in two metastases derived from the same individual. Two melanomas with non-canonical BRAF mutations harbored gain-of-function MAP2K1 and MAP2K2 (MEK1 and MEK2, respectively) mutations, resulting in constitutive ERK phosphorylation and higher resistance to MEK inhibitors. Screening a larger cohort of individuals with melanoma revealed the presence of recurring somatic MAP2K1 and MAP2K2 mutations, which occurred at an overall frequency of 8%. Furthermore, missense and nonsense somatic mutations were frequently found in three candidate melanoma genes, FAT4, LRP1B and DSC1.
We evaluated 25 protocol variants of 14 independent computational methods for exon identification, transcript reconstruction and expression-level quantification from RNA-seq data. Our results show ...that most algorithms are able to identify discrete transcript components with high success rates but that assembly of complete isoform structures poses a major challenge even when all constituent elements are identified. Expression-level estimates also varied widely across methods, even when based on similar transcript models. Consequently, the complexity of higher eukaryotic genomes imposes severe limitations on transcript recall and splice product discrimination that are likely to remain limiting factors for the analysis of current-generation RNA-seq data.
An aim of weight loss is to reduce the risk of type 2 diabetes (T2D) in obese subjects. However, the relation with long-term glycemic improvement remains unknown.
We evaluated the changes in lipid ...composition during weight loss and their association with long-term glycemic improvement.
We investigated the plasma lipidome of 383 obese, nondiabetic patients within a randomized, controlled dietary intervention in 8 European countries at baseline, after an 8-wk low-caloric diet (LCD) (800-1000 kcal/d), and after 6 mo of weight maintenance.
After weight loss, a lipid signature identified 2 groups of patients who were comparable at baseline but who differed in their capacities to lose weight and improve glycemic control. Six months after the LCD, one group had significant glycemic improvement homeostasis model assessment of insulin resistance (HOMA-IR) mean change: -0.92; 95% CI: -1.17, -0.67). The other group showed no improvement in glycemic control (HOMA-IR mean change: -0.26; 95% CI: -0.64, 0.13). These differences were sustained for ≥1 y after the LCD. The same conclusions were obtained with other endpoints (Matsuda index and fasting insulin and glucose concentrations). Significant differences between the 2 groups were shown in leptin gene expression in adipose tissue biopsies. Significant differences were also observed in weight-related endpoints (body mass index, weight, and fat mass). The lipid signature allowed prediction of which subjects would be considered to be insulin resistant after 6 mo of weight maintenance validation's receiver operating characteristic (ROC) area under the curve (AUC): 71%; 95% CI: 62%, 81%. This model outperformed a clinical data-only model (validation's ROC AUC: 61%; 95% CI: 50%, 71%; P = 0.01).
In this study, we report a lipid signature of LCD success (for weight and glycemic outcome) in obese, nondiabetic patients. Lipid changes during an 8-wk LCD allowed us to predict insulin-resistant patients after 6 mo of weight maintenance. The determination of the lipid composition during an LCD enables the identification of nonresponders and may help clinicians manage metabolic outcomes with further intervention, thereby improving the long-term outcome and preventing T2D. This trial was registered at clinicaltrials.gov as NCT00390637.
Weight loss in obese individuals aims to reduce the risk of type 2 diabetes by improving glycemic control. Yet, significant intersubject variability is observed and the outcomes remain poorly ...predictable.
The aim of the study was to predict whether an individual will show improvements in insulin sensitivity above or below the median population change at 6 mo after a low-calorie-diet (LCD) intervention.
With the use of plasma lipidomics and metabolomics for 433 subjects from the Diet, Obesity, and Genes (DiOGenes) Study, we attempted to predict good or poor Matsuda index improvements 6 mo after an 8-wk LCD intervention (800 kcal/d). Three independent analysis groups were defined: “training” (n = 119) for model construction, “testing” (n = 162) for model comparison, and “validation” (n = 152) to validate the final model.
Initial modeling with baseline clinical variables (body mass index, Matsuda index, total lipid concentrations, sex, age) showed limited performance area under the curve (AUC) on the “testing dataset” = 0.69; 95% CI: 0.61, 0.77. Significantly better performance was achieved with an omics model based on 27 variables (AUC = 0.77; 95% CI: 0.70, 0.85; P = 0.0297). This model could be greatly simplified while keeping the same performance. The simplified model relied on baseline Matsuda index, proline, and phosphatidylcholine 0-34:1. It successfully replicated on the validation set (AUC = 0.75; 95% CI: 0.67, 0.83) with the following characteristics: specificity = 0.73, sensitivity = 0.68, negative predictive value = 0.60, and positive predictive value = 0.80. Marginally lower performance was obtained when replacing the Matsuda index with homeostasis model assessment of insulin resistance (AUC = 0.72; 95% CI: 0.64, 0.80; P = 0.08).
Our study proposes a model to predict insulin sensitivity improvements, 6 mo after LCD completion in a large population of overweight or obese nondiabetic subjects. It relies on baseline information from 3 variables, accessible from blood samples. This model may help clinicians assessing the large variability in dietary interventions and predict outcomes before an intervention. This trial was registered at www.clinicaltrials.gov as NCT00390637.
The adipose tissue (AT) is a secretory organ producing a wide variety of factors that participate in the genesis of metabolic disorders linked to excess fat mass. Weight loss improves obesity-related ...disorders.
Transcriptomic studies on human AT, and a combination of analyses of transcriptome and proteome profiling of conditioned media from adipocytes and stromal cells isolated from human AT, have led to the identification of apolipoprotein M (apoM) as a putative adipokine. We aimed to validate apoM as novel adipokine, investigate the relation of AT APOM expression with metabolic syndrome and insulin sensitivity, and study the regulation of its expression in AT and secretion during calorie restriction–induced weight loss.
We examined APOM mRNA level and secretion in AT from 485 individuals enrolled in 5 independent clinical trials, and in vitro in human multipotent adipose-derived stem cell adipocytes. APOM expression and secretion were measured during dieting.
APOM was expressed in human subcutaneous and visceral AT, mainly by adipocytes. ApoM was released into circulation from AT, and plasma apoM concentrations correlate with AT APOM mRNA levels. In AT, APOM expression inversely correlated with adipocyte size, was lower in obese compared to lean individuals, and reduced in subjects with metabolic syndrome and type 2 diabetes. Regardless of fat depot, there was a positive relation between AT APOM expression and systemic insulin sensitivity, independently of fat mass and plasma HDL cholesterol. In human multipotent adipose-derived stem cell adipocytes, APOM expression was enhanced by insulin-sensitizing peroxisome proliferator–activated receptor agonists and inhibited by tumor necrosis factor α, a cytokine that causes insulin resistance. In obese individuals, calorie restriction increased AT APOM expression and secretion.
ApoM is a novel adipokine, the expression of which is a hallmark of healthy AT and is upregulated by calorie restriction. AT apoM deserves further investigation as a potential biomarker of risk for diabetes and cardiovascular diseases.
A low-calorie diet (LCD) reduces fat mass excess, improves insulin sensitivity, and alters adipose tissue (AT) gene expression, yet the relation with clinical outcomes remains unclear.
We evaluated ...AT transcriptome alterations during an LCD and the association with weight and glycemic outcomes both at LCD termination and 6 mo after the LCD.
Using RNA sequencing (RNAseq), we analyzed transcriptome changes in AT from 191 obese, nondiabetic patients within a multicenter, controlled dietary intervention. Expression changes were associated with outcomes after an 8-wk LCD (800–1000 kcal/d) and 6 mo after the LCD. Results were validated by using quantitative reverse transcriptase-polymerase chain reaction in 350 subjects from the same cohort. Statistical models were constructed to classify weight maintainers or glycemic improvers.
With RNAseq analyses, we identified 1173 genes that were differentially expressed after the LCD, of which 350 and 33 were associated with changes in body mass index (BMI; in kg/m2) and Matsuda index values, respectively, whereas 29 genes were associated with both endpoints. Pathway analyses highlighted enrichment in lipid and glucose metabolism. Classification models were constructed to identify weight maintainers. A model based on clinical baseline variables could not achieve any classification (validation AUC: 0.50; 95% CI: 0.36, 0.64). However, clinical changes during the LCD yielded better performance of the model (AUC: 0.73; 95% CI: 0.60, 0.87). Adding baseline expression to this model improved the performance significantly (AUC: 0.87; 95% CI: 0.77, 0.96; Delong's P = 0.012). Similar analyses were performed to classify subjects with good glycemic improvements. Baseline- and LCD-based clinical models yielded similar performance (best AUC: 0.73; 95% CI: 0.60, 0.86). The addition of expression changes during the LCD improved the performance substantially (AUC: 0.80; 95% CI: 0.69, 0.92; P = 0.058).
This study investigated AT transcriptome alterations after an LCD in a large cohort of obese, nondiabetic patients. Gene expression combined with clinical variables enabled us to distinguish weight and glycemic responders from nonresponders. These potential biomarkers may help clinicians understand intersubject variability and better predict the success of dietary interventions. This trial was registered at clinicaltrials.gov as NCT00390637.
The overall impact of proteomics on clinical research and its translation has lagged behind expectations. One recognized caveat is the limited size (subject numbers) of (pre)clinical studies ...performed at the discovery stage, the findings of which fail to be replicated in larger verification/validation trials. Compromised study designs and insufficient statistical power are consequences of the to-date still limited capacity of mass spectrometry (MS)-based workflows to handle large numbers of samples in a realistic time frame, while delivering comprehensive proteome coverages. We developed a highly automated proteomic biomarker discovery workflow. Herein, we have applied this approach to analyze 1000 plasma samples from the multicentered human dietary intervention study “DiOGenes”. Study design, sample randomization, tracking, and logistics were the foundations of our large-scale study. We checked the quality of the MS data and provided descriptive statistics. The data set was interrogated for proteins with most stable expression levels in that set of plasma samples. We evaluated standard clinical variables that typically impact forthcoming results and assessed body mass index-associated and gender-specific proteins at two time points. We demonstrate that analyzing a large number of human plasma samples for biomarker discovery with MS using isobaric tagging is feasible, providing robust and consistent biological results.
To date, single-cell studies of human white adipose tissue (WAT) have been based on small cohort sizes and no cellular consensus nomenclature exists. Herein, we performed a comprehensive ...meta-analysis of publicly available and newly generated single-cell, single-nucleus, and spatial transcriptomic results from human subcutaneous, omental, and perivascular WAT. Our high-resolution map is built on data from ten studies and allowed us to robustly identify >60 subpopulations of adipocytes, fibroblast and adipogenic progenitors, vascular, and immune cells. Using these results, we deconvolved spatial and bulk transcriptomic data from nine additional cohorts to provide spatial and clinical dimensions to the map. This identified cell-cell interactions as well as relationships between specific cell subtypes and insulin resistance, dyslipidemia, adipocyte volume, and lipolysis upon long-term weight changes. Altogether, our meta-map provides a rich resource defining the cellular and microarchitectural landscape of human WAT and describes the associations between specific cell types and metabolic states.