Triple-negative breast cancer (TNBC) is a heterogeneous disease with limited treatment options. To characterize TNBC heterogeneity, we defined transcriptional, epigenetic, and metabolic subtypes and ...subtype-driving super-enhancers and transcription factors by combining functional and molecular profiling with computational analyses. Single-cell RNA sequencing revealed relative homogeneity of the major transcriptional subtypes (luminal, basal, and mesenchymal) within samples. We found that mesenchymal TNBCs share features with mesenchymal neuroblastoma and rhabdoid tumors and that the PRRX1 transcription factor is a key driver of these tumors. PRRX1 is sufficient for inducing mesenchymal features in basal but not in luminal TNBC cells via reprogramming super-enhancer landscapes, but it is not required for mesenchymal state maintenance or for cellular viability. Our comprehensive, large-scale, multiplatform, multiomics study of both experimental and clinical TNBC is an important resource for the scientific and clinical research communities and opens venues for future investigation.
Display omitted
•Major transcriptional and super-enhancer subtypes of triple-negative breast cancer (TNBC)•Multiomics factor analysis of TNBC epigenetic, functional, and clinical heterogeneity•PRRX1 is a main transcriptional regulator of mesenchymal TNBC subtype
Jovanović et al. define the cellular, molecular, and functional heterogeneity of TNBC and integrated these using multiomics factor analysis into clinically relevant subtypes. Luminal, basal, and mesenchymal gene expression and super-enhancer subtypes do not match DNA methylation, histone modification, and metabolic patterns. PRRX1 is a central transcription factor in mesenchymal TNBC.
Several challenges remain in data-independent acquisition (DIA) data analysis, such as to confidently identify peptides, define integration boundaries, remove interferences, and control false ...discovery rates. In practice, a visual inspection of the signals is still required, which is impractical with large datasets. We present Avant-garde as a tool to refine DIA (and parallel reaction monitoring) data. Avant-garde uses a novel data-driven scoring strategy: signals are refined by learning from the dataset itself, using all measurements in all samples to achieve the best optimization. We evaluate the performance of Avant-garde using benchmark DIA datasets and show that it can determine the quantitative suitability of a peptide peak, and reach the same levels of selectivity, accuracy, and reproducibility as manual validation. Avant-garde is complementary to existing DIA analysis engines and aims to establish a strong foundation for subsequent analysis of quantitative mass spectrometry data.
Cyanobacteria, and the viruses (phages) that infect them, are significant contributors to the oceanic 'gene pool'. This pool is dynamic, and the transfer of genetic material between hosts and their ...phages probably influences the genetic and functional diversity of both. For example, photosynthesis genes of cyanobacterial origin have been found in phages that infect Prochlorococcus and Synechococcus, the numerically dominant phototrophs in ocean ecosystems. These genes include psbA, which encodes the photosystem II core reaction centre protein D1, and high-light-inducible (hli) genes. Here we show that phage psbA and hli genes are expressed during infection of Prochlorococcus and are co-transcribed with essential phage capsid genes, and that the amount of phage D1 protein increases steadily over the infective period. We also show that the expression of host photosynthesis genes declines over the course of infection and that replication of the phage genome is a function of photosynthesis. We thus propose that the phage genes are functional in photosynthesis and that they may be increasing phage fitness by supplementing the host production of these proteins.
Human gammaherpesviruses are associated with malignancies in HIV infected individuals; in macaques used in non-human primate models of HIV infection, gammaherpesvirus infections also occur. Limited ...data on prevalence and tumorigenicity of macaque gammaherpesviruses, mostly cross-sectional analyses of small series, are available. We comprehensively examine all three-rhesus macaque gammaherpesviruses -Rhesus rhadinovirus (RRV), Rhesus Lymphocryptovirus (RLCV) and Retroperitoneal Fibromatosis Herpesvirus (RFHV) in macaques experimentally infected with Simian Immunodeficiency Virus or Simian Human Immunodeficiency Virus (SIV/SHIV) in studies spanning 15 years at the AIDS and Cancer Virus Program of the Frederick National Laboratory for Cancer Research. We evaluated 18 animals with malignancies (16 lymphomas, one fibrosarcoma and one carcinoma) and 32 controls. We developed real time quantitative PCR assays for each gammaherpesvirus DNA viral load (VL) in malignant and non-tumor tissues; we also characterized the tumors using immunohistochemistry and in situ hybridization. Furthermore, we retrospectively quantified gammaherpesvirus DNA VL and SIV/SHIV RNA VL in longitudinally-collected PBMCs and plasma, respectively. One or more gammaherpesviruses were detected in 17 tumors; generally, one was predominant, and the relevant DNA VL in the tumor was very high compared to surrounding tissues. RLCV was predominant in tumors resembling diffuse large B cell lymphomas; in a Burkitt-like lymphoma, RRV was predominant; and in the fibrosarcoma, RFHV was predominant. Median RRV and RLCV PBMC DNA VL were significantly higher in cases than controls; SIV/SHIV VL and RLCV VL were independently associated with cancer. Local regressions showed that longitudinal VL patterns in cases and controls, from SIV infection to necropsy, differed for each gammaherpesvirus: while RFHV VL increased only slightly in all animals, RLCV and RRV VL increased significantly and continued to increase steeply in cases; in controls, VL flattened. In conclusion, the data suggest that gammaherpesviruses may play a significant role in tumorogenesis in macaques infected with immunodeficiency viruses.
A major unmet need in LC-MS/MS-based proteomics analyses is a set of tools for quantitative assessment of system performance and evaluation of technical variability. Here we describe 46 system ...performance metrics for monitoring chromatographic performance, electrospray source stability, MS1 and MS2 signals, dynamic sampling of ions for MS/MS, and peptide identification. Applied to data sets from replicate LC-MS/MS analyses, these metrics displayed consistent, reasonable responses to controlled perturbations. The metrics typically displayed variations less than 10% and thus can reveal even subtle differences in performance of system components. Analyses of data from interlaboratory studies conducted under a common standard operating procedure identified outlier data and provided clues to specific causes. Moreover, interlaboratory variation reflected by the metrics indicates which system components vary the most between laboratories. Application of these metrics enables rational, quantitative quality assessment for proteomics and other LC-MS/MS analytical applications.
•Quantitative mass spectrometry-based assay of histone modifications.•Robust sample preparation workflow with optimized chemistry.•Data analysis templates and spectral libraries for complete method ...replication provided.
Epigenetic control of genome function is an important regulatory mechanism in diverse processes such as lineage commitment and environmental sensing, and in disease etiologies ranging from neuropsychiatric disorders to cancer. Here we report a robust, high-throughput targeted, quantitative mass spectrometry (MS) method to rapidly profile modifications of the core histones of chromatin that compose the epigenetic landscape, enabling comparisons among cells with differing genetic backgrounds, genomic perturbations, and drug treatments.
Immunoaffinity enrichment of peptides coupled with analysis by stable isotope dilution multiple reaction mass spectrometry has been shown to have analytical performance and detection limits suitable ...for many biomarker verification studies and biological applications. Prior studies have shown that antipeptide antibodies can be multiplexed up to 50 in a single assay without significant loss of performance. Achieving higher multiplex levels is relevant to all studies involving precious biological material as this minimizes the amount of sample that must be consumed to measure a given set of analytes and reduces the assay cost per analyte. Here we developed automated methods employing the Agilent AssayMAP Bravo microchromatography platform and used these methods to characterize the performance of immunoaffinity enrichment of peptides up to multiplex levels of 172. Median capture efficiency for the target peptides remained high (88%) even at levels of 150-plex and declined to 70% at 172-plex compared to antibody performance observed at standard lower multiplex levels (n = 25). Subsequently, we developed and analytically characterized a multiplexed immuno-multiple reaction monitoring-mass spectrometry (immuno-MRM-MS) assay (n = 110) and applied it to measure candidate protein biomarkers of cardiovascular disease in plasma of patients undergoing planned myocardial infarction. The median lower limit of detection of all peptides was 71.5 amol/μL (nM), and the coefficient of variation (CV) was less than 15% at the lower limit of quantification. The results demonstrate that high multiplexed immuno-MRM-MS assays are readily achievable using the optimized sample processing and peptide capture methods described here.
Epigenetic dysregulation is an emerging hallmark of cancers. We developed a high-information-content mass spectrometry approach to profile global histone modifications in human cancers. When applied ...to 115 lines from the Cancer Cell Line Encyclopedia, this approach identified distinct molecular chromatin signatures. One signature was characterized by increased histone 3 lysine 36 (H3K36) dimethylation, exhibited by several lines harboring translocations in NSD2, which encodes a methyltransferase. A previously unknown NSD2 p.Glu1099Lys (p.E1099K) variant was identified in nontranslocated acute lymphoblastic leukemia (ALL) cell lines sharing this signature. Ectopic expression of the variant induced a chromatin signature characteristic of NSD2 hyperactivation and promoted transformation. NSD2 knockdown selectively inhibited the proliferation of NSD2-mutant lines and impaired the in vivo growth of an NSD2-mutant ALL xenograft. Sequencing analysis of >1,000 pediatric cancer genomes identified the NSD2 p.E1099K alteration in 14% of t(12;21) ETV6-RUNX1-containing ALLs. These findings identify NSD2 as a potential therapeutic target for pediatric ALL and provide a general framework for the functional annotation of cancer epigenomes.