Untargeted metabolomics aims to quantify the complete set of metabolites within a biological system, most commonly by liquid chromatography/mass spectrometry (LC/MS). Since nearly the inception of ...the field, compound identification has been widely recognized as the rate-limiting step of the experimental workflow. In spite of exponential increases in the size of metabolomic databases, which now contain experimental MS/MS spectra for over a half a million reference compounds, chemical structures still cannot be confidently assigned to many signals in a typical LC/MS dataset. The purpose of this Perspective is to consider why identification rates continue to be low in untargeted metabolomics. One rationalization is that many naturally occurring metabolites detected by LC/MS are true “novel” compounds that have yet to be incorporated into metabolomic databases. An alternative possibility, however, is that research data do not provide database matches because of informatic artifacts, chemical contaminants, and signal redundancies. Increasing evidence suggests that, for at least some sample types, many unidentifiable signals in untargeted metabolomics result from the latter rather than new compounds originating from the specimen being measured. The implications of these observations on chemical discovery in untargeted metabolomics are discussed.
In rodents, obesity and aging impair nicotinamide adenine dinucleotide (NAD
) biosynthesis, which contributes to metabolic dysfunction. Nicotinamide mononucleotide (NMN) availability is a ...rate-limiting factor in mammalian NAD
biosynthesis. We conducted a 10-week, randomized, placebo-controlled, double-blind trial to evaluate the effect of NMN supplementation on metabolic function in postmenopausal women with prediabetes who were overweight or obese. Insulin-stimulated glucose disposal, assessed by using the hyperinsulinemic-euglycemic clamp, and skeletal muscle insulin signaling phosphorylation of protein kinase AKT and mechanistic target of rapamycin (mTOR) increased after NMN supplementation but did not change after placebo treatment. NMN supplementation up-regulated the expression of platelet-derived growth factor receptor β and other genes related to muscle remodeling. These results demonstrate that NMN increases muscle insulin sensitivity, insulin signaling, and remodeling in women with prediabetes who are overweight or obese (clinicaltrial.gov NCT03151239).
Display omitted
•13C-13C J-couplings cause unwanted dephasing in C{N} REDOR experiments on 13C-spin clusters.•We show that a simple modification of an existing REDOR pulse sequence for clusters ...solves the problem.•Each 13C of the cluster has the same C{N} REDOR behavior as an isolated C-N pair.•We apply this new sequence to examine labeling of lyophilized whole kidney cells by L-13C5 -15N2-glutamine.
By using only half of the total evolution time for dephasing pulses, C{N} rotational-echo double resonance (REDOR) for clusters of 13C spins (RDX) results in the same universal REDOR behavior as observed for isolated 13C-15N pairs. RDX combines Hahn echoes with solid echoes to suppress interference from scalar J couplings. This is crucial for long evolution times. The modified version (which we call RDX24) makes RDX quantitative for 13C clusters. We apply this scheme to human embryonic kidney cells labeled in culture by L-13C5 -15N2-glutamine. We quantitatively characterize three separate nitrogen isotopic enrichments for: (i) the alpha nitrogens of glutamine residues in proteins (including the residues of the five amino acids synthesized from glutamine); (ii) the alpha nitrogens of the five amino-acid residues synthesized from glucose, together with those of the nine essential amino acids added to the growth medium; and (iii) the side-chain nitrogens of glutamine (and of asparagine derived from glutamine).
There is an urgent need to identify which COVID-19 patients will develop life-threatening illness so that medical resources can be optimally allocated and rapid treatment can be administered early in ...the disease course, when clinical management is most effective. To aid in the prognostic classification of disease severity, we perform untargeted metabolomics on plasma from 339 patients, with samples collected at six longitudinal time points. Using the temporal metabolic profiles and machine learning, we build a predictive model of disease severity. We discover that a panel of metabolites measured at the time of study entry successfully determines disease severity. Through analysis of longitudinal samples, we confirm that most of these markers are directly related to disease progression and that their levels return to baseline upon disease recovery. Finally, we validate that these metabolites are also altered in a hamster model of COVID-19.
Display omitted
Longitudinal profiling of plasma during COVID-19 reveals dynamic metabolic changesDecreases in LPC and PC lipids early in the disease course predict severe COVID-19Plasma LPCs and PCs are decreased in hamsters infected with SARS-CoV-2
Sindelar et al. combine untargeted metabolomics analysis of human plasma and machine learning to construct a model that predicts COVID-19 disease severity from the levels of 22 prognostic metabolites. The authors show that these metabolites change early in the disease course and are ultimately restored to control levels upon recovery.
The thousands of features commonly observed when performing untargeted metabolomics with quadrupole time-of-flight (QTOF) and Orbitrap mass spectrometers often correspond to only a few hundred unique ...metabolites of biological origin, which is in the range of what can be assayed in a single targeted metabolomics experiment by using a triple quadrupole (QqQ) mass spectrometer. A major benefit of performing targeted metabolomics with QqQ mass spectrometry is the affordability of the instruments relative to high-resolution QTOF and Orbitrap platforms. Optimizing targeted methods to profile hundreds of metabolites on a QqQ mass spectrometer, however, has historically been limited by the availability of authentic standards, particularly for "unknowns" that have yet to be structurally identified. Here, we report a strategy to develop multiple reaction monitoring (MRM) methods for QqQ instruments on the basis of high-resolution spectra, thereby enabling us to use data from untargeted metabolomics to design targeted experiments without the need for authentic standards. We demonstrate that using high-resolution fragmentation data alone to design MRM methods results in the same quantitative performance as when methods are optimized by measuring authentic standards on QqQ instruments, as is conventionally done. The approach was validated by showing that Orbitrap ID-X data can be used to establish MRM methods on a Thermo TSQ Altis and two Agilent QqQs for hundreds of metabolites, including unknowns, without a dependence on standards. Finally, we highlight an application where metabolite profiling was performed on an ID-X and a QqQ by using the strategy introduced here, with both data sets yielding the same result. The described approach therefore allows us to use QqQ instruments, which are often associated with targeted metabolomics, to profile knowns and unknowns at a comprehensive scale that is typical of untargeted metabolomics.
Abstract
N
1
-methyladenosine (m
1
A) was proposed to be a highly prevalent modification in mRNA 5’UTRs based on mapping studies using an m
1
A-binding antibody. We developed a bioinformatic approach ...to discover m
1
A and other modifications in mRNA throughout the transcriptome by analyzing preexisting ultra-deep RNA-Seq data for modification-induced misincorporations. Using this approach, we detected appreciable levels of m
1
A only in one mRNA: the mitochondrial
MT-ND5
transcript. As an alternative approach, we also developed an antibody-based m
1
A-mapping approach to detect m
1
A at single-nucleotide resolution, and confirmed that the commonly used m
1
A antibody maps sites to the transcription-start site in mRNA 5’UTRs. However, further analysis revealed that these were false-positives caused by binding of the antibody to the m
7
G-cap. A different m
1
A antibody that lacks cap-binding cross-reactivity does not show enriched binding in 5’UTRs. These results demonstrate that high-stoichiometry m
1
A sites are exceedingly rare in mRNAs and that previous mappings of m
1
A to 5’UTRs were the result of antibody cross-reactivity to the 5’ cap.
Chimeric MS/MS spectra contain fragments from multiple precursor ions and therefore hinder compound identification in metabolomics. Historically, deconvolution of these chimeric spectra has been ...challenging and relied on specific experimental methods that introduce variation in the ratios of precursor ions between multiple tandem mass spectrometry (MS/MS) scans. DecoID provides a complementary, method-independent approach where database spectra are computationally mixed to match an experimentally acquired spectrum by using LASSO regression. We validated that DecoID increases the number of identified metabolites in MS/MS datasets from both data-independent and data-dependent acquisition without increasing the false discovery rate. We applied DecoID to publicly available data from the MetaboLights repository and to data from human plasma, where DecoID increased the number of identified metabolites from data-dependent acquisition data by over 30% compared to direct spectral matching. DecoID is compatible with any user-defined MS/MS database and provides automated searching for some of the largest MS/MS databases currently available.
When using liquid chromatography/mass spectrometry (LC/MS) to perform untargeted metabolomics, it is common to detect thousands of features from a biological extract. Although it is impractical to ...collect non-chimeric MS/MS data for each in a single chromatographic run, this is generally unnecessary because most features do not correspond to unique metabolites of biological relevance. Here we show that relatively simple data-processing strategies that can be applied on the fly during acquisition of data with an Orbitrap ID-X, such as blank subtraction and well-established adduct or isotope calculations, decrease the number of features to target for MS/MS analysis by up to an order of magnitude for various types of biological matrices. We demonstrate that annotating these non-biological contaminants and redundancies in real time during data acquisition enables comprehensive MS/MS data to be acquired on each remaining feature at a single collision energy. To ensure that an appropriate collision energy is applied, we introduce a method using a series of hidden ion-trap scans in an Orbitrap ID-X to find an optimal value for each feature that can then be applied in a subsequent high-resolution Orbitrap scan. Data from 100 metabolite standards indicate that this real-time optimization of collision energies leads to more informative MS/MS patterns compared to using a single fixed collision energy alone. As a benchmark to evaluate the overall workflow, we manually annotated unique biological features by independently subjecting E. coli samples to a credentialing analysis. While credentialing led to a more rigorous reduction in feature number, on-the-fly annotation with blank subtraction on an Orbitrap ID-X did not inappropriately discard unique biological metabolites. Taken together, our results reveal that optimal fragmentation data can be obtained in a single LC/MS/MS run for >90% of the unique biological metabolites in a sample when features are annotated during acquisition and collision energies are selected by using parallel mass spectrometry detection.
Display omitted
•Blank subtraction does not unfaithfully remove credentialed features.•Filtering common redundancies and contaminants reduces the MS/MS burden up to 90%.•Collision energies can be optimized in real time with dual MS detection.•Optimal MS/MS data for >90% of credentialed metabolites can be acquired in one run.
The success of precision medicine relies upon collecting data from many individuals at the population level. Although advancing technologies have made such large-scale studies increasingly feasible ...in some disciplines such as genomics, the standard workflows currently implemented in untargeted metabolomics were developed for small sample numbers and are limited by the processing of liquid chromatography/mass spectrometry data. Here we present an untargeted metabolomics workflow that is designed to support large-scale projects with thousands of biospecimens. Our strategy is to first evaluate a reference sample created by pooling aliquots of biospecimens from the cohort. The reference sample captures the chemical complexity of the biological matrix in a small number of analytical runs, which can subsequently be processed with conventional software such as XCMS. Although this generates thousands of so-called features, most do not correspond to unique compounds from the samples and can be filtered with established informatics tools. The features remaining represent a comprehensive set of biologically relevant reference chemicals that can then be extracted from the entire cohort’s raw data on the basis of m/z values and retention times by using Skyline. To demonstrate applicability to large cohorts, we evaluated >2000 human plasma samples with our workflow. We focused our analysis on 360 identified compounds, but we also profiled >3000 unknowns from the plasma samples. As part of our workflow, we tested 14 different computational approaches for batch correction and found that a random forest-based approach outperformed the others. The corrected data revealed distinct profiles that were associated with the geographic location of participants.
Small-molecule drugs and toxicants commonly interact with more than a single protein target, each of which may have unique effects on cellular phenotype. Although untargeted metabolomics is often ...applied to understand the mode of action of these chemicals, simple pairwise comparisons of treated and untreated samples are insufficient to resolve the effects of disrupting two or more independent protein targets. Here, we introduce a workflow for dose-response metabolomics to evaluate chemicals that potentially affect multiple proteins with different potencies. Our approach relies on treating samples with various concentrations of compound prior to analysis with mass spectrometry-based metabolomics. Data are then processed with software we developed called TOXcms, which statistically evaluates dose-response trends for each metabolomic signal according to user-defined tolerances and subsequently groups those that follow the same pattern. Although TOXcms was built upon the XCMS framework, it is compatible with any metabolomic data-processing software. Additionally, to enable correlation of dose responses beyond those that can be measured by metabolomics, TOXcms also accepts data from respirometry, cell death assays, other omic platforms, etc. In this work, we primarily focus on applying dose-response metabolomics to find off-target effects of drugs. Using metformin and etomoxir as examples, we demonstrate that each group of dose-response patterns identified by TOXcms signifies a metabolic response to a different protein target with a unique drug binding affinity. TOXcms is freely available on our laboratory website at http://pattilab.wustl.edu/software/toxcms.