•No exhaustive experimental characterisation of an organism’s metabolome reported.•Deep metabolome annotation strategies based on multi-platform assays.•International task group formed to help ...coordinate metabolome annotation.•Entering new era of annotation focused on model organisms.
The metabolome describes the full complement of the tens to hundreds of thousands of low molecular weight metabolites present within a biological system. Identification of the metabolome is critical for discovering the maximum amount of biochemical knowledge from metabolomics datasets. Yet no exhaustive experimental characterisation of any organismal metabolome has been reported to date, dramatically contrasting with the genome sequencing of thousands of plants, animals and microbes. Here, we review the status of metabolome annotation and describe advances in the analytical methodologies being applied. In part through new international coordination, we conclude that we are now entering a new era of metabolome annotation.
Missing values in mass spectrometry metabolomic datasets occur widely and can originate from a number of sources, including for both technical and biological reasons. Currently, little is known about ...these data, i.e. about their distributions across datasets, the need (or not) to consider them in the data processing pipeline, and most importantly, the optimal way of assigning them values prior to univariate or multivariate data analysis. Here, we address all of these issues using direct infusion Fourier transform ion cyclotron resonance mass spectrometry data. We have shown that missing data are widespread, accounting for ca. 20% of data and affecting up to 80% of all variables, and that they do not occur randomly but rather as a function of signal intensity and mass-to-charge ratio. We have demonstrated that missing data estimation algorithms have a major effect on the outcome of data analysis when comparing the differences between biological sample groups, including by t test, ANOVA and principal component analysis. Furthermore, results varied significantly across the eight algorithms that we assessed for their ability to impute known, but labelled as missing, entries. Based on all of our findings we identified the k-nearest neighbour imputation method (
KNN
) as the optimal missing value estimation approach for our direct infusion mass spectrometry datasets. However, we believe the wider significance of this study is that it highlights the importance of missing metabolite levels in the data processing pipeline and offers an approach to identify optimal ways of treating missing data in metabolomics experiments.
Untargeted metabolomics is an established approach in toxicology for characterising endogenous metabolic responses to xenobiotic exposure. Detecting the xenobiotic and its biotransformation products ...as part of the metabolomics analysis provides an opportunity to simultaneously gain deep insights into its fate and metabolism, and to associate the internal relative dose directly with endogenous metabolic responses. This integration of untargeted exposure and response measurements into a single assay has yet to be fully demonstrated. Here we assemble a workflow to discover and analyse pharmaceutical-related measurements from routine untargeted UHPLC-MS metabolomics datasets, derived from in vivo (rat plasma and cardiac tissue, and human plasma) and in vitro (human cardiomyocytes) studies that were principally designed to investigate endogenous metabolic responses to drug exposure. Our findings clearly demonstrate how untargeted metabolomics can discover extensive biotransformation maps, temporally-changing relative systemic exposure, and direct associations of endogenous biochemical responses to the internal dose.
In metabolomics, tissues typically are extracted by grinding in liquid nitrogen followed by the stepwise addition of solvents. This is time-consuming and difficult to automate, and the multiple steps ...can introduce variability. Here we optimize tissue extraction methods compatible with high-throughput, reproducible nuclear magnetic resonance (NMR) spectroscopy- and mass spectrometry (MS)-based metabolomics. Previously, we concluded that methanol/chloroform/water extraction is preferable for metabolomics, and we further optimized this here using fish liver and an automated Precellys 24 bead-based homogenizer, allowing rapid extraction of multiple samples without carryover. We compared three solvent addition strategies: stepwise, two-step, and all solvents simultaneously. Then we evaluated strategies for improved partitioning of metabolites between solvent phases, including the addition of extra water and different partition times. Polar extracts were analyzed by NMR and principal components analysis, and the two-step approach was preferable based on lipid partitioning, reproducibility, yield, and throughput. Longer partitioning or extra water increased yield and decreased lipids in the polar phase but caused metabolic decay in these extracts. Overall, we conclude that the two-step method with extra water provides good quality data but that the two-step method with 10 min partitioning provides a more accurate snapshot of the metabolome. Finally, when validating the two-step strategy using NMR and MS metabolomics, we showed that technical variability was considerably smaller than biological variability.
Metabolomic and lipidomic studies measure and discover metabolic and lipid profiles in biological samples, enabling a better understanding of the metabolism of specific biological phenotypes. ...Accurate biological interpretations require high analytical reproducibility and sensitivity, and standardized and transparent data processing. Here we describe a complete workflow for nanoelectrospray ionization (nESI) direct-infusion mass spectrometry (DIMS) metabolomics and lipidomics. After metabolite and lipid extraction from tissues and biofluids, samples are directly infused into a high-resolution mass spectrometer (e.g., Orbitrap) using a chip-based nESI sample delivery system. nESI functions to minimize ionization suppression or enhancement effects as compared with standard electrospray ionization (ESI). Our analytical technique-named spectral stitching-measures data as several overlapping mass-to-charge (m/z) windows that are subsequently 'stitched' together, creating a complete mass spectrum. This considerably increases the dynamic range and detection sensitivity-about a fivefold increase in peak detection-as compared with the collection of DIMS data as a single wide mass-to-charge (m/z ratio) window. Data processing, statistical analysis and metabolite annotation are executed as a workflow within the user-friendly, transparent and freely available Galaxy platform (galaxyproject.org). Generated data have high mass accuracy that enables molecular formulae peak annotations. The workflow is compatible with any sample-extraction method; in this protocol, the examples are extracted using a biphasic method, with methanol, chloroform and water as the solvents. The complete workflow is reproducible, rapid and automated, which enables cost-effective analysis of >10,000 samples per year, making it ideal for high-throughput metabolomics and lipidomics screening-e.g., for clinical phenotyping, drug screening and toxicity testing.
Metabolomics is a widely used technology in academic research, yet its application to regulatory science has been limited. The most commonly cited barrier to its translation is lack of performance ...and reporting standards. The MEtabolomics standaRds Initiative in Toxicology (MERIT) project brings together international experts from multiple sectors to address this need. Here, we identify the most relevant applications for metabolomics in regulatory toxicology and develop best practice guidelines, performance and reporting standards for acquiring and analysing untargeted metabolomics and targeted metabolite data. We recommend that these guidelines are evaluated and implemented for several regulatory use cases.
Certain strains of the intracellular endosymbiont Wolbachia can strongly inhibit or block the transmission of viruses such as dengue virus (DENV) by Aedes mosquitoes, and the mechanisms responsible ...are still not well understood. Direct infusion and liquid chromatography-Fourier transform ion cyclotron resonance (FT-ICR) mass spectrometry-based lipidomics analyses were conducted using Aedes albopictus Aa23 cells that were infected with the wMel and wMelPop strains of Wolbachia in comparison to uninfected Aa23-T cells. Substantial shifts in the cellular lipid profile were apparent in the presence of Wolbachia Most significantly, almost all sphingolipid classes were depleted, and some reductions in diacylglycerols and phosphatidylcholines were also observed. These lipid classes have previously been shown to be selectively enriched in DENV-infected mosquito cells, suggesting that Wolbachia may produce a cellular lipid environment that is antagonistic to viral replication. The data improve our understanding of the intracellular interactions between Wolbachia and mosquitoes.
Mosquitoes transmit a variety of important viruses to humans, such as dengue virus and Zika virus. Certain strains of the intracellular bacterial genus called Wolbachia found in or introduced into mosquitoes can block the transmission of viruses, including dengue virus, but the mechanisms responsible are not well understood. We found substantial shifts in the cellular lipid profiles in the presence of these bacteria. Some lipid classes previously shown to be enriched in dengue virus-infected mosquito cells were depleted in the presence of Wolbachia, suggesting that Wolbachia may produce a cellular lipid environment that inhibits mosquito-borne viruses.
Physiological responses to temperature are known to be a major determinant of species distributions and can dictate the sensitivity of populations to global warming. In contrast, little is known ...about how other major global change drivers, such as ocean acidification (OA), will shape species distributions in the future. Here, by integrating population genetics with experimental data for growth and mineralization, physiology and metabolomics, we demonstrate that the sensitivity of populations of the gastropod Littorina littorea to future OA is shaped by regional adaptation. Individuals from populations towards the edges of the natural latitudinal range in the Northeast Atlantic exhibit greater shell dissolution and the inability to upregulate their metabolism when exposed to low pH, thus appearing most sensitive to low seawater pH. Our results suggest that future levels of OA could mediate temperature-driven shifts in species distributions, thereby influencing future biogeography and the functioning of marine ecosystems.
The high spectral congestion typically observed in one-dimensional (1D)
1H nuclear magnetic resonance (NMR) spectra of tissue extracts and biofluids limits the metabolic information that can be ...extracted. This study evaluates the application of two-dimensional J-resolved (JRES) spectroscopy for metabolomics, which can provide proton-decoupled projected 1D spectra (p-JRES). This approach is illustrated by an investigation of embryogenesis in Japanese medaka (
Oryzias latipes), an established fish model for developmental toxicology. When combined with optimized spectral pre-processing,
2
A copy of the MATLAB code for spectral pre-processing is available from the author.
2
including a 0.005-ppm bin width for data segmentation and a logarithmic transformation, the reduced congestion in the p-JRES spectra increases the likelihood that a specific metabolite can be accurately integrated and thus increases the extractable information content of the spectra. Principal components analysis of the p-JRES spectra reveals the concept of a developmental trajectory that summarizes the changes in the NMR-visible metabolome throughout medaka embryogenesis. Advantages and potential disadvantages of the p-JRES approach are discussed.