The Rank Product (RP) is a statistical technique widely used to detect differentially expressed features in molecular profiling experiments such as transcriptomics, metabolomics and proteomics ...studies. An implementation of the RP and the closely related Rank Sum (RS) statistics has been available in the RankProd Bioconductor package for several years. However, several recent advances in the understanding of the statistical foundations of the method have made a complete refactoring of the existing package desirable.
We implemented a completely refactored version of the RankProd package, which provides a more principled implementation of the statistics for unpaired datasets. Moreover, the permutation-based P -value estimation methods have been replaced by exact methods, providing faster and more accurate results.
RankProd 2.0 is available at Bioconductor ( https://www.bioconductor.org/packages/devel/bioc/html/RankProd.html ) and as part of the mzMatch pipeline ( http://www.mzmatch.sourceforge.net ).
rainer.breitling@manchester.ac.uk.
Supplementary data are available at Bioinformatics online.
The recent proliferation of high-resolution mass spectrometers has generated a wealth of new data analysis methods. However, flexible integration of these methods into configurations best suited to ...the research question is hampered by heterogeneous file formats and monolithic software development. The mzXML, mzData, and mzML file formats have enabled uniform access to unprocessed raw data. In this paper we present our efforts to produce an equally simple and powerful format, PeakML, to uniformly exchange processed intermediary and result data. To demonstrate the versatility of PeakML, we have developed an open source Java toolkit for processing, filtering, and annotating mass spectra in a customizable pipeline (mzMatch), as well as a user-friendly data visualization environment (PeakML Viewer). The PeakML format in particular enables the flexible exchange of processed data between software created by different groups or companies, as we illustrate by providing a PeakML-based integration of the widely used XCMS package with mzMatch data processing tools. As an added advantage, downstream analysis can benefit from direct access to the full mass trace information underlying summarized mass spectrometry results, providing the user with the means to rapidly verify results. The PeakML/mzMatch software is freely available at http://mzmatch.sourceforge.net, with documentation, tutorials, and a community forum.
Cancer cells exhibit several unique metabolic phenotypes that are critical for cell growth and proliferation. Specifically, they overexpress the M2 isoform of the tightly regulated enzyme pyruvate ...kinase (PKM2), which controls glycolytic flux, and are highly dependent on de novo biosynthesis of serine and glycine. Here we describe a new rheostat-like mechanistic relationship between PKM2 activity and serine biosynthesis. We show that serine can bind to and activate human PKM2, and that PKM2 activity in cells is reduced in response to serine deprivation. This reduction in PKM2 activity shifts cells to a fuel-efficient mode in which more pyruvate is diverted to the mitochondria and more glucose-derived carbon is channelled into serine biosynthesis to support cell proliferation.
An outstanding problem in the understanding of poxvirus biology is the molecular structure of the mature virion. Via deep learning methods combined with chemical cross-linking mass spectrometry, we ...have addressed the structure and assembly pathway of P4a, a key poxvirus virion core component.
With ever-increasing amounts of data produced by mass spectrometry (MS) proteomics and metabolomics, and the sheer volume of samples now analyzed, the need for a common open format possessing both ...file size efficiency and faster read/write speeds has become paramount to drive the next generation of data analysis pipelines. The Proteomics Standards Initiative (PSI) has established a clear and precise extensible markup language (XML) representation for data interchange, mzML, receiving substantial uptake; nevertheless, storage and file access efficiency has not been the main focus. We propose an HDF5 file format “mzMLb” that is optimized for both read/write speed and storage of the raw mass spectrometry data. We provide an extensive validation of the write speed, random read speed, and storage size, demonstrating a flexible format that with or without compression is faster than all existing approaches in virtually all cases, while with compression is comparable in size to proprietary vendor file formats. Since our approach uniquely preserves the XML encoding of the metadata, the format implicitly supports future versions of mzML and is straightforward to implement: mzMLb’s design adheres to both HDF5 and NetCDF4 standard implementations, which allows it to be easily utilized by third parties due to their widespread programming language support. A reference implementation within the established ProteoWizard toolkit is provided.
Many microbial secondary metabolites are of high biotechnological value for medicine, agriculture, and the food industry. Bacterial genome mining has revealed numerous novel secondary metabolite ...biosynthetic gene clusters, which encode the potential to synthesize a large diversity of compounds that have never been observed before. The stimulation or “awakening” of this cryptic microbial secondary metabolism has naturally attracted the attention of synthetic microbiologists, who exploit recent advances in DNA sequencing and synthesis to achieve unprecedented control over metabolic pathways. One of the indispensable tools in the synthetic biology toolbox is metabolomics, the global quantification of small biomolecules. This review illustrates the pivotal role of metabolomics for the synthetic microbiology of secondary metabolism, including its crucial role in novel compound discovery in microbes, the examination of side products of engineered metabolic pathways, as well as the identification of major bottlenecks for the overproduction of compounds of interest, especially in combination with metabolic modeling. We conclude by highlighting remaining challenges and recent technological advances that will drive metabolomics towards fulfilling its potential as a cornerstone technology of synthetic microbiology.
In this study, we evaluated the lipidome alterations caused by type 1 diabetes (T1D) and type 2 diabetes (T2D), by determining lipids significantly associated with diabetes overall and in both sexes, ...and lipids associated with the glycaemic state.
An untargeted lipidomic analysis was performed to measure the lipid profiles of 360 subjects (91 T1D, 91 T2D, 74 with prediabetes and 104 controls (CT)) without cardiovascular and/or chronic kidney disease. Ultra-high performance liquid chromatography-electrospray ionization mass spectrometry (UHPLC-ESI-MS) was conducted in two ion modes (positive and negative). We used multiple linear regression models to (1) assess the association between each lipid feature and each condition, (2) determine sex-specific differences related to diabetes, and (3) identify lipids associated with the glycaemic state by considering the prediabetes stage. The models were adjusted by sex, age, hypertension, dyslipidaemia, body mass index, glucose, smoking, systolic blood pressure, triglycerides, HDL cholesterol, LDL cholesterol, alternate Mediterranean diet score (aMED) and estimated glomerular filtration rate (eGFR); diabetes duration and glycated haemoglobin (HbA1c) were also included in the comparison between T1D and T2D.
A total of 54 unique lipid subspecies from 15 unique lipid classes were annotated. Lysophosphatidylcholines (LPC) and ceramides (Cer) showed opposite effects in subjects with T1D and subjects with T2D, LPCs being mainly up-regulated in T1D and down-regulated in T2D, and Cer being up-regulated in T2D and down-regulated in T1D. Also, Phosphatidylcholines were clearly down-regulated in subjects with T1D. Regarding sex-specific differences, ceramides and phosphatidylcholines exhibited important diabetes-associated differences due to sex. Concerning the glycaemic state, we found a gradual increase of a panel of 1-deoxyceramides from normoglycemia to prediabetes to T2D.
Our findings revealed an extensive disruption of lipid metabolism in both T1D and T2D. Additionally, we found sex-specific lipidome changes associated with diabetes, and lipids associated with the glycaemic state that can be linked to previously described molecular mechanisms in diabetes.
The tricarboxylic acid cycle is the central pathway of energy production in eukaryotic cells and plays a key part in aerobic respiration throughout all kingdoms of life. One of the pivotal enzymes in ...this cycle is 2-oxoglutarate dehydrogenase complex (OGDHC), which generates NADH by oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. OGDHC is a megadalton protein complex originally thought to be assembled from three catalytically active subunits (E1o, E2o, E3). In fungi and animals, however, the protein MRPS36 has more recently been proposed as a putative additional component. Based on extensive cross-linking mass spectrometry data supported by phylogenetic analyses, we provide evidence that MRPS36 is an important member of the eukaryotic OGDHC, with no prokaryotic orthologues. Comparative sequence analysis and computational structure predictions reveal that, in contrast with bacteria and archaea, eukaryotic E2o does not contain the peripheral subunit-binding domain (PSBD), for which we propose that MRPS36 evolved as an E3 adaptor protein, functionally replacing the PSBD. We further provide a refined structural model of the complete eukaryotic OGDHC of approximately 3.45 MDa with novel mechanistic insights.
Leishmaniasis is a debilitating disease caused by the parasite Leishmania. There is extensive clinical polymorphism, including variable responsiveness to treatment. We study Leishmania donovani ...parasites isolated from visceral leishmaniasis patients in Nepal that responded differently to antimonial treatment due to differing intrinsic drug sensitivity of the parasites. Here, we present a proof-of-principle study in which we applied a metabolomics pipeline specifically developed for L. donovani to characterize the global metabolic differences between antimonial-sensitive and antimonial-resistant L. donovani isolates. Clones of drug-sensitive and drug-resistant parasite isolates from clinical samples were cultured in vitro and harvested for metabolomics analysis. The relative abundance of 340 metabolites was determined by ZIC-HILIC chromatography coupled to LTQ-Orbitrap mass spectrometry. Our measurements cover approximately 20% of the predicted core metabolome of Leishmania and additionally detected a large number of lipids. Drug-sensitive and drug-resistant parasites showed distinct metabolic profiles, and unsupervised clustering and principal component analysis clearly distinguished the two phenotypes. For 100 metabolites, the detected intensity differed more than three-fold between the 2 phenotypes. Many of these were in specific areas of lipid metabolism, suggesting that the membrane composition of the drug-resistant parasites is extensively modified. Untargeted metabolomics has been applied on clinical Leishmania isolates to uncover major metabolic differences between drug-sensitive and drug-resistant isolates. The identified major differences provide novel insights into the mechanisms involved in resistance to antimonial drugs, and facilitate investigations using targeted approaches to unravel the key changes mediating drug resistance.
Metabolomics, the global characterization of metabolite profiles, is becoming an increasingly powerful tool for research on secondary metabolite discovery and production. In this review we discuss ...examples of recent technological advances and biological applications of metabolomics in the search for chemical novelty and the engineered production of bioactive secondary metabolites.