Color vision deficiency (CVD) affects more than 4% of the population and leads to a different visual perception of colors. Though this has been known for decades, colormaps with many colors across ...the visual spectra are often used to represent data, leading to the potential for misinterpretation or difficulty with interpretation by someone with this deficiency. Until the creation of the module presented here, there were no colormaps mathematically optimized for CVD using modern color appearance models. While there have been some attempts to make aesthetically pleasing or subjectively tolerable colormaps for those with CVD, our goal was to make optimized colormaps for the most accurate perception of scientific data by as many viewers as possible. We developed a Python module, cmaputil, to create CVD-optimized colormaps, which imports colormaps and modifies them to be perceptually uniform in CVD-safe colorspace while linearizing and maximizing the brightness range. The module is made available to the science community to enable others to easily create their own CVD-optimized colormaps. Here, we present an example CVD-optimized colormap created with this module that is optimized for viewing by those without a CVD as well as those with red-green colorblindness. This colormap, cividis, enables nearly-identical visual-data interpretation to both groups, is perceptually uniform in hue and brightness, and increases in brightness linearly.
High-throughput, comprehensive, and confident identifications of metabolites and other chemicals in biological and environmental samples will revolutionize our understanding of the role these ...chemically diverse molecules play in biological systems. Despite recent technological advances, metabolomics studies still result in the detection of a disproportionate number of features that cannot be confidently assigned to a chemical structure. This inadequacy is driven by the single most significant limitation in metabolomics, the reliance on reference libraries constructed by analysis of authentic reference materials with limited commercial availability. To this end, we have developed the in silico chemical library engine (ISiCLE), a high-performance computing-friendly cheminformatics workflow for generating libraries of chemical properties. In the instantiation described here, we predict probable three-dimensional molecular conformers (i.e., conformational isomers) using chemical identifiers as input, from which collision cross sections (CCS) are derived. The approach employs first-principles simulation, distinguished by the use of molecular dynamics, quantum chemistry, and ion mobility calculations, to generate structures and chemical property libraries, all without training data. Importantly, optimization of ISiCLE included a refactoring of the popular MOBCAL code for trajectory-based mobility calculations, improving its computational efficiency by over 2 orders of magnitude. Calculated CCS values were validated against 1983 experimentally measured CCS values and compared to previously reported CCS calculation approaches. Average calculated CCS error for the validation set is 3.2% using standard parameters, outperforming other density functional theory (DFT)-based methods and machine learning methods (e.g., MetCCS). An online database is introduced for sharing both calculated and experimental CCS values (metabolomics.pnnl.gov), initially including a CCS library with over 1 million entries. Finally, three successful applications of molecule characterization using calculated CCS are described, including providing evidence for the presence of an environmental degradation product, the separation of molecular isomers, and an initial characterization of complex blinded mixtures of exposure chemicals. This work represents a method to address the limitations of small molecule identification and offers an alternative to generating chemical identification libraries experimentally by analyzing authentic reference materials. All code is available at github.com/pnnl.
Mass-spectrometry based omics technologies - namely proteomics, metabolomics and lipidomics - have enabled the molecular level systems biology investigation of organisms in unprecedented detail. ...There has been increasing interest for gaining a thorough, functional understanding of the biological consequences associated with cellular heterogeneity in a wide variety of research areas such as developmental biology, precision medicine, cancer research and microbiome science. Recent advances in mass spectrometry (MS) instrumentation and sample handling strategies are quickly making comprehensive omics analyses of single cells feasible, but key breakthroughs are still required to push through remaining bottlenecks. In this review, we discuss the challenges faced by single cell MS-based omics analyses and highlight recent technological advances that collectively can contribute to comprehensive and high throughput omics analyses in single cells. We provide a vision of the potential of integrating pioneering technologies such as Structures for Lossless Ion Manipulations (SLIM) for improved sensitivity and resolution, novel peptide identification tactics and standards free metabolomics approaches for future applications in single cell analysis.
Relatively small changes to gene expression data dramatically affect co-expression networks inferred from that data which, in turn, can significantly alter the subsequent biological interpretation. ...This error propagation is an underappreciated problem that, while hinted at in the literature, has not yet been thoroughly explored. Resampling methods (e.g. bootstrap aggregation, random subspace method) are hypothesized to alleviate variability in network inference methods by minimizing outlier effects and distilling persistent associations in the data. But the efficacy of the approach assumes the generalization from statistical theory holds true in biological network inference applications.
We evaluated the effect of bootstrap aggregation on inferred networks using commonly applied network inference methods in terms of stability, or resilience to perturbations in the underlying expression data, a metric for accuracy, and functional enrichment of edge interactions.
Bootstrap aggregation results in improved stability and, depending on the size of the input dataset, a marginal improvement to accuracy assessed by each method's ability to link genes in the same functional pathway.
The majority of primary and secondary metabolites in nature have yet to be identified, representing a major challenge for metabolomics studies that currently require reference libraries from analyses ...of authentic compounds. Using currently available analytical methods, complete chemical characterization of metabolomes is infeasible for both technical and economic reasons. For example, unambiguous identification of metabolites is limited by the availability of authentic chemical standards, which, for the majority of molecules, do not exist. Computationally predicted or calculated data are a viable solution to expand the currently limited metabolite reference libraries, if such methods are shown to be sufficiently accurate. For example, determining nuclear magnetic resonance (NMR) spectroscopy spectra in silico has shown promise in the identification and delineation of metabolite structures. Many researchers have been taking advantage of density functional theory (DFT), a computationally inexpensive yet reputable method for the prediction of carbon and proton NMR spectra of metabolites. However, such methods are expected to have some error in predicted
13
C and
1
H NMR spectra with respect to experimentally measured values. This leads us to the question–what accuracy is required in predicted
13
C and
1
H NMR chemical shifts for confident metabolite identification? Using the set of 11,716 small molecules found in the Human Metabolome Database (HMDB), we simulated both experimental and theoretical NMR chemical shift databases. We investigated the level of accuracy required for identification of metabolites in simulated pure and impure samples by matching predicted chemical shifts to experimental data. We found 90% or more of molecules in simulated pure samples can be successfully identified when errors of
1
H and
13
C chemical shifts in water are below 0.6 and 7.1 ppm, respectively, and below 0.5 and 4.6 ppm in chloroform solvation, respectively. In simulated complex mixtures, as the complexity of the mixture increased, greater accuracy of the calculated chemical shifts was required, as expected. However, if the number of molecules in the mixture is known, e.g., when NMR is combined with MS and sample complexity is low, the likelihood of confident molecular identification increased by 90%.
A primary goal of metabolomics studies is to fully characterize the small-molecule composition of complex biological and environmental samples. However, despite advances in analytical technologies ...over the past two decades, the majority of small molecules in complex samples are not readily identifiable due to the immense structural and chemical diversity present within the metabolome. Current gold-standard identification methods rely on reference libraries built using authentic chemical materials ("standards"), which are not available for most molecules. Computational quantum chemistry methods, which can be used to calculate chemical properties that are then measured by analytical platforms, offer an alternative route for building reference libraries,
,
libraries for "standards-free" identification. In this review, we cover the major roadblocks currently facing metabolomics and discuss applications where quantum chemistry calculations offer a solution. Several successful examples for nuclear magnetic resonance spectroscopy, ion mobility spectrometry, infrared spectroscopy, and mass spectrometry methods are reviewed. Finally, we consider current best practices, sources of error, and provide an outlook for quantum chemistry calculations in metabolomics studies. We expect this review will inspire researchers in the field of small-molecule identification to accelerate adoption of
methods for generation of reference libraries and to add quantum chemistry calculations as another tool at their disposal to characterize complex samples.
We report on separations of ion isotopologues and isotopomers using ultrahigh-resolution traveling wave-based Structures for Lossless Ion Manipulations with serpentine ultralong path and extended ...routing ion mobility spectrometry coupled to mass spectrometry (SLIM SUPER IMS-MS). Mobility separations of ions from the naturally occurring ion isotopic envelopes (e.g., M, M+1, M+2, ... ions) showed the first and second isotopic peaks (i.e., M+1 and M+2) for various tetraalkylammonium ions could be resolved from their respective monoisotopic ion peak (M) after SLIM SUPER IMS with resolving powers of ∼400–600. Similar separations were obtained for other compounds (e.g., tetrapeptide ions). Greater separation was obtained using argon versus helium drift gas, as expected from the greater reduced mass contribution to ion mobility described by the Mason–Schamp relationship. To more directly explore the role of isotopic substitutions, we studied a mixture of specific isotopically substituted (15N, 13C, and 2H) protonated arginine isotopologues. While the separations in nitrogen were primarily due to their reduced mass differences, similar to the naturally occurring isotopologues, their separations in helium, where higher resolving powers could also be achieved, revealed distinct additional relative mobility shifts. These shifts appeared correlated, after correction for the reduced mass contribution, with changes in the ion center of mass due to the different locations of heavy atom substitutions. The origin of these apparent mass distribution-induced mobility shifts was then further explored using a mixture of Iodoacetyl Tandem Mass Tag (iodoTMT) isotopomers (i.e., each having the same exact mass, but with different isotopic substitution sites). Again, the observed mobility shifts appeared correlated with changes in the ion center of mass leading to multiple monoisotopic mobilities being observed for some isotopomers (up to a ∼0.04% difference in mobility). These mobility shifts thus appear to reflect details of the ion structure, derived from the changes due to ion rotation impacting collision frequency or momentum transfer, and highlight the potential for new approaches for ion structural characterization.
Many definitions of resilience have been proffered for natural and engineered ecosystems, but a conceptual consensus on resilience in microbial communities is still lacking. We argue that the ...disconnect largely results from the wide variance in microbial community complexity, which range from compositionally simple synthetic consortia to complex natural communities, and divergence between the typical practical outcomes emphasized by ecologists and engineers. Viewing microbial communities as elasto-plastic systems that undergo both recoverable and unrecoverable transitions, we argue that this gap between the engineering and ecological definitions of resilience stems from their respective emphases on elastic and plastic deformation, respectively. We propose that the two concepts may be fundamentally united around the resilience of function rather than state in microbial communities and the regularity in the relationship between environmental variation and a community's functional response. Furthermore, we posit that functional resilience is an intrinsic property of microbial communities and suggest that state changes in response to environmental variation may be a key mechanism driving functional resilience in microbial communities.
A growing number of software tools have been developed for metabolomics data processing and analysis. Many new tools are contributed by metabolomics practitioners who have limited prior experience ...with software development, and the tools are subsequently implemented by users with expertise that ranges from basic point-and-click data analysis to advanced coding. This Perspective is intended to introduce metabolomics software users and developers to important considerations that determine the overall impact of a publicly available tool within the scientific community. The recommendations reflect the collective experience of an NIH-sponsored Metabolomics Consortium working group that was formed with the goal of researching guidelines and best practices for metabolomics tool development. The recommendations are aimed at metabolomics researchers with little formal background in programming and are organized into three stages: (i) preparation, (ii) tool development, and (iii) distribution and maintenance.
Glycomics has become an increasingly important field of research since glycans play critical roles in biology processes ranging from molecular recognition and signaling to cellular communication. ...Glycans often conjugate with other biomolecules, such as proteins and lipids, and alter their properties and functions, so glycan characterization is essential for understanding the effects they have on cellular systems. However, the analysis of glycans is extremely difficult due to their complexity and structural diversity (i.e., the number and identity of monomer units, and configuration of their glycosidic linkages and connectivities). In this work, we coupled ion mobility spectrometry with mass spectrometry (IMS-MS) to characterize glycan standards and biologically important isomers of synthetic αGal-containing
O
-glycans including glycotopes of the protozoan parasite
Trypanosoma cruzi
, which is the causative agent of Chagas disease. IMS-MS results showed significant differences for the glycan structural isomers when analyzed in positive and negative polarity and complexed with different metal cations. These results suggest that specific metal ions or ion polarities could be used to target and baseline separate glycan isomers of interest with IMS-MS.
Graphical abstract
Glycan isomers, such as fructose and glucose, show distinct separations in positive and negative ion mode