Molecular dynamics simulations provide a mechanistic description of molecules by relying on empirical potentials. The quality and transferability of such potentials can be improved leveraging ...data-driven models derived with machine learning approaches. Here, we present TorchMD, a framework for molecular simulations with mixed classical and machine learning potentials. All force computations including bond, angle, dihedral, Lennard-Jones, and Coulomb interactions are expressed as PyTorch arrays and operations. Moreover, TorchMD enables learning and simulating neural network potentials. We validate it using standard Amber all-atom simulations, learning an ab initio potential, performing an end-to-end training, and finally learning and simulating a coarse-grained model for protein folding. We believe that TorchMD provides a useful tool set to support molecular simulations of machine learning potentials. Code and data are freely available at github.com/torchmd.
Prior biological knowledge greatly facilitates the meaningful interpretation of gene-expression data. Causal networks constructed from individual relationships curated from the literature are ...particularly suited for this task, since they create mechanistic hypotheses that explain the expression changes observed in datasets.
We present and discuss a suite of algorithms and tools for inferring and scoring regulator networks upstream of gene-expression data based on a large-scale causal network derived from the Ingenuity Knowledge Base. We extend the method to predict downstream effects on biological functions and diseases and demonstrate the validity of our approach by applying it to example datasets.
The causal analytics tools 'Upstream Regulator Analysis', 'Mechanistic Networks', 'Causal Network Analysis' and 'Downstream Effects Analysis' are implemented and available within Ingenuity Pathway Analysis (IPA, http://www.ingenuity.com).
Supplementary material is available at Bioinformatics online.
We combine replica exchange (parallel tempering) with normalizing flows, a class of deep generative models. These two sampling strategies complement each other, resulting in an efficient method for ...sampling molecular systems characterized by rare events, which we call learned replica exchange (LREX). In LREX, a normalizing flow is trained to map the configurations of the fastest-mixing replica into configurations belonging to the target distribution, allowing direct exchanges between the two without the need to simulate intermediate replicas. This can significantly reduce the computational cost compared to standard replica exchange. The proposed method also offers several advantages with respect to Boltzmann generators that directly use normalizing flows to sample the target distribution. We apply LREX to some prototypical molecular dynamics systems, highlighting the improvements over previous methods.
Long-range Lennard-Jones (LJ) interactions have been incorporated into the CHARMM36 (C36) lipid force field (FF) using the LJ particle-mesh Ewald (LJ-PME) method in order to remove the inconsistency ...of bilayer and monolayer properties arising from the exclusion of long-range dispersion Yu, Y. ; Semi-automated Optimization of the CHARMM36 Lipid Force Field to Include Explicit Treatment of Long-Range Dispersion. J. Chem. Theory Comput. 2021, 10.1021/acs.jctc.0c01326. (preceding article in this issue). The new FF is denoted C36/LJ-PME. While the first optimization was based on three phosphatidylcholines (PCs), this work extends the validation and parametrization to more lipids including PC, phosphatidylethanolamine (PE), phosphatidylglycerol (PG), and ether lipids. The agreement with experimental structure data is excellent for PC, PE, and ether lipids. C36/LJ-PME also compares favorably with scattering data of PG bilayers but less so with NMR deuterium order parameters of 1,2-dimyristoyl-sn-glycero-3-phospho-(1′-rac-glycerol) (DMPG) at 303.15 K, indicating a need for future optimization regarding PG-specific parameters. Frequency dependence of NMR T 1 spin–lattice relaxation times is well-described by C36/LJ-PME, and the overall agreement with experiment is comparable to C36. Lipid diffusion is slower than C36 due to the added long-range dispersion causing a higher viscosity, although it is still too fast compared to experiment after correction for periodic boundary conditions. When using a 10 Å real-space cutoff, the simulation speed of C36/LJ-PME is roughly equal to C36. While more lipids will be incorporated into the FF in the future, C36/LJ-PME can be readily used for common lipids and extends the capability of the CHARMM FF by supporting monolayers and eliminating the cutoff dependence.
Coarse-grained (CG) molecular simulations have become a standard tool to study molecular processes on time and length scales inaccessible to all-atom simulations. Parametrizing CG force fields to ...match all-atom simulations has mainly relied on force-matching or relative entropy minimization, which require many samples from costly simulations with all-atom or CG resolutions, respectively. Here we present flow-matching, a new training method for CG force fields that combines the advantages of both methods by leveraging normalizing flows, a generative deep learning method. Flow-matching first trains a normalizing flow to represent the CG probability density, which is equivalent to minimizing the relative entropy without requiring iterative CG simulations. Subsequently, the flow generates samples and forces according to the learned distribution in order to train the desired CG free energy model via force-matching. Even without requiring forces from the all-atom simulations, flow-matching outperforms classical force-matching by an order of magnitude in terms of data efficiency and produces CG models that can capture the folding and unfolding transitions of small proteins.
Cell type-specific gene expression patterns are represented as memory states of a Hopfield neural network model. It is shown that order parameters of this model can be interpreted as concentrations ...of master transcription regulators that form concurrent positive feedback loops with a large number of downstream regulated genes. The order parameter free energy then defines an epigenetic landscape in which local minima correspond to stable cell states. The model is applied to gene expression data in the context of hematopoiesis.
3-Aminoindazoles are privileged scaffolds for bioactive drug-like molecules. In this study, a microwave-assisted cascade reaction for the synthesis of N-1 substituted 3-aminoindazoles with yields up ...to 81% has been developed. Starting from 3-(2-bromoaryl)-1,2,4-oxadiazol-5(4H)-ones, the reaction exhibits a broad substrate scope including anilines, aliphatic amines, and sulfonamides and bypasses selectivity issues between N-1 and 3-amino group. Furthermore, the Differential Scanning Fluorimetry screen of a kinase panel demonstrated the value of targeting N-1 substituted 3-aminoindazoles as kinase-biased fragments.
Permeation of small molecules through membranes is a fundamental biological process, and molecular dynamics simulations have proven to be a promising tool for studying the permeability of membranes ...by providing a precise characterization of the free energy and diffusivity. In this study, permeation of ethanol through three different membranes of 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphatidylserine (POPS), PO-phosphatidylethanolamine (POPE), and PO-phosphatidylcholine (POPC) is studied. Permeabilities are calculated and compared with two different approaches based on Fick's first law and the inhomogeneous solubility-diffusion model. Microsecond simulation of double bilayers of these membranes provided a direct measurement of permeability by a flux-based counting method. These simulations show that a membrane of POPC has the highest permeability, followed by POPE and POPS. Due to the membrane-modulating properties of ethanol, the permeability increases as functions of concentration and saturation of the inner leaflet in a double bilayer setting, as opposed to the customary definition as a proportionality constant. This concentration dependence is confirmed by single bilayer simulations at different ethanol concentrations ranging from 1% to 18%, where permeability estimates are available from transition-based counting and the inhomogeneous solubility-diffusion model. We show that the free energy and diffusion profiles for ethanol lack accuracy at higher permeant concentrations due to non-Markovian kinetics caused by collective behavior. In contrast, the counting method provides unbiased estimates. Finally, the permeabilities obtained from single bilayer simulations are combined to represent natural gradients felt by a cellular membrane, which accurately models the non-equilibrium effects on ethanol permeability from single bilayer simulations in equilibrium.
Building on the pyrazolopyrimidine CK2 (casein kinase 2) inhibitor scaffold, we designed a small targeted library. Through comprehensive evaluation of inhibitor selectivity, we identified inhibitor ...24 (SGC-CK2-1) as a highly potent and cell-active CK2 chemical probe with exclusive selectivity for both human CK2 isoforms. Remarkably, despite years of research pointing to CK2 as a key driver in cancer, our chemical probe did not elicit a broad antiproliferative phenotype in >90% of >140 cell lines when tested in dose-response. While many publications have reported CK2 functions, CK2 biology is complex and an available high-quality chemical tool such as SGC-CK2-1 will be indispensable in deciphering the relationships between CK2 function and phenotypes.
Display omitted
•We developed a potent, selective, and cell-active chemical probe for CK2•We identified a negative control compound to be used with the CK2 probe•Our selective chemical probe (SGC-CK2-1) is not broadly antiproliferative•SGC-CK2-1 is the best available chemical tool to define the roles of CK2 in cells
Many papers have detailed the function(s) of casein kinase 2 (CK2). Characterization of CK2, however, has been confounded by the use of suboptimal inhibitors. Wells et al. have identified the best available chemical tool to use in illuminating the essential roles of this pleiotropic kinase.