Tinker 8: Software Tools for Molecular Design Rackers, Joshua A; Wang, Zhi; Lu, Chao ...
Journal of chemical theory and computation,
10/2018, Letnik:
14, Številka:
10
Journal Article
Recenzirano
Odprti dostop
The Tinker software, currently released as version 8, is a modular molecular mechanics and dynamics package written primarily in a standard, easily portable dialect of Fortran 95 with OpenMP ...extensions. It supports a wide variety of force fields, including polarizable models such as the Atomic Multipole Optimized Energetics for Biomolecular Applications (AMOEBA) force field. The package runs on Linux, macOS, and Windows systems. In addition to canonical Tinker, there are branches, Tinker-HP and Tinker-OpenMM, designed for use on message passing interface (MPI) parallel distributed memory supercomputers and state-of-the-art graphical processing units (GPUs), respectively. The Tinker suite also includes a tightly integrated Java-based graphical user interface called Force Field Explorer (FFE), which provides molecular visualization capabilities as well as the ability to launch and control Tinker calculations.
The classification of genetic variants represents a major challenge in the post-genome era by virtue of their extraordinary number and the complexities associated with ascribing a clinical impact, ...especially for disorders exhibiting exceptional phenotypic, genetic, and allelic heterogeneity. To address this challenge for hearing loss, we have developed the Deafness Variation Database (DVD), a comprehensive, open-access resource that integrates all available genetic, genomic, and clinical data together with expert curation to generate a single classification for each variant in 152 genes implicated in syndromic and non-syndromic deafness. We evaluate 876,139 variants and classify them as pathogenic or likely pathogenic (more than 8,100 variants), benign or likely benign (more than 172,000 variants), or of uncertain significance (more than 695,000 variants); 1,270 variants are re-categorized based on expert curation and in 300 instances, the change is of medical significance and impacts clinical care. We show that more than 96% of coding variants are rare and novel and that pathogenicity is driven by minor allele frequency thresholds, variant effect, and protein domain. The mutational landscape we define shows complex gene-specific variability, making an understanding of these nuances foundational for improved accuracy in variant interpretation in order to enhance clinical decision making and improve our understanding of deafness biology.
We present Tinker-HP, a massively MPI parallel package dedicated to classical molecular dynamics (MD) and to multiscale simulations, using advanced polarizable force fields (PFF) encompassing ...distributed multipoles electrostatics. Tinker-HP is an evolution of the popular Tinker package code that conserves its simplicity of use and its reference double precision implementation for CPUs. Grounded on interdisciplinary efforts with applied mathematics, Tinker-HP allows for long polarizable MD simulations on large systems up to millions of atoms. We detail in the paper the newly developed extension of massively parallel 3D spatial decomposition to point dipole polarizable models as well as their coupling to efficient Krylov iterative and non-iterative polarization solvers. The design of the code allows the use of various computer systems ranging from laboratory workstations to modern petascale supercomputers with thousands of cores. Tinker-HP proposes therefore the first high-performance scalable CPU computing environment for the development of next generation point dipole PFFs and for production simulations. Strategies linking Tinker-HP to Quantum Mechanics (QM) in the framework of multiscale polarizable self-consistent QM/MD simulations are also provided. The possibilities, performances and scalability of the software are demonstrated
benchmarks calculations using the polarizable AMOEBA force field on systems ranging from large water boxes of increasing size and ionic liquids to (very) large biosystems encompassing several proteins as well as the complete satellite tobacco mosaic virus and ribosome structures. For small systems, Tinker-HP appears to be competitive with the Tinker-OpenMM GPU implementation of Tinker. As the system size grows, Tinker-HP remains operational thanks to its access to distributed memory and takes advantage of its new algorithmic enabling for stable long timescale polarizable simulations. Overall, a several thousand-fold acceleration over a single-core computation is observed for the largest systems. The extension of the present CPU implementation of Tinker-HP to other computational platforms is discussed.
An understanding of molecular interactions is essential for insight into biological systems at the molecular scale. Among the various components of molecular interactions, electrostatics are of ...special importance because of their long-range nature and their influence on polar or charged molecules, including water, aqueous ions, proteins, nucleic acids, carbohydrates, and membrane lipids. In particular, robust models of electrostatic interactions are essential for understanding the solvation properties of biomolecules and the effects of solvation upon biomolecular folding, binding, enzyme catalysis, and dynamics. Electrostatics, therefore, are of central importance to understanding biomolecular structure and modeling interactions within and among biological molecules. This review discusses the solvation of biomolecules with a computational biophysics view toward describing the phenomenon. While our main focus lies on the computational aspect of the models, we provide an overview of the basic elements of biomolecular solvation (e.g. solvent structure, polarization, ion binding, and non-polar behavior) in order to provide a background to understand the different types of solvation models.
Molecular force fields have been approaching a generational transition over the past several years, moving away from well-established and well-tuned, but intrinsically limited, fixed point charge ...models toward more intricate and expensive polarizable models that should allow more accurate description of molecular properties. The recently introduced AMOEBA force field is a leading publicly available example of this next generation of theoretical model, but to date, it has only received relatively limited validation, which we address here. We show that the AMOEBA force field is in fact a significant improvement over fixed charge models for small molecule structural and thermodynamic observables in particular, although further fine-tuning is necessary to describe solvation free energies of drug-like small molecules, dynamical properties away from ambient conditions, and possible improvements in aromatic interactions. State of the art electronic structure calculations reveal generally very good agreement with AMOEBA for demanding problems such as relative conformational energies of the alanine tetrapeptide and isomers of water sulfate complexes. AMOEBA is shown to be especially successful on protein−ligand binding and computational X-ray crystallography where polarization and accurate electrostatics are critical.
The thrombotic microangiopathies (TMAs) and C3 glomerulopathies (C3Gs) include a spectrum of rare diseases such as atypical hemolytic uremic syndrome, thrombotic thrombocytopenic purpura, C3GN, and ...dense deposit disease, which share phenotypic similarities and underlying genetic commonalities. Variants in several genes contribute to the pathogenesis of these diseases, and identification of these variants may inform the diagnosis and treatment of affected patients. We have developed and validated a comprehensive genetic panel that screens all exons of all genes implicated in TMA and C3G. The closely integrated pipeline implemented includes targeted genomic enrichment, massively parallel sequencing, bioinformatic analysis, and a multidisciplinary conference to analyze identified variants in the context of each patient's specific phenotype. Herein, we present our 1-year experience with this panel, during which time we studied 193 patients. We identified 17 novel and 74 rare variants, which we classified as pathogenic (11), likely pathogenic (12), and of uncertain significance (68). Compared with controls, patients with C3G had a higher frequency of rare and novel variants in C3 convertase (C3 and CFB) and complement regulator (CFH, CFI, CFHR5, and CD46) genes (P<0.05). In contrast, patients with TMA had an increase in rare and novel variants only in complement regulator genes (P<0.01), a distinction consistent with differing sites of complement dysregulation in these two diseases. In summary, we were able to provide a positive genetic diagnosis in 43% and 41% of patients carrying the clinical diagnosis of C3G and TMA, respectively.
High levels of HDL-C are correlated with a decreased risk of cardiovascular disease. HDL-C levels are modulated in part by the secreted phospholipase, endothelial lipase (EL), which hydrolyzes the ...phospholipids of HDL and decreases circulating HDL-C concentrations. A 584C/T polymorphism in LIPG, the gene which encodes EL, was first identified in individuals with increased HDL levels. This polymorphism results in a T111I point mutation the EL protein. The association between this variant, HDL levels, and the risk of coronary artery disease (CAD) in humans has been extensively studied, but the findings have been inconsistent. In this study, we took a biochemical approach, investigating how the T111I variant affected EL activity, structure, and stability. Moreover, we tested whether the T111I variant altered the inhibition of phospholipase activity by angiopoietin-like 3 (ANGPTL3) and angiopoietin-like 4 (ANGPTL4), two known EL inhibitors. We found that neither the stability nor enzymatic activity of EL was altered by the T111I variant. Moreover, we found no difference between wild-type and T111I EL in their ability to be inhibited by ANGPTL proteins. These data suggest that any effect this variant may have on HDL-C levels or cardiovascular disease are not mediated through alterations in these functions.
The formulation of active pharmaceutical ingredients involves discovering stable crystal packing arrangements or polymorphs, each of which has distinct pharmaceutically relevant properties. ...Traditional experimental screening techniques utilizing various conditions are commonly supplemented with in silico crystal structure prediction (CSP) to inform the crystallization process and mitigate risk. Predictions are often based on advanced classical force fields or quantum mechanical calculations that model the crystal potential energy landscape but do not fully incorporate temperature, pressure, or solution conditions during the search procedure. This study proposes an innovative alchemical path that utilizes an advanced polarizable atomic multipole force field to predict crystal structures based on direct sampling of the NPT ensemble. The use of alchemical (i.e., nonphysical) intermediates, a novel Monte Carlo barostat, and an orthogonal space tempering bias combine to enhance the sampling efficiency of the deposition/sublimation phase transition. The proposed algorithm was applied to 2-((4-(2-(3,4-dichlorophenyl)ethyl)phenyl)amino)benzoic acid (Cambridge Crystallography Database Centre ID: XAFPAY) as a case study to showcase the algorithm. Each experimentally determined polymorph with one molecule in the asymmetric unit was successfully reproduced via approximately 1000 short 1 ns simulations per space group where each simulation was initiated from random rigid body coordinates and unit cell parameters. Utilizing two threads of a recent Intel CPU (a Xeon Gold 6330 CPU at 2.00 GHz), 1 ns of sampling using the polarizable AMOEBA force field can be acquired in 4 h (equating to more than 300 ns/day using all 112 threads/56 cores of a dual CPU node) within the Force Field X software (https://ffx.biochem.uiowa.edu). These results demonstrate a step forward in the rigorous use of the NPT ensemble during the CSP search process and open the door to future algorithms that incorporate solution conditions using continuum solvation methods.
During in silico crystal structure prediction of organic molecules, millions of candidate structures are often generated. These candidates must be compared to remove duplicates prior to further ...analysis (e.g. optimization with electronic structure methods) and ultimately compared with structures determined experimentally. The agreement of predicted and experimental structures forms the basis of evaluating the results from the Cambridge Crystallographic Data Centre (CCDC) blind assessment of crystal structure prediction, which further motivates the pursuit of rigorous alignments. Evaluating crystal structure packings using coordinate root‐mean‐square deviation (RMSD) for N molecules (or N asymmetric units) in a reproducible manner requires metrics to describe the shape of the compared molecular clusters to account for alternative approaches used to prioritize selection of molecules. Described here is a flexible algorithm called Progressive Alignment of Crystals (PAC) to evaluate crystal packing similarity using coordinate RMSD and introducing the radius of gyration (Rg) as a metric to quantify the shape of the superimposed clusters. It is shown that the absence of metrics to describe cluster shape adds ambiguity to the results of the CCDC blind assessments because it is not possible to determine whether the superposition algorithm has prioritized tightly packed molecular clusters (i.e. to minimize Rg) or prioritized reduced RMSD (i.e. via possibly elongated clusters with relatively larger Rg). For example, it is shown that when the PAC algorithm described here uses single linkage to prioritize molecules for inclusion in the superimposed clusters, the results are nearly identical to those calculated by the widely used program COMPACK. However, the lower Rg values obtained by the use of average linkage are favored for molecule prioritization because the resulting RMSDs more equally reflect the importance of packing along each dimension. It is shown that the PAC algorithm is faster than COMPACK when using a single process and its utility for biomolecular crystals is demonstrated. Finally, parallel scaling up to 64 processes in the open‐source code Force Field X is presented.
Evaluating crystal structure packings using coordinate root‐mean‐square deviation (RMSD) for N molecules (or N asymmetric units) in a reproducible manner requires metrics to describe the shape of the compared molecular clusters to account for alternative approaches used to prioritize selection of molecules. Described here is a fast algorithm called Progressive Alignment of Crystals (PAC) to evaluate crystal packing similarity using coordinate RMSD and introducing the radius of gyration as a metric to quantify the shape of the superimposed clusters.
The majority (85%) of nonsyndromic cleft lip with or without cleft palate (nsCL/P) cases occur sporadically, suggesting a role for de novo mutations (DNMs) in the etiology of nsCL/P. To identify high ...impact protein-altering DNMs that contribute to the risk of nsCL/P, we conducted whole-genome sequencing (WGS) analyses in 130 African case-parent trios (affected probands and unaffected parents). We identified 162 high confidence protein-altering DNMs some of which are based on available evidence, contribute to the risk of nsCL/P. These include novel protein-truncating DNMs in the ACTL6A, ARHGAP10, MINK1, TMEM5 and TTN genes; as well as missense variants in ACAN, DHRS3, DLX6, EPHB2, FKBP10, KMT2D, RECQL4, SEMA3C, SEMA4D, SHH, TP63, and TULP4. Many of these protein-altering DNMs were predicted to be pathogenic. Analysis using mouse transcriptomics data showed that some of these genes are expressed during the development of primary and secondary palate. Gene-set enrichment analysis of the protein-altering DNMs identified palatal development and neural crest migration among the few processes that were significantly enriched. These processes are directly involved in the etiopathogenesis of clefting. The analysis of the coding sequence in the WGS data provides more evidence of the opportunity for novel findings in the African genome.