A proliferation-inducing ligand (APRIL) is a member of the tumor necrosis factor superfamily. APRIL is quite unique in this superfamily for at least for two reasons: (i) it binds to ...glycosaminoglycans (GAGs) via its positively charged N-terminus; (ii) one of its signaling receptor, the transmembrane activator and CAML interactor (TACI), was also reported to bind GAGs. Here, as provided by biochemical evidences with the use of an APRIL deletion mutant linked to computational studies, APRIL-GAG interaction involved other regions than the APRIL N-terminus. Preferential interaction of APRIL with heparin followed by chondroitin sulfate E was confirmed by in silico analysis. Both computational and experimental approaches did not reveal the heparan sulfate binding to TACI. Together, computational results corroborated experiments contributing with atomistic details to the knowledge on this biologically relevant trimolecular system. Additionally, a high-throughput rigorous analysis of the free energy calculations data was performed to critically evaluate the applied computational methodologies.
A reliable representation of local interactions is critical for the accuracy of modeling protein structure and dynamics at both the all-atom and coarse-grained levels. The development of local ...(mainly torsional) potentials was focused on careful parametrization of the predetermined (usually Fourier) formulas rather than on their physics-based derivation. In this Perspective we discuss the state-of-the-art methods for modeling local interactions, including the scale-consistent theory developed in our laboratory, which implies that the coarse-grained torsional potentials inseparably depend on the virtual-bond angles adjacent to a given dihedral and that multitorsional terms should be considered. We extend the treatment to split the residue-based torsional potentials into the site-based regular and improper torsional potentials. These considerations are illustrated with the revised torsional potentials and improper-torsional potentials involving the l-alanine residue and the improper-torsional potential corresponding to serine-residue enantiomerization. Applications of the new approach in coarse-grained modeling and revising all-atom force fields are discussed.
Molecular dynamics with coarse-grained models is nowadays extensively used to simulate biomolecular systems at large time and size scales, compared to those accessible to all-atom molecular dynamics. ...In this review article, we describe the physical basis of coarse-grained molecular dynamics, the coarse-grained force fields, the equations of motion and the respective numerical integration algorithms, and selected practical applications of coarse-grained molecular dynamics. We demonstrate that the motion of coarse-grained sites is governed by the potential of mean force and the friction and stochastic forces, resulting from integrating out the secondary degrees of freedom. Consequently, Langevin dynamics is a natural means of describing the motion of a system at the coarse-grained level and the potential of mean force is the physical basis of the coarse-grained force fields. Moreover, the choice of coarse-grained variables and the fact that coarse-grained sites often do not have spherical symmetry implies a non-diagonal inertia tensor. We describe selected coarse-grained models used in molecular dynamics simulations, including the most popular MARTINI model developed by Marrink's group and the UNICORN model of biological macromolecules developed in our laboratory. We conclude by discussing examples of the application of coarse-grained molecular dynamics to study biologically important processes.
Abstract
Summary
The UNited RESisdue (UNRES) package for coarse-grained simulations, which has recently been optimized to treat large protein systems, has been implemented on Graphical Processor ...Units (GPUs). An over 100-time speed-up of the GPU code (run on an NVIDIA A100) with respect to the sequential code and an 8.5 speed-up with respect to the parallel Open Multi-Processing (OpenMP) code (run on 32 cores of 2 AMD EPYC 7313 Central Processor Units (CPUs)) has been achieved for large proteins (with size over 10 000 residues). Due to the averaging over the fine-grain degrees of freedom, 1 time unit of UNRES simulations is equivalent to about 1000 time units of laboratory time; therefore, millisecond time scale of large protein systems can be reached with the UNRES-GPU code.
Availability and implementation
The source code of UNRES-GPU along with the benchmarks used for tests is available at https://projects.task.gda.pl/eurohpcpl-public/unres.
The recent NEWCT-9P version of the coarse-grained UNRES force field for proteins, with scale-consistent formulas for the local and correlation terms, has been tested in the CASP13 experiment of the ...blind-prediction of protein structure, in the ab initio, contact-assisted, and data-assisted modes. Significant improvement of the performance has been observed with respect to the CASP11 and CASP12 experiments (by over 10 GDT_TS units for the ab initio mode predictions and by over 15 GDT_TS units for the contact-assisted prediction, respectively), which is a result of introducing scale-consistent terms and improved handling of contact-distance restraints. As in previous CASP exercises, UNRES ranked higher in the free modeling category than in the general category that included template based modeling targets. Use of distance restraints from the predicted contacts, albeit many of them were wrong, resulted in the increase of GDT_TS by over 8 units on average and introducing sparse restraints from small-angle X-ray/neutron scattering and chemical cross-link-mass-spectrometry experiments, and ambiguous restraints from nuclear magnetic resonance experiments has also improved the predictions by 8.6, 9.7, and 10.7 GDT_TS units on average, respectively.
Display omitted
•UNRES coarse-grained force field exhibited major performance improvement in CASP13.•The reason of the improvement are scale-consistent terms that embed atomic structure.•Using predicted contacts through a bounded restraint function improves model quality.•Sparse/ambiguous experimental data improve models in the order SAXS/SANS < XLMS < NMR.
The dynamics of the virus like particles (VLPs) corresponding to the GII.4 Houston, GII.2 SMV, and GI.1 Norwalk strains of human noroviruses (HuNoV) that cause gastroenteritis was investigated by ...means of long‐time (about 30 μs in the laboratory timescale) molecular dynamics simulations with the coarse‐grained UNRES force field. The main motion of VLP units turned out to be the bending at the junction between the P1 subdomain (that sits in the VLP shell) and the P2 subdomain (that protrudes outside) of the major VP1 protein, this resulting in a correlated wagging motion of the P2 subdomains with respect to the VLP surface. The fluctuations of the P2 subdomain were found to be more pronounced and the P2 domain made a greater angle with the normal to the VLP surface for the GII.2 strain, which could explain the inability of this strain to bind the histo‐blood group antigens (HBGAs).
Illustration of the dominant motion of the outer P2 domain of the VP1 protein of human norovirus (upper left), correlation between this domain motion, units clustered into two groups with intragroup‐correlated and intergroup‐anticorrelated motion (upper right), and extent of fluctuations (bottom), which is the largest for the GII.2 strain that does not bind to the histo‐blood group antigens, obtained by molecular dynamics simulations with the UNRES coarse‐grained model for three strains of human norovirus.
We report major algorithmic improvements of the UNRES package for physics‐based coarse‐grained simulations of proteins. These include (i) introduction of interaction lists to optimize computations, ...(ii) transforming the inertia matrix to a pentadiagonal form to reduce computing and memory requirements, (iii) removing explicit angles and dihedral angles from energy expressions and recoding the most time‐consuming energy/force terms to minimize the number of operations and to improve numerical stability, (iv) using OpenMP to parallelize those sections of the code for which distributed‐memory parallelization involves unfavorable computing/communication time ratio, and (v) careful memory management to minimize simultaneous access of distant memory sections. The new code enables us to run molecular dynamics simulations of protein systems with size exceeding 100,000 amino‐acid residues, reaching over 1 ns/day (1 μs/day in all‐atom timescale) with 24 cores for proteins of this size. Parallel performance of the code and comparison of its performance with that of AMBER, GROMACS and MARTINI 3 is presented.
Molecular dynamics with the optimized and efficiently parallelized implementation of the highly‐reduced physics‐based UNRES model of polypeptide chains enables us to reach, with moderate computer resources, several nanosecond/day of MD time, for protein systems with size over 100,000 residues, which translates to microsecond/day real time given a 1000 fold faster occurrence of events in the simulations with the UNRES model compared to the all‐atom representation.
In this study, we characterize the interactions between the extracellular matrix protein, procollagen C-proteinase enhancer-1 (PCPE-1), and glycosaminoglycans (GAGs), which are linear anionic ...periodic polysaccharides. We applied molecular modeling approaches to build a structural model of full-length PCPE-1, which is not experimentally available, to predict GAG binding poses for various GAG lengths, types and sulfation patterns, and to determine the effect of calcium ions on the binding. The computational data are analyzed and discussed in the context of the experimental results previously obtained using surface plasmon resonance binding assays. We also provide experimental data on PCPE-1/GAG interactions obtained using inhibition assays with GAG oligosaccharides ranging from disaccharides to octadecasaccharides. Our results predict the localization of GAG-binding sites at the amino acid residue level onto PCPE-1 and is the first attempt to describe the effects of ions on protein-GAG binding using modeling approaches. In addition, this study allows us to get deeper insights into the in silico methodology challenges and limitations when applied to GAG-protein interactions.
The method for protein-structure prediction, which combines the physics-based coarse-grained UNRES force field with knowledge-based modeling, has been developed further and tested in the 13th ...Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP13). The method implements restraints from the consensus fragments common to server models. In this work, the server models to derive fragments have been chosen on the basis of quality assessment; a fully automatic fragment-selection procedure has been introduced, and Dynamic Fragment Assembly pseudopotentials have been fully implemented. The Global Distance Test Score (GDT_TS), averaged over our “Model 1” predictions, increased by over 10 units with respect to CASP12 for the free-modeling category to reach 40.82. Our “Model 1” predictions ranked 20 and 14 for all and free-modeling targets, respectively (upper 20.2% and 14.3% of all models submitted to CASP13 in these categories, respectively), compared to 27 (upper 21.1%) and 24 (upper 18.9%) in CASP12, respectively. For oligomeric targets, the Interface Patch Similarity (IPS) and Interface Contact Similarity (ICS) averaged over our best oligomer models increased from 0.28 to 0.36 and from 12.4 to 17.8, respectively, from CASP12 to CASP13, and top-ranking models of 2 targets (H0968 and T0997o) were obtained (none in CASP12). The improvement of our method in CASP13 over CASP12 was ascribed to the combined effect of the overall enhancement of server-model quality, our success in selecting server models and fragments to derive restraints, and improvements of the restraint and potential-energy functions.
Participating as the Cornell-Gdansk group, we have used our physics-based coarse-grained UNited RESidue (UNRES) force field to predict protein structure in the 11th Community Wide Experiment on the ...Critical Assessment of Techniques for Protein Structure Prediction (CASP11). Our methodology involved extensive multiplexed replica exchange simulations of the target proteins with a recently improved UNRES force field to provide better reproductions of the local structures of polypeptide chains. All simulations were started from fully extended polypeptide chains, and no external information was included in the simulation process except for weak restraints on secondary structure to enable us to finish each prediction within the allowed 3-week time window. Because of simplified UNRES representation of polypeptide chains, use of enhanced sampling methods, code optimization and parallelization and sufficient computational resources, we were able to treat, for the first time, all 55 human prediction targets with sizes from 44 to 595 amino acid residues, the average size being 251 residues. Complete structures of six single-domain proteins were predicted accurately, with the highest accuracy being attained for the T0769, for which the CαRMSD was 3.8 Å for 97 residues of the experimental structure. Correct structures were also predicted for 13 domains of multi-domain proteins with accuracy comparable to that of the best template-based modeling methods. With further improvements of the UNRES force field that are now underway, our physics-based coarse-grained approach to protein-structure prediction will eventually reach global prediction capacity and, consequently, reliability in simulating protein structure and dynamics that are important in biochemical processes.
Freely available on the web at http://www.unres.pl/ CONTACT: has5@cornell.edu.