Proteins are dynamic entities that undergo a plethora of conformational changes that may take place on a wide range of time scales. These changes can be as small as the rotation of one or a few ...side-chain dihedral angles or involve concerted motions in larger portions of the three-dimensional structure; both kinds of motions can be important for biological function and allostery. It is becoming increasingly evident that "connector regions" are important components of the dynamic personality of protein structures. These regions may be either disordered loops, i.e., poorly structured regions connecting secondary structural elements, or linkers that connect entire protein domains. Experimental and computational studies have, however, revealed that these regions are not mere connectors, and their role in allostery and conformational changes has been emerging in the last few decades. Here we provide a detailed overview of the structural properties and classification of loops and linkers, as well as a discussion of the main computational methods employed to investigate their function and dynamical properties. We also describe their importance for protein dynamics and allostery using as examples key proteins in cellular biology and human diseases such as kinases, ubiquitinating enzymes, and transcription factors.
The advent of Next-Generation Sequencing (NGS) technologies has opened new perspectives in deciphering the genetic mechanisms underlying complex diseases. Nowadays, the amount of genomic data is ...massive and substantial efforts and new tools are required to unveil the information hidden in the data. The Genomic Data Commons (GDC) Data Portal is a platform that contains different genomic studies including the ones from The Cancer Genome Atlas (TCGA) and the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) initiatives, accounting for more than 40 tumor types originating from nearly 30000 patients. Such platforms, although very attractive, must make sure the stored data are easily accessible and adequately harmonized. Moreover, they have the primary focus on the data storage in a unique place, and they do not provide a comprehensive toolkit for analyses and interpretation of the data. To fulfill this urgent need, comprehensive but easily accessible computational methods for integrative analyses of genomic data that do not renounce a robust statistical and theoretical framework are required. In this context, the R/Bioconductor package TCGAbiolinks was developed, offering a variety of bioinformatics functionalities. Here we introduce new features and enhancements of TCGAbiolinks in terms of i) more accurate and flexible pipelines for differential expression analyses, ii) different methods for tumor purity estimation and filtering, iii) integration of normal samples from other platforms iv) support for other genomics datasets, exemplified here by the TARGET data. Evidence has shown that accounting for tumor purity is essential in the study of tumorigenesis, as these factors promote confounding behavior regarding differential expression analysis. With this in mind, we implemented these filtering procedures in TCGAbiolinks. Moreover, a limitation of some of the TCGA datasets is the unavailability or paucity of corresponding normal samples. We thus integrated into TCGAbiolinks the possibility to use normal samples from the Genotype-Tissue Expression (GTEx) project, which is another large-scale repository cataloging gene expression from healthy individuals. The new functionalities are available in the TCGAbiolinks version 2.8 and higher released in Bioconductor version 3.7.
Intrinsically disordered proteins play important roles throughout biology, yet our understanding of the relationship between their sequences, structural properties, and functions remains incomplete. ...The dynamic nature of these proteins, however, makes them difficult to characterize structurally. Many disordered proteins can attain both compact and expanded conformations, and the level of expansion may be regulated and important for function. Experimentally, the level of compaction and shape is often determined either by small-angle x-ray scattering experiments or pulsed-field-gradient NMR diffusion measurements, which provide ensemble-averaged estimates of the radius of gyration and hydrodynamic radius, respectively. Often, these experiments are interpreted using molecular simulations or are used to validate them. We here provide, to our knowledge, a new and efficient method to calculate the hydrodynamic radius of a disordered protein chain from a model of its structural ensemble. In particular, starting from basic concepts in polymer physics, we derive a relationship between the radius of gyration of a structure and its hydrodynamic ratio, which in turn can be used, for example, to compare a simulated ensemble of conformations to NMR diffusion measurements. The relationship may also be valuable when using NMR diffusion measurements to restrain molecular simulations.
Apoptosis is an essential defensive mechanism against tumorigenesis. Proteins of the B-cell lymphoma-2 (Bcl-2) family regulate programmed cell death by the mitochondrial apoptosis pathway. In ...response to intracellular stress, the apoptotic balance is governed by interactions of three distinct subgroups of proteins; the activator/sensitizer BH3 (Bcl-2 homology 3)-only proteins, the pro-survival, and the pro-apoptotic executioner proteins. Changes in expression levels, stability, and functional impairment of pro-survival proteins can lead to an imbalance in tissue homeostasis. Their overexpression or hyperactivation can result in oncogenic effects. Pro-survival Bcl-2 family members carry out their function by binding the BH3 short linear motif of pro-apoptotic proteins in a modular way, creating a complex network of protein-protein interactions. Their dysfunction enables cancer cells to evade cell death. The critical role of Bcl-2 proteins in homeostasis and tumorigenesis, coupled with mounting insight in their structural properties, make them therapeutic targets of interest. A better understanding of gene expression, mutational profile, and molecular mechanisms of pro-survival Bcl-2 proteins in different cancer types, could help to clarify their role in cancer development and may guide advancement in drug discovery. Here, we shed light on the pro-survival Bcl-2 proteins in breast cancer using different bioinformatic approaches, linking -omics with structural data. We analyzed the changes in the expression of the Bcl-2 proteins and their BH3-containing interactors in breast cancer samples. We then studied, at the structural level, a selection of interactions, accounting for effects induced by mutations found in the breast cancer samples. We find two complexes between the up-regulated Bcl2A1 and two down-regulated BH3-only candidates (i.e., Hrk and Nr4a1) as targets associated with reduced apoptosis in breast cancer samples for future experimental validation. Furthermore, we predict L99R, M75R as damaging mutations altering protein stability, and Y120C as a possible allosteric mutation from an exposed surface to the BH3-binding site.
Abstract
The Database of Protein Disorder (DisProt, URL: https://disprot.org) provides manually curated annotations of intrinsically disordered proteins from the literature. Here we report recent ...developments with DisProt (version 8), including the doubling of protein entries, a new disorder ontology, improvements of the annotation format and a completely new website. The website includes a redesigned graphical interface, a better search engine, a clearer API for programmatic access and a new annotation interface that integrates text mining technologies. The new entry format provides a greater flexibility, simplifies maintenance and allows the capture of more information from the literature. The new disorder ontology has been formalized and made interoperable by adopting the OWL format, as well as its structure and term definitions have been improved. The new annotation interface has made the curation process faster and more effective. We recently showed that new DisProt annotations can be effectively used to train and validate disorder predictors. We believe the growth of DisProt will accelerate, contributing to the improvement of function and disorder predictors and therefore to illuminate the ‘dark’ proteome.
The selective removal of undesired or damaged mitochondria by autophagy, known as mitophagy, is crucial for cellular homoeostasis, and prevents tumour diffusion, neurodegeneration and ageing. The ...pro-autophagic molecule AMBRA1 (autophagy/beclin-1 regulator-1) has been defined as a novel regulator of mitophagy in both PINK1/PARKIN-dependent and -independent systems. Here, we identified the E3 ubiquitin ligase HUWE1 as a key inducing factor in AMBRA1-mediated mitophagy, a process that takes place independently of the main mitophagy receptors. Furthermore, we show that mitophagy function of AMBRA1 is post-translationally controlled, upon HUWE1 activity, by a positive phosphorylation on its serine 1014. This modification is mediated by the IKKα kinase and induces structural changes in AMBRA1, thus promoting its interaction with LC3/GABARAP (mATG8) proteins and its mitophagic activity. Altogether, these results demonstrate that AMBRA1 regulates mitophagy through a novel pathway, in which HUWE1 and IKKα are key factors, shedding new lights on the regulation of mitochondrial quality control and homoeostasis in mammalian cells.
Abstract
The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions ...from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.
Proteins are highly dynamic entities attaining a myriad of different conformations. Protein side chains change their states during dynamics, causing clashes that are propagated at distal sites. A ...convenient formalism to analyze protein dynamics is based on network theory using Protein Structure Networks (PSNs). Despite their broad applicability, few efforts have been devoted to benchmarking PSN methods and to provide the community with best practices. In many applications, it is convenient to use the centers of mass of the side chains as nodes. It becomes thus critical to evaluate the minimal distance cutoff between the centers of mass which will provide stable network properties. Moreover, when the PSN is derived from a structural ensemble collected by molecular dynamics (MD), the impact of the MD force field has to be evaluated. We selected a dataset of proteins with different fold and size and assessed the two fundamental properties of the PSN, i.e. hubs and connected components. We identified an optimal cutoff of 5 Å that is robust to changes in the force field and the proteins. Our study builds solid foundations for the harmonization and standardization of the PSN approach.
Salt bridges form between pairs of ionisable residues in close proximity and are important interactions in proteins. While salt bridges are known to be important both for protein stability, ...recognition and regulation, we still do not have fully accurate predictive models to assess the energetic contributions of salt bridges. Molecular dynamics simulation is one technique that may be used study the complex relationship between structure, solvation and energetics of salt bridges, but the accuracy of such simulations depends on the force field used. We have used NMR data on the B1 domain of protein G (GB1) to benchmark molecular dynamics simulations. Using enhanced sampling simulations, we calculated the free energy of forming a salt bridge for three possible lysine-carboxylate ionic interactions in GB1. The NMR experiments showed that these interactions are either not formed, or only very weakly formed, in solution. In contrast, we show that the stability of the salt bridges is overestimated, to different extents, in simulations of GB1 using seven out of eight commonly used combinations of fixed charge force fields and water models. We also find that the Amber ff15ipq force field gives rise to weaker salt bridges in good agreement with the NMR experiments. We conclude that many force fields appear to overstabilize these ionic interactions, and that further work may be needed to refine our ability to model quantitatively the stability of salt bridges through simulations. We also suggest that comparisons between NMR experiments and simulations will play a crucial role in furthering our understanding of this important interaction.
Determining the structures, kinetics, thermodynamics and mechanisms that underlie conformational exchange processes in proteins remains extremely difficult. Only in favourable cases is it possible to ...provide atomic-level descriptions of sparsely populated and transiently formed alternative conformations. Here we benchmark the ability of enhanced-sampling molecular dynamics simulations to determine the free energy landscape of the L99A cavity mutant of T4 lysozyme. We find that the simulations capture key properties previously measured by NMR relaxation dispersion methods including the structure of a minor conformation, the kinetics and thermodynamics of conformational exchange, and the effect of mutations. We discover a new tunnel that involves the transient exposure towards the solvent of an internal cavity, and show it to be relevant for ligand escape. Together, our results provide a comprehensive view of the structural landscape of a protein, and point forward to studies of conformational exchange in systems that are less characterized experimentally.