Abstract
AlphaFold can predict the structure of single- and multiple-chain proteins with very high accuracy. However, the accuracy decreases with the number of chains, and the available GPU memory ...limits the size of protein complexes which can be predicted. Here we show that one can predict the structure of large complexes starting from predictions of subcomponents. We assemble 91 out of 175 complexes with 10–30 chains from predicted subcomponents using Monte Carlo tree search, with a median TM-score of 0.51. There are 30 highly accurate complexes (TM-score ≥0.8, 33% of complete assemblies). We create a scoring function, mpDockQ, that can distinguish if assemblies are complete and predict their accuracy. We find that complexes containing symmetry are accurately assembled, while asymmetrical complexes remain challenging. The method is freely available and accesible as a Colab notebook
https://colab.research.google.com/github/patrickbryant1/MoLPC/blob/master/MoLPC.ipynb
.
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure ...predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods across structural-biology applications remains to be tested. Here, we evaluate the use of AlphaFold2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modeling of interactions; and modeling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modeled when compared with homology modeling, identifying structural features rarely seen in the Protein Data Bank. AF2-based predictions of protein disorder and complexes surpass dedicated tools, and AF2 models can be used across diverse applications equally well compared with experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life-science research.
A prerequisite for single cell study is the capture and isolation of individual cells. In microfluidic devices, cell capture is often achieved by means of trapping. While many microfluidic trapping ...techniques exist, hydrodynamic methods are particularly attractive due to their simplicity and scalability. However, current design guidelines for single cell hydrodynamic traps predominantly rely on flow resistance manipulation or qualitative streamline analysis without considering the target particle size. This lack of quantitative design criteria from first principles often leads to non-optimal probabilistic trapping. In this work, we describe an analytical design guideline for deterministic single cell hydrodynamic trapping through the optimization of streamline distributions under laminar flow with cell size as a key parameter. Using this guideline, we demonstrate an example design which can achieve 100% capture efficiency for a given particle size. Finite element modelling was used to determine the design parameters necessary for optimal trapping. The simulation results were subsequently confirmed with on-chip microbead and white blood cell trapping experiments.
Abstract
Motivation
Despite near-experimental accuracy on single-chain predictions, there is still scope for improvement among multimeric predictions. Methods like AlphaFold-Multimer and FoldDock can ...accurately model dimers. However, how well these methods fare on larger complexes is still unclear. Further, evaluation methods of the quality of multimeric complexes are not well established.
Results
We analysed the performance of AlphaFold-Multimer on a homology-reduced dataset of homo- and heteromeric protein complexes. We highlight the differences between the pairwise and multi-interface evaluation of chains within a multimer. We describe why certain complexes perform well on one metric (e.g. TM-score) but poorly on another (e.g. DockQ). We propose a new score, Predicted DockQ version 2 (pDockQ2), to estimate the quality of each interface in a multimer. Finally, we modelled protein complexes (from CORUM) and identified two highly confident structures that do not have sequence homology to any existing structures.
Availability and implementation
All scripts, models, and data used to perform the analysis in this study are freely available at https://gitlab.com/ElofssonLab/afm-benchmark.
Understanding metal-protein interaction can provide structural and functional insights into cellular processes. As the number of protein sequences increases, developing fast yet precise computational ...approaches to predict and annotate metal-binding sites becomes imperative. Quick and resource-efficient pre-trained protein language model (pLM) embeddings have successfully predicted binding sites from protein sequences despite not using structural or evolutionary features (multiple sequence alignments). Using residue-level embeddings from the pLMs, we have developed a sequence-based method (M-Ionic) to identify metal-binding proteins and predict residues involved in metal binding.
On independent validation of recent proteins, M-Ionic reports an area under the curve (AUROC) of 0.83 (recall = 84.6%) in distinguishing metal binding from non-binding proteins compared to AUROC of 0.74 (recall = 61.8%) of the next best method. In addition to comparable performance to the state-of-the-art method for identifying metal-binding residues (Ca2+, Mg2+, Mn2+, Zn2+), M-Ionic provides binding probabilities for six additional ions (i.e. Cu2+, Po43-, So42-, Fe2+, Fe3+, Co2+). We show that the pLM embedding of a single residue contains sufficient information about its neighbours to predict its binding properties.
M-Ionic can be used on your protein of interest using a Google Colab Notebook (https://bit.ly/40FrRbK). The GitHub repository (https://github.com/TeamSundar/m-ionic) contains all code and data.
Cellular functions are governed by molecular machines that assemble through protein-protein interactions. Their atomic details are critical to studying their molecular mechanisms. However, fewer than ...5% of hundreds of thousands of human protein interactions have been structurally characterized. Here we test the potential and limitations of recent progress in deep-learning methods using AlphaFold2 to predict structures for 65,484 human protein interactions. We show that experiments can orthogonally confirm higher-confidence models. We identify 3,137 high-confidence models, of which 1,371 have no homology to a known structure. We identify interface residues harboring disease mutations, suggesting potential mechanisms for pathogenic variants. Groups of interface phosphorylation sites show patterns of co-regulation across conditions, suggestive of coordinated tuning of multiple protein interactions as signaling responses. Finally, we provide examples of how the predicted binary complexes can be used to build larger assemblies helping to expand our understanding of human cell biology.
Abstract
The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions ...from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.
▶ Variations in fire severity drive stand recovery patterns in Alaskan boreal forests. ▶ Black spruce self replaces in lightly burned stands. ▶ Aspen dominates severely burned stands and persists for ...at least two decades. ▶ With reduced fire return interval aspen could replace spruce over large extents. ▶ Shift in dominance from coniferous to deciduous will impact climate and wildlife.
There has been a recent increase in the frequency and extent of wildfires in interior Alaska, and this trend is predicted to continue under a warming climate. Although less well documented, corresponding increases in fire severity are expected. Previous research from boreal forests in Alaska and western Canada indicate that severe fire promotes the recruitment of deciduous tree species and decreases the relative abundance of black spruce (Picea mariana) immediately after fire. Here we extend these observations by (1) examining changes in patterns of aspen and spruce density and biomass that occurred during the first two decades of post-fire succession, and (2) comparing patterns of tree composition in relation to variations in post-fire organic layer depth in four burned black spruce forests in interior Alaska after 10–20 years of succession. We found that initial effects of fire severity on recruitment and establishment of aspen and black spruce were maintained by subsequent effects of organic layer depth and initial plant biomass on plant growth during post-fire succession. The proportional contribution of aspen (Populus tremuloides) to total stand biomass remained above 90% during the first and second decades of succession in severely burned sites, while in lightly burned sites the proportional contribution of aspen was reduced due to a 40-fold increase in spruce biomass in these sites. Relationships between organic layer depth and stem density and biomass were consistently negative for aspen, and positive or neutral for black spruce in all four burns. Our results suggest that initial effects of post-fire organic layer depths on deciduous recruitment are likely to translate into a prolonged phase of deciduous dominance during post-fire succession in severely burned stands. This shift in vegetation distribution has important implications for climate-albedo feedbacks, future fire regime, wildlife habitat quality and natural resources for indigenous subsistence activities in interior Alaska.
Toxin-antitoxin (TA) systems are a large group of small genetic modules found in prokaryotes and their mobile genetic elements. Type II TAs are encoded as bicistronic (two-gene) operons that encode ...two proteins: a toxin and a neutralizing antitoxin. Using our tool NetFlax (standing for Network-FlaGs for toxins and antitoxins), we have performed a large-scale bioinformatic analysis of proteinaceous TAs, revealing interconnected clusters constituting a core network of TA-like gene pairs. To understand the structural basis of toxin neutralization by antitoxins, we have predicted the structures of 3,419 complexes with AlphaFold2. Together with mutagenesis and functional assays, our structural predictions provide insights into the neutralizing mechanism of the hyperpromiscuous Panacea antitoxin domain. In antitoxins composed of standalone Panacea, the domain mediates direct toxin neutralization, while in multidomain antitoxins the neutralization is mediated by other domains, such as PAD1, Phd-C, and ZFD. We hypothesize that Panacea acts as a sensor that regulates TA activation. We have experimentally validated 16 NetFlax TA systems and used domain annotations and metabolic labeling assays to predict their potential mechanisms of toxicity (such as membrane disruption, and inhibition of cell division or protein synthesis) as well as biological functions (such as antiphage defense). We have validated the antiphage activity of a RosmerTA system encoded by
phage Kita, and used fluorescence microscopy to confirm its predicted membrane-depolarizing activity. The interactive version of the NetFlax TA network that includes structural predictions can be accessed at http://netflax.webflags.se/.
Fire activity in the North American boreal region is projected to increase under a warming climate and trigger changes in vegetation composition. In black spruce forests of interior Alaska, fire ...severity impacts residual organic layer depth which is strongly linked to the relative dominance of deciduous versus coniferous trees in early succession. These alternate successional pathways may be reinforced by biogeochemical processes that affect the relative ability of deciduous versus coniferous trees to acquire limiting nutrients. To test this hypothesis, we examined changes in soil inorganic nitrogen (N) supply and in situ 15N root uptake by aspen (Populus tremuloides) and black spruce (Picea mariana) saplings regenerating in lightly and severely burned sites, 16 years following fire. Fire severity did not impact the composition or magnitude of N supply, and nitrate represented nearly 40 % of total N supply. Both aspen and spruce took up more N in severely burned than in lightly burned sites. Spruce exhibited only a moderately lower rate of nitrate uptake, and a higher ammonium uptake rate than aspen in severely burned sites. At the stand level, differences in species nutrient uptake were magnified, with aspen taking up nearly an order-of-magnitude more N per m 2 in severely burned than in lightly burned sites. We suggest that differences in nutrient sinks (biomass) established early in succession and effects of post-fire organic layer depth on nutrient uptake, are key mechanisms reinforcing the opposing stand dominance patterns that have developed in response to variations in organic layer depth.