Peroxiredoxins (Prxs or Prdxs) are a large protein superfamily of antioxidant enzymes that rapidly detoxify damaging peroxides and/or affect signal transduction and, thus, have roles in ...proliferation, differentiation, and apoptosis. Prx superfamily members are widespread across phylogeny and multiple methods have been developed to classify them. Here we present an updated atlas of the Prx superfamily identified using a novel method called MISST (Multi-level Iterative Sequence Searching Technique). MISST is an iterative search process developed to be both agglomerative, to add sequences containing similar functional site features, and divisive, to split groups when functional site features suggest distinct functionally-relevant clusters. Superfamily members need not be identified initially-MISST begins with a minimal representative set of known structures and searches GenBank iteratively. Further, the method's novelty lies in the manner in which isofunctional groups are selected; rather than use a single or shifting threshold to identify clusters, the groups are deemed isofunctional when they pass a self-identification criterion, such that the group identifies itself and nothing else in a search of GenBank. The method was preliminarily validated on the Prxs, as the Prxs presented challenges of both agglomeration and division. For example, previous sequence analysis clustered the Prx functional families Prx1 and Prx6 into one group. Subsequent expert analysis clearly identified Prx6 as a distinct functionally relevant group. The MISST process distinguishes these two closely related, though functionally distinct, families. Through MISST search iterations, over 38,000 Prx sequences were identified, which the method divided into six isofunctional clusters, consistent with previous expert analysis. The results represent the most complete computational functional analysis of proteins comprising the Prx superfamily. The feasibility of this novel method is demonstrated by the Prx superfamily results, laying the foundation for potential functionally relevant clustering of the universe of protein sequences.
Display omitted
•Data standard for macromolecular structures from integrative and hybrid modeling (IHM).•Infrastructure for deposition, archiving, and public dissemination of integrative ...structures.•Representation and tools to enable interoperation of structural biology resources.•Community-driven effort that is extensible to handle novel methods.•Supports FAIR data delivery and accelerates scientific discovery.
IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB.
Many structures of large molecular assemblies such as virus capsids and ribosomes have been experimentally determined to atomic resolution. We consider four software problems that arise in ...interactive visualization and analysis of large assemblies: how to represent multimers efficiently, how to make cartoon representations, how to calculate contacts efficiently, and how to select subassemblies. We describe techniques and algorithms we have developed and give examples of their use. Existing molecular visualization programs work well for single protein and nucleic acid molecules and for small complexes. The methods presented here are proposed as features to add to existing programs or include in next-generation visualization software to allow easy exploration of assemblies containing tens to thousands of macromolecules. Our approach is pragmatic, emphasizing simplicity of code, reliability, and speed. The methods described have been distributed as the Multiscale extension of the UCSF Chimera (
www.cgl.ucsf.edu/chimera) molecular graphics program.
Comparison of human solute carriers Schlessinger, Avner; Matsson, Pär; Shima, James E. ...
Protein science,
March 2010, Volume:
19, Issue:
3
Journal Article
Peer reviewed
Open access
Solute carriers are eukaryotic membrane proteins that control the uptake and efflux of solutes, including essential cellular compounds, environmental toxins, and therapeutic drugs. Solute carriers ...can share similar structural features despite weak sequence similarities. Identification of sequence relationships among solute carriers is needed to enhance our ability to model individual carriers and to elucidate the molecular mechanisms of their substrate specificity and transport. Here, we describe a comprehensive comparison of solute carriers. We link the proteins using sensitive profile–profile alignments and two classification approaches, including similarity networks. The clusters are analyzed in view of substrate type, transport mode, organism conservation, and tissue specificity. Solute carrier families with similar substrates generally cluster together, despite exhibiting relatively weak sequence similarities. In contrast, some families cluster together with no apparent reason, revealing unexplored relationships. We demonstrate computationally and experimentally the functional overlap between representative members of these families. Finally, we identify four putative solute carriers in the human genome. The solute carriers include a biomedically important group of membrane proteins that is diverse in sequence and structure. The proposed classification of solute carriers, combined with experiment, reveals new relationships among the individual families and identifies new solute carriers. The classification scheme will inform future attempts directed at modeling the structures of the solute carriers, a prerequisite for describing the substrate specificities of the individual families.
ABCC4 encodes multidrug resistance protein 4 (MRP4), a member of the ATP-binding cassette family of membrane transporters involved in the efflux of endogenous and xenobiotic molecules. The aims of ...this study were to identify single nucleotide polymorphisms of ABCC4 and to functionally characterize selected nonsynonymous variants. Resequencing was performed in a large ethnically diverse population. Ten nonsynonymous variants were selected for analysis of transport function based on allele frequencies and evolutionary conservation. The reference and variant MRP4 cDNAs were constructed by site-directed mutagenesis and transiently transfected into human embryonic kidney cells (HEK 293T). The function of MRP4 variants was compared by measuring the intracellular accumulation of two antiviral agents, azidothymidine (AZT) and adefovir (PMEA). A total of 98 variants were identified in the coding and flanking intronic regions of ABCC4. Of these, 43 variants are in the coding region, and 22 are nonsynonymous. In a functional screen of ten variants, there was no evidence for a complete loss of function allele. However, two variants (G187W and G487E) showed a significantly reduced function compared to reference with both substrates, as evidenced by higher intracellular accumulation of AZT and PMEA compared to the reference MRP4 (43 and 69% increase in accumulation for G187W compared with the reference MRP4, with AZT and PMEA, respectively). The G187W variant also showed decreased expression following transient transfection of HEK 293T cells. Further studies are required to assess the clinical significance of this altered function and expression and to evaluate substrate specificity of this functional change.
Methotrexate (MTX) is used in patients with malignant and autoimmune diseases. This drug is primarily excreted unchanged in the urine, and its net excretion occurs via active secretory and ...reabsorptive processes. We characterized the interaction of MTX with human organic-anion transporting polypeptide transporter (OATP) 1A2, which is expressed in tissues important for MTX disposition and toxicity, such as the intestine, kidney, liver, and endothelial cells of the blood-brain barrier. In Xenopus laevis oocytes expressing OATP1A2, the uptake of the model substrate, estrone-3-sulfate (ES), was enhanced 30-fold compared with uninjected oocytes. MTX uptake in oocytes expressing OATP1A2 was saturable (Km = 457 +/- 118 microM; Vmax = 17.5 +/- 4.9 pmol/oocyte/60 min) and sensitive to extracellular pH. That is, acidic pHs stimulated MTX uptake by as much as 7-fold. Seven novel protein-altering variants were identified in 270 ethnically diverse DNA samples. Four protein-altering variants in OATP1A2 exhibited altered transport of ES and/or MTX. The common variant, protein reference sequence (p.) Ile13Thr, was hyperfunctional for ES and MTX and showed a 2-fold increase in the V(max) for ES. The common variant, p. Glu172Asp, exhibited reduced maximal transport capacity for ES and MTX. p. Arg168Cys was hypofunctional, and p. Asn277DEL was nonfunctional. Because of its expression on the apical membrane of the distal tubule and in tissues relevant to MTX disposition and toxicity, these findings suggest that OATP1A2 may play a role in active tubular reabsorption of MTX and in MTX-induced toxicities. Furthermore, genetic variation in OATP1A2 may contribute to variation in MTX disposition and response.
The organic cation transporter, OCT1, is a major hepatic transporter that mediates the uptake of many organic cations from the blood into the liver where the compounds may be metabolized or secreted ...into the bile. Because OCT1 interacts with a variety of structurally diverse organic cations, including clinically used drugs as well as toxic substances (e.g., N-methylpyridinium, MPP+), it is an important determinant of systemic exposure to many xenobiotics. To understand the genetic basis of extensive interindividual differences in xenobiotic disposition, we functionally characterized 15 protein-altering variants of the human liver organic cation transporter, OCT1, in Xenopus oocytes. All variants that reduced or eliminated function (OCT1-R61C, OCT1-P341L, OCT1-G220V, OCT1-G401S, and OCT1-G465R) altered evolutionarily conserved amino acid residues. In general, variants with decreased function had amino acid substitutions that resulted in more radical chemical changes (higher Grantham values) and were less evolutionarily favorable (lower Blosum62 values) than variants that maintained function. A variant with increased function (OCT1-S14F) changed an amino acid residue such that the human protein matched the consensus of the OCT1 mammalian orthologs. Our results indicate that changes at evolutionarily conserved positions of OCT1 are strong predictors of decreased function and suggest that a combination of evolutionary conservation and chemical change might be a stronger predictor of function.
Membrane transporters maintain cellular and organismal homeostasis by importing nutrients and exporting toxic compounds. Transporters also play a crucial role in drug response, serving as drug ...targets and setting drug levels. As part of a pharmacogenetics project, we screened exons and flanking intronic regions for variation in a set of 24 membrane transporter genes (96 kb; 57% coding) in 247 DNA samples from ethnically diverse populations. We identified 680 single nucleotide polymorphisms (SNPs), of which 175 were synonymous and 155 caused amino acid changes, and 29 small insertions and deletions. Amino acid diversity (τNS) in transmembrane domains (TMDs) was significantly lower than in loop domains, suggesting that TMDs have special functional constraints. This difference was especially striking in the ATP-binding cassette superfamily and did not parallel evolutionary conservation: there was little variation in the TMDs, even in evolutionarily unconserved residues. We used allele frequency distribution to evaluate different scoring systems (Grantham, Blosum62, SIFT, and evolutionarily conserved/evolutionarily unconserved) for their ability to predict which SNPs affect function. Our underlying assumption was that alleles that are functionally deleterious will be selected against and thus under represented at high frequencies and over represented at low frequencies. We found that evolutionary conservation of orthologous sequences, as assessed by evolutionarily conserved/evolutionarily unconserved and SIFT, was the best predictor of allele frequency distribution and hence of function. European Americans had an excess of high frequency alleles in comparison to African Americans, consistent with a historic bottleneck. In addition, African Americans exhibited a much higher frequency of population specific medium-frequency alleles than did European Americans.
Software for viewing three-dimensional models and maps of viruses, ribosomes, filaments, and other molecular assemblies is advancing on many fronts. New developments include molecular representations ...that offer better control over level of detail, lighting that improves the perception of depth, and two-dimensional projections that simplify data interpretation. Programmable graphics processors offer quality, speed, and visual effects not previously possible, while 3D printers, haptic interaction devices, and auto-stereo displays show promise in more naturally engaging our senses. Visualization methods are developed by diverse groups of researchers with differing goals: experimental biologists, database developers, computer scientists, and package developers. We survey recent developments and problems faced by the developer community in bringing innovative visualization methods into widespread use.
The study of mechanistically diverse enzyme superfamiliescollections of enzymes that perform different overall reactions but share both a common fold and a distinct mechanistic step performed by key ...conserved residueshelps elucidate the structure−function relationships of enzymes. We have developed a resource, the structure−function linkage database (SFLD), to analyze these structure−function relationships. Unique to the SFLD is its hierarchical classification scheme based on linking the specific partial reactions (or other chemical capabilities) that are conserved at the superfamily, subgroup, and family levels with the conserved structural elements that mediate them. We present the results of analyses using the SFLD in correcting misannotations, guiding protein engineering experiments, and elucidating the function of recently solved enzyme structures from the structural genomics initiative. The SFLD is freely accessible at http://sfld.rbvi.ucsf.edu.