The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of ...Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for bioinformatics application programmers. The Bioperl modules have been successfully and repeatedly used to reduce otherwise complex tasks to only a few lines of code. The Bioperl object model has been proven to be flexible enough to support enterprise-level applications such as EnsEMBL, while maintaining an easy learning curve for novice Perl programmers. Bioperl is capable of executing analyses and processing results from programs such as BLAST, ClustalW, or the EMBOSS suite. Interoperation with modules written in Python and Java is supported through the evolving BioCORBA bridge. Bioperl provides access to data stores such as GenBank and SwissProt via a flexible series of sequence input/output modules, and to the emerging common sequence data storage format of the Open Bioinformatics Database Access project. This study describes the overall architecture of the toolkit, the problem domains that it addresses, and gives specific examples of how the toolkit can be used to solve common life-sciences problems. We conclude with a discussion of how the open-source nature of the project has contributed to the development effort.
Comparative Genomics of the Eukaryotes Rubin, Gerald M.; Yandell, Mark D.; Wortman, Jennifer R. ...
Science (American Association for the Advancement of Science),
03/2000, Letnik:
287, Številka:
5461
Journal Article
Recenzirano
Odprti dostop
A comparative analysis of the genomes of Drosophila melanogaster, Caenorhabditis elegans, and Saccharomyces cerevisiae-and the proteins they are predicted to encode-was undertaken in the context of ...cellular, developmental, and evolutionary processes. The nonredundant protein sets of flies and worms are similar in size and are only twice that of yeast, but different gene families are expanded in each genome, and the multidomain proteins and signaling pathways of the fly and worm are far more complex than those of yeast. The fly has orthologs to 177 of the 289 human disease genes examined and provides the foundation for rapid analysis of some of the basic processes involved in human disease.
To facilitate sharing of Omics data, many groups of scientists have been working to establish the relevant data standards. The main components of data sharing standards are experiment description ...standards, data exchange standards, terminology standards, and experiment execution standards. Here we provide a survey of existing and emerging standards that are intended to assist the free and open exchange of large-format data.
Comparative analysis of predicted protein sequences encoded by the genomes of Caenorhabditis elegans and Saccharomyces cerevisiae suggests that most of the core biological functions are carried out ...by orthologous proteins (proteins of different species that can be traced back to a common ancestor) that occur in comparable numbers. The specialized processes of signal transduction and regulatory control that are unique to the multicellular work appear to use novel protein, many of which re-use conserved domains. Major expansion of the number of some of these domains seen in the worm may have contributed to the advant of multicellularity. The proteins conserved in yeast and worm are likely to have orthologs throughout eukaryotes; in contrast, the proteins unique to the worm may well define metazoans.
The chemosensory pathway of bacterial chemotaxis has become a paradigm for
the two-component superfamily of receptor-regulated phosphorylation pathways.
This simple pathway illustrates many of the ...fundamental principles and
unanswered questions in the field of signaling biology. A molecular description
of pathway function has progressed rapidly because it is accessible to diverse
structural, biochemical, and genetic approaches. As a result, structures are
emerging for most of the pathway elements, biochemical studies are elucidating
the mechanisms of key signaling events, and genetic methods are revealing the
intermolecular interactions that transmit information between components.
Recent advances include (
a
) the first molecular picture of a
conformational transmembrane signal in a cell surface receptor, (
b
) four
new structures of kinase domains and adaptation enzymes, and (
c
)
significant new insights into the mechanisms of receptor-mediated kinase
regulation, receptor adaptation, and the phospho-activation of signaling
proteins. Overall, the chemosensory pathway and the propulsion system it
regulates provide an ideal system in which to probe molecular principles
underlying complex cellular signaling and behavior.
Site-directed cysteine and disulfide chemistry is broadly useful in the analysis of protein structure and dynamics, and applications of this chemistry to the bacterial chemotaxis pathway have ...illustrated the kinds of information that can be generated. Notably, in many cases, cysteine and disulfide chemistry can be carried out in the native environment of the protein whether it be aqueous solution, a lipid bilayer, or a multiprotein complex. Moreover, the approach can tackle three types of problems crucial to a molecular understanding of a given protein: (1) it can map out 2 degrees structure, 3 degrees structure, and 4 degrees structure; (2) it can analyze conformational changes and the structural basis of regulation by covalently trapping specific conformational or signaling states; and (3) it can uncover the spatial and temporal aspects of thermal fluctuations by detecting backbone and domain dynamics. The approach can provide structural information for many proteins inaccessible to high-resolution methods. Even when a high-resolution structure is available, the approach provides complementary information about regulatory mechanisms and thermal dynamics in the native environment. Finally, the approach can be applied to an entire protein, or to a specific domain or subdomain within the full-length protein, thereby facilitating a divide-and-conquer strategy in large systems or multiprotein complexes. Rigorous application of the approach to a given protein, domain, or subdomain requires careful experimental design that adequately resolves the structural and dynamical information provided by the method. A full structural and dynamical analysis begins by scanning engineered cysteines throughout the region of interest. To determine 2 degrees structure, the solvent exposure of each cysteine is determined by measuring its chemical reactivity, and the periodicity of exposure is analyzed. To probe 3 degrees structure, 4 degrees structure, and conformational regulation, pairs of cysteines are identified that rapidly form disulfide bonds and that retain function when induced to form a disulfide bond in the folded protein or complex. Finally, to map out thermal fluctuations in a protein of known structure, disulfide formation rates are measured between distal pairs of nonperturbing surface cysteines. This chapter details these methods and illustrates applications to two proteins from the bacterial chemotaxis pathway: the periplasmic galactose binding protein and the transmembrane aspartate receptor.
The aspartate receptor of bacterial chemotaxis is representative of a large class of membrane-spanning receptors found in prokaryotic and eukaryotic organisms. These receptors, which regulate ...histidine kinase pathways and possess two putative transmembrane helices per subunit, appear to control a wide variety of cellular processes. The best characterized subgroup of the two-helix receptor class is the homologous family of chemosensory receptors from Escherichia coli and Salmonella typhimurium, including the aspartate receptor. This receptor binds aspartate, an attractant, in the periplasmic compartment and undergoes an intramolecular, transmembrane conformational change, thereby modulating the autophosphorylation rate of a bound histidine kinase in the cytoplasm. Here, we analyze recent results from x-ray crystallographic, solution 19F NMR, and engineered disulfide studies probing the aspartate-induced structural change within the periplasmic and transmembrane regions of the receptor. Together, these approaches provide evidence that aspartate binding triggers a ``swinging-piston'' displacement of the second membrane-spanning helix, which is proposed to communicate the signal across the bilayer.
The Saccharomyces Genome Database (SGD) provides Internet access to the complete Saccharomyces cerevisiae genomic sequence, its genes and their products, the phenotypes of its mutants, and the ...literature supporting these data. The amount of information and the number of features provided by SGD have increased greatly following the release of the S.cerevisiae genomic sequence, which is currently the only complete sequence of a eukaryotic genome. SGD aids researchers by providing not only basic information, but also tools such as sequence similarity searching that lead to detailed information about features of the genome and relationships between genes. SGD presents information using a variety of user-friendly, dynamically created graphical displays illustrating physical, genetic and sequence feature maps. SGD can be accessed via the World Wide Web at http://genome-www.stanford.edu/Saccharomyces/
The aspartate receptor of the bacterial chemotaxis pathway regulates the autophosphorylation rate of a cytoplasmic histidine kinase in response to ligand binding. The transmembrane signal, which is ...transmitted from the periplasmic aspartate-binding domain to the cytoplasmic regulatory domain, is carried by an intramolecular conformational change within the homodimeric receptor structure. The present work uses engineered cysteines and disulfide bonds to probe the nature of this conformational change, focusing in particular on the role of the second transmembrane α-helix. Altogether 26 modifications, consisting of 13 cysteine pairs and the corresponding disulfide bonds, have been introduced into the contacts between the second transmembrane helix and adjacent helices. The effects of these modifications on the transmembrane signal have been quantified by in vitro assays which measure (i) ligand binding, (ii) receptor-mediated regulation of kinase activity, and (iii) receptor methylation. All three parameters are observed to be highly sensitive to perturbations of the second transmembrane helix. In particular, 13 of the 26 modifications (6 cysteine pairs and 7 disulfides) significantly increase or decrease aspartate affinity, while 15 of the 26 modifications (5 cysteine pairs and 10 disulfides) destroy transmembrane kinase regulation. Importantly, 3 of the perturbing disulfides are found to lock the receptor in the “on” or “off” signaling state by covalently constraining the second transmembrane helix, demonstrating that it is possible to use engineered disulfides to lock the signaling function of a receptor protein. A separate aspect of the study probes the thermal motions of the second transmembrane helix: 4 disulfides designed to trap large amplitude twisting motions are observed to disrupt function but form readily, suggesting that the helix is mobile. Together the results support a model in which the second transmembrane helix is a mobile signaling element responsible for communicating the transmembrane signal.
The Saccharomyces Genome Database (SGD) collects and organizes information about the molecular biology and genetics of the yeast Saccharomyces cerevisiae. The latest protein structure and comparison ...tools available at SGD are presented here. With the completion of the yeast sequence and the Caenorhabditis elegans sequence soon to follow, comparison of proteins from complete eukaryotic proteomes will be an extremely powerful way to learn more about a particular protein's structure, its function, and its relationships with other proteins. SGD can be accessed through the World Wide Web at http://genome-www.stanford.edu/Saccharomyces/