The complete DNA sequence of the aerobic cellulolytic soil bacterium Cytophaga hutchinsonii, which belongs to the phylum Bacteroidetes, is presented. The genome consists of a single, circular, ...4.43-Mb chromosome containing 3,790 open reading frames, 1,986 of which have been assigned a tentative function. Two of the most striking characteristics of C. hutchinsonii are its rapid gliding motility over surfaces and its contact-dependent digestion of crystalline cellulose. The mechanism of C. hutchinsonii motility is not known, but its genome contains homologs for each of the gld genes that are required for gliding of the distantly related bacteroidete Flavobacterium johnsoniae. Cytophaga-Flavobacterium gliding appears to be novel and does not involve well-studied motility organelles such as flagella or type IV pili. Many genes thought to encode proteins involved in cellulose utilization were identified. These include candidate endo-β-1,4-glucanases and β-glucosidases. Surprisingly, obvious homologs of known cellobiohydrolases were not detected. Since such enzymes are needed for efficient cellulose digestion by well-studied cellulolytic bacteria, C. hutchinsonii either has novel cellobiohydrolases or has an unusual method of cellulose utilization. Genes encoding proteins with cohesin domains, which are characteristic of cellulosomes, were absent, but many proteins predicted to be involved in polysaccharide utilization had putative D5 domains, which are thought to be involved in anchoring proteins to the cell surface.
Web services application programming interface (API) was developed to provide a programmatic access to the regulatory interactions accumulated in the RegPrecise database (http://regprecise.lbl.gov), ...a core resource on transcriptional regulation for the microbial domain of the Department of Energy (DOE) Systems Biology Knowledgebase. RegPrecise captures and visualize regulogs, sets of genes controlled by orthologous regulators in several closely related bacterial genomes, that were reconstructed by comparative genomics. The current release of RegPrecise 2.0 includes >1400 regulogs controlled either by protein transcription factors or by conserved ribonucleic acid regulatory motifs in >250 genomes from 24 taxonomic groups of bacteria. The reference regulons accumulated in RegPrecise can serve as a basis for automatic annotation of regulatory interactions in newly sequenced genomes. The developed API provides an efficient access to the RegPrecise data by a comprehensive set of 14 web service resources. The RegPrecise web services API is freely accessible at http://regprecise.lbl.gov/RegPrecise/services.jsp with no login requirements.
The marine bacterium strain MC-1 is a member of the alpha subgroup of the proteobacteria that contains the magnetotactic cocci and was the first member of this group to be cultured axenically. The ...magnetotactic cocci are not closely related to any other known alphaproteobacteria and are only distantly related to other magnetotactic bacteria. The genome of MC-1 contains an extensive (102 kb) magnetosome island that includes numerous genes that are conserved among all known magnetotactic bacteria, as well as some genes that are unique. Interestingly, certain genes that encode proteins considered to be important in magnetosome assembly (mamJ and mamW) are absent from the genome of MC-1. Magnetotactic cocci exhibit polar magneto-aerotaxis, and the MC-1 genome contains a relatively large number of identified chemotaxis genes. Although MC-1 is capable of both autotrophic and heterotrophic growth, it does not appear to be metabolically versatile, with heterotrophic growth confined to the utilization of acetate. Central carbon metabolism is encoded by genes for the citric acid cycle (oxidative and reductive), glycolysis, and gluconeogenesis. The genome also reveals the presence or absence of specific genes involved in the nitrogen, sulfur, iron, and phosphate metabolism of MC-1, allowing us to infer the presence or absence of specific biochemical pathways in strain MC-1. The pathways inferred from the MC-1 genome provide important information regarding central metabolism in this strain that could provide insights useful for the isolation and cultivation of new magnetotactic bacterial strains, in particular strains of other magnetotactic cocci.
Whether Vibrio mimicus is a variant of Vibrio cholerae or a separate species has been the subject of taxonomic controversy. A genomic analysis was undertaken to resolve the issue. The genomes of V. ...mimicus MB451, a clinical isolate, and VM223, an environmental isolate, comprise ca. 4,347,971 and 4,313,453 bp and encode 3,802 and 3,290 ORFs, respectively. As in other vibrios, chromosome I (C-I) predominantly contains genes necessary for growth and viability, whereas chromosome II (C-II) bears genes for adaptation to environmental change. C-I harbors many virulence genes, including some not previously reported in V. mimicus, such as mannose-sensitive hemagglutinin (MSHA), and enterotoxigenic hemolysin (HlyA); C-II encodes a variant of Vibrio pathogenicity island 2 (VPI-2), and Vibrio seventh pandemic island II (VSP-II) cluster of genes. Extensive genomic rearrangement in C-II indicates it is a hot spot for evolution and genesis of speciation for the genus Vibrio. The number of virulence regions discovered in this study (VSP-II, MSHA, HlyA, type IV pilin, PilE, and integron integrase, IntI4) with no notable difference in potential virulence genes between clinical and environmental strains suggests these genes also may play a role in the environment and that pathogenic strains may arise in the environment. Significant genome synteny with prototypic pre-seventh pandemic strains of V. cholerae was observed, and the results of phylogenetic analysis support the hypothesis that, in the course of evolution, V. mimicus and V. cholerae diverged from a common ancestor with a prototypic sixth pandemic genomic backbone.
We have developed two novel methods for Singular Value Decomposition analysis (SVD) of microarray data. The first is a threshold-based method for obtaining gene groups, and the second is a method for ...obtaining a measure of confidence in SVD analysis. Gene groups are obtained by identifying elements of the left singular vectors, or gene coefficient vectors, that are greater in magnitude than the threshold \batchmode \documentclassfleqn,10pt,legalpaper{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(WN^{{-}1/2}\) \end{document}, where \batchmode \documentclassfleqn,10pt,legalpaper{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(N\) \end{document}is the number of genes, and \batchmode \documentclassfleqn,10pt,legalpaper{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(W\) \end{document}is a weight factor whose default value is 3. The groups are non-exclusive and may contain genes of opposite (i.e. inversely correlated) regulatory response. The confidence measure is obtained by systematically deleting assays from the data set, interpolating the SVD of the reduced data set to reconstruct the missing assay, and calculating the Pearson correlation between the reconstructed assay and the original data. This confidence measure is applicable when each experimental assay corresponds to a value of parameter that can be interpolated, such as time, dose or concentration. Algorithms for the grouping method and the confidence measure are available in a software application called SVD Microarray ANalysis (SVDMAN). In addition to calculating the SVD for generic analysis, SVDMAN provides a new means for using microarray data to develop hypotheses for gene associations and provides a measure of confidence in the hypotheses, thus extending current SVD research in the area of global gene expression analysis. Availability: ftp://bpublic.lanl.gov/compbio/software Contact: brettin@lanl.gov Supplementary information: http://home.lanl.gov/svdman * To whom correspondence should be addressed.