Noroviruses are major pathogens associated with acute gastroenteritis worldwide. Their RNA genomes are diverse, with two major genogroups (GI and GII) comprised of at least 28 genotypes associated ...with human disease. To elucidate mechanisms underlying norovirus diversity and evolution, we used a large-scale genomics approach to analyze human norovirus sequences. Comparison of over 2000 nearly full-length ORF2 sequences representing most of the known GI and GII genotypes infecting humans showed a limited number (≤5) of distinct intra-genotypic variants within each genotype, with the exception of GII.4. The non-GII.4 genotypes were comprised of one or more intra-genotypic variants, with each variant containing strains that differed by only a few residues over several decades (remaining "static") and that have co-circulated with no clear epidemiologic pattern. In contrast, the GII.4 genotype presented the largest number of variants (>10) that have evolved over time with a clear pattern of periodic variant replacement. To expand our understanding of these two patterns of diversification ("static" versus "evolving"), we analyzed using NGS the nearly full-length norovirus genome in healthy individuals infected with GII.4, GII.6 or GII.17 viruses in different outbreak settings. The GII.4 viruses accumulated mutations rapidly within and between hosts, while the GII.6 and GII.17 viruses remained relatively stable, consistent with their diversification patterns. Further analysis of genetic relationships and natural history patterns identified groupings of certain genotypes into larger related clusters designated here as "immunotypes". We propose that "immunotypes" and their evolutionary patterns influence the prevalence of a particular norovirus genotype in the human population.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The Virus Pathogen Database and Analysis Resource (ViPR, www.ViPRbrc.org) is an integrated repository of data and analysis tools for multiple virus families, supported by the National Institute of ...Allergy and Infectious Diseases (NIAID) Bioinformatics Resource Centers (BRC) program. ViPR contains information for human pathogenic viruses belonging to the Arenaviridae, Bunyaviridae, Caliciviridae, Coronaviridae, Flaviviridae, Filoviridae, Hepeviridae, Herpesviridae, Paramyxoviridae, Picornaviridae, Poxviridae, Reoviridae, Rhabdoviridae and Togaviridae families, with plans to support additional virus families in the future. ViPR captures various types of information, including sequence records, gene and protein annotations, 3D protein structures, immune epitope locations, clinical and surveillance metadata and novel data derived from comparative genomics analysis. Analytical and visualization tools for metadata-driven statistical sequence analysis, multiple sequence alignment, phylogenetic tree construction, BLAST comparison and sequence variation determination are also provided. Data filtering and analysis workflows can be combined and the results saved in personal 'Workbenches' for future use. ViPR tools and data are available without charge as a service to the virology research community to help facilitate the development of diagnostics, prophylactics and therapeutics for priority pathogens and other viruses.
Antimicrobial peptides (AMPs) are anti-infectives that may represent a novel and untapped class of biotherapeutics. Increasing interest in AMPs means that new peptides (natural and synthetic) are ...discovered faster than ever before. We describe herein a new version of the Database of Antimicrobial Activity and Structure of Peptides (DBAASPv.2, which is freely accessible at http://dbaasp.org). This iteration of the database reports chemical structures and empirically-determined activities (MICs, IC50, etc.) against more than 4200 specific target microbes for more than 2000 ribosomal, 80 non-ribosomal and 5700 synthetic peptides. Of these, the vast majority are monomeric, but nearly 200 of these peptides are found as homo- or heterodimers. More than 6100 of the peptides are linear, but about 515 are cyclic and more than 1300 have other intra-chain covalent bonds. More than half of the entries in the database were added after the resource was initially described, which reflects the recent sharp uptick of interest in AMPs. New features of DBAASPv.2 include: (i) user-friendly utilities and reporting functions, (ii) a 'Ranking Search' function to query the database by target species and return a ranked list of peptides with activity against that target and (iii) structural descriptions of the peptides derived from empirical data or calculated by molecular dynamics (MD) simulations. The three-dimensional structural data are critical components for understanding structure-activity relationships and for design of new antimicrobial drugs. We created more than 300 high-throughput MD simulations specifically for inclusion in DBAASP. The resulting structures are described in the database by novel trajectory analysis plots and movies. Another 200+ DBAASP entries have links to the Protein DataBank. All of the structures are easily visualized directly in the web browser.
Rapid sequencing of RNA/DNA from pathogen samples obtained during disease outbreaks provides critical scientific and public health information. However, challenges exist for exporting samples to ...laboratories or establishing conventional sequencers in remote outbreak regions. We successfully used a novel, pocket-sized nanopore sequencer at a field diagnostic laboratory in Liberia during the current Ebola virus outbreak.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, ODKLJ, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Please cite this paper as: Squires et al. (2012) Influenza research database: an integrated bioinformatics resource for influenza research and surveillance. Influenza and Other Respiratory Viruses ...6(6), 404–416.
Background The recent emergence of the 2009 pandemic influenza A/H1N1 virus has highlighted the value of free and open access to influenza virus genome sequence data integrated with information about other important virus characteristics.
Design The Influenza Research Database (IRD, http://www.fludb.org) is a free, open, publicly‐accessible resource funded by the U.S. National Institute of Allergy and Infectious Diseases through the Bioinformatics Resource Centers program. IRD provides a comprehensive, integrated database and analysis resource for influenza sequence, surveillance, and research data, including user‐friendly interfaces for data retrieval, visualization and comparative genomics analysis, together with personal log in‐protected ‘workbench’ spaces for saving data sets and analysis results. IRD integrates genomic, proteomic, immune epitope, and surveillance data from a variety of sources, including public databases, computational algorithms, external research groups, and the scientific literature.
Results To demonstrate the utility of the data and analysis tools available in IRD, two scientific use cases are presented. A comparison of hemagglutinin sequence conservation and epitope coverage information revealed highly conserved protein regions that can be recognized by the human adaptive immune system as possible targets for inducing cross‐protective immunity. Phylogenetic and geospatial analysis of sequences from wild bird surveillance samples revealed a possible evolutionary connection between influenza virus from Delaware Bay shorebirds and Alberta ducks.
Conclusions The IRD provides a wealth of integrated data and information about influenza virus to support research of the genetic determinants dictating virus pathogenicity, host range restriction and transmission, and to facilitate development of vaccines, diagnostics, and therapeutics.
The symptoms of malaria are brought about by blood-stage parasites, which are established when merozoites invade human erythrocytes. Our understanding of the molecular events that underpin ...erythrocyte invasion remains hampered by the short-period of time that merozoites are invasive. To address this challenge, a Plasmodium falciparum gamma-irradiated long-lived merozoite (LLM) line was developed and investigated. Purified LLMs invaded erythrocytes by an increase of 10-300 fold compared to wild-type (WT) merozoites. Using an integrated omics approach, we investigated the basis for the phenotypic difference. Only a few single nucleotide polymorphisms within the P. falciparum genome were identified and only marginal differences were observed in the merozoite transcriptomes. By contrast, using label-free quantitative mass-spectrometry, a significant change in protein abundance was noted, of which 200 were proteins of unknown function. We determined the relative molar abundance of over 1100 proteins in LLMs and further characterized the major merozoite surface protein complex. A unique processed MSP1 intermediate was identified in LLM but not observed in WT suggesting that delayed processing may be important for the observed phenotype. This integrated approach has demonstrated the significant role of the merozoite proteome during erythrocyte invasion, while identifying numerous unknown proteins likely to be involved in invasion.
•Quantifying sequence similarity-temporal relationships revealed added structure in most similar to H1N1pdm09 by Blast.•One of the subgroups exhibiting characteristics of the true evolutionary ...trajectory of the H1N1pdm09 lineage.•Suggests sequence similarity in absence of temporal component not enough to correctly determine evolutionary relationships.•Utilizing isolation year metadata enables a more accurate tracking of the true evolutionary trajectory.
In 2009 a novel pandemic H1N1 influenza virus (H1N1pdm09) emerged as the first official influenza pandemic of the 21st century. Early genomic sequence analysis pointed to the swine origin of the virus. Here we report a novel computational approach to determine the evolutionary trajectory of viral sequences that uses data-driven estimations of nucleotide substitution rates to track the gradual accumulation of observed sequence alterations over time. Phylogenetic analysis and multiple sequence alignments show that sequences belonging to the resulting evolutionary trajectory of the H1N1pdm09 lineage exhibit a gradual accumulation of sequence variations and tight temporal correlations in the topological structure of the phylogenetic trees. These results suggest that our evolutionary trajectory analysis (ETA) can more effectively pinpoint the evolutionary history of viruses, including the host and geographical location traversed by each segment, when compared against either BLAST or traditional phylogenetic analysis alone.
Finding our way through phenotypes Deans, Andrew R; Lewis, Suzanna E; Huala, Eva ...
PLoS biology,
01/2015, Letnik:
13, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Despite a large and multifaceted effort to understand the vast landscape of phenotypic data, their current form inhibits productive data analysis. The lack of a community-wide, consensus-based, ...human- and machine-interpretable language for describing phenotypes and their genomic and environmental contexts is perhaps the most pressing scientific bottleneck to integration across many key fields in biology, including genomics, systems biology, development, medicine, evolution, ecology, and systematics. Here we survey the current phenomics landscape, including data resources and handling, and the progress that has been made to accurately capture relevant data descriptions for phenotypes. We present an example of the kind of integration across domains that computable phenotypes would enable, and we call upon the broader biology community, publishers, and relevant funding agencies to support efforts to surmount today's data barriers and facilitate analytical reproducibility.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK