Abstract
Background
Gonorrhea, caused by the bacterium Neisseria gonorrhoeae, is a globally prevalent sexually transmitted infection. The dynamics of gonococcal population biology have been poorly ...defined due to a lack of resolution in strain typing methods.
Methods
In this study, we assess how the core genome can be used to improve our understanding of gonococcal population structure compared with current typing schemes.
Results
A total of 1668 loci were identified as core to the gonococcal genome. These were organized into a core genome multilocus sequence typing scheme (N gonorrhoeae cgMLST v1.0). A clustering algorithm using a threshold of 400 allelic differences between isolates resolved gonococci into discrete and stable core genome groups, some of which persisted for multiple decades. These groups were associated with antimicrobial genotypes and non-overlapping NG-STAR and NG-MAST sequence types. The MLST-STs were more widely distributed among core genome groups.
Conclusions
Clustering with cgMLST identified globally distributed, persistent, gonococcal lineages improving understanding of the population biology of gonococci and revealing its population structure. These findings have implications for the emergence of antimicrobial resistance in gonococci and how this is associated with lineages, some of which are more predisposed to developing antimicrobial resistance than others.
Availability of gonococcal whole genome sequence data enhances our understanding of the population biology of this increasingly antimicrobial-resistant human pathogen. Here, genes core to the gonococcus are defined, which can be exploited to track emergence and transmission.
Pathogenic Neisseria meningitidis isolates contain a polysaccharide capsule that is the main virulence determinant for this bacterium. Thirteen capsular polysaccharides have been described, and ...nuclear magnetic resonance spectroscopy has enabled determination of the structure of capsular polysaccharides responsible for serogroup specificity. Molecular mechanisms involved in N. meningitidis capsule biosynthesis have also been identified, and genes involved in this process and in cell surface translocation are clustered at a single chromosomal locus termed cps. The use of multiple names for some of the genes involved in capsule synthesis, combined with the need for rapid diagnosis of serogroups commonly associated with invasive meningococcal disease, prompted a requirement for a consistent approach to the nomenclature of capsule genes. In this report, a comprehensive description of all N. meningitidis serogroups is provided, along with a proposed nomenclature, which was presented at the 2012 XVIIIth International Pathogenic Neisseria Conference.
No single genealogical reconstruction or typing method currently encompasses all levels of bacterial diversity, from domain to strain. We propose ribosomal multilocus sequence typing (rMLST), an ...approach which indexes variation of the 53 genes encoding the bacterial ribosome protein subunits (rps genes), as a means of integrating microbial genealogy and typing. As with multilocus sequence typing (MLST), rMLST employs curated reference sequences to identify gene variants efficiently and rapidly. The rps loci are ideal targets for a universal characterization scheme as they are: (i) present in all bacteria; (ii) distributed around the chromosome; and (iii) encode proteins which are under stabilizing selection for functional conservation. Collectively, the rps loci exhibit variation that resolves bacteria into groups at all taxonomic and most typing levels, providing significantly more resolution than 16S small subunit rRNA gene phylogenies. A web-accessible expandable database, comprising whole-genome data from more than 1900 bacterial isolates, including 28 draft genomes assembled de novo from the European Bioinformatics Institute (EBI) sequence read archive, has been assembled. The rps gene variation catalogued in this database permits rapid and computationally non-intensive identification of the phylogenetic position of any bacterial sequence at the domain, phylum, class, order, family, genus, species and strain levels. The groupings generated with rMLST data are consistent with current nomenclature schemes and independent of the clustering algorithm used. This approach is applicable to the other domains of life, potentially providing a rational and universal approach to the classification of life that is based on one of its fundamental features, the translation mechanism.
Many invasive bacterial diseases are caused by organisms that are ordinarily harmless components of the human microbiome. Effective interventions against these microbes require an understanding of ...the processes whereby symbiotic or commensal relationships transition into pathology. Here, we describe bacterial genome-wide association studies (GWAS) of
Neisseria meningitidis
, a common commensal of the human respiratory tract that is nevertheless a leading cause of meningitis and sepsis. An initial GWAS discovered bacterial genetic variants, including single nucleotide polymorphisms (SNPs), associated with invasive meningococcal disease (IMD)
versus
carriage in several loci across the meningococcal genome, encoding antigens and other extracellular components, confirming the polygenic nature of the invasive phenotype. In particular, there was a significant peak of association around the
fHbp
locus, encoding factor H binding protein (fHbp), which promotes bacterial immune evasion of human complement by recruiting complement factor H (CFH) to the meningococcal surface. The association around
fHbp
with IMD was confirmed by a validation GWAS, and we found that the SNPs identified in the validation affected the 5’ region of
fHbp
mRNA, altering secondary RNA structures, thereby increasing fHbp expression and enhancing bacterial escape from complement-mediated killing. This finding is consistent with the known link between complement deficiencies and CFH variation with human susceptibility to IMD. These observations demonstrate the importance of human and bacterial genetic variation across the fHbp:CFH interface in determining IMD susceptibility, the transition from carriage to disease.
(group B streptococcus; GBS) is a colonizer of the gastrointestinal and urogenital tracts, and an opportunistic pathogen of infants and adults. The worldwide population of GBS is characterized by ...clonal complexes (CCs) with different invasive potentials. CC17, for example, is a hypervirulent lineage commonly associated with neonatal sepsis and meningitis, while CC1 is less invasive in neonates and more commonly causes invasive disease in adults with comorbidities. The genetic basis of GBS virulence and the extent to which different CCs have adapted to different host environments remain uncertain. We have therefore applied a pan-genome-wide association study (GWAS) approach to 1,988 GBS strains isolated from different hosts and countries. Our analysis identified 279 CC-specific genes associated with virulence, disease, metabolism, and regulation of cellular mechanisms that may explain the differential virulence potential of particular CCs. In CC17 and CC23, for example, we have identified genes encoding pilus, quorum-sensing proteins, and proteins for the uptake of ions and micronutrients which are absent in less invasive lineages. Moreover, in CC17, carriage and disease strains were distinguished by the allelic variants of 21 of these CC-specific genes. Together our data highlight the lineage-specific basis of GBS niche adaptation and virulence.
GBS is a leading cause of mortality in newborn babies in high- and low-income countries worldwide. Different strains of GBS are characterized by different degrees of virulence, where some are harmlessly carried by humans or animals and others are much more likely to cause disease.The genome sequences of almost 2,000 GBS samples isolated from both animals and humans in high- and low- income countries were analyzed using a pan-genome-wide association study approach. This allowed us to identify 279 genes which are associated with different lineages of GBS, characterized by a different virulence and preferred host. Additionally, we propose that the GBS now carried in humans may have first evolved in animals before expanding clonally once adapted to the human host.These findings are essential to help understand what is causing GBS disease and how the bacteria have evolved and are transmitted.
The human pathogens N. gonorrhoeae and N. meningitidis display robust intra- and interstrain glycan diversity associated with their O-linked protein glycosylation (pgl) systems. In an effort to ...better understand the evolution and function of protein glycosylation operating there, we aimed to determine if other human-restricted, Neisseria species similarly glycosylate proteins and if so, to assess the levels of glycoform diversity. Comparative genomics revealed the conservation of a subset of genes minimally required for O-linked protein glycosylation glycan and established those pgl genes as core genome constituents of the genus. In conjunction with mass spectrometric-based glycan phenotyping, we found that extant glycoform repertoires in N. gonorrhoeae, N. meningitidis and the closely related species N. polysaccharea and N. lactamica reflect the functional replacement of a progenitor glycan biosynthetic pathway. This replacement involved loss of pgl gene components of the primordial pathway coincident with the acquisition of two exogenous glycosyltransferase genes. Critical to this discovery was the identification of a ubiquitous but previously unrecognized glycosyltransferase gene (pglP) that has uniquely undergone parallel but independent pseudogenization in N. gonorrhoeae and N. meningitidis. We suggest that the pseudogenization events are driven by processes of compositional epistasis leading to gene decay. Additionally, we documented instances where inter-species recombination influences pgl gene status and creates discordant genetic interactions due ostensibly to the multi-locus nature of pgl gene networks. In summary, these findings provide a novel perspective on the evolution of protein glycosylation systems and identify phylogenetically informative, genetic differences associated with Neisseria species.
The bacterium
causes the sexually transmitted infection gonorrhoea. Although diverse clinical manifestations are associated with gonorrhoea, ranging from asymptomatic through to localized and ...disseminated infection, very little is known about the bacterial determinants implicated in causing such different clinical symptoms. In particular, virulence factors, although defined and investigated in particular strains, often lack comprehensive analysis of their genetic diversity and how this relates to particular disease states. This review examines the clinical manifestations of gonorrhoea and discusses them in relation to disease severity and association with expression of particular virulence factors including PorB, lipooligosaccharide (LOS) and Opa, both in terms of their mechanisms of action and inter- and intra-strain variation. Particular attention is paid to phase variation as a key mechanism of genetic variation in the gonococcus and the impact of this during infection. We describe how whole-genome-sequence-based approaches that focus on virulence factors can be employed for vaccine development and discuss whether whole-genome-sequence data can be used to predict the severity of gonococcal infection.
Many host-adapted bacterial pathogens contain DNA methyltransferases (mod genes) that are subject to phase-variable expression (high-frequency reversible ON/OFF switching of gene expression). In ...Haemophilus influenzae, the random switching of the modA gene controls expression of a phase-variable regulon of genes (a "phasevarion"), via differential methylation of the genome in the modA ON and OFF states. Phase-variable mod genes are also present in Neisseria meningitidis and Neisseria gonorrhoeae, suggesting that phasevarions may occur in these important human pathogens. Phylogenetic studies on phase-variable mod genes associated with type III restriction modification (R-M) systems revealed that these organisms have two distinct mod genes--modA and modB. There are also distinct alleles of modA (abundant: modA11, 12, 13; minor: modA4, 15, 18) and modB (modB1, 2). These alleles differ only in their DNA recognition domain. ModA11 was only found in N. meningitidis and modA13 only in N. gonorrhoeae. The recognition site for the modA13 methyltransferase in N. gonorrhoeae strain FA1090 was identified as 5'-AGAAA-3'. Mutant strains lacking the modA11, 12 or 13 genes were made in N. meningitidis and N. gonorrhoeae and their phenotype analyzed in comparison to a corresponding mod ON wild-type strain. Microarray analysis revealed that in all three modA alleles multiple genes were either upregulated or downregulated, some of which were virulence-associated. For example, in N. meningitidis MC58 (modA11), differentially expressed genes included those encoding the candidate vaccine antigens lactoferrin binding proteins A and B. Functional studies using N. gonorrhoeae FA1090 and the clinical isolate O1G1370 confirmed that modA13 ON and OFF strains have distinct phenotypes in antimicrobial resistance, in a primary human cervical epithelial cell model of infection, and in biofilm formation. This study, in conjunction with our previous work in H. influenzae, indicates that phasevarions may be a common strategy used by host-adapted bacterial pathogens to randomly switch between "differentiated" cell types.
Summary Background Invasive meningococcal disease (IMD) is a worldwide health issue that is potentially preventable with vaccination. In view of its sporadic nature and the high diversity of ...Neisseria meningitidis , epidemiological surveillance incorporating detailed isolate characterisation is crucial for effective control and understanding the evolving epidemiology of IMD. The Meningitis Research Foundation Meningococcus Genome Library (MRF-MGL) exploits whole-genome sequencing (WGS) for this purpose and presents data on a comprehensive and coherent IMD isolate collection from England and Wales via the internet. We assessed the contribution of these data to investigating IMD epidemiology. Methods WGS data were obtained for all 899 IMD isolates available for England and Wales in epidemiological years 2010–11 and 2011–12. The data had been annotated at 1720 loci, analysed, and disseminated online. Information was also available on meningococcal population structure and vaccine (Bexsero, GlaxoSmithKline, Brentford, Middlesex, UK) antigen variants, which enabled the investigation of IMD-associated genotypes over time and by patients' age groups. Population genomic analyses were done with a hierarchical gene-by-gene approach. Findings The methods used by MRF-MGL efficiently characterised IMD isolates and information was provided in plain language. At least 20 meningococcal lineages were identified, three of which (hyperinvasive clonal complexes 41/44 lineage 3, 269 lineage 2, and 23 lineage 23) were responsible for 528 (59%) of IMD isolates. Lineages were highly diverse and showed evidence of extensive recombination. Specific lineages were associated with IMD in particular age groups, with notable diversity in the youngest and oldest individuals. The increased incidence of IMD from 1984 to 2010 in England and Wales was due to successive and concurrent epidemics of different lineages. Genetically, 74% of isolates were characterised as encoding group B capsules: 16% group Y, 6% group W, and 3% group C. Exact peptide matches for individual Bexsero vaccine antigens were present in up to 26% of isolates. Interpretation The MRF-MGL represents an effective, broadly applicable model for the storage, analysis, and dissemination of WGS data that can facilitate real-time genomic pathogen surveillance. The data revealed information crucial to effective deployment and assessment of vaccines against N meningitidis. Funding Meningitis Research Foundation, Wellcome Trust, Public Health England, European Union.