Anthrax, caused by the bacterium Bacillus anthracis, is a disease of historical and current importance that is found throughout the world. The basis of its historical transmission is anecdotal and ...its true global population structure has remained largely cryptic. Seven diverse B. anthracis strains were whole-genome sequenced to identify rare single nucleotide polymorphisms (SNPs), followed by phylogenetic reconstruction of these characters onto an evolutionary model. This analysis identified SNPs that define the major clonal lineages within the species. These SNPs, in concert with 15 variable number tandem repeat (VNTR) markers, were used to subtype a collection of 1,033 B. anthracis isolates from 42 countries to create an extensive genotype data set. These analyses subdivided the isolates into three previously recognized major lineages (A, B, and C), with further subdivision into 12 clonal sub-lineages or sub-groups and, finally, 221 unique MLVA15 genotypes. This rare genomic variation was used to document the evolutionary progression of B. anthracis and to establish global patterns of diversity. Isolates in the A lineage are widely dispersed globally, whereas the B and C lineages occur on more restricted spatial scales. Molecular clock models based upon genome-wide synonymous substitutions indicate there was a massive radiation of the A lineage that occurred in the mid-Holocene (3,064-6,127 ybp). On more recent temporal scales, the global population structure of B. anthracis reflects colonial-era importation of specific genotypes from the Old World into the New World, as well as the repeated industrial importation of diverse genotypes into developed countries via spore-contaminated animal products. These findings indicate humans have played an important role in the evolution of anthrax by increasing the proliferation and dispersal of this now global disease. Finally, the value of global genotypic analysis for investigating bioterrorist-mediated outbreaks of anthrax is demonstrated.
The three main species of the Bacillus cereus sensu lato, B. cereus, B. thuringiensis, and B. anthracis, were recognized and established by the early 1900 s because they each exhibited distinct ...phenotypic traits. B. thuringiensis isolates and their parasporal crystal proteins have long been established as a natural pesticide and insect pathogen. B. anthracis, the etiological agent for anthrax, was used by Robert Koch in the 19th century as a model to develop the germ theory of disease, and B. cereus, a common soil organism, is also an occasional opportunistic pathogen of humans. In addition to these three historical species designations, are three less-recognized and -understood species: B. mycoides, B. weihenstephanensis, and B. pseudomycoides. All of these "species" combined comprise the Bacillus cereus sensu lato group. Despite these apparently clear phenotypic definitions, early molecular approaches to separate the first three by various DNA hybridization and 16S/23S ribosomal sequence analyses led to some "confusion" because there were limited differences to differentiate between these species. These and other results have led to frequent suggestions that a taxonomic change was warranted to reclassify this group to a single species. But the pathogenic properties of B. anthracis and the biopesticide applications of B. thuringiensis appear to "have outweighed pure taxonomic considerations" and the separate species categories are still being maintained. B. cereus sensu lato represents a classic example of a now common bacterial species taxonomic quandary.
The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven ...primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is 'open', with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order.
In December 2009, two unusual cases of anthrax were diagnosed in heroin users in Scotland. A subsequent anthrax outbreak in heroin users emerged throughout Scotland and expanded into England and ...Germany, sparking concern of nefarious introduction of anthrax spores into the heroin supply. To better understand the outbreak origin, we used established genetic signatures that provided insights about strain origin. Next, we sequenced the whole genome of a representative Bacillus anthracis strain from a heroin user (Ba4599), developed Ba4599-specific single-nucleotide polymorphism assays, and genotyped all available material from other heroin users with anthrax. Of 34 case-patients with B. anthracis-positive PCR results, all shared the Ba4599 single-nucleotide polymorphism genotype. Phylogeographic analysis demonstrated that Ba4599 was closely related to strains from Turkey and not to previously identified isolates from Scotland or Afghanistan, the presumed origin of the heroin. Our results suggest accidental contamination along the drug trafficking route through a cutting agent or animal hides used to smuggle heroin into Europe.
Following detection of putative Francisella species in aerosol samples from Houston, Texas, we surveyed soil and water samples from the area for the agent of tularemia, Francisella tularensis, and ...related species. The initial survey used 16S rRNA gene primers to detect Francisella species and related organisms by PCR amplification of DNA extracts from environmental samples. This analysis indicated that sequences related to Francisella were present in one water and seven soil samples. This is the first report of the detection of Francisella-related species in soil samples by DNA-based methods. Cloning and sequencing of PCR products indicated the presence of a wide variety of Francisella-related species. Sequences from two soil samples were 99.9% similar to previously reported sequences from F. tularensis isolates and may represent new subspecies. Additional analyses with primer sets developed for detection and differentiation of F. tularensis subspecies support the finding of very close relatives to known F. tularensis strains in some samples. While the pathogenicity of these organisms is unknown, they have the potential to be detected in F. tularensis-specific assays. Similarly, a potential new subspecies of Francisella philomiragia was identified. The majority of sequences obtained, while more similar to those of Francisella than to any other genus, were phylogenetically distinct from known species and formed several new clades potentially representing new species or genera. The results of this study revise our understanding of the diversity and distribution of Francisella and have implications for tularemia epidemiology and our ability to detect bioterrorist activities.
The global pattern of distribution of 1033 B. anthracis isolates has previously been defined by a set of 12 conserved canonical single nucleotide polymorphisms (canSNP). These studies reinforced the ...presence of three major lineages and 12 sub-lineages and sub-groups of this anthrax-causing pathogen. Isolates that form the A lineage (unlike the B and C lineages) have become widely dispersed throughout the world and form the basis for the geographical disposition of "modern" anthrax. An archival collection of 191 different B. anthracis isolates from China provides a glimpse into the possible role of Chinese trade and commerce in the spread of certain sub-lineages of this pathogen. Canonical single nucleotide polymorphism (canSNP) and multiple locus VNTR analysis (MLVA) typing has been used to examine this archival collection of isolates.
The canSNP study indicates that there are 5 different sub-lineages/sub-groups in China out of 12 previously described world-wide canSNP genotypes. Three of these canSNP genotypes were only found in the western-most province of China, Xinjiang. These genotypes were A.Br.008/009, a sub-group that is spread across most of Europe and Asia; A.Br.Aust 94, a sub-lineage that is present in Europe and India, and A.Br.Vollum, a lineage that is also present in Europe. The remaining two canSNP genotypes are spread across the whole of China and belong to sub-group A.Br.001/002 and the A.Br.Ames sub-lineage, two closely related genotypes. MLVA typing adds resolution to the isolates in each canSNP genotype and diversity indices for the A.Br.008/009 and A.Br.001/002 sub-groups suggest that these represent older and established clades in China.
B. anthracis isolates were recovered from three canSNP sub-groups (A.Br.008/009, A.Br.Aust94, and A.Br.Vollum) in the western most portion of the large Chinese province of Xinjiang. The city of Kashi in this province appears to have served as a crossroads for not only trade but the movement of diseases such as anthrax along the ancient "silk road". Phylogenetic inference also suggests that the A.Br.Ames sub-lineage, first identified in the original Ames strain isolated from Jim Hogg County, TX, is descended from the A.Br.001/002 sub-group that has a major presence in most of China. These results suggest a genetic discontinuity between the younger Ames sub-lineage in Texas and the large Western North American sub-lineage spread across central Canada and the Dakotas.
Bacillus cereus is ubiquitous in nature, and while most isolates appear to be harmless, some are associated with food-borne illnesses, periodontal diseases, and other more serious infections. In one ...such infection, B. cereus G9241 was identified as the causative agent of a severe pneumonia in a Louisiana welder in 1994. This isolate was found to harbor most of the B. anthracis virulence plasmid pXO1 (13). Here we report the characterization of two clinical and one environmental B. cereus isolate collected during an investigation of two fatal pneumonia cases in Texas metal workers. Molecular subtyping revealed that the two cases were not caused by the same strain. However, one of the three isolates was indistinguishable from B. cereus G9241. PCR analysis demonstrated that both clinical isolates contained B. anthracis pXO1 toxin genes. One clinical isolate and the environmental isolate collected from that victim's worksite contained the cap A, B, and C genes required for capsule biosynthesis in B. anthracis. Both clinical isolates expressed a capsule; however, neither was composed of poly-D-glutamic acid. Although most B. cereus isolates are not opportunistic pathogens and only a limited number cause food-borne illnesses, these results demonstrate that some B. cereus strains can cause severe and even fatal infections in patients who appear to be otherwise healthy.
Phylogenetic hypotheses using whole genome sequences have the potential for unprecedented accuracy, yet a failure to understand issues associated with discovery bias, character sampling, and strain ...sampling can lead to highly erroneous conclusions. For microbial pathogens, phylogenies derived from whole genome sequences are becoming more common, as large numbers of characters distributed across entire genomes can yield extremely accurate phylogenies, particularly for strictly clonal populations. The availability of whole genomes is increasing as new sequencing technologies reduce the cost and time required for genome sequencing. Until entire sample collections can be fully sequenced, harnessing the phylogenetic power from whole genome sequences in more than a small subset of fully sequenced strains requires the integration of whole genome and partial genome genotyping data. Such integration involves discovering evolutionarily stable polymorphic characters by whole genome comparisons, then determining allelic states across a wide panel of isolates using high-throughput genotyping technologies. Here, we demonstrate how such an approach using single nucleotide polymorphisms (SNPs) yields highly accurate, but biased phylogenetic reconstructions and how the accuracy of the resulting tree is compromised by incomplete taxon and character sampling. Despite recent phylogenetic work detailing the strengths and biases of integrating whole genome and partial genome genotype data, these issues are relatively new and remain poorly understood by many researchers. Here, we revisit these biases and provide strategies for maximizing phylogenetic accuracy. Although we write this review with bacterial pathogens in mind, these concepts apply to any clonally reproducing population or indeed to any evolutionarily stable marker that is inherited in a strictly clonal manner. Understanding the ways in which current and emerging technologies can be used to maximize phylogenetic knowledge is advantageous only with a complete understanding of the strengths and weaknesses of these methods.
DNA from over 300 Bacillus thuringiensis, Bacillus cereus, and Bacillus anthracis isolates was analyzed by fluorescent amplified fragment length polymorphism (AFLP). B. thuringiensis and B. cereus ...isolates were from diverse sources and locations, including soil, clinical isolates and food products causing diarrheal and emetic outbreaks, and type strains from the American Type Culture Collection, and over 200 B. thuringiensis isolates representing 36 serovars or subspecies were from the U.S. Department of Agriculture collection. Twenty-four diverse B. anthracis isolates were also included. Phylogenetic analysis of AFLP data revealed extensive diversity within B. thuringiensis and B. cereus compared to the monomorphic nature of B. anthracis. All of the B. anthracis strains were more closely related to each other than to any other Bacillus isolate, while B. cereus and B. thuringiensis strains populated the entire tree. Ten distinct branches were defined, with many branches containing both B. cereus and B. thuringiensis isolates. A single branch contained all the B. anthracis isolates plus an unusual B. thuringiensis isolate that is pathogenic in mice. In contrast, B. thuringiensis subsp. kurstaki (ATCC 33679) and other isolates used to prepare insecticides mapped distal to the B. anthracis isolates. The interspersion of B. cereus and B. thuringiensis isolates within the phylogenetic tree suggests that phenotypic traits used to distinguish between these two species do not reflect the genomic content of the different isolates and that horizontal gene transfer plays an important role in establishing the phenotype of each of these microbes. B. thuringiensis isolates of a particular subspecies tended to cluster together.
Disease introduction into the New World during colonial expansion is well documented and had a major impact on indigenous populations; however, few diseases have been associated with early human ...migrations into North America. During the late Pleistocene epoch, Asia and North America were joined by the Beringian Steppe ecosystem which allowed animals and humans to freely cross what would become a water barrier in the Holocene. Anthrax has clearly been shown to be dispersed by human commerce and trade in animal products contaminated with Bacillus anthracis spores. Humans appear to have brought B. anthracis to this area from Asia and then moved it further south as an ice-free corridor opened in central Canada approximately 13,000 ybp. In this study, we have defined the evolutionary history of Western North American (WNA) anthrax using 2,850 single nucleotide polymorphisms (SNPs) and 285 geographically diverse B. anthracis isolates. Phylogeography of the major WNA B. anthracis clone reveals ancestral populations in northern Canada with progressively derived populations to the south; the most recent ancestor of this clonal lineage is in Eurasia. Our phylogeographic patterns are consistent with B. anthracis arriving with humans via the Bering Land Bridge. This northern-origin hypothesis is highly consistent with our phylogeographic patterns and rates of SNP accumulation observed in current day B. anthracis isolates. Continent-wide dispersal of WNA B. anthracis likely required movement by later European colonizers, but the continent's first inhabitants may have seeded the initial North American populations.