Although de novo assembly graphs contain assembled contigs (nodes), the connections between those contigs (edges) are difficult for users to access. Bandage (a Bioinformatics Application for ...Navigating De novo Assembly Graphs Easily) is a tool for visualizing assembly graphs with connections. Users can zoom in to specific areas of the graph and interact with it by moving nodes, adding labels, changing colors and extracting sequences. BLAST searches can be performed within the Bandage graphical user interface and the hits are displayed as highlights in the graph. By displaying connections between contigs, Bandage presents new possibilities for analyzing de novo assemblies that are not possible through investigation of contigs alone.
Source code and binaries are freely available at https://github.com/rrwick/Bandage. Bandage is implemented in C++ and supported on Linux, OS X and Windows. A full feature list and screenshots are available at http://rrwick.github.io/Bandage.
rrwick@gmail.com
Supplementary data are available at Bioinformatics online.
Rapid molecular typing of bacterial pathogens is critical for public health epidemiology, surveillance and infection control, yet routine use of whole genome sequencing (WGS) for these purposes poses ...significant challenges. Here we present SRST2, a read mapping-based tool for fast and accurate detection of genes, alleles and multi-locus sequence types (MLST) from WGS data. Using >900 genomes from common pathogens, we show SRST2 is highly accurate and outperforms assembly-based methods in terms of both gene detection and allele assignment. We include validation of SRST2 within a public health laboratory, and demonstrate its use for microbial genome surveillance in the hospital setting. In the face of rising threats of antimicrobial resistance and emerging virulence among bacterial pathogens, SRST2 represents a powerful tool for rapidly extracting clinically useful information from raw WGS data. Source code is available from http://katholt.github.io/srst2/.
Klebsiella pneumoniae is now recognized as an urgent threat to human health because of the emergence of multidrug-resistant strains associated with hospital outbreaks and hypervirulent strains ...associated with severe community-acquired infections. K. pneumoniae is ubiquitous in the environment and can colonize and infect both plants and animals. However, little is known about the population structure of K. pneumoniae, so it is difficult to recognize or understand the emergence of clinically important clones within this highly genetically diverse species. Here we present a detailed genomic framework for K. pneumoniae based on whole-genome sequencing of more than 300 human and animal isolates spanning four continents. Our data provide genome-wide support for the splitting of K. pneumoniae into three distinct species, KpI (K. pneumoniae), KpII (K. quasipneumoniae), and KpIII (K. variicola). Further, for K. pneumoniae (KpI), the entity most frequently associated with human infection, we show the existence of >150 deeply branching lineages including numerous multidrug-resistant or hypervirulent clones. We show K. pneumoniae has a large accessory genome approaching 30,000 protein-coding genes, including a number of virulence functions that are significantly associated with invasive community-acquired disease in humans. In our dataset, antimicrobial resistance genes were common among human carriage isolates and hospital-acquired infections, which generally lacked the genes associated with invasive disease. The convergence of virulence and resistance genes potentially could lead to the emergence of untreatable invasive K. pneumoniae infections; our data provide the whole-genome framework against which to track the emergence of such threats.
Genomic sequencing has significant potential to inform public health management for SARS-CoV-2. Here we report high-throughput genomics for SARS-CoV-2, sequencing 80% of cases in Victoria, Australia ...(population 6.24 million) between 6 January and 14 April 2020 (total 1,333 COVID-19 cases). We integrate epidemiological, genomic and phylodynamic data to identify clusters and impact of interventions. The global diversity of SARS-CoV-2 is represented, consistent with multiple importations. Seventy-six distinct genomic clusters were identified, including large clusters associated with social venues, healthcare and cruise ships. Sequencing sequential samples from 98 patients reveals minimal intra-patient SARS-CoV-2 genomic diversity. Phylodynamic modelling indicates a significant reduction in the effective viral reproductive number (R
) from 1.63 to 0.48 after implementing travel restrictions and physical distancing. Our data provide a concrete framework for the use of SARS-CoV-2 genomics in public health responses, including its use to rapidly identify SARS-CoV-2 transmission chains, increasingly important as social restrictions ease globally.
To describe the first isolation and sequencing of SARS-CoV-2 in Australia and rapid sharing of the isolate.
SARS-CoV-2 was isolated from a 58-year-old man from Wuhan, China who arrived in Melbourne ...on 19 January 2020 and was admitted to the Monash Medical Centre, Melbourne from the emergency department on 24 January 2020 with fever, cough, and progressive dyspnoea.
Clinical course and laboratory features of the first reported case of COVID-19 (the illness caused by SARS-CoV-2) in Australia; isolation, whole genome sequencing, imaging, and rapid sharing of virus from the patient.
A nasopharyngeal swab and sputum collected when the patient presented to hospital were each positive for SARS-CoV-2 (reverse transcription polymerase chain reaction). Inoculation of Vero/hSLAM cells with material from the nasopharyngeal swab led to the isolation of SARS-CoV-2 virus in culture. Electron microscopy of the supernatant confirmed the presence of virus particles with morphology characteristic of viruses of the family Coronaviridae. Whole genome sequencing of the viral isolate and phylogenetic analysis indicated the isolate exhibited greater than 99.99% sequence identity with other publicly available SARS-CoV-2 genomes. Within 24 hours of isolation, the first Australian SARS-CoV-2 isolate was shared with local and overseas reference laboratories and major North American and European culture collections.
The ability to rapidly identify, propagate, and internationally share our SARS-CoV-2 isolate is an important step in collaborative scientific efforts to deal effectively with this international public health emergency by developing better diagnostic procedures, vaccine candidates, and antiviral agents.
Realising the promise of genomics to revolutionise identification and surveillance of antimicrobial resistance (AMR) has been a long-standing challenge in clinical and public health microbiology. ...Here, we report the creation and validation of abritAMR, an ISO-certified bioinformatics platform for genomics-based bacterial AMR gene detection. The abritAMR platform utilises NCBI's AMRFinderPlus, as well as additional features that classify AMR determinants into antibiotic classes and provide customised reports. We validate abritAMR by comparing with PCR or reference genomes, representing 1500 different bacteria and 415 resistance alleles. In these analyses, abritAMR displays 99.9% accuracy, 97.9% sensitivity and 100% specificity. We also compared genomic predictions of phenotype for 864 Salmonella spp. against agar dilution results, showing 98.9% accuracy. The implementation of abritAMR in our institution has resulted in streamlined bioinformatics and reporting pathways, and has been readily updated and re-verified. The abritAMR tool and validation datasets are publicly available to assist laboratories everywhere harness the power of AMR genomics in professional practice.
Terrestrial life in Antarctica has been described as some of the simplest on the planet, and mainly confined to soil microfaunal communities. Studies have suggested that the lack of diversity is due ...to extreme environmental conditions and thought to be driven by abiotic factors. In this study we investigated soil microfauna composition, abundance, and distribution in East Antarctica, and assessed correlations with soil geochemistry and environmental variables. We examined 109 soil samples from a wide range of ice-free habitats, spanning 2000 km from Framnes Mountains to Bailey Peninsula. Microfauna across all samples were patchily distributed, from complete absence of invertebrates to over 1600 specimens/gram of dry weight of soil (gdw), with highest microfauna abundance observed in samples with visible vegetation. Bdelloid rotifers were on average the most widespread found in 87% of sampled sites and the most abundant (44 specimens/gdw). Tardigrades occurred in 57% of the sampled sites with an abundance of 12 specimens/gdw. Nematodes occurred in 71% of samples with a total abundance of 3 specimens/gdw. Ciliates and mites were rarely found in soil samples, with an average abundance of 1.3 and 0.04 specimens/gdw, respectively. We found that microfaunal composition and abundance were mostly correlated with the soil geochemical parameters; phosphorus, NO3 (-) and salinity, and likely to be the result of soil properties and historic landscape formation and alteration, rather than the geographic region they were sampled from. Studies focusing on Antarctic biodiversity must take into account soil geochemical and environmental factors that influence population and species heterogeneity.
Although it is possible to recover the complete mitogenome directly from shotgun sequencing data, currently reported methods and pipelines are still relatively time consuming and costly. Using a ...sample of the Australian freshwater crayfish Engaeus lengana, we demonstrate that it is possible to achieve three-day turnaround time (four hours hands-on time) from tissue sample to NCBI-ready submission file through the integration of MiSeq sequencing platform, Nextera sample preparation protocol, MITObim assembly algorithm and MITOS annotation pipeline.
The complete mitochondrial genome of the parastacid freshwater crayfish, Engaeus lengana, was recovered by modest shotgun sequencing (1.2 giga bases) using the Illumina MiSeq benchtop sequencing platform. Genome assembly using the MITObim mitogenome assembler recovered the mitochondrial genome as a single contig with a 97-fold mean coverage (min. = 17; max. = 138). The mitogenome consists of 15,934 base pairs and contains the typical 37 mitochondrial genes and a non-coding AT-rich region. The genome arrangement is similar to the only other published parastacid mitogenome from the Australian genus Cherax.
We infer that the gene order arrangement found in Cherax destructor is common to Australian crayfish and may be a derived feature of the southern hemisphere family Parastacidae. Further, we report to our knowledge, the simplest and fastest protocol for the recovery and assembly of complete mitochondrial genomes using the MiSeq benchtop sequencer.
Display omitted
•MitoPhAST is an automated tool for phylogenetic analysis of mitochondrial proteins.•We report eight new mitogenome sequences for parastacid crayfishes.•One of the new mitogenomes ...includes the first for the genus Gramastacus.•We provide an update of the Decapoda mitogenomic phylogeny comprising 89 decapods.•We identify new mitogenome gene rearrangements for freshwater crayfishes.
The increased rate at which complete mitogenomes are being sequenced and their increasing use for phylogenetic studies have resulted in a bioinformatic bottleneck in preparing and utilising such data for phylogenetic analysis. Hence, we present MitoPhAST, an automated tool that (1) identifies annotated protein-coding gene features and generates a standardised, concatenated and partitioned amino acid alignment directly from complete/partial GenBank/EMBL-format mitogenome flat files, (2) generates a maximum likelihood phylogenetic tree using optimised protein models and (3) reports various mitochondrial genes and sequence information in a table format. To demonstrate the capacity of MitoPhAST in handling a large dataset, we used 81 publicly available decapod mitogenomes, together with eight new complete mitogenomes of Australian freshwater crayfishes, including the first for the genus Gramastacus, to undertake an updated test of the monophyly of the major groups of the order Decapoda and their phylogenetic relationships. The recovered phylogenetic trees using both Bayesian and ML methods support the results of studies using fragments of mtDNA and nuclear markers and other smaller-scale studies using whole mitogenomes. In comparison to the fragment-based phylogenies, nodal support values are generally higher despite reduced taxon sampling suggesting there is value in utilising more fully mitogenomic data. Additionally, the simple table output from MitoPhAST provides an efficient summary and statistical overview of the mitogenomes under study at the gene level, allowing the identification of missing or duplicated genes and gene rearrangements. The finding of new mtDNA gene rearrangements in several genera of Australian freshwater crayfishes indicates that this group has undergone an unusually high rate of evolutionary change for this organelle compared to other major families of decapod crustaceans. As a result, freshwater crayfishes are likely to be a useful model for studies designed to understand the evolution of mtDNA rearrangements. We anticipate that our bioinformatics pipeline will substantially help mitogenome-based studies increase the speed, accuracy and efficiency of phylogenetic studies utilising mitogenome information. MitoPhAST is available for download at https://github.com/mht85/MitoPhAST.