The Oxford Nanopore Technologies (ONT) MinION is used for sequencing a wide variety of sample types with diverse methods of sample extraction. Nanopore sequencers output FAST5 files containing signal ...data subsequently base called to FASTQ format. Optionally, ONT devices can collect data from all sequencing channels simultaneously in a bulk FAST5 file enabling inspection of signal in any channel at any point. We sought to visualize this signal to inspect challenging or difficult to sequence samples.
The BulkVis tool can load a bulk FAST5 file and overlays MinKNOW (the software that controls ONT sequencers) classifications on the signal trace and can show mappings to a reference. Users can navigate to a channel and time or, given a FASTQ header from a read, jump to its specific position. BulkVis can export regions as Nanopore base caller compatible reads. Using BulkVis, we find long reads can be incorrectly divided by MinKNOW resulting in single DNA molecules being split into two or more reads. The longest seen to date is 2 272 580 bases in length and reported in eleven consecutive reads. We provide helper scripts that identify and reconstruct split reads given a sequencing summary file and alignment to a reference. We note that incorrect read splitting appears to vary according to input sample type and is more common in 'ultra-long' read preparations.
The software is available freely under an MIT license at https://github.com/LooseLab/bulkvis.
Supplementary data are available at Bioinformatics online.
Nanopore sequencing has been available to researchers for a little over 3 years. Recently, the milestone of sequencing and assembling a human genome on this platform was achieved for the first time. ...Significant improvements to the platform in yield and accuracy, coupled with higher throughput nanopore sequencers, mean that human genome sequencing at scale is now possible. Here, a brief recent history of the nanopore platform is provided, key papers and innovations are highlighted and some of the challenges for the future are discussed.
After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end ...to end, and hundreds of unresolved gaps persist
. Here we present a human genome assembly that surpasses the continuity of GRCh38
, along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome
, we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes.
Full text
Available for:
FZAB, GEOZS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains ...challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
Routine full characterization of
is culture based, taking many weeks. Whole-genome sequencing (WGS) can generate antibiotic susceptibility profiles to inform treatment, augmented with strain ...information for global surveillance; such data could be transformative if provided at or near the point of care. We demonstrate a low-cost method of DNA extraction directly from patient samples for
WGS. We initially evaluated the method by using the Illumina MiSeq sequencer (40 smear-positive respiratory samples obtained after routine clinical testing and 27 matched liquid cultures).
was identified in all 39 samples from which DNA was successfully extracted. Sufficient data for antibiotic susceptibility prediction were obtained from 24 (62%) samples; all results were concordant with reference laboratory phenotypes. Phylogenetic placement was concordant between direct and cultured samples. With Illumina MiSeq/MiniSeq, the workflow from patient sample to results can be completed in 44/16 h at a reagent cost of £96/£198 per sample. We then employed a nonspecific PCR-based library preparation method for sequencing on an Oxford Nanopore Technologies MinION sequencer. We applied this to cultured
strain BCG DNA and to combined culture-negative sputum DNA and BCG DNA. For flow cell version R9.4, the estimated turnaround time from patient to identification of BCG, detection of pyrazinamide resistance, and phylogenetic placement was 7.5 h, with full susceptibility results 5 h later. Antibiotic susceptibility predictions were fully concordant. A critical advantage of MinION is the ability to continue sequencing until sufficient coverage is obtained, providing a potential solution to the problem of variable amounts of
DNA in direct samples.
Abstract Polyploidy, the result of whole-genome duplication (WGD), is a major driver of eukaryote evolution. Yet WGDs are hugely disruptive mutations, and we still lack a clear understanding of their ...fitness consequences. Here, we study whether WGDs result in greater diversity of genomic structural variants (SVs) and how they influence evolutionary dynamics in a plant genus, Cochlearia (Brassicaceae). By using long-read sequencing and a graph-based pangenome, we find both negative and positive interactions between WGDs and SVs. Masking of recessive mutations due to WGDs leads to a progressive accumulation of deleterious SVs across four ploidal levels (from diploids to octoploids), likely reducing the adaptive potential of polyploid populations. However, we also discover putative benefits arising from SV accumulation, as more ploidy-specific SVs harbor signals of local adaptation in polyploids than in diploids. Together, our results suggest that SVs play diverse and contrasting roles in the evolutionary trajectories of young polyploids.
Abstract
Nosocomial severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections have severely affected bed capacity and patient flow. We utilized whole-genome sequencing (WGS) to identify ...outbreaks and focus infection control resources and intervention during the United Kingdom’s second pandemic wave in late 2020. Phylogenetic analysis of WGS and epidemiological data pinpointed an initial transmission event to an admission ward, with immediate prior community infection linkage documented. High incidence of asymptomatic staff infection with genetically identical viral sequences was also observed, which may have contributed to the propagation of the outbreak. WGS allowed timely nosocomial transmission intervention measures, including admissions ward point-of-care testing and introduction of portable HEPA14 filters. Conversely, WGS excluded nosocomial transmission in 2 instances with temporospatial linkage, conserving time and resources. In summary, WGS significantly enhanced understanding of SARS-CoV-2 clusters in a hospital setting, both identifying high-risk areas and conversely validating existing control measures in other units, maintaining clinical service overall.
SARS-CoV-2 whole-genome sequencing, combined with classical epidemiology, identified nosocomial transmission events, informing targeted infection prevention measures during the UK’s second epidemic wave. Conversely, nosocomial transmission was excluded by sequencing in 2 instances with temporospatial linkage, conserving time and resources.
Abstract Cell-fate decisions during mammalian gastrulation are poorly understood outside of rodent embryos. The embryonic disc of pig embryos mirrors humans, making them a useful proxy for studying ...gastrulation. Here we present a single-cell transcriptomic atlas of pig gastrulation, revealing cell-fate emergence dynamics, as well as conserved and divergent gene programs governing early porcine, primate, and murine development. We highlight heterochronicity in extraembryonic cell-types, despite the broad conservation of cell-type-specific transcriptional programs. We apply these findings in combination with functional investigations, to outline conserved spatial, molecular, and temporal events during definitive endoderm specification. We find early FOXA2 + /TBXT- embryonic disc cells directly form definitive endoderm, contrasting later-emerging FOXA2/TBXT+ node/notochord progenitors. Unlike mesoderm, none of these progenitors undergo epithelial-to-mesenchymal transition. Endoderm/Node fate hinges on balanced WNT and hypoblast-derived NODAL, which is extinguished upon endodermal differentiation. These findings emphasise the interplay between temporal and topological signalling in fate determination during gastrulation.
Erythroid cell production results from passage through cellular hierarchies dependent on differential gene expression under the control of transcription factors responsive to changing niches. We have ...constructed Genetic Regulatory Networks (GRNs) describing this process, based predominantly on mouse data. Regulatory network motifs identified in
E. coli and yeast GRNs are found in combination in these GRNs. Feed-forward motifs with autoregulation generate forward momentum and also control its rate, which is at its lowest in hematopoietic stem cells (HSCs). The simultaneous requirement for multiple regulators in multi-input motifs (MIMs) provides tight control over expression of target genes. Combinations of MIMs, exemplified by the SCL/LMO2 complexes, which have variable content and binding sites, explain how individual regulators can have different targets in HSCs and erythroid cells and possibly also how HSCs maintain stem cell functions while expressing lineage-affiliated genes at low level, so-called multi-lineage priming. MIMs combined with cross-antagonism describe the relationship between PU.1 and GATA-1 and between two of their target genes, Fli-1 and EKLF, with victory for GATA-1 and EKLF leading to erythroid lineage specification. These GRNs are useful repositories for current regulatory information, are accessible in interactive form via the internet, enable the consequences of perturbation to be predicted, and can act as seed networks to organize the rapidly accumulating microarray data.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPUK, ZAGLJ, ZRSKP