Abstract
ChIP-Atlas (https://chip-atlas.org) is a web service providing both GUI- and API-based data-mining tools to reveal the architecture of the transcription regulatory landscape. ChIP-Atlas is ...powered by comprehensively integrating all data sets from high-throughput ChIP-seq and DNase-seq, a method for profiling chromatin regions accessible to DNase. In this update, we further collected all the ATAC-seq and whole-genome bisulfite-seq data for six model organisms (human, mouse, rat, fruit fly, nematode, and budding yeast) with the latest genome assemblies. These together with ChIP-seq data can be visualized with the Peak Browser tool and a genome browser to explore the epigenomic landscape of a query genomic locus, such as its chromatin accessibility, DNA methylation status, and protein–genome interactions. This epigenomic landscape can also be characterized for multiple genes and genomic loci by querying with the Enrichment Analysis tool, which, for example, revealed that inflammatory bowel disease-associated SNPs are the most significantly hypo-methylated in neutrophils. Therefore, ChIP-Atlas provides a panoramic view of the whole epigenomic landscape. All datasets are free to download via either a simple button on the web page or an API.
Graphical Abstract
Graphical Abstract
ChIP-Atlas 2021 update integrates all publicly available ATAC-seq and WGBS data besides ChIP-seq data for six representative model organisms, facilitating comprehensive exploration of chromatin accessibility, DNA methylation status, and protein binding around single or multiple genes and genomic regions of users' interest.
Metagenomic next-generation sequencing (NGS) of cerebrospinal fluid (CSF) has the potential to identify a broad range of pathogens in a single test.
In a 1-year, multicenter, prospective study, we ...investigated the usefulness of metagenomic NGS of CSF for the diagnosis of infectious meningitis and encephalitis in hospitalized patients. All positive tests for pathogens on metagenomic NGS were confirmed by orthogonal laboratory testing. Physician feedback was elicited by teleconferences with a clinical microbial sequencing board and by surveys. Clinical effect was evaluated by retrospective chart review.
We enrolled 204 pediatric and adult patients at eight hospitals. Patients were severely ill: 48.5% had been admitted to the intensive care unit, and the 30-day mortality among all study patients was 11.3%. A total of 58 infections of the nervous system were diagnosed in 57 patients (27.9%). Among these 58 infections, metagenomic NGS identified 13 (22%) that were not identified by clinical testing at the source hospital. Among the remaining 45 infections (78%), metagenomic NGS made concurrent diagnoses in 19. Of the 26 infections not identified by metagenomic NGS, 11 were diagnosed by serologic testing only, 7 were diagnosed from tissue samples other than CSF, and 8 were negative on metagenomic NGS owing to low titers of pathogens in CSF. A total of 8 of 13 diagnoses made solely by metagenomic NGS had a likely clinical effect, with 7 of 13 guiding treatment.
Routine microbiologic testing is often insufficient to detect all neuroinvasive pathogens. In this study, metagenomic NGS of CSF obtained from patients with meningitis or encephalitis improved diagnosis of neurologic infections and provided actionable information in some cases. (Funded by the National Institutes of Health and others; PDAID ClinicalTrials.gov number, NCT02910037.).
Understanding why some evolutionary lineages generate exceptionally high species diversity is an important goal in evolutionary biology. Haplochromine cichlid fishes of Africa's Lake Victoria region ...encompass >700 diverse species that all evolved in the last 150,000 years. How this 'Lake Victoria Region Superflock' could evolve on such rapid timescales is an enduring question. Here, we demonstrate that hybridization between two divergent lineages facilitated this process by providing genetic variation that subsequently became recombined and sorted into many new species. Notably, the hybridization event generated exceptional allelic variation at an opsin gene known to be involved in adaptation and speciation. More generally, differentiation between new species is accentuated around variants that were fixed differences between the parental lineages, and that now appear in many new combinations in the radiation species. We conclude that hybridization between divergent lineages, when coincident with ecological opportunity, may facilitate rapid and extensive adaptive radiation.
Isolength restriction site-associated DNA (isoRAD) sequencing is a very simple but powerful approach that was originally developed for genome-wide genotyping at minimal labor and cost, and it has ...recently extended its applicability to allow quantification of DNA methylation levels. The isoRAD method is distinct from other genotyping-by-sequencing (GBS) methods because of its use of special restriction enzymes to produce isolength tags (32-36 bp), and sequencing of these uniform tags can bring many benefits. However, the relatively short tags produced by the original protocol are mostly suited to single-end (SE) sequencing (36-50 bp), and therefore they cannot efficiently match the gradually increased sequencing capacity of next-generation sequencing (NGS) platforms. To address this issue, we describe an advanced protocol that allows the preparation of five concatenated isoRAD tags for Illumina paired-end (PE) sequencing (100-150 bp). The configuration of the five concatenated tags is highly flexible, and can be defined by users to work with a desired combination of samples and/or restriction enzymes to suit specific research purposes. In comparison with the original protocol, the advanced protocol has an additional digestion and ligation step, and library preparation can be completed in ∼8 h.
We present single-cell combinatorial indexed Hi-C (sciHi-C), a method that applies combinatorial cellular indexing to chromosome conformation capture. In this proof of concept, we generate and ...sequence six sciHi-C libraries comprising a total of 10,696 single cells. We use sciHi-C data to separate cells by karyotypic and cell-cycle state differences and identify cell-to-cell heterogeneity in mammalian chromosomal conformation. Our results demonstrate that combinatorial indexing is a generalizable strategy for single-cell genomics.
Determining the pathogenicity of genetic variants is a critical challenge, and functional assessment is often the only option. Experimentally characterizing millions of possible missense variants in ...thousands of clinically important genes requires generalizable, scalable assays. We describe variant abundance by massively parallel sequencing (VAMP-seq), which measures the effects of thousands of missense variants of a protein on intracellular abundance simultaneously. We apply VAMP-seq to quantify the abundance of 7,801 single-amino-acid variants of PTEN and TPMT, proteins in which functional variants are clinically actionable. We identify 1,138 PTEN and 777 TPMT variants that result in low protein abundance, and may be pathogenic or alter drug metabolism, respectively. We observe selection for low-abundance PTEN variants in cancer, and show that p.Pro38Ser, which accounts for ~10% of PTEN missense variants in melanoma, functions via a dominant-negative mechanism. Finally, we demonstrate that VAMP-seq is applicable to other genes, highlighting its generalizability.
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains ...challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
Sequence-based methods to delimit species are central to DNA taxonomy, microbial community surveys and DNA metabarcoding studies. Current approaches either rely on simple sequence similarity ...thresholds (OTU-picking) or on complex and compute-intensive evolutionary models. The OTU-picking methods scale well on large datasets, but the results are highly sensitive to the similarity threshold. Coalescent-based species delimitation approaches often rely on Bayesian statistics and Markov Chain Monte Carlo sampling, and can therefore only be applied to small datasets.
We introduce the Poisson tree processes (PTP) model to infer putative species boundaries on a given phylogenetic input tree. We also integrate PTP with our evolutionary placement algorithm (EPA-PTP) to count the number of species in phylogenetic placements. We compare our approaches with popular OTU-picking methods and the General Mixed Yule Coalescent (GMYC) model. For de novo species delimitation, the stand-alone PTP model generally outperforms GYMC as well as OTU-picking methods when evolutionary distances between species are small. PTP neither requires an ultrametric input tree nor a sequence similarity threshold as input. In the open reference species delimitation approach, EPA-PTP yields more accurate results than de novo species delimitation methods. Finally, EPA-PTP scales on large datasets because it relies on the parallel implementations of the EPA and RAxML, thereby allowing to delimit species in high-throughput sequencing data.
The code is freely available at www.exelixis-lab.org/software.html. .
The human microbiome has emerged as a major player in regulating human health and disease. Translational studies of the microbiome have the potential to indicate clinical applications such as fecal ...transplants and probiotics. However, one major issue is accurate identification of microbes constituting the microbiota. Studies of the microbiome have frequently utilized sequencing of the conserved 16S ribosomal RNA (rRNA) gene. We present a comparative study of an alternative approach using whole genome shotgun sequencing (WGS). In the present study, we analyzed the human fecal microbiome compiling a total of 194.1 × 106 reads from a single sample using multiple sequencing methods and platforms. Specifically, after establishing the reproducibility of our methods with extensive multiplexing, we compared: 1) The 16S rRNA amplicon versus the WGS method, 2) the Illumina HiSeq versus MiSeq platforms, 3) the analysis of reads versus de novo assembled contigs, and 4) the effect of shorter versus longer reads. Our study demonstrates that whole genome shotgun sequencing has multiple advantages compared with the 16S amplicon method including enhanced detection of bacterial species, increased detection of diversity and increased prediction of genes. In addition, increased length, either due to longer reads or the assembly of contigs, improved the accuracy of species detection.
•Human microbiome has emerged as a major player in regulating health and disease.•Accurate identification of microbes constituting the microbiome is major challenge.•We report comparative study of whole genome shotgun (WGS) and amplicon sequencing.•WGS enhances detection of bacterial species with high accuracy.•WGS increased detection of microbial diversity and prediction of putative genes.
PurposeThe 2015 American College of Medical Genetics and Genomics-Association for Molecular Pathology (ACMG-AMP) guidelines were a major step toward establishing a common framework for variant ...classification. In practice, however, several aspects of the guidelines lack specificity, are subject to varied interpretations, or fail to capture relevant aspects of clinical molecular genetics. A simple implementation of the guidelines in their current form is insufficient for consistent and comprehensive variant classification.MethodsWe undertook an iterative process of refining the ACMG-AMP guidelines. We used the guidelines to classify more than 40,000 clinically observed variants, assessed the outcome, and refined the classification criteria to capture exceptions and edge cases. During this process, the criteria evolved through eight major and minor revisions.ResultsOur implementation: (i) separated ambiguous ACMG-AMP criteria into a set of discrete but related rules with refined weights; (ii) grouped certain criteria to protect against the overcounting of conceptually related evidence; and (iii) replaced the "clinical criteria" style of the guidelines with additive, semiquantitative criteria.ConclusionSherloc builds on the strong framework of 33 rules established by the ACMG-AMP guidelines and introduces 108 detailed refinements, which support a more consistent and transparent approach to variant classification.