The study of cell-population heterogeneity in a range of biological systems, from viruses to bacterial isolates to tumor samples, has been transformed by recent advances in sequencing throughput. ...While the high-coverage afforded can be used, in principle, to identify very rare variants in a population, existing ad hoc approaches frequently fail to distinguish true variants from sequencing errors. We report a method (LoFreq) that models sequencing run-specific error rates to accurately call variants occurring in <0.05% of a population. Using simulated and real datasets (viral, bacterial and human), we show that LoFreq has near-perfect specificity, with significantly improved sensitivity compared with existing methods and can efficiently analyze deep Illumina sequencing datasets without resorting to approximations or heuristics. We also present experimental validation for LoFreq on two different platforms (Fluidigm and Sequenom) and its application to call rare somatic variants from exome sequencing datasets for gastric cancer. Source code and executables for LoFreq are freely available at http://sourceforge.net/projects/lofreq/.
As humans age, normal tissues, such as the esophageal epithelium, become a patchwork of mutant clones. Some mutations are under positive selection, conferring a competitive advantage over wild-type ...cells. We speculated that altering the selective pressure on mutant cell populations may cause them to expand or contract. We tested this hypothesis by examining the effect of oxidative stress from low-dose ionizing radiation (LDIR) on wild-type and p53 mutant cells in the transgenic mouse esophagus. We found that LDIR drives wild-type cells to stop proliferating and differentiate. p53 mutant cells are insensitive to LDIR and outcompete wild-type cells following exposure. Remarkably, combining antioxidant treatment and LDIR reverses this effect, promoting wild-type cell proliferation and p53 mutant differentiation, reducing the p53 mutant population. Thus, p53-mutant cells can be depleted from the normal esophagus by redox manipulation, showing that external interventions may be used to alter the mutational landscape of an aging tissue.
Display omitted
•Cell tracking shows low-dose ionizing radiation drives differentiation in esophagus•Low-dose radiation acts via redox stress without activating the DNA repair pathway•p53 mutant cells do not differentiate after irradiation and outcompete normal cells•Antioxidant plus radiation depletes p53 mutant cells by improving normal cell fitness
Normal human esophagus contains p53 mutant progenitors. Fernandez-Antoran and colleagues show that p53 mutant progenitors outcompete their wild-type neighbors after low-dose ionizing radiation. This effect is reversed by antioxidant pretreatment. p53 mutant cells can be displaced from normal tissues via the improvement of the competitive fitness of wild-type progenitors.
During aging, progenitor cells acquire mutations, which may generate clones that colonize the surrounding tissue. By middle age, normal human tissues, including the esophageal epithelium (EE), become ...a patchwork of mutant clones. Despite their relevance for understanding aging and cancer, the processes that underpin mutational selection in normal tissues remain poorly understood. Here, we investigated this issue in the esophageal epithelium of mutagen-treated mice. Deep sequencing identified numerous mutant clones with multiple genes under positive selection, including Notch1, Notch2 and Trp53, which are also selected in human esophageal epithelium. Transgenic lineage tracing revealed strong clonal competition that evolved over time. Clone dynamics were consistent with a simple model in which the proliferative advantage conferred by positively selected mutations depends on the nature of the neighboring cells. When clones with similar competitive fitness collide, mutant cell fate reverts towards homeostasis, a constraint that explains how selection operates in normal-appearing epithelium.
An accurate genome assembly from short read sequencing data is critical for downstream analysis, for example allowing investigation of variants within a sequenced population. However, assembling ...sequencing data from virus samples, especially RNA viruses, into a genome sequence is challenging due to the combination of viral population diversity and extremely uneven read depth caused by amplification bias in the inevitable reverse transcription and polymerase chain reaction amplification process of current methods.
We developed a new de novo assembler called IVA (Iterative Virus Assembler) designed specifically for read pairs sequenced at highly variable depth from RNA virus samples. We tested IVA on datasets from 140 sequenced samples from human immunodeficiency virus-1 or influenza-virus-infected people and demonstrated that IVA outperforms all other virus de novo assemblers.
The software runs under Linux, has the GPLv3 licence and is freely available from http://sanger-pathogens.github.io/iva
Genome-wide CRISPR-based knockout (CRISPR-KO) screening is an emerging technique which enables systematic genetic analysis of a cellular or molecular phenotype in question. Continuous improvements, ...such as modifications to the guide RNA (gRNA) scaffold and the development of gRNA on-target prediction algorithms, have since been made to increase their screening performance. We compared the performance of three available second-generation human genome-wide CRISPR-KO libraries that included at least one of the improvements, and examined the effect of gRNA scaffold, number of gRNAs per gene and number of replicates on screen performance. We identified duplicated screens using a library with 6 gRNAs per gene as providing the best trade-off. Despite the improvements, we found that each improved library still has library-specific false negatives and, for the first time, estimated the false negative rates of CRISPR-KO screens, which are between 10% and 20%. Our newly-defined optimal screening parameters would be helpful in designing screens and constructing bespoke gRNA libraries.
The genetic basis of naive pluripotency maintenance and loss is a central question in embryonic stem cell biology. Here, we deploy CRISPR-knockout-based screens in mouse embryonic stem cells to ...interrogate this question through a genome-wide, non-biased approach using the Rex1GFP reporter as a phenotypic readout. This highly sensitive and efficient method identified genes in diverse biological processes and pathways. We uncovered a key role for negative regulators of mTORC1 in maintenance and exit from naive pluripotency and provided an integrated account of how mTORC1 activity influences naive pluripotency through Gsk3. Our study therefore reinforces Gsk3 as the central node and provides a comprehensive, data-rich resource that will improve our understanding of mechanisms regulating pluripotency and stimulate avenues for further mechanistic studies.
Display omitted
•Genome-wide CRISPR screening identifies naive pluripotency regulators in mouse ESCs•mTORC1-negative regulators from two axes show opposing phenotypes•Gator1 is required for proper self-renewal and differentiation via Gsk3 regulation•Tsc2 loss causes Akt-dependent, mTORC1-dependent Gsk3 suppression
Li et al. conducted genome-wide CRISPR screens in mouse ESCs to identify genes affecting maintenance of and exit from naive pluripotency using a Rex1GFP reporter. They show that loss of two mTORC1-negative regulators, Tsc1/2 and Gator1, can cause opposing phenotypes through differential regulation of Gsk3 activity.
Aging normal human oesophagus accumulates TP53 mutant clones. These are the origin of most oesophageal squamous carcinomas, in which biallelic TP53 disruption is almost universal. However, how p53 ...mutant clones expand and contribute to cancer development is unclear. Here we show that inducing the p53
mutant in single oesophageal progenitor cells in transgenic mice confers a proliferative advantage and clonal expansion but does not disrupt normal epithelial structure. Loss of the remaining p53 allele in mutant cells results in genomically unstable p53
epithelium with giant polyaneuploid cells and copy number altered clones. In carcinogenesis, p53 mutation does not initiate tumour formation, but tumours developing from areas with p53 mutation and LOH are larger and show extensive chromosomal instability compared to lesions arising in wild type epithelium. We conclude that p53 has distinct functions at different stages of carcinogenesis and that LOH within p53 mutant clones in normal epithelium is a critical step in malignant transformation.
Histone acetyltransferases (HATs) catalyze the transfer of an acetyl group from acetyl-CoA to lysine residues of histones and play a central role in transcriptional regulation in diverse biological ...processes. Dysregulation of HAT activity can lead to human diseases including developmental disorders and cancer. Through genome-wide CRISPR-Cas9 screens, we identified several HATs of the MYST family as fitness genes for acute myeloid leukemia (AML). Here we investigate the essentiality of lysine acetyltransferase KAT7 in AMLs driven by the MLL-X gene fusions. We found that KAT7 loss leads to a rapid and complete loss of both H3K14ac and H4K12ac marks, in association with reduced proliferation, increased apoptosis, and differentiation of AML cells. Acetyltransferase activity of KAT7 is essential for the proliferation of these cells. Mechanistically, our data propose that acetylated histones provide a platform for the recruitment of MLL-fusion-associated adaptor proteins such as BRD4 and AF4 to gene promoters. Upon KAT7 loss, these factors together with RNA polymerase II rapidly dissociate from several MLL-fusion target genes that are essential for AML cell proliferation, including MEIS1, PBX3, and SENP6. Our findings reveal that KAT7 is a plausible therapeutic target for this poor prognosis AML subtype.
The high throughput and cost-effectiveness afforded by short-read sequencing technologies, in principle, enable researchers to perform 16S rRNA profiling of complex microbial communities at ...unprecedented depth and resolution. Existing Illumina sequencing protocols are, however, limited by the fraction of the 16S rRNA gene that is interrogated and therefore limit the resolution and quality of the profiling. To address this, we present the design of a novel protocol for shotgun Illumina sequencing of the bacterial 16S rRNA gene, optimized to amplify more than 90% of sequences in the Greengenes database and with the ability to distinguish nearly twice as many species-level OTUs compared to existing protocols. Using several in silico and experimental datasets, we demonstrate that despite the presence of multiple variable and conserved regions, the resulting shotgun sequences can be used to accurately quantify the constituents of complex microbial communities. The reconstruction of a significant fraction of the 16S rRNA gene also enabled high precision (>90%) in species-level identification thereby opening up potential application of this approach for clinical microbial characterization.
Norovirus is a highly transmissible infectious agent that causes epidemic gastroenteritis in susceptible children and adults. Norovirus infections can be severe and can be initiated from an ...exceptionally small number of viral particles. Detailed genome sequence data are useful for tracking norovirus transmission and evolution. To address this need, we have developed a whole-genome deep-sequencing method that generates entire genome sequences from small amounts of clinical specimens. This novel approach employs an algorithm for reverse transcription and PCR amplification primer design using all of the publically available norovirus sequence data. Deep sequencing and de novo assembly were used to generate norovirus genomes from a large set of diarrheal patients attending three hospitals in Ho Chi Minh City, Vietnam, over a 2.5-year period. Positive-selection analysis and direct examination of protein changes in the virus over time identified codons in the regions encoding proteins VP1, p48 (NS1-2), and p22 (NS4) under positive selection and expands the known targets of norovirus evolutionary pressure.
The high transmissibility and rapid evolutionary rate of norovirus, combined with a short-lived host immune responses, are thought to be the reasons why the virus causes the majority of pediatric viral diarrhea cases. The evolutionary patterns of this RNA virus have been described in detail for only a portion of the virus genome and never for a virus from a detailed urban tropical setting. We provide a detailed sequence description of the noroviruses circulating in three Ho Chi Minh City hospitals over a 2.5-year period. This study identified patterns of virus change in known sites of host immune response and identified three additional regions of the virus genome under selection that were not previously recognized. In addition, the method described here provides a robust full-genome sequencing platform for community-based virus surveillance.