When choosing a read mapper, one faces the trade off between speed and the ability to map reads in highly polymorphic regions. Here, we report NextGenMap, a fast and accurate read mapper, which ...reduces this dilemma. NextGenMap aligns reads reliably to a reference genome even when the sequence difference between target and reference genome is large, i.e. highly polymorphic genome. At the same time, NextGenMap outperforms current mapping methods with respect to runtime and to the number of correctly mapped reads. NextGenMap efficiently uses the available hardware by exploiting multi-core CPUs as well as graphic cards (GPUs), if available. In addition, NextGenMap handles automatically any read data independent of read length and sequencing technology.
NextGenMap source code and documentation are available at: http://cibiv.github.io/NextGenMap/.
fritz.sedlazeck@univie.ac.at.
Supplementary data are available at Bioinformatics online.
Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to ...dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings.
Defining direct targets of transcription factors and regulatory pathways is key to understanding their roles in physiology and disease. We combined SLAM-seq thiol(SH)-linked alkylation for the ...metabolic sequencing of RNA, a method for direct quantification of newly synthesized messenger RNAs (mRNAs), with pharmacological and chemical-genetic perturbation in order to define regulatory functions of two transcriptional hubs in cancer, BRD4 and MYC, and to interrogate direct responses to BET bromodomain inhibitors (BETis). We found that BRD4 acts as general coactivator of RNA polymerase II-dependent transcription, which is broadly repressed upon high-dose BETi treatment. At doses triggering selective effects in leukemia, BETis deregulate a small set of hypersensitive targets including MYC. In contrast to BRD4, MYC primarily acts as a selective transcriptional activator controlling metabolic processes such as ribosome biogenesis and de novo purine synthesis. Our study establishes a simple and scalable strategy to identify direct transcriptional targets of any gene or pathway.
Lamins are components of the peripheral nuclear lamina and interact with heterochromatic genomic regions, termed lamina-associated domains (LADs). In contrast to lamin B1 being primarily present at ...the nuclear periphery, lamin A/C also localizes throughout the nucleus, where it associates with the chromatin-binding protein lamina-associated polypeptide (LAP) 2 alpha. Here, we show that lamin A/C also interacts with euchromatin, as determined by chromatin immunoprecipitation of euchromatin- and heterochromatin-enriched samples. By way of contrast, lamin B1 was only found associated with heterochromatin. Euchromatic regions occupied by lamin A/C overlap with those bound by LAP2alpha, and lack of LAP2alpha in LAP2alpha-deficient cells shifts binding of lamin A/C toward more heterochromatic regions. These alterations in lamin A/C-chromatin interactions correlate with changes in epigenetic histone marks in euchromatin but do not significantly affect gene expression. Loss of lamin A/C in heterochromatic regions in LAP2alpha-deficient cells, however, correlated with increased gene expression. Our data show a novel role of nucleoplasmic lamin A/C and LAP2alpha in regulating euchromatin.
Methods to read out naturally occurring or experimentally introduced nucleic acid modifications are emerging as powerful tools to study dynamic cellular processes. The recovery, quantification and ...interpretation of such events in high-throughput sequencing datasets demands specialized bioinformatics approaches.
Here, we present Digital Unmasking of Nucleotide conversions in K-mers (DUNK), a data analysis pipeline enabling the quantification of nucleotide conversions in high-throughput sequencing datasets. We demonstrate using experimentally generated and simulated datasets that DUNK allows constant mapping rates irrespective of nucleotide-conversion rates, promotes the recovery of multimapping reads and employs Single Nucleotide Polymorphism (SNP) masking to uncouple true SNPs from nucleotide conversions to facilitate a robust and sensitive quantification of nucleotide-conversions. As a first application, we implement this strategy as SLAM-DUNK for the analysis of SLAMseq profiles, in which 4-thiouridine-labeled transcripts are detected based on T > C conversions. SLAM-DUNK provides both raw counts of nucleotide-conversion containing reads as well as a base-content and read coverage normalized approach for estimating the fractions of labeled transcripts as readout.
Beyond providing a readily accessible tool for analyzing SLAMseq and related time-resolved RNA sequencing methods (TimeLapse-seq, TUC-seq), DUNK establishes a broadly applicable strategy for quantifying nucleotide conversions.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The SK-BR-3 cell line is one of the most important models for HER2+ breast cancers, which affect one in five breast cancer patients. SK-BR-3 is known to be highly rearranged, although much of the ...variation is in complex and repetitive regions that may be underreported. Addressing this, we sequenced SK-BR-3 using long-read single molecule sequencing from Pacific Biosciences and develop one of the most detailed maps of structural variations (SVs) in a cancer genome available, with nearly 20,000 variants present, most of which were missed by short-read sequencing. Surrounding the important
oncogene (also known as
), we discover a complex sequence of nested duplications and translocations, suggesting a punctuated progression. Full-length transcriptome sequencing further revealed several novel gene fusions within the nested genomic variants. Combining long-read genome and transcriptome sequencing enables an in-depth analysis of how SVs disrupt the genome and sheds new light on the complex mechanisms involved in cancer genome evolution.
As the Lyme disease bacterium Borrelia burgdorferi traverses its enzootic cycle, alternating between a tick vector and a vertebrate host, the spirochete must adapt and persist in the tick midgut ...under prolonged nutrient stress between blood meals. In this study, we examined the role of the stringent response in tick persistence and in regulation of gene expression during nutrient limitation. Nutritionally starving B. burgdorferi in vitro increased the levels of guanosine tetraphosphate (ppGpp) and guanosine pentaphosphate (pppGpp), collectively referred to as (p)ppGpp, products of the bifunctional synthetase/hydrolase RelBbu (RelA/SpoT homolog). Conversely, returning B. burgdorferi to a nutrient-rich medium decreased (p)ppGpp levels. B. burgdorferi survival in ticks between the larval and nymph blood meals, and during starvation in vitro, was dependent on RelBbu. Furthermore, normal morphological conversion from a flat-wave shape to a condensed round body (RB) form during starvation was dependent on RelBbu; relBbu mutants more frequently formed RBs, but their membranes were compromised. By differential RNA sequencing analyses, we found that RelBbu regulates an extensive transcriptome, both dependent and independent of nutrient stress. The RelBbu regulon includes the glp operon, which is important for glycerol utilization and persistence in the tick, virulence factors and the late phage operon of the 32-kb circular plasmid (cp32) family. In summary, our data suggest that RelBbu globally modulates transcription in response to nutrient stress by increasing (p)ppGpp levels to facilitate B. burgdorferi persistence in the tick.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The Lyme disease spirochete
(
)
must tolerate nutrient stress to persist in the tick phase of its enzootic life cycle. We previously found that the stringent response mediated by Rel
globally ...regulates gene expression to facilitate persistence in the tick vector. Here, we show that Rel
regulates the expression of a swath of small RNAs (sRNA), affecting 36% of previously identified sRNAs in
. This is the first sRNA regulatory mechanism identified in any spirochete. Threefold more sRNAs were Rel
-upregulated than downregulated during nutrient stress and included antisense, intergenic and 5' untranslated region sRNAs. Rel
-regulated sRNAs associated with genes known to be important for host infection (
and
) as well as persistence in the tick (
and
) were identified, suggesting potential mechanisms for post-transcriptional regulation of gene expression.
Gene expression profiling by high-throughput sequencing reveals qualitative and quantitative changes in RNA species at steady state but obscures the intracellular dynamics of RNA transcription, ...processing and decay. We developed thiol(SH)-linked alkylation for the metabolic sequencing of RNA (SLAM seq), an orthogonal-chemistry-based RNA sequencing technology that detects 4-thiouridine (s
U) incorporation in RNA species at single-nucleotide resolution. In combination with well-established metabolic RNA labeling protocols and coupled to standard, low-input, high-throughput RNA sequencing methods, SLAM seq enabled rapid access to RNA-polymerase-II-dependent gene expression dynamics in the context of total RNA. We validated the method in mouse embryonic stem cells by showing that the RNA-polymerase-II-dependent transcriptional output scaled with Oct4/Sox2/Nanog-defined enhancer activity, and we provide quantitative and mechanistic evidence for transcript-specific RNA turnover mediated by post-transcriptional gene regulatory pathways initiated by microRNAs and N
-methyladenosine. SLAM seq facilitates the dissection of fundamental mechanisms that control gene expression in an accessible, cost-effective and scalable manner.