Defining the transcriptional dynamics of a temporal process such as cell differentiation is challenging owing to the high variability in gene expression between individual cells. Time-series gene ...expression analyses of bulk cells have difficulty distinguishing early and late phases of a transcriptional cascade or identifying rare subpopulations of cells, and single-cell proteomic methods rely on a priori knowledge of key distinguishing markers. Here we describe Monocle, an unsupervised algorithm that increases the temporal resolution of transcriptome dynamics using single-cell RNA-Seq data collected at multiple time points. Applied to the differentiation of primary human myoblasts, Monocle revealed switch-like changes in expression of key regulatory factors, sequential waves of gene regulation, and expression of regulators that were not known to act in differentiation. We validated some of these predicted regulators in a loss-of function screen. Monocle can in principle be used to recover single-cell gene expression kinetics from a wide array of cellular processes, including differentiation, proliferation and oncogenic transformation.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Estimating an epidemic's trajectory is crucial for developing public health responses to infectious diseases, but case data used for such estimation are confounded by variable testing practices. We ...show that the population distribution of viral loads observed under random or symptom-based surveillance-in the form of cycle threshold (Ct) values obtained from reverse transcription quantitative polymerase chain reaction testing-changes during an epidemic. Thus, Ct values from even limited numbers of random samples can provide improved estimates of an epidemic's trajectory. Combining data from multiple such samples improves the precision and robustness of this estimation. We apply our methods to Ct values from surveillance conducted during the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic in a variety of settings and offer alternative approaches for real-time estimates of epidemic trajectories for outbreak management and response.
BACKGROUND: DNA sequencing technologies deviate from the ideal uniform distribution of reads. These biases impair scientific and medical applications. Accordingly, we have developed computational ...methods for discovering, describing and measuring bias. RESULTS: We applied these methods to the Illumina, Ion Torrent, Pacific Biosciences and Complete Genomics sequencing platforms, using data from human and from a set of microbes with diverse base compositions. As in previous work, library construction conditions significantly influence sequencing bias. Pacific Biosciences coverage levels are the least biased, followed by Illumina, although all technologies exhibit error-rate biases in high- and low-GC regions and at long homopolymer runs. The GC-rich regions prone to low coverage include a number of human promoters, so we therefore catalog 1,000 that were exceptionally resistant to sequencing. Our results indicate that combining data from two technologies can reduce coverage bias if the biases in the component technologies are complementary and of similar magnitude. Analysis of Illumina data representing 120-fold coverage of a well-studied human sample reveals that 0.20% of the autosomal genome was covered at less than 10% of the genome-wide average. Excluding locations that were similar to known bias motifs or likely due to sample-reference variations left only 0.045% of the autosomal genome with unexplained poor coverage. CONCLUSIONS: The assays presented in this paper provide a comprehensive view of sequencing bias, which can be used to drive laboratory improvements and to monitor production processes. Development guided by these assays should result in improved genome assemblies and better coverage of biologically important loci.
While high-throughput sequencing methods are revolutionizing fungal ecology, recovering accurate estimates of species richness and abundance has proven elusive. We sought to design internal ...transcribed spacer (ITS) primers and an Illumina protocol that would maximize coverage of the kingdom Fungi while minimizing nontarget eukaryotes. We inspected alignments of the 5.8S and large subunit (LSU) ribosomal genes and evaluated potential primers using PrimerProspector. We tested the resulting primers using tiered-abundance mock communities and five previously characterized soil samples. We recovered operational taxonomic units (OTUs) belonging to all 8 members in both mock communities, despite DNA abundances spanning 3 orders of magnitude. The expected and observed read counts were strongly correlated (r = 0.94 to 0.97). However, several taxa were consistently over- or underrepresented, likely due to variation in rRNA gene copy numbers. The Illumina data resulted in clustering of soil samples identical to that obtained with Sanger sequence clone library data using different primers. Furthermore, the two methods produced distance matrices with a Mantel correlation of 0.92. Nonfungal sequences comprised less than 0.5% of the soil data set, with most attributable to vascular plants. Our results suggest that high-throughput methods can produce fairly accurate estimates of fungal abundances in complex communities. Further improvements might be achieved through corrections for rRNA copy number and utilization of standardized mock communities.
Fungi play numerous important roles in the environment. Improvements in sequencing methods are providing revolutionary insights into fungal biodiversity, yet accurate estimates of the number of fungal species (i.e., richness) and their relative abundances in an environmental sample (e.g., soil, roots, water, etc.) remain difficult to obtain. We present improved methods for high-throughput Illumina sequencing of the species-diagnostic fungal ribosomal marker gene that improve the accuracy of richness and abundance estimates. The improvements include new PCR primers and library preparation, validation using a known mock community, and bioinformatic parameter tuning.
Here we present an in-depth characterization of the mechanism of sequencer-induced sample contamination due to the phenomenon of index swapping that impacts Illumina sequencers employing patterned ...flow cells with Exclusion Amplification (ExAmp) chemistry (HiSeqX, HiSeq4000, and NovaSeq). We also present a remediation method that minimizes the impact of such swaps.
Leveraging data collected over a two-year period, we demonstrate the widespread prevalence of index swapping in patterned flow cell data. We calculate mean swap rates across multiple sample preparation methods and sequencer models, demonstrating that different library methods can have vastly different swapping rates and that even non-ExAmp chemistry instruments display trace levels of index swapping. We provide methods for eliminating sample data cross contamination by utilizing non-redundant dual indexing for complete filtering of index swapped reads, and share the sequences for 96 non-combinatorial dual indexes we have validated across various library preparation methods and sequencer models. Finally, using computational methods we provide a greater insight into the mechanism of index swapping.
Index swapping in pooled libraries is a prevalent phenomenon that we observe at a rate of 0.2 to 6% in all sequencing runs on HiSeqX, HiSeq 4000/3000, and NovaSeq. Utilizing non-redundant dual indexing allows for the removal (flagging/filtering) of these swapped reads and eliminates swapping induced sample contamination, which is critical for sensitive applications such as RNA-seq, single cell, blood biopsy using circulating tumor DNA, or clinical sequencing.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Fungi play key roles in ecosystems as mutualists, pathogens, and decomposers. Current estimates of global species richness are highly uncertain, and the importance of stochastic vs. deterministic ...forces in the assembly of fungal communities is unknown. Molecular studies have so far failed to reach saturated, comprehensive estimates of fungal diversity. To obtain a more accurate estimate of global fungal diversity, we used a direct molecular approach to census diversity in a boreal ecosystem with precisely known plant diversity, and we carefully evaluated adequacy of sampling and accuracy of species delineation. We achieved the first exhaustive enumeration of fungi in soil, recording 1002 taxa in this system. We show that the fungus:plant ratio in Picea mariana forest soils from interior Alaska is at least 17:1 and is regionally stable. A global extrapolation of this ratio would suggest 6 million species of fungi, as opposed to leading estimates ranging from 616000 to 1.5 million. We also find that closely related fungi often occupy divergent niches. This pattern is seen in fungi spanning all major functional guilds and four phyla, suggesting a major role of deterministic niche partitioning in community assembly. Extinctions and range shifts are reorganizing biodiversity on Earth, yet our results suggest that 98% of fungi remain undescribed and that many of these species occupy unique niches.
Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo ...population consensus assembly is valuable both as a single linear representation of the population and as a backbone on which intra-host variants can be accurately mapped. The availability of consensus assemblies and robustly mapped variants are crucial to the genetic study of viral disease progression, transmission dynamics, and viral evolution. Existing de novo assembly techniques fail to robustly assemble ultra-deep sequence data from genetically heterogeneous populations such as viruses into full-length genomes due to the presence of extensive genetic variability, contaminants, and variable sequence coverage.
We present VICUNA, a de novo assembly algorithm suitable for generating consensus assemblies from genetically heterogeneous populations. We demonstrate its effectiveness on Dengue, Human Immunodeficiency and West Nile viral populations, representing a range of intra-host diversity. Compared to state-of-the-art assemblers designed for haploid or diploid systems, VICUNA recovers full-length consensus and captures insertion/deletion polymorphisms in diverse samples. Final assemblies maintain a high base calling accuracy. VICUNA program is publicly available at: http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/ viral-genomics-analysis-software.
We developed VICUNA, a publicly available software tool, that enables consensus assembly of ultra-deep sequence derived from diverse viral populations. While VICUNA was developed for the analysis of viral populations, its application to other heterogeneous sequence data sets such as metagenomic or tumor cell population samples may prove beneficial in these fields of research.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Clonal hematopoiesis (CH) can be transmitted from a donor to a recipient during allogeneic hematopoietic cell transplantation. Exclusion of candidate donors with CH is controversial since its impact ...on recipient outcomes and graft alloimmune function is uncertain.
We performed targeted error-corrected sequencing on samples from 1,727 donors age 40 years or older and assessed the effect of donor CH on recipient clinical outcomes. We measured long-term engraftment of 102 donor clones and cytokine levels in 256 recipients at 3 and 12 months after transplant.
CH was present in 22.5% of donors, with
(14.6%) and
(5.2%) mutations being most common; 85% of donor clones showed long-term engraftment in recipients after transplantation, including clones with a variant allele fraction < 0.01.
CH with a variant allele fraction ≥ 0.01, but not smaller clones, was associated with improved recipient overall (hazard ratio HR, 0.79;
= .042) and progression-free survival (HR, 0.72;
= .003) after adjustment for significant clinical variables. In patients who received calcineurin-based graft-versus-host disease prophylaxis, donor
CH was associated with reduced relapse (subdistribution HR, 0.59;
= .014), increased chronic graft-versus-host disease (subdistribution HR, 1.36;
= .042), and higher interleukin-12p70 levels in recipients. No recipient of sole
or
-CH developed donor cell leukemia (DCL). In seven of eight cases, DCL evolved from donor CH with rare
or splicing factor mutations or from donors carrying germline
mutations.
Donor CH is closely associated with clinical outcomes in transplant recipients, with differential impact on graft alloimmune function and potential for leukemic transformation related to mutated gene and somatic clonal abundance. Donor
-CH is associated with improved recipient survival because of reduced relapse risk and with an augmented network of inflammatory cytokines in recipients. Risk of DCL in allogeneic hematopoietic cell transplantation is driven by somatic myelodysplastic syndrome-associated mutations or germline predisposition in donors.
Rapid diagnostic tests (RDTs) for SARS-CoV-2 antigens (Ag) that can be performed at point of care (POC) can supplement molecular testing and help mitigate the COVID-19 pandemic. Deployment of an Ag ...RDT requires an understanding of its operational and performance characteristics under real-world conditions and in relevant subpopulations. We evaluated the Abbott BinaxNOW COVID-19 Ag card in a high-throughput, drive-through, free community testing site in Massachusetts using anterior nasal (AN) swab reverse transcriptase PCR (RT-PCR) for clinical testing. Individuals presenting for molecular testing in two of seven lanes were offered the opportunity to also receive BinaxNOW testing. Dual AN swabs were collected from symptomatic and asymptomatic children (≤18 years of age) and adults. BinaxNOW testing was performed in a testing pod with temperature/humidity monitoring. One individual performed testing and official result reporting for each test, but most tests had a second independent reading to assess interoperator agreement. Positive BinaxNOW results were scored as faint, medium, or strong. Positive BinaxNOW results were reported to patients by phone, and they were instructed to isolate pending RT-PCR results. The paired RT-PCR result was the reference for sensitivity and specificity calculations. Of 2,482 participants, 1,380 adults and 928 children had paired RT-PCR/BinaxNOW results and complete symptom data. In this study, 974/1,380 (71%) adults and 829/928 (89%) children were asymptomatic. BinaxNOW had 96.5% (95% confidence interval CI, 90.0 to 99.3) sensitivity and 100% (95% CI, 98.6 to 100.0) specificity in adults within 7 days of symptoms and 84.6% (95% CI, 65.1 to 95.6) sensitivity and 100% (95% CI, 94.5 to 100.0) specificity in children within 7 days of symptoms. Sensitivity and specificity in asymptomatic adults were 70.2% (95% CI, 56.6 to 81.6) and 99.6% (95% CI, 98.9 to 99.9), respectively, and in asymptomatic children, they were 65.4% (95% CI, 55.6 to 74.4) and 99.0% (95% CI, 98.0 to 99.6), respectively. By cycle threshold (
) value cutoff, sensitivity in all subgroups combined (
= 292 RT-PCR-positive individuals) was 99.3% with
values of ≤25, 95.8% with
values of ≤30, and 81.2% with
values of ≤35. Twelve false-positive BinaxNOW results (out of 2,308 tests) were observed; in all 12, the test bands were faint but otherwise normal and were noted by both readers. One invalid BinaxNOW result was identified. Interoperator agreement (positive versus negative BinaxNOW result) was 100% (
= 2,230/2,230 double reads). Each operator was able to process 20 RDTs per hour. In a separate set of 30 specimens (from individuals with symptoms ≤7 days) run at temperatures below the manufacturer's recommended range (46 to 58.5°F), sensitivity was 66.7% and specificity 95.2%. BinaxNOW had very high specificity in both adults and children and very high sensitivity in newly symptomatic adults. Overall, 95.8% sensitivity was observed with
values of ≤30. These data support public health recommendations for use of the BinaxNOW test in adults with symptoms for ≤7 days without RT-PCR confirmation. Excellent interoperator agreement indicates that an individual can perform and read the BinaxNOW test alone. A skilled laboratorian can perform and read 20 tests per hour. Careful attention to temperature is critical.
Comprehensive sequence characterization across the MHC is important for successful organ transplantation and genetic association studies. To this end, we have developed an automated sample ...preparation, molecular barcoding and multiplexing protocol for the amplification and sequence-determination of class I HLA loci. We have coupled this process to a novel HLA calling algorithm to determine the most likely pair of alleles at each locus.
We have benchmarked our protocol with 270 HapMap individuals from four worldwide populations with 96.4% accuracy at 4-digit resolution. A variation of this initial protocol, more suitable for large sample sizes, in which molecular barcodes are added during PCR rather than library construction, was tested on 95 HapMap individuals with 98.6% accuracy at 4-digit resolution.
Next-generation sequencing on the 454 FLX Titanium platform is a reliable, efficient, and scalable technology for HLA typing.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK