The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the 'gold standard' for modeling human influenza virus infection and ...transmission. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotated 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterized the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time-course data and showed distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis disease progression, we showed that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with cystic fibrosis disease.
The naked mole rat (Heterocephalus glaber) is an exceptionally long-lived and cancer-resistant rodent native to East Africa. Although its genome was previously sequenced, here we report a new ...assembly sequenced by us with substantially higher N50 values for scaffolds and contigs.
We analyzed the annotation of this new improved assembly and identified candidate genomic adaptations which may have contributed to the evolution of the naked mole rat's extraordinary traits, including in regions of p53, and the hyaluronan receptors CD44 and HMMR (RHAMM). Furthermore, we developed a freely available web portal, the Naked Mole Rat Genome Resource (http://www.naked-mole-rat.org), featuring the data and results of our analysis, to assist researchers interested in the genome and genes of the naked mole rat, and also to facilitate further studies on this fascinating species.
Summary
T4‐like myoviruses are ubiquitous, and their genes are among the most abundant documented in ocean systems. Here we compare 26 T4‐like genomes, including 10 from non‐cyanobacterial ...myoviruses, and 16 from marine cyanobacterial myoviruses (cyanophages) isolated on diverse Prochlorococcus or Synechococcus hosts. A core genome of 38 virion construction and DNA replication genes was observed in all 26 genomes, with 32 and 25 additional genes shared among the non‐cyanophage and cyanophage subsets, respectively. These hierarchical cores are highly syntenic across the genomes, and sampled to saturation. The 25 cyanophage core genes include six previously described genes with putative functions (psbA, mazG, phoH, hsp20, hli03, cobS), a hypothetical protein with a potential phytanoyl‐CoA dioxygenase domain, two virion structural genes, and 16 hypothetical genes. Beyond previously described cyanophage‐encoded photosynthesis and phosphate stress genes, we observed core genes that may play a role in nitrogen metabolism during infection through modulation of 2‐oxoglutarate. Patterns among non‐core genes that may drive niche diversification revealed that phosphorus‐related gene content reflects source waters rather than host strain used for isolation, and that carbon metabolism genes appear associated with putative mobile elements. As well, phages isolated on Synechococcus had higher genome‐wide %G+C and often contained different gene subsets (e.g. petE, zwf, gnd, prnA, cpeT) than those isolated on Prochlorococcus. However, no clear diagnostic genes emerged to distinguish these phage groups, suggesting blurred boundaries possibly due to cross‐infection. Finally, genome‐wide comparisons of both diverse and closely related, co‐isolated genomes provide a locus‐to‐locus variability metric that will prove valuable for interpreting metagenomic data sets.
High-grade serous ovarian cancers are characterized by widespread recurrent copy number alterations. Although some regions of copy number change harbor known oncogenes and tumor suppressor genes, the ...genes targeted by the majority of amplified or deleted regions in ovarian cancer remain undefined. Here we systematically tested amplified genes for their ability to promote tumor formation using an in vivo multiplexed transformation assay. We identified the GRB2-associated binding protein 2 (GAB2) as a recurrently amplified gene that potently transforms immortalized ovarian and fallopian tube secretory epithelial cells. Cancer cell lines overexpressing GAB2 require GAB2 for survival and show evidence of phosphatidylinositol 3-kinase (PI3K) pathway activation, which was required for GAB2-induced transformation. Cell lines overexpressing GAB2 were as sensitive to PI3K inhibition as cell lines harboring mutant PIK3CA . Together, these observations nominate GAB2 as an ovarian cancer oncogene, identify an alternative mechanism to activate PI3K signaling, and underscore the importance of PI3K signaling in this cancer.
Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local ...and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage.
To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling.
These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.
RNA viruses are the causative agents for AIDS, influenza, SARS, and other serious health threats. Development of rapid and broadly applicable methods for complete viral genome sequencing is highly ...desirable to fully understand all aspects of these infectious agents as well as for surveillance of viral pandemic threats and emerging pathogens. However, traditional viral detection methods rely on prior sequence or antigen knowledge. In this study, we describe sequence-independent amplification for samples containing ultra-low amounts of viral RNA coupled with Illumina sequencing and de novo assembly optimized for viral genomes. With 5 million reads, we capture 96 to 100% of the viral protein coding region of HIV, respiratory syncytial and West Nile viral samples from as little as 100 copies of viral RNA. The methods presented here are scalable to large numbers of samples and capable of generating full or near full length viral genomes from clone and clinical samples with low amounts of viral RNA, without prior sequence information and in the presence of substantial host contamination.
Abstract
Objectives
Diffuse large B-cell lymphoma (DLBCL) is an aggressive non-Hodgkin lymphoma with a heterogenous genetic landscape that can require multiple assays to characterize. We reviewed a ...1-step RNA-based assay to determine cell of origin (COO), detect translocations, and identify mutations and to assess the role of the assay in diagnosis.
Methods
Using a single custom Archer FusionPlex Lymphoma panel, we performed anchored multiplex polymerase chain reaction–based RNA sequencing on 41 cases of de novo DLBCL. Each case was subclassified by COO, and gene fusions and hotspot mutations were identified. The findings were then compared with COO classification by the Hans immunohistochemical algorithm and NanoString technology, cytogenetics, and fluorescence in situ hybridization results.
Results
Concordant COO classification by the FusionPlex panel and NanoString was observed in 35 of 41 cases (85.3%), with NanoString and Hans concordant in 33 of 41 cases (80.5%) and FusionPlex and Hans concordant in 33 of 41 cases (80.5%). The FusionPlex assay also detected 6 of 11 BCL6 translocations (4 cryptic), 2 of 3 BCL2 translocations, and 2 of 4 MYC translocations. Mutations were detected in lymphoma-related genes in 24 of 41 cases.
Conclusion
This FusionPlex assay offers a single method for COO classification, mutation detection, and identification of important translocations in DLBCL. Although not replacing traditional testing, it could offer useful data when limited tissue is available
Pyrenophora tritici-repentis is a necrotrophic fungus causal to the disease tan spot of wheat, whose contribution to crop loss has increased significantly during the last few decades. Pathogenicity ...by this fungus is attributed to the production of host-selective toxins (HST), which are recognized by their host in a genotype-specific manner. To better understand the mechanisms that have led to the increase in disease incidence related to this pathogen, we sequenced the genomes of three P. tritici-repentis isolates. A pathogenic isolate that produces two known HSTs was used to assemble a reference nuclear genome of approximately 40 Mb composed of 11 chromosomes that encode 12,141 predicted genes. Comparison of the reference genome with those of a pathogenic isolate that produces a third HST, and a nonpathogenic isolate, showed the nonpathogen genome to be more diverged than those of the two pathogens. Examination of gene-coding regions has provided candidate pathogen-specific proteins and revealed gene families that may play a role in a necrotrophic lifestyle. Analysis of transposable elements suggests that their presence in the genome of pathogenic isolates contributes to the creation of novel genes, effector diversification, possible horizontal gene transfer events, identified copy number variation, and the first example of transduplication by DNA transposable elements in fungi. Overall, comparative analysis of these genomes provides evidence that pathogenicity in this species arose through an influx of transposable elements, which created a genetically flexible landscape that can easily respond to environmental changes.
Aims
Endometrial stromal sarcomas (ESSs) are divided into low‐grade and high‐grade subtypes, with the latter showing more aggressive clinical behaviour. Although histology and immunophenotype can aid ...in the diagnosis of these tumours, genetic studies can provide additional diagnostic insights, as low‐grade ESSs frequently harbour fusions involving JAZF1/SUZ12 and/or JAZF1/PHF1, whereas high‐grade ESSs are defined by YWHAE–NUTM2A/B fusions. The aim of this study was to evaluate the utility of a next‐generation sequencing (NGS)‐based assay in identifying ESS fusions in archival formalin‐fixed paraffin‐embedded tumour samples.
Methods and results
We applied an NGS‐based fusion transcript detection assay (Archer FusionPlex Sarcoma Panel) that targets YWHAE and JAZF1 fusions in a series of low‐grade ESSs (n = 11) and high‐grade ESSs (n = 5) that were previously confirmed to harbour genetic rearrangements by fluorescence in‐situ hybridization (FISH) and/or reverse transcription polymerase chain reaction (RT‐PCR) analyses. The fusion assay identified junctional fusion transcript sequences that corresponded to the known FISH/RT‐PCR results in all cases. Four low‐grade ESSs harboured JAZF1–PHF1 fusions with different junctional sequences, and all were correctly identified because of the open‐ended nature of the assay design, using anchored multiplex polymerase chain reaction. Seven non‐ESS sarcomas were also included as negative controls, and no strong ESS fusion candidates were identified in these cases.
Conclusions
Our findings demonstrate good sensitivity and specificity of an NGS‐based gene fusion assay in the detection of ESS fusion transcripts.
Eliminating the bacterial cloning step has been a major factor in the vastly improved efficiency of massively parallel sequencing approaches. However, this also has made it a technical challenge to ...produce the modern equivalent of the Fosmid- or BAC-end sequences that were crucial for assembling and analyzing complex genomes during the Sanger-based sequencing era. To close this technology gap, we developed Fosill, a method for converting Fosmids to Illumina-compatible jumping libraries. We constructed Fosmid libraries in vectors with Illumina primer sequences and specific nicking sites flanking the cloning site. Our family of pFosill vectors allows multiplex Fosmid cloning of end-tagged genomic fragments without physical size selection and is compatible with standard and multiplex paired-end Illumina sequencing. To excise the bulk of each cloned insert, we introduced two nicks in the vector, translated them into the inserts, and cleaved them. Recircularization of the vector via coligation of insert termini followed by inverse PCR generates a jumping library for paired-end sequencing with 101-base reads. The yield of unique Fosmid-sized jumps is sufficiently high, and the background of short, incorrectly spaced and chimeric artifacts sufficiently low, to enable applications such as mapping of structural variation and scaffolding of de novo assemblies. We demonstrate the power of Fosill to map genome rearrangements in a cancer cell line and identified three fusion genes that were corroborated by RNA-seq data. Our Fosill-powered assembly of the mouse genome has an N50 scaffold length of 17.0 Mb, rivaling the connectivity (16.9 Mb) of the Sanger-sequencing based draft assembly.