Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this ...was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.
The Cancer Genome Atlas Network recently cataloged recurrent genomic abnormalities in glioblastoma multiforme (GBM). We describe a robust gene expression-based molecular classification of GBM into ...Proneural, Neural, Classical, and Mesenchymal subtypes and integrate multidimensional genomic data to establish patterns of somatic mutations and DNA copy number. Aberrations and gene expression of
EGFR,
NF1, and
PDGFRA/IDH1 each define the Classical, Mesenchymal, and Proneural subtypes, respectively. Gene signatures of normal brain cell types show a strong relationship between subtypes and different neural lineages. Additionally, response to aggressive therapy differs by subtype, with the greatest benefit in the Classical subtype and no benefit in the Proneural subtype. We provide a framework that unifies transcriptomic and genomic dimensions for GBM molecular stratification with important implications for future studies.
Four gene expression subtypes of GBM: Proneural, Neural, Classical, and Mesenchymal ► NF1 mutation and loss define Mesenchymal GBM ► Focal EGFR events define Classical GBM ► PGFRA\IDH1 events define Proneural GBM
We compared the human and mouse X chromosomes to systematically test Ohno's law, which states that the gene content of X chromosomes is conserved across placental mammals. First, we improved the ...accuracy of the human X-chromosome reference sequence through single-haplotype sequencing of ampliconic regions. The new sequence closed gaps in the reference sequence, corrected previously misassembled regions and identified new palindromic amplicons. Our subsequent analysis led us to conclude that the evolution of human and mouse X chromosomes was bimodal. In accord with Ohno's law, 94-95% of X-linked single-copy genes are shared by humans and mice; most are expressed in both sexes. Notably, most X-ampliconic genes are exceptions to Ohno's law: only 31% of human and 22% of mouse X-ampliconic genes had orthologs in the other species. X-ampliconic genes are expressed predominantly in testicular germ cells, and many were independently acquired since divergence from the common ancestor of humans and mice, specializing portions of their X chromosomes for sperm production.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
The identification of small sequence variants remains a challenging but critical step in the analysis of next-generation sequencing data. Our variant calling tool, VarScan 2, employs heuristic and ...statistic thresholds based on user-defined criteria to call variants using SAMtools mpileup data as input. Here, we provide guidelines for generating that input, and describe protocols for using VarScan 2 to (1) identify germline variants in individual samples; (2) call somatic mutations, copy number alterations, and LOH events in tumor-normal pairs; and (3) identify germline variants, de novo mutations, and Mendelian inheritance errors in family trios. Further, we describe a strategy for variant filtering that removes likely false positives associated with common sequencing- and alignment-related artifacts.
Colorectal cancer (CRC) is the most common gastrointestinal malignancy in the U.S.A. and approximately 50% of patients develop metastatic disease (mCRC). Despite our understanding of long non-coding ...RNAs (lncRNAs) in primary colon cancer, their role in mCRC and treatment resistance remains poorly characterized. Therefore, through transcriptome sequencing of normal, primary, and distant mCRC tissues we find 148 differentially expressed RNAs Associated with Metastasis (RAMS). We prioritize RAMS11 due to its association with poor disease-free survival and promotion of aggressive phenotypes in vitro and in vivo. A FDA-approved drug high-throughput viability assay shows that elevated RAMS11 expression increases resistance to topoisomerase inhibitors. Subsequent experiments demonstrate RAMS11-dependent recruitment of Chromobox protein 4 (CBX4) transcriptionally activates Topoisomerase II alpha (TOP2α). Overall, recent clinical trials using topoisomerase inhibitors coupled with our findings of RAMS11-dependent regulation of TOP2α supports the potential use of RAMS11 as a biomarker and therapeutic target for mCRC.
Large-scale cancer sequencing data enable discovery of rare germline cancer susceptibility variants. Here we systematically analyse 4,034 cases from The Cancer Genome Atlas cancer cases representing ...12 cancer types. We find that the frequency of rare germline truncations in 114 cancer-susceptibility-associated genes varies widely, from 4% (acute myeloid leukaemia (AML)) to 19% (ovarian cancer), with a notably high frequency of 11% in stomach cancer. Burden testing identifies 13 cancer genes with significant enrichment of rare truncations, some associated with specific cancers (for example, RAD51C, PALB2 and MSH6 in AML, stomach and endometrial cancers, respectively). Significant, tumour-specific loss of heterozygosity occurs in nine genes (ATM, BAP1, BRCA1/2, BRIP1, FANCM, PALB2 and RAD51C/D). Moreover, our homology-directed repair assay of 68 BRCA1 rare missense variants supports the utility of allelic enrichment analysis for characterizing variants of unknown significance. The scale of this analysis and the somatic-germline integration enable the detection of rare variants that may affect individual susceptibility to tumour development, a critical step toward precision medicine.
B73 Maize Genome: Complexity, Diversity, and Dynamics Ware, Doreen; Fulton, Robert S; Wei, Fusheng ...
Science (American Association for the Advancement of Science),
11/2009, Letnik:
326, Številka:
5956
Journal Article
Recenzirano
Odprti dostop
We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were ...placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.
Rhabdomyosarcoma is a soft-tissue sarcoma with molecular and cellular features of developing skeletal muscle. Rhabdomyosarcoma has two major histologic subtypes, embryonal and alveolar, each with ...distinct clinical, molecular, and genetic features. Genomic analysis shows that embryonal tumors have more structural and copy number variations than alveolar tumors. Mutations in the RAS/NF1 pathway are significantly associated with intermediate- and high-risk embryonal rhabdomyosarcomas (ERMS). In contrast, alveolar rhabdomyosarcomas (ARMS) have fewer genetic lesions overall and no known recurrently mutated cancer consensus genes. To identify therapeutics for ERMS, we developed and characterized orthotopic xenografts of tumors that were sequenced in our study. High-throughput screening of primary cultures derived from those xenografts identified oxidative stress as a pathway of therapeutic relevance for ERMS.
•There are higher rates of mutation in ERMS than in ARMS tumors•RAS pathway mutations are associated with intermediate- and high-risk ERMS•ERMS tumor cells have elevated oxidative stress•ERMS tumors are sensitive to drugs that target oxidative stress
Detection and characterization of genomic structural variation are important for understanding the landscape of genetic variation in human populations and in complex diseases such as cancer. Recent ...studies demonstrate the feasibility of detecting structural variation using next-generation, short-insert, paired-end sequencing reads. However, the utility of these reads is not entirely clear, nor are the analysis methods with which accurate detection can be achieved. The algorithm BreakDancer predicts a wide variety of structural variants including insertion-deletions (indels), inversions and translocations. We examined BreakDancer's performance in simulation, in comparison with other methods and in analyses of a sample from an individual with acute myeloid leukemia and of samples from the 1,000 Genomes trio individuals. BreakDancer sensitively and accurately detected indels ranging from 10 base pairs to 1 megabase pair that are difficult to detect via a single conventional approach.
Heterorhabditis bacteriophora are entomopathogenic nematodes that have evolved a mutualism with Photorhabdus luminescens bacteria to function as highly virulent insect pathogens. The nematode ...provides a safe harbor for intestinal symbionts in soil and delivers the symbiotic bacteria into the insect blood. The symbiont provides virulence and toxins, metabolites essential for nematode reproduction, and antibiotic preservation of the insect cadaver. Approximately half of the 21,250 putative protein coding genes identified in the 77 Mbp high quality draft H. bacteriophora genome sequence were novel proteins of unknown function lacking homologs in Caenorhabditis elegans or any other sequenced organisms. Similarly, 317 of the 603 predicted secreted proteins are novel with unknown function in addition to 19 putative peptidases, 9 peptidase inhibitors and 7 C-type lectins that may function in interactions with insect hosts or bacterial symbionts. The 134 proteins contained mariner transposase domains, of which there are none in C. elegans, suggesting an invasion and expansion of mariner transposons in H. bacteriophora. Fewer Kyoto Encyclopedia of Genes and Genomes Orthologies in almost all metabolic categories were detected in the genome compared with 9 other sequenced nematode genomes, which may reflect dependence on the symbiont or insect host for these functions. The H. bacteriophora genome sequence will greatly facilitate genetics, genomics and evolutionary studies to gain fundamental knowledge of nematode parasitism and mutualism. It also elevates the utility of H. bacteriophora as a bridge species between vertebrate parasitic nematodes and the C. elegans model.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK