The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects ...the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health.
Decitabine produced responses in patients with acute myeloid leukemia or myelodysplastic syndromes who had cytogenetic abnormalities associated with a poor prognosis, including 21 of 21 patients with ...tumors that contained
TP53
mutations.
Acute myeloid leukemia (AML) and myelodysplastic syndromes (MDS) are clonal disorders of myeloid hematopoiesis.
1
Adult patients with AML who have karyotypes that are associated with unfavorable risk and older patients with AML (≥60 years of age) have poor outcomes, with a median survival of approximately 1 year.
2
,
3
Patients with AML and
TP53
mutations tend to be older (median age, 61 to 67 years), and almost all have karyotypes that are associated with unfavorable risk; if they receive standard cytotoxic chemotherapy, these patients have especially poor outcomes (median survival, 4 to 6 months).
3
–
6
Decitabine (5-aza-2′-deoxycytidine) is commonly used as . . .
The Cancer Genome Atlas (TCGA) has used the latest sequencing and analysis methods to identify somatic variants across thousands of tumours. Here we present data and analytical results for point ...mutations and small insertions/deletions from 3,281 tumours across 12 tumour types as part of the TCGA Pan-Cancer effort. We illustrate the distributions of mutation frequencies, types and contexts across tumour types, and establish their links to tissues of origin, environmental/carcinogen influences, and DNA repair defects. Using the integrated data sets, we identified 127 significantly mutated genes from well-known (for example, mitogen-activated protein kinase, phosphatidylinositol-3-OH kinase, Wnt/β-catenin and receptor tyrosine kinase signalling pathways, and cell cycle control) and emerging (for example, histone, histone modification, splicing, metabolism and proteolysis) cellular processes in cancer. The average number of mutations in these significantly mutated genes varies across tumour types; most tumours have two to six, indicating that the number of driver mutations required during oncogenesis is relatively small. Mutations in transcriptional factors/regulators show tissue specificity, whereas histone modifiers are often mutated across several cancer types. Clinical association analysis identifies genes having a significant effect on survival, and investigations of mutations with respect to clonal/subclonal architecture delineate their temporal orders during tumorigenesis. Taken together, these results lay the groundwork for developing new diagnostics and individualizing cancer treatment.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, KISLJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
In an effort to more fully understand the full spectrum of human genetic variation, we generated deep single-molecule, real-time (SMRT) sequencing data from two haploid human genomes. By using an ...assembly-based approach (SMRT-SV), we systematically assessed each genome independently for structural variants (SVs) and indels resolving the sequence structure of 461,553 genetic variants from 2 bp to 28 kbp in length. We find that >89% of these variants have been missed as part of analysis of the 1000 Genomes Project even after adjusting for more common variants (MAF > 1%). We estimate that this theoretical human diploid differs by as much as ∼16 Mbp with respect to the human reference, with long-read sequencing data providing a fivefold increase in sensitivity for genetic variants ranging in size from 7 bp to 1 kbp compared with short-read sequence data. Although a large fraction of genetic variants were not detected by short-read approaches, once the alternate allele is sequence-resolved, we show that 61% of SVs can be genotyped in short-read sequence data sets with high accuracy. Uncoupling discovery from genotyping thus allows for the majority of this missed common variation to be genotyped in the human population. Interestingly, when we repeat SV detection on a pseudodiploid genome constructed in silico by merging the two haploids, we find that ∼59% of the heterozygous SVs are no longer detected by SMRT-SV. These results indicate that haploid resolution of long-read sequencing data will significantly increase sensitivity of SV detection.
We sequenced the MSY (male-specific region of the Y chromosome) of the C57BL/6J strain of the laboratory mouse Mus musculus. In contrast to theories that Y chromosomes are heterochromatic and gene ...poor, the mouse MSY is 99.9% euchromatic and contains about 700 protein-coding genes. Only 2% of the MSY derives from the ancestral autosomes that gave rise to the mammalian sex chromosomes. Instead, all but 45 of the MSY's genes belong to three acquired, massively amplified gene families that have no homologs on primate MSYs but do have acquired, amplified homologs on the mouse X chromosome. The complete mouse MSY sequence brings to light dramatic forces in sex chromosome evolution: lineage-specific convergent acquisition and amplification of X-Y gene families, possibly fueled by antagonism between acquired X-Y homologs. The mouse MSY sequence presents opportunities for experimental studies of a sex-specific chromosome in its entirety, in a genetically tractable model organism.
A constitutive mutation in a cancer-susceptibility gene can have implications for clinical treatment and genetic counseling of family members. This study involving 1120 children and adolescents ...showed that 95 (8.5%) had such a mutation.
The frequency of germline mutations in cancer-predisposition genes in children and adolescents with cancer and the implications of such mutations are largely unknown. Previous studies have relied mainly on candidate-gene approaches, which are, by design, limited. To better determine the contribution of germline predisposition mutations to childhood cancer, we used next-generation sequencing, including whole-genome and whole-exome sequencing, to analyze the genomes of 1120 children and adolescents with cancer. We describe the prevalence and spectrum of germline variants among 565 cancer-associated genes, with an emphasis on the analysis of 60 genes that have been associated with autosomal dominant cancer-predisposition syndromes. We . . .
Massively parallel sequencing technology and the associated rapidly decreasing sequencing costs have enabled systemic analyses of somatic mutations in large cohorts of cancer cases. Here we introduce ...a comprehensive mutational analysis pipeline that uses standardized sequence-based inputs along with multiple types of clinical data to establish correlations among mutation sites, affected genes and pathways, and to ultimately separate the commonly abundant passenger mutations from the truly significant events. In other words, we aim to determine the Mutational Significance in Cancer (MuSiC) for these large data sets. The integration of analytical operations in the MuSiC framework is widely applicable to a broad set of tumor types and offers the benefits of automation as well as standardization. Herein, we describe the computational structure and statistical underpinnings of the MuSiC pipeline and demonstrate its performance using 316 ovarian cancer samples from the TCGA ovarian cancer project. MuSiC correctly confirms many expected results, and identifies several potentially novel avenues for discovery.
Massively parallel sequencing technologies hold incredible promise for the study of DNA sequence variation, particularly the identification of variants affecting human disease. The unprecedented ...throughput and relatively short read lengths of Roche/454, Illumina/Solexa, and other platforms have spurred development of a new generation of sequence alignment algorithms. Yet detection of sequence variants based on short read alignments remains challenging, and most currently available tools are limited to a single platform or aligner type. We present VarScan, an open source tool for variant detection that is compatible with several short read aligners. We demonstrate VarScan's ability to detect SNPs and indels with high sensitivity and specificity, in both Roche/454 sequencing of individuals and deep Illumina/Solexa sequencing of pooled samples. Availability and Implementation: Source code and documentation freely available at http://genome.wustl.edu/tools/cancer-genomics implemented as a Perl package and supported on Linux/UNIX, MS Windows and Mac OSX. Contact: dkoboldt@genome.wustl.edu Supplementary information: Supplementary data are available at Bioinformatics online.
In the developing retina, multipotent neural progenitors undergo unidirectional differentiation in a precise spatiotemporal order. Here we profile the epigenetic and transcriptional changes that ...occur during retinogenesis in mice and humans. Although some progenitor genes and cell cycle genes were epigenetically silenced during retinogenesis, the most dramatic change was derepression of cell-type-specific differentiation programs. We identified developmental-stage-specific super-enhancers and showed that most epigenetic changes are conserved in humans and mice. To determine how the epigenome changes during tumorigenesis and reprogramming, we performed integrated epigenetic analysis of murine and human retinoblastomas and induced pluripotent stem cells (iPSCs) derived from murine rod photoreceptors. The retinoblastoma epigenome mapped to the developmental stage when retinal progenitors switch from neurogenic to terminal patterns of cell division. The epigenome of retinoblastomas was more similar to that of the normal retina than that of retina-derived iPSCs, and we identified retina-specific epigenetic memory.
•Changes in histone modifications were more prominent than those in DNA methylation•Epigenetic changes were more prevalent at differentiation genes than progenitors•The retinoblastoma epigenome resembles the retina at a developmental transition•Histone modifications are important for neuronal epigenetic memory in iPSCs
Aldiri, Xu, and colleagues show in this article how the epigenome of the mouse and human retina changes during development in coordination with transcriptional programs. They also relate those developmental changes to retinoblastoma and epigenetic memory in retina-derived iPSCs.