Human colorectal cancer cell lines are used widely to investigate tumor biology, experimental therapy, and biomarkers. However, to what extent these established cell lines represent and maintain the ...genetic diversity of primary cancers is uncertain. In this study, we profiled 70 colorectal cancer cell lines for mutations and DNA copy number by whole-exome sequencing and SNP microarray analyses, respectively. Gene expression was defined using RNA-Seq. Cell line data were compared with those published for primary colorectal cancers in The Cancer Genome Atlas. Notably, we found that exome mutation and DNA copy-number spectra in colorectal cancer cell lines closely resembled those seen in primary colorectal tumors. Similarities included the presence of two hypermutation phenotypes, as defined by signatures for defective DNA mismatch repair and DNA polymerase ε proofreading deficiency, along with concordant mutation profiles in the broadly altered WNT, MAPK, PI3K, TGFβ, and p53 pathways. Furthermore, we documented mutations enriched in genes involved in chromatin remodeling (ARID1A, CHD6, and SRCAP) and histone methylation or acetylation (ASH1L, EP300, EP400, MLL2, MLL3, PRDM2, and TRRAP). Chromosomal instability was prevalent in nonhypermutated cases, with similar patterns of chromosomal gains and losses. Although paired cell lines derived from the same tumor exhibited considerable mutation and DNA copy-number differences, in silico simulations suggest that these differences mainly reflected a preexisting heterogeneity in the tumor cells. In conclusion, our results establish that human colorectal cancer lines are representative of the main subtypes of primary tumors at the genomic level, further validating their utility as tools to investigate colorectal cancer biology and drug responses.
While genetic mutation is a hallmark of cancer, many cancers also acquire epigenetic alterations during tumorigenesis including aberrant DNA hypermethylation of tumor suppressors, as well as changes ...in chromatin modifications as caused by genetic mutations of the chromatin-modifying machinery. However, the extent of epigenetic alterations in cancer cells has not been fully characterized. Here, we describe complete methylome maps at single nucleotide resolution of a low-passage breast cancer cell line and primary human mammary epithelial cells. We find widespread DNA hypomethylation in the cancer cell, primarily at partially methylated domains (PMDs) in normal breast cells. Unexpectedly, genes within these regions are largely silenced in cancer cells. The loss of DNA methylation in these regions is accompanied by formation of repressive chromatin, with a significant fraction displaying allelic DNA methylation where one allele is DNA methylated while the other allele is occupied by histone modifications H3K9me3 or H3K27me3. Our results show a mutually exclusive relationship between DNA methylation and H3K9me3 or H3K27me3. These results suggest that global DNA hypomethylation in breast cancer is tightly linked to the formation of repressive chromatin domains and gene silencing, thus identifying a potential epigenetic pathway for gene regulation in cancer cells.
Next generation sequencing (NGS) platforms are currently being utilized for targeted sequencing of candidate genes or genomic intervals to perform sequence-based association studies. To evaluate ...these platforms for this application, we analyzed human sequence generated by the Roche 454, Illumina GA, and the ABI SOLiD technologies for the same 260 kb in four individuals.
Local sequence characteristics contribute to systematic variability in sequence coverage (>100-fold difference in per-base coverage), resulting in patterns for each NGS technology that are highly correlated between samples. A comparison of the base calls to 88 kb of overlapping ABI 3730xL Sanger sequence generated for the same samples showed that the NGS platforms all have high sensitivity, identifying >95% of variant sites. At high coverage, depth base calling errors are systematic, resulting from local sequence contexts; as the coverage is lowered additional 'random sampling' errors in base calling occur.
Our study provides important insights into systematic biases and data variability that need to be considered when utilizing NGS platforms for population targeted sequencing studies.
Activation of the canonical TGF-β signaling pathway provides growth inhibitory signals in the normal intestinal epithelium. Colorectal cancers (CRCs) frequently harbor somatic mutations in the ...pathway members TGFBR2 and SMAD4, but to what extent mutations in SMAD2 or SMAD3 contribute to tumorigenesis is unclear. A cohort of 744 primary CRCs and 36 CRC cell lines were sequenced for SMAD4, SMAD2, and SMAD3 and analyzed for allelic loss by single-nucleotide polymorphism (SNP) microarray analysis. Mutation spectra were compared between the genes, the pathogenicity of mutations was assessed, and relationships with clinicopathologic features were examined. The prevalence of SMAD4, SMAD2, and SMAD3 mutations in sporadic CRCs was 8.6% (64 of 744), 3.4% (25 of 744), and 4.3% (32 of 744), respectively. A significant overrepresentation of two genetic hits was detected for SMAD4 and SMAD3, consistent with these genes acting as tumor suppressors. SMAD4 mutations were associated with mucinous histology. The mutation spectra of SMAD2 and SMAD3 were highly similar to that of SMAD4, both in mutation type and location within the encoded proteins. In silico analyses suggested the majority of the mutations were pathogenic, with most missense changes predicted to reduce protein stability or hinder SMAD complex formation. The latter altered interface residues or disrupted the phosphorylation-regulated Ser-Ser-X-Ser motifs within SMAD2 and SMAD3. Functional analyses of selected mutations showed reductions in SMAD3 transcriptional activity and SMAD2-SMAD4 complex formation. Joint biallelic hits in SMAD2 and SMAD3 were overrepresented and mutually exclusive to SMAD4 mutation, underlining the critical roles of these three proteins within the TGF-β signaling pathway.
There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized ...genomics. We focus here on the variants in a person's 'exome,' which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the approximately 12,500 variants that affect the protein coding portion of an individual's genome. We identified approximately 10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which approximately 15-20% are rare in the human population. We predict approximately 1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the approximately 700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of approximately 12,500 nonsilent coding variants by approximately 8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Detection of circulating tumor DNA (ctDNA) after resection of stage II colon cancer may identify patients at the highest risk of recurrence and help inform adjuvant treatment decisions. We used ...massively parallel sequencing-based assays to evaluate the ability of ctDNA to detect minimal residual disease in 1046 plasma samples from a prospective cohort of 230 patients with resected stage II colon cancer. In patients not treated with adjuvant chemotherapy, ctDNA was detected postoperatively in 14 of 178 (7.9%) patients, 11 (79%) of whom had recurred at a median follow-up of 27 months; recurrence occurred in only 16 (9.8 %) of 164 patients with negative ctDNA hazard ratio (HR), 18; 95% confidence interval (CI), 7.9 to 40; P < 0.001. In patients treated with chemotherapy, the presence of ctDNA after completion of chemotherapy was also associated with an inferior recurrence-free survival (HR, 11; 95% CI, 1.8 to 68; P = 0.001). ctDNA detection after stage II colon cancer resection provides direct evidence of residual disease and identifies patients at very high risk of recurrence.
Glioblastoma multiforme (GBM) is the most common and lethal type of brain cancer. To identify the genetic alterations in GBMs, we sequenced 20,661 protein coding genes, determined the presence of ...amplifications and deletions using high-density oligonucleotide arrays, and performed gene expression analyses using next-generation sequencing technologies in 22 human tumor samples. This comprehensive analysis led to the discovery of a variety of genes that were not known to be altered in GBMs. Most notably, we found recurrent mutations in the active site of isocitrate dehydrogenase 1 (IDH1) in 12% of GBM patients. Mutations in IDH1 occurred in a large fraction of young patients and in most patients with secondary GBMs and were associated with an increase in overall survival. These studies demonstrate the value of unbiased genomic analyses in the characterization of human brain cancer and identify a potentially useful genetic alteration for the classification and targeted therapy of GBMs.
The National Cancer Institute Initiative in Chemical Genetics is designed to encourage the development of small molecular probes. The probes are useful for activating or inactivating protein ...functions, thereby providing resources that help discern the functions of gene products in normal and disease cells, as well as in tissues. This initiative includes "ChemBank," a suite of informatics tools and databases aimed at promoting the development and use of chemical genetics by scientists worldwide. The information generated with such tools should provide a critical link from genomic discovery to drug development.
PIK3CA and PTEN mutations are prevalent in colorectal cancer and potential markers of response to mitogen-activated protein/extracellular signal-regulated kinase inhibitors and anti-EGF receptor ...antibody therapy. Relationships between phosphoinositide 3-kinase (PI3K) pathway mutation, clinicopathologic characteristics, molecular features, and prognosis remain controversial.
A total of 1,093 stage I-IV colorectal cancers were screened for PIK3CA (exons 9 and 20), KRAS (codons 12-13), BRAF (codon 600) mutations, and microsatellite instability (MSI). PTEN (exons 3-8) and CpG island methylator phenotype (CIMP) status were determined in 744 and 489 cases. PIK3CA data were integrated with 17 previous reports (n = 5,594).
PIK3CA and PTEN mutations were identified in 11.9% and 5.8% of colorectal cancers. PTEN mutation was associated with proximal tumors, mucinous histology, MSI-high (MSI-H), CIMP-high (CIMP-H), and BRAF mutation (P < 0.02). PIK3CA mutation was related to older age, proximal tumors, mucinous histology, and KRAS mutation (P < 0.04). In integrated cohort analysis, PIK3CA exon 9 and 20 mutations were overrepresented in proximal, CIMP-low (CIMP-L), and KRAS-mutated cancers (P ≤ 0.011). Comparing PIK3CA exonic mutants, exon 20 mutation was associated with MSI-H, CIMP-H, and BRAF mutation, and exon 9 mutation was associated with KRAS mutation (P ≤ 0.027). Disease-free survival for stage II/III colorectal cancers did not differ by PI3K pathway status.
PI3K pathway mutation is prominent in proximal colon cancers, with PIK3CA exon 20 and PTEN mutations associated with features of the sessile-serrated pathway (MSI-H/CIMP-H/BRAF(mut)), and PIK3CA exon 9 (and to a lesser extent exon 20) mutation associated with features of the traditional serrated pathway (CIMP-L/KRAS(mut)) of tumorigenesis. Our data highlight the PI3K pathway as a therapeutic target in distinct colorectal cancer subtypes.
Owing to their phylogenetic position, cartilaginous fishes (sharks, rays, skates, and chimaeras) provide a critical reference for our understanding of vertebrate genome evolution. The relatively ...small genome of the elephant shark, Callorhinchus milii, a chimaera, makes it an attractive model cartilaginous fish genome for whole-genome sequencing and comparative analysis. Here, the authors describe survey sequencing (1.4x coverage) and comparative analysis of the elephant shark genome, one of the first cartilaginous fish genomes to be sequenced to this depth. Repetitive sequences, represented mainly by a novel family of short interspersed element-like and long interspersed element-like sequences, account for about 28% of the elephant shark genome. Fragments of approximately 15,000 elephant shark genes reveal specific examples of genes that have been lost differentially during the evolution of tetrapod and teleost fish lineages. Interestingly, the degree of conserved synteny and conserved sequences between the human and elephant shark genomes are higher than that between human and teleost fish genomes. Elephant shark contains putative four Hox clusters indicating that, unlike teleost fish genomes, the elephant shark genome has not experienced an additional whole-genome duplication. These findings underscore the importance of the elephant shark as a critical reference vertebrate genome for comparative analysis of the human and other vertebrate genomes. This study also demonstrates that a survey-sequencing approach can be applied productively for comparative analysis of distantly related vertebrate genomes.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK