Recent studies have shown that some silent mutations can be harmful to various processes. In this study, we performed a comprehensive in silico analysis to elucidate the effects of silent mutations ...on cancer pathogenesis using exome sequencing data derived from the Cancer Genome Atlas. We focused on the codon optimality scores of silent mutations, which were defined as the difference between the optimality of synonymous codons, calculated using the codon usage table. The relationship between cancer evolution and silent mutations showed that the codon optimality score of the mutations that occurred later in carcinogenesis was significantly higher than of those that occurred earlier. In addition, mutations with higher scores were enriched in genes involved in the cell cycle and cell division, while those with lower scores were enriched in genes involved in apoptosis and cellular senescence. Our results demonstrate that some silent mutations can be involved in cancer pathogenesis.
Whole-genome doubling (WGD) is a common mutation in cancer. Various studies have suggested that WGD is associated with a poor prognosis in cancer. However, the detailed association between WGD ...occurrence and prognosis remains unclear. In this study, we aimed to elucidate the mechanism by which WGD affects prognosis using sequencing data from the Pan-Cancer Analysis of Whole Genomes (PCAWG) and The Cancer Genome Atlas.
Whole-genome sequencing data of 23 cancer types were downloaded from PCAWG project. We defined the WGD event in each sample using the WGD status annotated using PCAWG. We used MutationTimeR to predict the relative timings of mutations and loss of heterozygosity (LOH) in WGD, thus evaluating their association with WGD. We also analyzed the association between WGD-associated factors and patient prognosis.
WGD was associated with several factors, e.g., length of LOH regions. Survival analysis using WGD-associated factors revealed that longer LOH regions and LOH in chr17 were associated with poor prognosis in samples with WGD (WGD samples) and samples without WGD (nWGD samples). In addition to these two factors, nWGD samples showed that the number of mutations in tumor suppressor genes was associated with prognosis. Moreover, we explored the genes associated with prognosis in both samples separately.
The prognosis-related factors in WGD samples differed significantly compared with those in nWGD samples. This study emphasizes the need for different treatment strategies for WGD and nWGD samples.
Genomic variations are associated with gene expression levels, which are called expression quantitative trait loci (eQTL). Most eQTL may affect the total gene expression levels by regulating ...transcriptional activities of a specific promoter. However, the direct exploration of genomic loci associated with promoter activities using RNA-seq data has been challenging because eQTL analyses treat the total expression levels estimated by summing those of all isoforms transcribed from distinct promoters. Here we propose a new method for identifying genomic loci associated with promoter activities, called promoter usage quantitative trait loci (puQTL), using conventional RNA-seq data. By leveraging public RNA-seq datasets from the lymphoblastoid cell lines of 438 individuals from the GEUVADIS project, we obtained promoter activity estimates and mapped 2,592 puQTL at the 10% FDR level. The results of puQTL mapping enabled us to interpret the manner in which genomic variations regulate gene expression. We found that 310 puQTL genes (16.1%) were not detected by eQTL analysis, suggesting that our pipeline can identify novel variant–gene associations. Furthermore, we identified genomic loci associated with the activity of “hidden” promoters, which the standard eQTL studies have ignored. We found that most puQTL signals were concordant with at least one genome-wide association study (GWAS) signal, enabling novel interpretations of the molecular mechanisms of complex traits. Our results emphasize the importance of the re-analysis of public RNA-seq datasets to obtain novel insights into gene regulation by genomic variations and their contributions to complex traits.
Abstract Genome analysis in cancer has focused mainly on elucidating the function and regulatory mechanisms of genes that exhibit differential expression or mutation in cancer samples compared to ...normal samples. Recently, transcriptome analysis revealed that abnormal splicing events in cancer samples could contribute to cancer pathogenesis. Moreover, splicing variants in cancer reportedly generate diverse cancer antigens. Although abnormal splicing events are expected to be potential targets in cancer immunotherapy, the exploration of such targets and their biological significance in cancer have not been fully understood. In this study, to explore subtype-specific alternative splicing events, we conducted a comprehensive analysis of splicing events for each breast cancer subtype using large-scale splicing data derived from The Cancer Genome Atlas and found subtype-specific alternative splicing patterns. Analyses indicated that genes that produce subtype-specific alternative splicing events are potential novel targets for immunotherapy against breast cancer. The subtype-specific alternative splicing events identified in this study, which were not identified by mutation or differential expression analysis, bring new significance to previously overlooked splicing events.
Histone acetylation, which is regulated by histone acetyltransferases (HATs) and histone deacetylases (HDACs), plays a crucial role in the control of gene expression. HDAC inhibitors (HDACi) have ...shown potential in cancer therapy; however, the specific roles of HDACs in early embryos remain unclear. Moreover, although some pan-HDACi have been used to maintain cellular undifferentiated states in early embryos, the specific mechanisms underlying their effects remain unknown. Thus, there remains a significant knowledge gap regarding the application of selective HDACi in early embryos.
To address this gap, we treated early embryos with two selective HDACi (MGCD0103 and T247). Subsequently, we collected and analyzed their transcriptome data at different developmental stages. Our findings unveiled a significant effect of HDACi treatment during the crucial 2-cell stage of zygotes, leading to a delay in embryonic development after T247 and an arrest at 2-cell stage after MGCD0103 administration. Furthermore, we elucidated the regulatory targets underlying this arrested embryonic development, which pinpointed the G2/M phase as the potential period of embryonic development arrest caused by MGCD0103. Moreover, our investigation provided a comprehensive profile of the biological processes that are affected by HDACi, with their main effects being predominantly localized in four aspects of zygotic gene activation (ZGA): RNA splicing, cell cycle regulation, autophagy, and transcription factor regulation. By exploring the transcriptional regulation and epigenetic features of the genes affected by HDACi, we made inferences regarding the potential main pathways via which HDACs affect gene expression in early embryos. Notably, Hdac7 exhibited a distinct response, highlighting its potential as a key player in early embryonic development.
Our study conducted a comprehensive analysis of the effects of HDACi on early embryonic development at the transcriptional level. The results demonstrated that HDACi significantly affected ZGA in embryos, elucidated the distinct actions of various selective HDACi, and identified specific biological pathways and mechanisms via which these inhibitors modulated early embryonic development.
In an alignment of closely related genomic sequences, the existence of discordant mutation sites, which do not reflect the phylogenetic relationship of the genomes, is often observed. Although these ...discordant mutation sites are thought to have emerged by ancestral polymorphism or gene flow, their frequency and distribution in the genome have not yet been analyzed in detail. Using the genome sequences of all protein coding genes of 25 inbred rat strains, we analyzed the frequency and genome-wide distribution of the discordant mutation sites. From the comparison of different substrains, it was found that these loci are not substrain specific, but are common among different groups of substrains, suggesting that the discordant sites might have mainly emerged through ancestral polymorphism. It was also revealed that the discordant sites are not uniformly distributed along chromosomes, but are concentrated at certain genomic loci, such as
RT1
, major histocompatibility complex of rats, and olfactory receptors, indicating that genes known to be highly polymorphic tend to have more discordant sites. Our results also showed that loci with a high density of discordant sites are also rich in heterozygous variants, even though these are inbred strains.
DNA methylation is globally reprogrammed during mammalian preimplantation development, which is critical for normal development. Recent reduced representation bisulfite sequencing (RRBS) studies ...suggest that the methylome dynamics are essentially conserved between human and mouse early embryos. RRBS is known to cover 5-10% of all genomic CpGs, favoring those contained within CpG-rich regions. To obtain an unbiased and more complete representation of the methylome during early human development, we performed whole genome bisulfite sequencing of human gametes and blastocysts that covered>70% of all genomic CpGs. We found that the maternal genome was demethylated to a much lesser extent in human blastocysts than in mouse blastocysts, which could contribute to an increased number of imprinted differentially methylated regions in the human genome. Global demethylation of the paternal genome was confirmed, but SINE-VNTR-Alu elements and some other tandem repeat-containing regions were found to be specifically protected from this global demethylation. Furthermore, centromeric satellite repeats were hypermethylated in human oocytes but not in mouse oocytes, which might be explained by differential expression of de novo DNA methyltransferases. These data highlight both conserved and species-specific regulation of DNA methylation during early mammalian development. Our work provides further information critical for understanding the epigenetic processes underlying differentiation and pluripotency during early human development.
Background
Open chromatin is associated with gene transcription. Previous studies have shown that the density of mutations in open chromatin regions is lower than that in flanking regions because of ...the higher accessibility of DNA repair machinery. However, in several cancer types, open chromatin regions show an increased local density of mutations in activated regulatory regions. Although the mutation distribution within open chromatin regions in cancer cells has been investigated, only few studies have focused on their functional implications in cancer. To reveal the impact of highly mutated open chromatin regions on cancer, we investigated the association between mutations in open chromatin regions and their possible functions.
Methods
Whole‐genome sequencing data of 18 cancer types were downloaded from the PanCancer Analysis of Whole Genomes and Catalog of Somatic Mutations in Cancer. We quantified the mutations located in open chromatin regions defined by The Cancer Genome Atlas and classified open chromatin regions into three categories based on the number of mutations. Then, we investigated the chromatin state, amplification, and possible target genes of the open chromatin regions with a high number of mutations. We also analyzed the association between the number of mutations in open chromatin regions and patient prognosis.
Results
In some cancer types, the proportion of promoter or enhancer chromatin state in open chromatin regions with a high number of mutations was significantly higher than that in the regions with a low number of mutations. The possible target genes of open chromatin regions with a high number of mutations were more strongly associated with cancer than those of other open chromatin regions. Moreover, a high number of mutations in open chromatin regions was significantly associated with a poor prognosis in some cancer types.
Conclusions
These results suggest that highly mutated open chromatin regions play an important role in cancer pathogenesis and can be effectively used to predict patient prognosis.
In this study, to reveal the impact of highly mutated open chromatin regions on cancer, we analyzed whole‐genome sequencing data of cancer genome. Specifically, we investigated the association between mutations in open chromatin regions and their possible functions.
Although several long noncoding RNAs (lncRNAs) have recently been shown to encode small polypeptides, those in testis remain largely uncharacterized. Here we identify two sperm-specific polypeptides, ...Kastor and Polluks, encoded by a single mouse locus (Gm9999) previously annotated as encoding a lncRNA. Both Kastor and Polluks are inserted in the outer mitochondrial membrane and directly interact with voltage-dependent anion channel (VDAC), despite their different amino acid sequences. Male VDAC3-deficient mice are infertile as a result of reduced sperm motility due to an abnormal mitochondrial sheath in spermatozoa, and deficiency of both Kastor and Polluks also severely impaired male fertility in association with formation of a similarly abnormal mitochondrial sheath. Spermatozoa lacking either Kastor or Polluks partially recapitulate the phenotype of those lacking both. Cooperative function of Kastor and Polluks in regulation of VDAC3 may thus be essential for mitochondrial sheath formation in spermatozoa and for male fertility.
Mutations that affect phenotypes have been identified primarily as those that directly alter amino acid sequences or disrupt splice sites. However, some mutations not located in functionally ...important sites can also affect phenotypes, such as splice-site-creating mutations (SCMs). To investigate how frequent exon extension/shrinkage events induced by SCMs occur in normal individuals, we used personal genome sequencing data and transcriptome data of the corresponding individuals and identified 371 exon extension/shrinkage events in normal individuals. This number was about three times higher than the number of pseudo-exon activation events identified in the previous study. The average numbers of exon extension and exon shrinkage events in each sample were 3.3 and 11.2, respectively. We also evaluated the impact of exon extension/shrinkage events on the resulting transcripts and their protein products and found that 40.2% of the identified events may have possible functional impacts by either generating premature termination codons in transcripts or affecting protein domains. Our results indicated that a certain fraction of SCMs identified in this study can be pathogenic mutations by creating novel splice sites.