APOBEC3A and APOBEC3B, cytidine deaminases of the APOBEC family, are among the main factors causing mutations in human cancers. APOBEC deaminates cytosines in single-stranded DNA (ssDNA). A fraction ...of the APOBEC-induced mutations occur as clusters ("kataegis") in single-stranded DNA produced during repair of double-stranded breaks (DSBs). However, the properties of the remaining 87% of nonclustered APOBEC-induced mutations, the source and the genomic distribution of the ssDNA where they occur, are largely unknown. By analyzing genomic and exomic cancer databases, we show that >33% of dispersed APOBEC-induced mutations occur on the lagging strand during DNA replication, thus unraveling the major source of ssDNA targeted by APOBEC in cancer. Although methylated cytosine is generally more mutation-prone than nonmethylated cytosine, we report that methylation reduces the rate of APOBEC-induced mutations by a factor of roughly two. Finally, we show that in cancers with extensive APOBEC-induced mutagenesis, there is almost no increase in mutation rates in late replicating regions (contrary to other cancers). Because late-replicating regions are depleted in exons, this results in a 1.3-fold higher fraction of mutations residing within exons in such cancers. This study provides novel insight into the APOBEC-induced mutagenesis and describes the peculiarity of the mutational processes in cancers with the signature of APOBEC-induced mutations.
The study of gene expression in mammalian single cells via genomic technologies now provides the possibility to investigate the patterns of allelic gene expression. We used single-cell RNA sequencing ...to detect the allele-specific mRNA level in 203 single human primary fibroblasts over 133,633 unique heterozygous single-nucleotide variants (hetSNVs). We observed that at the snapshot of analyses, each cell contained mostly transcripts from one allele from the majority of genes; indeed, 76.4% of the hetSNVs displayed stochastic monoallelic expression in single cells. Remarkably, adjacent hetSNVs exhibited a haplotype-consistent allelic ratio; in contrast, distant sites located in two different genes were independent of the haplotype structure. Moreover, the allele-specific expression in single cells correlated with the abundance of the cellular transcript. We observed that genes expressing both alleles in the majority of the single cells at a given time point were rare and enriched with highly expressed genes. The relative abundance of each allele in a cell was controlled by some regulatory mechanisms given that we observed related single-cell allelic profiles according to genes. Overall, these results have direct implications in cellular phenotypic variability.
Basal cell carcinoma (BCC) of the skin is the most common malignant neoplasm in humans. BCC is primarily driven by the Sonic Hedgehog (Hh) pathway. However, its phenotypic variation remains ...unexplained. Our genetic profiling of 293 BCCs found the highest mutation rate in cancer (65 mutations/Mb). Eighty-five percent of the BCCs harbored mutations in Hh pathway genes (PTCH1, 73% or SMO, 20% (P = 6.6 × 10(-8)) and SUFU, 8%) and in TP53 (61%). However, 85% of the BCCs also harbored additional driver mutations in other cancer-related genes. We observed recurrent mutations in MYCN (30%), PPP6C (15%), STK19 (10%), LATS1 (8%), ERBB2 (4%), PIK3CA (2%), and NRAS, KRAS or HRAS (2%), and loss-of-function and deleterious missense mutations were present in PTPN14 (23%), RB1 (8%) and FBXW7 (5%). Consistent with the mutational profiles, N-Myc and Hippo-YAP pathway target genes were upregulated. Functional analysis of the mutations in MYCN, PTPN14 and LATS1 suggested their potential relevance in BCC tumorigenesis.
Mitochondria is a powerhouse of all eukaryotic cells that have its own circular DNA (mtDNA) encoding various RNAs and proteins. Somatic perturbations of mtDNA are accumulating with age thus it is of ...great importance to uncover the main sources of mtDNA instability. Recent analyses demonstrated that somatic mtDNA deletions depend on imperfect repeats of various nature between distant mtDNA segments. However, till now there are no comprehensive databases annotating all types of imperfect repeats in numerous species with sequenced complete mitochondrial genome as well as there are no algorithms capable to call all types of imperfect repeats in circular mtDNA.
We implemented naïve algorithm of pattern recognition by analogy to standard dot-plot construction procedures allowing us to find both perfect and imperfect repeats of four main types: direct, inverted, mirror and complementary. Our algorithm is adapted to specific characteristics of mtDNA such as circularity and an excess of short repeats - it calls imperfect repeats starting from the length of 10 b.p. We constructed interactive web available database ImtRDB depositing perfect and imperfect repeats positions in mtDNAs of more than 3500 Vertebrate species. Additional tools, such as visualization of repeats within a genome, comparison of repeat densities among different genomes and a possibility to download all results make this database useful for many biologists. Our first analyses of the database demonstrated that mtDNA imperfect repeats (i) are usually short; (ii) associated with unfolded DNA structures; (iii) four types of repeats positively correlate with each other forming two equivalent pairs: direct and mirror versus inverted and complementary, with identical nucleotide content and similar distribution between species; (iv) abundance of repeats is negatively associated with GC content; (v) dinucleotides GC versus CG are overrepresented on light chain of mtDNA covered by repeats.
ImtRDB is available at http://bioinfodbs.kantiana.ru/ImtRDB/ . It is accompanied by the software calling all types of interspersed repeats with different level of degeneracy in circular DNA. This database and software can become a very useful tool in various areas of mitochondrial and chloroplast DNA research.
The mutational spectrum of the mitochondrial DNA (mtDNA) does not resemble any of the known mutational signatures of the nuclear genome and variation in mtDNA mutational spectra between different ...organisms is still incomprehensible. Since mitochondria are responsible for aerobic respiration, it is expected that mtDNA mutational spectrum is affected by oxidative damage. Assuming that oxidative damage increases with age, we analyse mtDNA mutagenesis of different species in regards to their generation length. Analysing, (i) dozens of thousands of somatic mtDNA mutations in samples of different ages (ii) 70053 polymorphic synonymous mtDNA substitutions reconstructed in 424 mammalian species with different generation lengths and (iii) synonymous nucleotide content of 650 complete mitochondrial genomes of mammalian species we observed that the frequency of AH > GH substitutions (H: heavy strand notation) is twice bigger in species with high versus low generation length making their mtDNA more AH poor and GH rich. Considering that AH > GH substitutions are also sensitive to the time spent single-stranded (TSSS) during asynchronous mtDNA replication we demonstrated that AH > GH substitution rate is a function of both species-specific generation length and position-specific TSSS. We propose that AH > GH is a mitochondria-specific signature of oxidative damage associated with both aging and TSSS.
Large intergenic noncoding RNAs (lincRNAs) are still poorly functionally characterized. We analyzed the genetic and epigenetic regulation of human lincRNA expression in the GenCord collection by ...using three cell types from 195 unrelated European individuals. We detected a considerable number of cis expression quantitative trait loci (cis-eQTLs) and demonstrated that the genetic regulation of lincRNA expression is independent of the regulation of neighboring protein-coding genes. lincRNAs have relatively more cis-eQTLs than do equally expressed protein-coding genes with the same exon number. lincRNA cis-eQTLs are located closer to transcription start sites (TSSs) and their effect sizes are higher than cis-eQTLs found for protein-coding genes, suggesting that lincRNA expression levels are less constrained than that of protein-coding genes. Additionally, lincRNA cis-eQTLs can influence the expression level of nearby protein-coding genes and thus could be considered as QTLs for enhancer activity. Enrichment of expressed lincRNA promoters in enhancer marks provides an additional argument for the involvement of lincRNAs in the regulation of transcription in cis. By investigating the epigenetic regulation of lincRNAs, we observed both positive and negative correlations between DNA methylation and gene expression (expression quantitative trait methylation eQTMs), as expected, and found that the landscapes of passive and active roles of DNA methylation in gene regulation are similar to protein-coding genes. However, lincRNA eQTMs are located closer to TSSs than are protein-coding gene eQTMs. These similarities and differences in genetic and epigenetic regulation between lincRNAs and protein-coding genes contribute to the elucidation of potential functions of lincRNAs.
Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the ...functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci cis-eQTLs) and protein levels (cis protein QTLs cis-pQTLs), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.
After the effective size of a population, Ne, declines, some slightly deleterious amino acid replacements which were initially suppressed by purifying selection become effectively neutral and can ...reach fixation. Here we investigate this phenomenon for a set of all 13 mitochondrial protein-coding genes from 110 mammalian species. By using body mass as a proxy for Ne, we show that large mammals (i.e., those with low Ne) as compared with small ones (in our sample these are, on average, 369.5 kg and 275 g, respectively) have a 43% higher rate of accumulation of nonsynonymous nucleotide substitutions relative to synonymous substitutions, and an 8-40% higher rate of accumulation of radical amino acid substitutions relative to conservative substitutions, depending on the type of amino acid classification. These higher rates result in a 6% greater amino acid dissimilarity between modern species and their most recent reconstructed ancestors in large versus small mammals. Because nonsynonymous substitutions are likely to be more harmful than synonymous substitutions, and radical amino acid substitutions are likely to be more harmful than conservative ones, our results suggest that large mammals experience less efficient purifying selection than small mammals. Furthermore, because in the course of mammalian evolution body size tends to increase and, consequently, Ne tends to decline, evolution of mammals toward large body size may involve accumulation of slightly deleterious mutations in mitochondrial protein-coding genes, which may contribute to decline or extinction of large mammals.
When man got his mtDNA deletions? Popadin, Konstantin; Safdar, Adeel; Kraytsberg, Yevgenya ...
Aging cell,
August 2014, Letnik:
13, Številka:
4
Journal Article
Recenzirano
Odprti dostop
Summary
Somatic mtDNA mutations and deletions in particular are known to clonally expand within cells, eventually reaching detrimental intracellular concentrations. The possibility that clonal ...expansion is a slow process taking a lifetime had prompted an idea that founder mutations of mutant clones that cause mitochondrial dysfunction in the aged tissue might have originated early in life. If, conversely, expansion was fast, founder mutations should predominantly originate later in life. This distinction is important: indeed, from which mutations should we protect ourselves – those of early development/childhood or those happening at old age? Recently, high‐resolution data describing the distribution of mtDNA deletions have been obtained using a novel, highly efficient method (Taylor et al., ). These data have been interpreted as supporting predominantly early origin of founder mutations. Re‐analysis of the data implies that the data actually better fit mostly late origin of founders, although more research is clearly needed to resolve the controversy.
Mitochondrial DNA (mtDNA) encodes core subunits of oxidative phosphorylation complexes and, as a result of intricate regulatory crosstalk between nuclear and mitochondrial genomes, the total number ...of mtDNA copies fits the requirements of each cell type. Deviations from the physiological number of mtDNA copies are expected to be deleterious and might cause some inherited diseases and normal ageing. We studied 46 obese patients with type 2 diabetes (T2DM) one year after a laparoscopic sleeve gastrectomy (LSG) and Roux-en-Y gastric bypass (RYGB). The results were compared with normal-weight patients without T2DM (control group 1) (body mass index (BMI) = 22.5 ± 3.01 kg/m
) and patients with obesity without T2DM (control group 2) (BMI = 36 ± 3.45 kg/m
). We detected an increase of mtDNA copy number in the cells of the buffy coat obtained from peripheral blood, sampled one year after bariatric surgery. We also found that average mtDNA copy number as well as its dynamics (before and after the surgery) are gender-specific. To the best of our knowledge, this is the first evidence for the restoration of mtDNA copy number in obese patients after LSG and RYGB.