Spontaneous canine cancers are valuable but relatively understudied and underutilized models. To enhance their usage, we reanalyze whole exome and genome sequencing data published for 684 cases of >7 ...common tumor types and >35 breeds, with rigorous quality control and breed validation. Our results indicate that canine tumor alteration landscape is tumor type-dependent, but likely breed-independent. Each tumor type harbors major pathway alterations also found in its human counterpart (e.g., PI3K in mammary tumor and p53 in osteosarcoma). Mammary tumor and glioma have lower tumor mutational burden (TMB) (median < 0.5 mutations per Mb), whereas oral melanoma, osteosarcoma and hemangiosarcoma have higher TMB (median ≥ 1 mutations per Mb). Across tumor types and breeds, TMB is associated with mutation of TP53 but not PIK3CA, the most mutated genes. Golden Retrievers harbor a TMB-associated and osteosarcoma-enriched mutation signature. Here, we provide a snapshot of canine mutations across major tumor types and breeds.
Maltase-glucoamylase (MGAM) and MGAM2 both belong to the glycoside hydrolase family 31. MGAM, a therapeutic target for type 2 diabetes, is α-1,4-glucosidase and expressed in the intestine to catalyze ...starch digestion. MGAM2, however, is largely uncharacterized. By investigating The Cancer Genome Atlas data, we found that among breast cancer subtypes, MGAM2 expression is nearly exclusive to basal-like breast cancers (BLBCs), whereas MGAM tends to express in luminal A breast cancers. Moreover, MGAM2 expression is associated with better patient survival and correlated with immune genes/signatures, unlike MGAM. Both genes have emerged in mammals, but diverged after the placental-marsupial split. In placentals, MGAM2 has likely lost its α-1,4-glucosidase activity due to mutations in key catalytic sites, and has acquired a large domain that is extracellular, threonine-rich and evolutionarily hypervariable (EHV). Guided by MGAM2 findings, our genome-wide search identified >1000 human proteins with EHV regions. These proteins are enriched in immune functions and molecules, including major histocompatibility complex proteins. Their genes are expressed higher in BLBCs and are associated with better patient survival, like MGAM2. Their EHV-coding sequences are rich in simple repeats and harbor more cancer passenger mutations. In conclusion, MGAM2 diverges from MGAM structurally and likely functionally in placentals. MGAM2 is among >1000 human proteins with EHV regions and associated with immune response. We propose that these EHV molecules may have significant implication in cancer immunotherapy and BLBC treatment.
Display omitted
Naturally occurring canine cancers have remarkable similarities to their human counterparts. To better understand these similarities, we investigated 671 client-owned dogs from 96 breeds with 23 ...common tumor types, including those whose mutation profile are unknown (anal sac carcinoma and neuroendocrine carcinoma) or understudied (thyroid carcinoma, soft tissue sarcoma and hepatocellular carcinoma). We discovered mutations in 50 well-established oncogenes and tumor suppressors, and compared them to those reported in human cancers. As in human cancer, TP53 is the most commonly mutated gene, detected in 22.5% of canine tumors overall. Canine tumors share mutational hotspots with human tumors in oncogenes including PIK3CA, KRAS, NRAS, BRAF, KIT and EGFR. Hotspot mutations with significant association to tumor type include NRAS G61R and PIK3CA H1047R in hemangiosarcoma, ERBB2 V659E in pulmonary carcinoma, and BRAF V588E (equivalent of V600E in humans) in urothelial carcinoma. Our findings better position canines as a translational model of human cancer to investigate a wide spectrum of targeted therapies.
Abstract
Background
About 20% of breast cancers in humans are basal-like, a subtype that is often triple-negative and difficult to treat. An effective translational model for basal-like breast cancer ...is currently lacking and urgently needed. To determine whether spontaneous mammary tumors in pet dogs could meet this need, we subtyped canine mammary tumors and evaluated the dog–human molecular homology at the subtype level.
Methods
We subtyped 236 canine mammary tumors from 3 studies by applying various subtyping strategies on their RNA-seq data. We then performed PAM50 classification with canine tumors alone, as well as with canine tumors combined with human breast tumors. We identified feature genes for human BLBC and luminal A subtypes via machine learning and used these genes to repeat canine-alone and cross-species tumor classifications. We investigated differential gene expression, signature gene set enrichment, expression association, mutational landscape, and other features for dog–human subtype comparison.
Results
Our independent genome-wide subtyping consistently identified two molecularly distinct subtypes among the canine tumors. One subtype is mostly basal-like and clusters with human BLBC in cross-species PAM50 and feature gene classifications, while the other subtype does not cluster with any human breast cancer subtype. Furthermore, the canine basal-like subtype recaptures key molecular features (e.g., cell cycle gene upregulation, TP53 mutation) and gene expression patterns that characterize human BLBC. It is enriched in histological subtypes that match human breast cancer, unlike the other canine subtype. However, about 33% of canine basal-like tumors are estrogen receptor negative (ER−) and progesterone receptor positive (PR+), which is rare in human breast cancer. Further analysis reveals that these ER−PR+ canine tumors harbor additional basal-like features, including upregulation of genes of interferon-
γ
response and of the Wnt-pluripotency pathway. Interestingly, we observed an association of
PGR
expression with gene silencing in all canine tumors and with the expression of T cell exhaustion markers (e.g.,
PDCD1
) in ER−PR+ canine tumors.
Conclusions
We identify a canine mammary tumor subtype that molecularly resembles human BLBC overall and thus could serve as a vital translational model of this devastating breast cancer subtype. Our study also sheds light on the dog–human difference in the mammary tumor histology and the hormonal cycle.
Here we show that bivalent domains and chromosome architecture for bivalent genes are dynamically regulated during the cell cycle in human pluripotent cells. Central to this is the transient increase ...in H3K4-trimethylation at developmental genes during G1, thereby creating a “window of opportunity” for cell-fate specification. This mechanism is controlled by CDK2-dependent phosphorylation of the MLL2 (KMT2B) histone methyl-transferase, which facilitates its recruitment to developmental genes in G1. MLL2 binding is required for changes in chromosome architecture around developmental genes and establishes promoter-enhancer looping interactions in a cell-cycle-dependent manner. These cell-cycle-regulated loops are shown to be essential for activation of bivalent genes and pluripotency exit. These findings demonstrate that bivalent domains are established to control the cell-cycle-dependent activation of developmental genes so that differentiation initiates from the G1 phase.
•Bivalent domains are unstable, dynamic, and cell-cycle regulated•CDK2 phosphorylates MLL2 and establishes bivalent domains in G1•Chromosome remodeling in G1 is required for the “poised” pluripotent state
In this report, Dalton and colleagues show that developmental genes are primed for activation in G1 phase of the cell cycle by a mechanism requiring convergence of the cell-cycle machinery with cell signaling pathways. This priming mechanism involves the establishment of bivalent epigenetic domains and dynamic changes in chromosome architecture around developmental genes.
Heterogeneity within pluripotent stem cell (PSC) populations is indicative of dynamic changes that occur when cells drift between different states. Although the role of metastability in PSCs is ...unclear, it appears to reflect heterogeneity in cell signaling. Using the Fucci cell-cycle indicator system, we show that elevated expression of developmental regulators in G1 is a major determinant of heterogeneity in human embryonic stem cells. Although signaling pathways remain active throughout the cell cycle, their contribution to heterogeneous gene expression is restricted to G1. Surprisingly, we identify dramatic changes in the levels of global 5-hydroxymethylcytosine, an unanticipated source of epigenetic heterogeneity that is tightly linked to cell-cycle progression and the expression of developmental regulators. When we evaluated gene expression in differentiating cells, we found that cell-cycle regulation of developmental regulators was maintained during lineage specification. Cell-cycle regulation of developmentally regulated transcription factors is therefore an inherent feature of the mechanisms underpinning differentiation.
•Embryonic stem cells are lineage primed in G1•Transcription of developmentally regulated genes is cell-cycle regulated•5hmC is cell-cycle regulated•Stem cells initiate differentiation from G1
Pluripotent stem cell heterogeneity has been attributed to stochastic variations in signaling pathways across the population. Using Fucci cell-cycle reporters, Dalton and colleagues show that stem cell “lineage priming” in G1 is associated with cell-cycle-dependent changes in the transcription of developmentally regulated genes. Moreover, these changes are paralleled by levels of the epigenetic mark 5-hydroxymethylcytosine. These findings identify the cell cycle as major source of heterogeneity in human pluripotent stem cells.
Spontaneous canine head and neck squamous cell carcinoma (HNSCC) represents an excellent model of human HNSCC but is greatly understudied. To better understand and utilize this valuable resource, we ...performed a pilot study that represents its first genome-wide characterization by investigating 12 canine HNSCC cases, of which 9 are oral, via high density array comparative genomic hybridization and RNA-seq. The analyses reveal that these canine cancers recapitulate many molecular features of human HNSCC. These include analogous genomic copy number abnormality landscapes and sequence mutation patterns, recurrent alteration of known HNSCC genes and pathways (e.g., cell cycle, PI3K/AKT signaling), and comparably extensive heterogeneity. Amplification or overexpression of protein kinase genes, matrix metalloproteinase genes, and epithelial-mesenchymal transition genes TWIST1 and SNAI1 are also prominent in these canine tumors. This pilot study, along with a rapidly growing body of literature on canine cancer, reemphasizes the potential value of spontaneous canine cancers in HNSCC basic and translational research.
The major histocompatibility complex class I (MHC-I) genes are highly polymorphic. MHC-I genotyping is required for determining the peptide epitopes available to an individual’s T-cell repertoire. ...Current genotyping software tools do not work for the dog, due to very limited known canine alleles. To address this, we developed a Kmer-based paired-end read (KPR) de novo assembler and genotyper, which assemble paired-end RNA-seq reads from MHC-I regions into contigs, and then genotype each contig and estimate its expression level. KPR tools outperform other popular software examined in typing new alleles. We used KPR tools to successfully genotype152 dogs from a published dataset. The study discovers 33 putative new alleles, finds dominant alleles in 4 dog breeds, and builds allele diversity and expression landscapes among the 152 dogs. Our software meets a significant need in biomedical research.
Display omitted
•KPR software genotypes DLA-I alleles and estimates expression using RNA-seq data•KPR performs de novo assembly and outperforms other tools in typing new alleles•Typing 152 dogs uncovers 33 putative new alleles and dominant alleles in 4 breeds•DLA-12 and DLA-88L alleles cluster with DLA-88 alleles
Biocomputational method; Computational bioinformatics; Genomic analysis
Using bacteria artificial chromosome (BAC) end sequences (16.9 Mb) and high-quality alignments of genomic sequences (17.4 Mb), we performed a global assessment of the divergence distributions, ...phylogenies, and consensus sequences for Alu elements in primates including lemur, marmoset, macaque, baboon, and chimpanzee as compared to human. We found that in lemurs, Alu elements show a broader and more symmetric sequence divergence distribution, suggesting a steady rate of Alu retrotransposition activity among prosimians. In contrast, Alu elements in anthropoids show a skewed distribution shifted toward more ancient elements with continual declining rates in recent Alu activity along the hominoid lineage of evolution. Using an integrated approach combining mutation profile and insertion/deletion analyses, we identified nine novel lineage-specific Alu subfamilies in lemur (seven), marmoset (one), and baboon/macaque (one) containing multiple diagnostic mutations distinct from their human counterparts---Alu J, S, and Y subfamilies, respectively. Among these primates, we show that that the lemur has the lowest density of Alu repeats (55 repeats/Mb), while marmoset has the greatest abundance (188 repeats/Mb). We estimate that approximately 70% of lemur and 16% of marmoset Alu elements belong to lineage-specific subfamilies. Our analysis has provided an evolutionary framework for further classification and refinement of the Alu repeat phylogeny. The differences in the distribution and rates of Alu activity have played an important role in subtly reshaping the structure of primate genomes. The functional consequences of these changes among the diverse primate lineages over such short periods of evolutionary time are an important area of future investigation.
We report results of a megabase-scale phylogenomic analysis of the Reptilia, the sister group of mammals. Large-scale end-sequence scanning of genomic clones of a turtle, alligator, and lizard ...reveals diverse, mammal-like landscapes of retroelements and simple sequence repeats (SSRs) not found in the chicken. Several global genomic traits, including distinctive phylogenetic lineages of CR1-like long interspersed elements (LINEs) and a paucity of A-T rich SSRs, characterize turtles and archosaur genomes, whereas higher frequencies of tandem repeats and a lower global GC content reveal mammal-like features in ANOLIS: Nonavian reptile genomes also possess a high frequency of diverse and novel 50-bp unit tandem duplications not found in chicken or mammals. The frequency distributions of almost equal to65,000 8-mer oligonucleotides suggest that rates of DNA-word frequency change are an order of magnitude slower in reptiles than in mammals. These results suggest a diverse array of interspersed and SSRs in the common ancestor of amniotes and a genomic conservatism and gradual loss of retroelements in reptiles that culminated in the minimalist chicken genome.