Pseudogenes are ideal markers of genome remodelling. In turn, the mouse is an ideal platform for studying them, particularly with the recent availability of strain-sequencing and transcriptional ...data. Here, combining both manual curation and automatic pipelines, we present a genome-wide annotation of the pseudogenes in the mouse reference genome and 18 inbred mouse strains (available via the mouse.pseudogene.org resource). We also annotate 165 unitary pseudogenes in mouse, and 303, in human. The overall pseudogene repertoire in mouse is similar to that in human in terms of size, biotype distribution, and family composition (e.g. with GAPDH and ribosomal proteins being the largest families). Notable differences arise in the pseudogene age distribution, with multiple retro-transpositional bursts in mouse evolutionary history and only one in human. Furthermore, in each strain about a fifth of all pseudogenes are unique, reflecting strain-specific evolution. Finally, we find that ~15% of the mouse pseudogenes are transcribed, and that highly transcribed parent genes tend to give rise to many processed pseudogenes.
We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the ...current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.
Recently, we have shown that seven genes, namely
,
,
,
,
,
and
have prognostic power for the overall survival in ovarian cancer (OC).
We present an analysis on the association of these genes with any ...phenotypes and mutations indicative of involvement in female cancers and predict the structural and functional consequences of those SNPS using in silico tools.
These seven genes present with 976 SNPs/mutations that are associated with human cancers, out of which 284 related to female cancers. We have then analysed the mutation impact on amino acid polarity, charge and water affinity, leading to the identification of 30 mutations in gynaecological cancers where amino acid (aa) changes lead to opposite polarity, charges and water affinity. Out of these 30 mutations identified, only a missense mutation (i.e., R831C/R804C in uterine corpus endometrial carcinomas, UCEC) was suggestive of structural damage on the
protein.
We demonstrate that the R831C/R804C mutation is deleterious and the predicted ΔΔG values suggest that the mutation reduces the stability of the protein. Future in vitro studies should provide further insight into the role of this transporter protein in UCEC.
H2A.Z is a H2A‐type histone variant essential for many aspects of cell biology, ranging from gene expression to genome stability. From deuterostomes, H2A.Z evolved into two paralogues, H2A.Z.1 and ...H2A.Z.2, that differ by only three amino acids and are encoded by different genes (H2AFZ and H2AFV, respectively). Despite the importance of this histone variant in development and cellular homeostasis, very little is known about the individual functions of each paralogue in mammals. Here, we have investigated the distinct roles of the two paralogues in cell cycle regulation and unveiled non‐redundant functions for H2A.Z.1 and H2A.Z.2 in cell division. Our findings show that H2A.Z.1 regulates the expression of cell cycle genes such as Myc and Ki‐67 and its depletion leads to a G1 arrest and cellular senescence. On the contrary, H2A.Z.2, in a transcription‐independent manner, is essential for centromere integrity and sister chromatid cohesion regulation, thus playing a key role in chromosome segregation.
Synopsis
This study shows that the very similar histone variants H2A.Z.1 and H2A.Z.2 have different functions in chromatin organisation and cell cycle regulation.
H2A.Z.2 is essential for chromosome segregation fidelity.
H2A.Z.2 regulates sister chromatid cohesion, CPC localisation and kinetochores.
H2A.Z.1 is important for the G1/S transition via MYC transcription and p21/p27 suppression.
H2A.Z.1 and H2A.Z.2 have distinct role in chromatin organisation and gene expression.
This study shows that the very similar histone variants H2A.Z.1 and H2A.Z.2 have different functions in chromatin organisation and cell cycle regulation.
Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality ...genome assemblies. Here, we have created chromosome-level assemblies of the
and
genomes. Together with the
and
genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of
and
between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in
, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.
The identification of diagnostic and prognostic biomarkers is a major objective in improving clinical outcomes in cancer, which has been facilitated by the availability of high-throughput gene ...expression data. A growing interest in non-coding genomic regions has identified dysregulation of long non-coding RNAs (lncRNAs) in several malignancies, suggesting a potential use as biomarkers. In this study, we leveraged data from large-scale sequencing projects to uncover the expression patterns of the
gene and its associated lncRNAs
-
and
-
in solid tumours. Despite many reports describing
overexpression in several cancers, limited studies exist on
-
and
-
and their potential as biomarkers. By employing clustering methods to visualise multi-gene relationships, we identified a discriminative power of the three genes in distinguishing tumour vs. normal samples in several cancers of the gastrointestinal tract and reproductive systems, as well as in discerning oesophageal and testicular cancer histological subtypes. Notably, the expressions of
and its antisenses also correlated with clinical features and endpoints, uncovering previously unreported associations. This work highlights the advantages of using combinatory expression patterns of non-coding transcripts of differentially expressed genes as clinical evaluators and identifies
,
-
, and
-
expressions as robust candidate biomarkers for clinical applications.
Significance Pseudogenes have long been considered nonfunctional elements. However, recent studies have shown they can potentially regulate the expression of protein-coding genes. Capitalizing on ...available functional-genomics data and the finished annotation of human, worm, and fly, we compared the pseudogene complements across the three phyla. We found that in contrast to protein-coding genes, pseudogenes are highly lineage specific, reflecting genome history more so than the conservation of essential biological functions. Specifically, the human pseudogene complement reflects a massive burst of retrotranspositional activity at the dawn of the primates, whereas the worm’s and fly's repertoire reflects a history of deactivated duplications. However, we also observe that pseudogenes across the three phyla have a consistent level of partial activity, with ∼15% being transcribed.
Three-dimensional (3D) cancer models are revolutionising research, allowing for the recapitulation of an in vivo-like response through the use of an in vitro system, which is more complex and ...physiologically relevant than traditional monolayer cultures. Cancers such as ovarian (OvCa) are prone to developing resistance, are often lethal, and stand to benefit greatly from the enhanced modelling emulated by 3D cultures. However, the current models often fall short of the predicted response, where reproducibility is limited owing to the lack of standardised methodology and established protocols. This meta-analysis aims to assess the current scope of 3D OvCa models and the differences in the genetic profiles presented by a vast array of 3D cultures. An analysis of the literature (Pubmed.gov) spanning 2012-2022 was used to identify studies with paired data of 3D and 2D monolayer counterparts in addition to RNA sequencing and microarray data. From the data, 19 cell lines were found to show differential regulation in their gene expression profiles depending on the bio-scaffold (i.e., agarose, collagen, or Matrigel) compared to 2D cell cultures. The top genes differentially expressed in 2D vs. 3D included C3, CXCL1, 2, and 8, IL1B, SLP1, FN1, IL6, DDIT4, PI3, LAMC2, CCL20, MMP1, IFI27, CFB, and ANGPTL4. The top enriched gene sets for 2D vs. 3D included IFN-α and IFN-γ response, TNF-α signalling, IL-6-JAK-STAT3 signalling, angiogenesis, hedgehog signalling, apoptosis, epithelial-mesenchymal transition, hypoxia, and inflammatory response. Our transversal comparison of numerous scaffolds allowed us to highlight the variability that can be induced by these scaffolds in the transcriptional landscape and identify key genes and biological processes that are hallmarks of cancer cells grown in 3D cultures. Future studies are needed to identify which is the most appropriate in vitro/preclinical model to study tumour microenvironments.
Endocrine-disrupting chemicals (EDCs) can exert multiple deleterious effects and have been implicated in carcinogenesis. The xenoestrogen Bisphenol A (BPA) that is found in various consumer products ...has been involved in the dysregulation of numerous signalling pathways. In this paper, we present the analysis of a set of 94 genes that have been shown to be dysregulated in presence of BPA in ovarian cancer cell lines since we hypothesised that these genes might be of biomarker potential. This study sought to identify biomarkers of disease and biomarkers of disease-associated exposure. In silico analyses took place using gene expression data extracted from The Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) databases. Differential expression was further validated at protein level using immunohistochemistry on an ovarian cancer tissue microarray. We found that 14 out of 94 genes are solely dysregulated in the presence of BPA, while the remaining 80 genes are already dysregulated (p-value < 0.05) in their expression pattern as a consequence of the disease. We also found that seven genes have prognostic power for the overall survival in OC in relation to their expression levels. Out of these seven genes, Keratin 4 (KRT4) appears to be a biomarker of exposure-associated ovarian cancer, whereas Guanylate Binding Protein 5 (GBP5), long intergenic non-protein coding RNA 707 (LINC00707) and Solute Carrier Family 4 Member 11 (SLC4A11) are biomarkers of disease. BPA can exert a plethora of effects that can be tissue- or cancer-specific. Our in silico findings generate a hypothesis around biomarkers of disease and exposure that could potentially inform regulation and policy making.
Long non-coding RNAs (lncRNAs) perform a wide functional repertoire of roles in cell biology, ranging from RNA editing to gene regulation, as well as tumour genesis and tumour progression. The lncRNA ...X-inactive specific transcript (XIST) is involved in the aetiopathogenesis of non-small cell lung cancer (NSCLC). However, its role at the molecular level is not fully elucidated. The expression of XIST and co-regulated genes TSIX, hnRNPu, Bcl-2, and BRCA1 analyses in lung cancer (LC) and controls were performed in silico. Differentially expressed genes (DEGs) were determined using RNA-seq in H1975 and A549 NSCLC cell lines following siRNA for XIST. XIST exhibited sexual dimorphism, being up-regulated in females compared to males in both control and LC patient cohorts. RNA-seq revealed 944 and 751 DEGs for A549 and H1975 cell lines, respectively. These DEGs are involved in signal transduction, cell communication, energy pathways, and nucleic acid metabolism. XIST expression associated with TSIX, hnRNPu, Bcl-2, and BRCA1 provided a strong collective feature to discriminate between controls and LC, implying a diagnostic potential. There is a much more complex role for XIST in lung cancer. Further studies should concentrate on sex-specific changes and investigate the signalling pathways of the DEGs following silencing of this lncRNA.