To gain insight into the mechanisms by which the Myb transcription factor controls normal hematopoiesis and particularly, how it contributes to leukemogenesis, we mapped the genome-wide occupancy of ...Myb by chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) in ERMYB myeloid progenitor cells. By integrating the genome occupancy data with whole genome expression profiling data, we identified a Myb-regulated transcriptional program. Gene signatures for leukemia stem cells, normal hematopoietic stem/progenitor cells and myeloid development were overrepresented in 2368 Myb regulated genes. Of these, Myb bound directly near or within 793 genes. Myb directly activates some genes known critical in maintaining hematopoietic stem cells, such as Gfi1 and Cited2. Importantly, we also show that, despite being usually considered as a transactivator, Myb also functions to repress approximately half of its direct targets, including several key regulators of myeloid differentiation, such as Sfpi1 (also known as Pu.1), Runx1, Junb and Cebpb. Furthermore, our results demonstrate that interaction with p300, an established coactivator for Myb, is unexpectedly required for Myb-mediated transcriptional repression. We propose that the repression of the above mentioned key pro-differentiation factors may contribute essentially to Myb's ability to suppress differentiation and promote self-renewal, thus maintaining progenitor cells in an undifferentiated state and promoting leukemic transformation.
Detailed information regarding the number and organization of transfer RNA (tRNA) genes at the genome level is becoming readily available with the increase of DNA sequencing of whole genomes. However ...the identification of functional tRNA genes is challenging for species that have large numbers of repetitive elements containing tRNA derived sequences, such as Bos taurus. Reliable identification and annotation of entire sets of tRNA genes allows the evolution of tRNA genes to be understood on a genomic scale.
In this study, we explored the B. taurus genome using bioinformatics and comparative genomics approaches to catalogue and analyze cow tRNA genes. The initial analysis of the cow genome using tRNAscan-SE identified 31,868 putative tRNA genes and 189,183 pseudogenes, where 28,830 of the 31,868 predicted tRNA genes were classified as repetitive elements by the RepeatMasker program. We then used comparative genomics to further discriminate between functional tRNA genes and tRNA-derived sequences for the remaining set of 3,038 putative tRNA genes. For our analysis, we used the human, chimpanzee, mouse, rat, horse, dog, chicken and fugu genomes to predict that the number of active tRNA genes in cow lies in the vicinity of 439. Of this set, 150 tRNA genes were 100% identical in their sequences across all nine vertebrate genomes studied. Using clustering analyses, we identified a new tRNA-GlyCCC subfamily present in all analyzed mammalian genomes. We suggest that this subfamily originated from an ancestral tRNA-GlyGCC gene via a point mutation prior to the radiation of the mammalian lineages. Lastly, in a separate analysis we created phylogenetic profiles for each putative cow tRNA gene using a representative set of genomes to gain an overview of common evolutionary histories of tRNA genes.
The use of a combination of bioinformatics and comparative genomics approaches has allowed the confident identification of a set of cow tRNA genes that will facilitate further studies in understanding the molecular evolution of cow tRNA genes.
MicroRNAs (miRNAs) are a rapidly growing family of small regulatory RNAs modulating gene expression in plants and animals. In animals, most of the miRNAs discovered in early studies were found to be ...evolutionarily conserved across the whole kingdom. More recent studies, however, have identified many miRNAs that are specific to a particular group of organisms or even a single species. These present a question about evolution of the individual miRNAs and their role in establishing and maintaining lineage-specific functions and characteristics.
In this study, we describe a detailed analysis of the miRNA cluster (hereafter mir-379/mir-656 cluster) located within the imprinted DLK-DIO3 region on human chromosome 14. We show that orthologous miRNA clusters are present in all sequenced genomes of the placental (eutherian) mammals but not in the marsupial (metatherian), monotreme (prototherian), or any other vertebrate genomes. We provide evidence that the locus encompassing this cluster emerged in an early eutherian ancestor prior to the radiation of modern placental mammals by tandem duplication of the ancient precursor sequence. The original amplified cluster may have contained in excess of 250 miRNA precursor sequences, most of which now appear to be inactive. Examination of the eutherian genomes showed that the cluster has been maintained in evolution for approximately 100 Myr.
Analysis of genes that contain predicted evolutionarily conserved targets for miRNAs from this cluster revealed significant overrepresentation of the Gene Ontology terms associated with biological processes such as neurogenesis, embryonic development, transcriptional regulation, and RNA metabolism. Consistent with these findings, a survey of the miRNA expression data within the cluster demonstrates a strong bias toward brain and placenta samples from adult organisms and some embryonic tissues.
Our results suggest that emergence of the mir-379/mir-656 miRNA cluster was one of the factors that facilitated evolution of the placental mammals. Overrepresentation of genes involved in regulation of neurogenesis among predicted miRNAs targets indicates an important role of the mir-379/mir-656 cluster in this biological process in the placental mammals.
MicroRNAs (miRNAs) have been shown to play key regulatory roles in a range of biological processes, including cell differentiation and development. To identify miRNAs that participate in gonad ...differentiation, a fundamental and tightly regulated developmental process, we examined miRNA expression profiles at the time of sex determination and during the early fetal differentiation of mouse testes and ovaries using high-throughput sequencing. We identified several miRNAs that were expressed in a sexually dimorphic pattern, including several members of the let-7 family, miR-378, and miR-140-3p. We focused our analysis on the most highly expressed, sexually dimorphic miRNA, miR-140-3p, and found that both miR-140-3p and its more lowly expressed counterpart, the previously annotated guide strand, miR-140-5p, are testis enriched and expressed in testis cords. Analysis of the miR-140-5p/miR-140-3p-null mouse revealed a significant increase in the number of Leydig cells in the developing XY gonad, strongly suggesting an important role for miR-140-5p/miR-140-3p in testis differentiation in mouse.
Bovine herpesvirus 1 (BoHV-1) is a ubiquitous and important pathogen of cattle worldwide. This study reports the identification of 10 microRNA (miRNA) genes, Bhv1-mir-B1–Bhv1-mir-B10, encoded by the ...BoHV-1 genome that were processed into 12 detectable mature miRNAs as determined by ultra-high throughput sequencing bioinformatics analyses of small RNA libraries and expression studies. We found that four of the miRNA genes were present as two copies in the BoHV-1 genome, resulting in a total of 14 miRNA encoding loci. Unique features of the BoHV-1 miRNAs include evidence of bidirectional transcription and a close association of two miRNA genes with the origin of replication, including one miRNA that is encoded within the origin of replication. The miRNA gene Bhv1-mir-B5 was encoded on the opposite DNA strand to the latency associated transcript, potentially giving rise to antisense transcripts originating from this locus. The association of herpesvirus miRNAs with latency appears to be a common feature in the alphaherpesviruses. Analyses of the BoHV-5 genome for putative miRNA gene orthologues identified a high degree of evolutionary conservation for nine of the BoHV-1 miRNA genes. The possible roles for BoHV-1 miRNAs in the regulation of known BoHV-1 transcription units and the genetics of the BoHV-1 genotypes are also discussed.
In a recent study that identified highly evolutionary conserved sequences in three genomes of Diptera species we described an ultraconserved element found at an internal exon-intron junction of the ...Drosophila melanogaster homothorax (hth) gene that appeared to be involved in the control of hth pre-mRNA splicing. We also discussed a possible role of RNA secondary structure at this site in the regulation of hth pre-mRNA splicing. In this report we identify a shorter evolutionary conserved intronic element within the hth gene that is located downstream of the first element and has sequence complementarity to it. We demonstrate that intramolecular interactions between these two elements would give rise to alternative RNA secondary structures, which in turn may result in differential control of homothorax pre-mRNA splicing. We also provide additional comparative genomic data from several newly available insect genomes supporting our original conclusion that these conserved elements are important in the post-transcriptional regulation of homothorax gene expression in Diptera.
Multiple sclerosis (MS) is a common chronic inflammatory disease of the central nervous system. Susceptibility to the disease is affected by both environmental and genetic factors. Genetic factors ...include haplotypes in the histocompatibility complex (MHC) and over 50 non-MHC loci reported by genome-wide association studies. Amongst these, we previously reported polymorphisms in chromosome 12q13-14 with a protective effect in individuals of European descent. This locus spans 288 kb and contains 17 genes, including several candidate genes which have potentially significant pathogenic and therapeutic implications. In this study, we aimed to fine-map this locus. We have implemented a two-phase study: a variant discovery phase where we have used next-generation sequencing and two target-enrichment strategies long-range polymerase chain reaction (PCR) and Nimblegen's solution phase hybridization capture in pools of 25 samples; and a genotyping phase where we genotyped 712 variants in 3577 healthy controls and 3269 MS patients. This study confirmed the association (rs2069502, P = 9.9 × 10(-11), OR = 0.787) and narrowed down the locus of association to an 86.5 kb region. Although the study was unable to pinpoint the key-associated variant, we have identified a 42 (genotyped and imputed) single-nucleotide polymorphism haplotype block likely to harbour the causal variant. No evidence of association at previously reported low-frequency variants in CYP27B1 was observed. As part of the study we compared variant discovery performance using two target-enrichment strategies. We concluded that our pools enriched with Nimblegen's solution phase hybridization capture had better sensitivity to detect true variants than the pools enriched with long-range PCR, whilst specificity was better in the long-range PCR-enriched pools compared with solution phase hybridization capture enriched pools; this result has important implications for the design of future fine-mapping studies.
Introduction - AML is a complex group of malignancies, with heterogeneity in morphology, cytogenetics, molecular characteristics, aggressiveness and importantly, in its response to treatment and ...survival outcomes. Next generation sequencing by the Cancer Genome Atlas Research Network analysed 200 primary AML cases and identified 23 genes that display recurrent somatic mutations at varying frequency in AML (NEJM 368(22):2059-2074). Defects in DNA repair are frequently identified in treatment-related AML and inherited mutations in genes of DNA repair pathways predispose patients to myeloid malignancies. For example, biallelic mutations in FANC genes, which cause the recessive heritable bone marrow failure syndrome Fanconi Anaemia (FA) are associated with high risk of progression to AML and other cancers (Kutler et al.Blood, 101:1249-1256), suggesting a potential involvement of FANC gene mutations in AML pathogenesis.
Methods - In this study we present a two-stage approach to gene discovery in AML: initial unbiased whole genome sequence (WGS) and whole exome sequence (WES) analysis of tumour DNA from a cytogenetically normal AML case at diagnosis and relapse, and corresponding germ-line DNA (prepared from mesenchymal stromal cells). Potential oncogenic mutations and changes associated with disease progression were identified. WES of a further 96 diagnostic AML samples further defined recurrent mutations and allowed identification of affected functional groups and networks in AML.
Results – WGS and WES were performed on diagnosis, non-haematopoietic and relapse samples from an index AML patient. Somatic SNVs and indels unique to the tumour samples include a number of variants in genes previously reported as recurrently somatically mutated in AML including FLT3, WT1 and IDH2. Somatic mutations in genes not previously associated with AML were also identified including a mutation in FANCD2 (p.S1412N) present in the index AML tumour DNA at diagnosis and at relapse.
Variants in genes recurrently mutated at low frequency in AML can also be disease drivers, however separating such genes from the background level of mutation in AML requires analysis across multiple samples, and sequencing studies to determine recurrence and/or mutations in proteins involved in the same functional pathway or complex. STRING-db v9.05 (Franceschini et al. NAR, 2013(41), Database issue) was used to identify a larger network of proteins, including and associated with the FANC genes, involved in homologous recombination-mediated DNA repair. Known somatic mutations from other AML studies were mapped onto this network; as shown in Figure 1 multiple genes in this extended network are affected by somatic mutation in AML suggesting a potential role in pathogenesis. Analysis of our WES data from diagnosis samples from a further 96 Australian AML cases identified an additional two somatic mutations in genes from the extended STRING-db v9.05 FANC network. In total we identified 18 mutations in the 16 classified FANC genes and 8 variants in the BLM complex as shown in Figure 2. Two of the germline FANC gene mutations, FANCM-Q13333fs and FANCD2-R926X, are known pathogenic mutations in FA. Patients with mutations in the 8 FANC genes of the core complex form a distinct subset from those with mutations in the other 8 FANC genes. 5 of the 8 patients with mutations in the BLM complex also form a separate group while BLM complex mutations are present in 2 patients that also have FANC mutations. For the two patients with acquired changes the allele frequency for these FANC mutations is greater than 25% suggesting an early origin in disease.
Discussion. Our findings suggest that germline and somatic mutations affecting function of the FANC DNA repair pathway may be a recurrent abnormality in AML, potentially contributing to leukaemogenesis. FANC/BLM gene mutations frequently co-exist with mutations in DNMT3A and DNMT1; 46% of the patients with DNMT3A/DNMT1 mutations are also mutant for FANC or BLM complex genes representing significant over-representation (p = 0.021). Within the group of FANC and BLM patients there is also significant under-representation of FLT3-ITD mutations and mutations in N-RAS and K-RAS (p = 0.051), raising the possibility that defects in homologous DNA repair may favour cooperation with alternative signalling pathways.
Display omitted
Display omitted
No relevant conflicts of interest to declare.
MicroRNAs (miRNAs) have been shown to play key regulatory roles in a range of biological processes, including cell differentiation and development. To identify miRNAs that participate in gonad ...differentiation, a fundamental and tightly regulated developmental process, we examined miRNA expression profiles at the time of sex determination and during the early fetal differentiation of mouse testes and ovaries using high-throughput sequencing. We identified several miRNAs that were expressed in a sexually dimorphic pattern, including several members of the let-7 family, miR-378, and miR-140-3p. We focused our analysis on the most highly expressed, sexually dimorphic miRNA, miR-140-3p, and found that both miR-140-3p and its more lowly expressed counterpart, the previously annotated guide strand, miR-140-5p, are testis enriched and expressed in testis cords. Analysis of the miR-140-5p/miR-140-3p-null mouse revealed a significant increase in the number of Leydig cells in the developing XY gonad, strongly suggesting an important role for miR-140-5p/miR-140-3p in testis differentiation in mouse.