Long-read technologies are overcoming early limitations in accuracy and throughput, broadening their application domains in genomics. Dedicated analysis tools that take into account the ...characteristics of long-read data are thus required, but the fast pace of development of such tools can be overwhelming. To assist in the design and analysis of long-read sequencing projects, we review the current landscape of available tools and present an online interactive database, long-read-tools.org, to facilitate their browsing. We further focus on the principles of error correction, base modification detection, and long-read transcriptomics analysis and highlight the challenges that remain.
DNA methylation in plants is traditionally partitioned into CG, CHG and CHH contexts (with H any nucleotide but G). By investigating DNA methylation patterns in trinucleotide contexts in four ...angiosperm species, we show that such a representation hides spatial and functional partitioning of different methylation pathways and is incomplete. CG methylation (mCG) is largely context-independent whereas, at CHG motifs, there is under-representation of mCCG in pericentric regions of A. thaliana and tomato and throughout the chromosomes of maize and rice. In A. thaliana the biased representation of mCCG in heterochromatin is related to specificities of H3K9 methyltransferase SUVH family members. At CHH motifs there is an over-representation of different variant forms of mCHH that, similarly to mCCG hypomethylation, is partitioned into the pericentric regions of the two dicots but dispersed in the monocot chromosomes. The over-represented mCHH motifs in A. thaliana associate with specific types of transposon including both class I and II elements. At mCHH the contextual bias is due to the involvement of various chromomethyltransferases whereas the context-independent CHH methylation in A. thaliana and tomato is mediated by the RNA-directed DNA methylation process that is most active in the gene-rich euchromatin. This analysis therefore reveals that the sequence context of the methylome of plant genomes is informative about the mechanisms associated with maintenance of methylation and the overlying chromatin structure.
Seed germination involves progression from complete metabolic dormancy to a highly active, growing seedling. Many factors regulate germination and these interact extensively, forming a complex ...network of inputs that control the seed-to-seedling transition. Our understanding of the direct regulation of gene expression and the dynamic changes in the epigenome and small RNAs during germination is limited. The interactions between genome, transcriptome and epigenome must be revealed in order to identify the regulatory mechanisms that control seed germination.
We present an integrated analysis of high-resolution RNA sequencing, small RNA sequencing and MethylC sequencing over ten developmental time points in Arabidopsis thaliana seeds, finding extensive transcriptomic and epigenomic transformations associated with seed germination. We identify previously unannotated loci from which messenger RNAs are expressed transiently during germination and find widespread alternative splicing and divergent isoform abundance of genes involved in RNA processing and splicing. We generate the first dynamic transcription factor network model of germination, identifying known and novel regulatory factors. Expression of both microRNA and short interfering RNA loci changes significantly during germination, particularly between the seed and the post-germinative seedling. These are associated with changes in gene expression and large-scale demethylation observed towards the end of germination, as the epigenome transitions from an embryo-like to a vegetative seedling state.
This study reveals the complex dynamics and interactions of the transcriptome and epigenome during seed germination, including the extensive remodelling of the seed DNA methylome from an embryo-like to vegetative-like state during the seed-to-seedling transition. Data are available for exploration in a user-friendly browser at https://jbrowse.latrobe.edu.au/germination_epigenome .
Transposable elements in crop plants are the powerful drivers of phenotypic variation that has been selected during domestication and breeding programs. In tomato, transpositions of the LTR (long ...terminal repeat) retrotransposon family Rider have contributed to various phenotypes of agronomical interest, such as fruit shape and colour. However, the mechanisms regulating Rider activity are largely unknown. We have developed a bioinformatics pipeline for the functional annotation of retrotransposons containing LTRs and defined all full-length Rider elements in the tomato genome. Subsequently, we showed that accumulation of Rider transcripts and transposition intermediates in the form of extrachromosomal DNA is triggered by drought stress and relies on abscisic acid signalling. We provide evidence that residual activity of Rider is controlled by epigenetic mechanisms involving siRNAs and the RNA-dependent DNA methylation pathway. Finally, we demonstrate the broad distribution of Rider-like elements in other plant species, including crops. Our work identifies Rider as an environment-responsive element and a potential source of genetic and epigenetic variation in plants.
Freakish and rare or the tip of the iceberg? Both phrases have been used to refer to paramutation, an epigenetic drive that contravenes Mendel's first law of segregation. Although its underlying ...mechanisms are beginning to unravel, its understanding relies only on a few examples that may involve transgenes or artificially generated epialleles.
By using DNA methylation of introgression lines as an indication of past paramutation, we reveal that the paramutation-like properties of the H06 locus in hybrids of Solanum lycopersicum and a range of tomato relatives and cultivars depend on the timing of sRNA production and conform to an RNA-directed mechanism. In addition, by scanning the methylomes of tomato introgression lines for shared regions of differential methylation that are absent in the S. lycopersicum parent, we identify thousands of candidate regions for paramutation-like behaviour. The methylation patterns for a subset of these regions segregate with non Mendelian ratios, consistent with secondary paramutation-like interactions to variable extents depending on the locus.
Together these results demonstrate that paramutation-like epigenetic interactions are common for natural epialleles in tomato, but vary in timing and penetrance.
A key benefit of long-read nanopore sequencing technology is the ability to detect modified DNA bases, such as 5-methylcytosine. The lack of R/Bioconductor tools for the effective visualization of ...nanopore methylation profiles between samples from different experimental groups led us to develop the NanoMethViz R package. Our software can handle methylation output generated from a range of different methylation callers and manages large datasets using a compressed data format. To fully explore the methylation patterns in a dataset, NanoMethViz allows plotting of data at various resolutions. At the sample-level, we use dimensionality reduction to look at the relationships between methylation profiles in an unsupervised way. We visualize methylation profiles of classes of features such as genes or CpG islands by scaling them to relative positions and aggregating their profiles. At the finest resolution, we visualize methylation patterns across individual reads along the genome using the spaghetti plot and heatmaps, allowing users to explore particular genes or genomic regions of interest. In summary, our software makes the handling of methylation signal more convenient, expands upon the visualization options for nanopore data and works seamlessly with existing methylation analysis tools available in the Bioconductor project. Our software is available at https://bioconductor.org/packages/NanoMethViz.
Abstract
Parents transmit genetic and epigenetic information to their offspring. Maternal effect genes regulate the offspring epigenome to ensure normal development. Here we report that the ...epigenetic regulator SMCHD1 has a maternal effect on
Hox
gene expression and skeletal patterning. Maternal SMCHD1, present in the oocyte and preimplantation embryo, prevents precocious activation of
Hox
genes post-implantation. Without maternal SMCHD1, highly penetrant posterior homeotic transformations occur in the embryo.
Hox
genes are decorated with Polycomb marks H2AK119ub and H3K27me3 from the oocyte throughout early embryonic development; however, loss of maternal SMCHD1 does not deplete these marks. Therefore, we propose maternal SMCHD1 acts downstream of Polycomb marks to establish a chromatin state necessary for persistent epigenetic silencing and appropriate
Hox
gene expression later in the developing embryo. This is a striking role for maternal SMCHD1 in long-lived epigenetic effects impacting offspring phenotype.
The process of epigenetic silencing, while fundamentally important, is not yet completely understood. Here we report a replenishable female mouse embryonic stem cell (mESC) system, Xmas, that allows ...rapid assessment of X chromosome inactivation (XCI), the epigenetic silencing mechanism of one of the two X chromosomes that enables dosage compensation in female mammals. Through a targeted genetic screen in differentiating Xmas mESCs, we reveal that the BAF complex is required to create nucleosome-depleted regions at promoters on the inactive X chromosome during the earliest stages of establishment of XCI. Without this action gene silencing fails. Xmas mESCs provide a tractable model for screen-based approaches that enable the discovery of unknown facets of the female-specific process of XCI and epigenetic silencing more broadly.