Bisulfite sequencing detects 5mC and 5hmC at single-base resolution. However, bisulfite treatment damages DNA, which results in fragmentation, DNA loss, and biased sequencing data. To overcome these ...problems, enzymatic methyl-seq (EM-seq) was developed. This method detects 5mC and 5hmC using two sets of enzymatic reactions. In the first reaction, TET2 and T4-BGT convert 5mC and 5hmC into products that cannot be deaminated by APOBEC3A. In the second reaction, APOBEC3A deaminates unmodified cytosines by converting them to uracils. Therefore, these three enzymes enable the identification of 5mC and 5hmC. EM-seq libraries were compared with bisulfite-converted DNA, and each library type was ligated to Illumina adaptors before conversion. Libraries were made using NA12878 genomic DNA, cell-free DNA, and FFPE DNA over a range of DNA inputs. The 5mC and 5hmC detected in EM-seq libraries were similar to those of bisulfite libraries. However, libraries made using EM-seq outperformed bisulfite-converted libraries in all specific measures examined (coverage, duplication, sensitivity, etc.). EM-seq libraries displayed even GC distribution, better correlations across DNA inputs, increased numbers of CpGs within genomic features, and accuracy of cytosine methylation calls. EM-seq was effective using as little as 100 pg of DNA, and these libraries maintained the described advantages over bisulfite sequencing. EM-seq library construction, using challenging samples and lower DNA inputs, opens new avenues for research and clinical applications.
The predominant methodology for DNA methylation analysis relies on the chemical deamination by sodium bisulfite of unmodified cytosine to uracil to permit the differential readout of methylated ...cytosines. Bisulfite treatment damages the DNA, leading to fragmentation and loss of long-range methylation information. To overcome this limitation of bisulfite-treated DNA, we applied a new enzymatic deamination approach, termed enzymatic methyl-seq (EM-seq), to long-range sequencing technologies. Our methodology, named long-read enzymatic modification sequencing (LR-EM-seq), preserves the integrity of DNA, allowing long-range methylation profiling of 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) over multikilobase length of genomic DNA. When applied to known differentially methylated regions (DMRs), LR-EM-seq achieves phasing of >5 kb, resulting in broader and better defined DMRs compared with that previously reported. This result showed the importance of phasing methylation for biologically relevant questions and the applicability of LR-EM-seq for long-range epigenetic analysis at single-molecule and single-nucleotide resolution.
Cytosine residues in the vertebrate genome are enzymatically modified to 5-methylcytosine, which participates in transcriptional repression of genes during development and disease progression. ...5-Methylcytosine can be further enzymatically modified to 5-hydroxymethylcytosine by the TET family of methylcytosine dioxygenases. Analysis of 5-methylcytosine and 5-hydroxymethylcytosine is confounded, as these modifications are indistinguishable by traditional sequencing methods even when supplemented by bisulfite conversion. Here we demonstrate a simple enzymatic approach that involves cloning, identification, and quantification of 5-hydroxymethylcytosine in various CCGG loci within murine and human genomes. 5-Hydroxymethylcytosine was prevalent in human and murine brain and heart genomic DNAs at several regions. The cultured cell lines NIH3T3 and HeLa both displayed very low or undetectable amounts of 5-hydroxymethylcytosine at the examined loci. Interestingly, 5-hydroxymethylcytosine levels in mouse embryonic stem cell DNA first increased then slowly decreased upon differentiation to embryoid bodies, whereas 5-methylcytosine levels increased gradually over time. Finally, using a quantitative PCR approach, we established that a portion of VANGL1 and EGFR gene body methylation in human tissue DNA samples is indeed hydroxymethylation.
Shotgun metagenomic sequencing is a powerful approach to study microbiomes in an unbiased manner and of increasing relevance for identifying novel enzymatic functions. However, the potential of ...metagenomics to relate from microbiome composition to function has thus far been underutilized. Here, we introduce the Metagenomics Genome-Phenome Association (MetaGPA) study framework, which allows linking genetic information in metagenomes with a dedicated functional phenotype. We applied MetaGPA to identify enzymes associated with cytosine modifications in environmental samples. From the 2365 genes that met our significance criteria, we confirm known pathways for cytosine modifications and proposed novel cytosine-modifying mechanisms. Specifically, we characterized and identified a novel nucleic acid-modifying enzyme, 5-hydroxymethylcytosine carbamoyltransferase, that catalyzes the formation of a previously unknown cytosine modification, 5-carbamoyloxymethylcytosine, in DNA and RNA. Our work introduces MetaGPA as a novel and versatile tool for advancing functional metagenomics.
Modified DNA bases in mammalian genomes, such as 5-methylcytosine ((5m)C) and its oxidized forms, are implicated in important epigenetic regulation processes. In human or mouse, successive enzymatic ...conversion of (5m)C to its oxidized forms is carried out by the ten-eleven translocation (TET) proteins. Previously we reported the structure of a TET-like (5m)C oxygenase (NgTET1) from Naegleria gruberi, a single-celled protist evolutionarily distant from vertebrates. Here we show that NgTET1 is a 5-methylpyrimidine oxygenase, with activity on both (5m)C (major activity) and thymidine (T) (minor activity) in all DNA forms tested, and provide unprecedented evidence for the formation of 5-formyluridine ((5f)U) and 5-carboxyuridine ((5ca)U) in vitro. Mutagenesis studies reveal a delicate balance between choice of (5m)C or T as the preferred substrate. Furthermore, our results suggest substrate preference by NgTET1 to (5m)CpG and TpG dinucleotide sites in DNA. Intriguingly, NgTET1 displays higher T-oxidation activity in vitro than mammalian TET1, supporting a closer evolutionary relationship between NgTET1 and the base J-binding proteins from trypanosomes. Finally, we demonstrate that NgTET1 can be readily used as a tool in (5m)C sequencing technologies such as single molecule, real-time sequencing to map (5m)C in bacterial genomes at base resolution.
Filarial parasites (e.g., Brugia malayi, Onchocerca volvulus, and Wuchereria bancrofti) are causative agents of lymphatic filariasis and onchocerciasis, which are among the most disabling of ...neglected tropical diseases. There is an urgent need to develop macro-filaricidal drugs, as current anti-filarial chemotherapy (e.g., diethylcarbamazine DEC, ivermectin and albendazole) can interrupt transmission predominantly by killing microfilariae (mf) larvae, but is less effective on adult worms, which can live for decades in the human host. All medically relevant human filarial parasites appear to contain an obligate endosymbiotic bacterium, Wolbachia. This alpha-proteobacterial mutualist has been recognized as a potential target for filarial nematode life cycle intervention, as antibiotic treatments of filarial worms harboring Wolbachia result in the loss of worm fertility and viability upon antibiotic treatments both in vitro and in vivo. Human trials have confirmed this approach, although the length of treatments, high doses required and medical counter-indications for young children and pregnant women warrant the identification of additional anti-Wolbachia drugs.
Genome sequence analysis indicated that enzymes involved in heme biosynthesis might constitute a potential anti-Wolbachia target set. We tested different heme biosynthetic pathway inhibitors in ex vivo B. malayi viability assays and report a specific effect of N-methyl mesoporphyrin (NMMP), which targets ferrochelatase (FC, the last step). Our phylogenetic analysis indicates evolutionarily significant divergence between Wolbachia heme genes and their human homologues. We therefore undertook the cloning, overexpression and analysis of several enzymes of this pathway alongside their human homologues, and prepared proteins for drug targeting. In vitro enzyme assays revealed a approximately 600-fold difference in drug sensitivities to succinyl acetone (SA) between Wolbachia and human 5'-aminolevulinic acid dehydratase (ALAD, the second step). Similarly, Escherichia coli hemH (FC) deficient strains transformed with human and Wolbachia FC homologues showed significantly different sensitivities to NMMP. This approach enables functional complementation in E. coli heme deficient mutants as an alternative E. coli-based method for drug screening.
Our studies indicate that the heme biosynthetic genes in the Wolbachia of B. malayi (wBm) might be essential for the filarial host survival. In addition, the results suggest they are likely candidate drug targets based upon significant differences in phylogenetic distance, biochemical properties and sensitivities to heme biosynthesis inhibitors, as compared to their human homologues.
Nucleic acids in living organisms are more complex than the simple combinations of the four canonical nucleotides. Recent advances in biomedical research have led to the discovery of numerous ...naturally occurring nucleotide modifications and enzymes responsible for the synthesis of such modifications. In turn, these enzymes can be leveraged towards toolkits for DNA and RNA manipulation for epigenetic sequencing or other biotechnological applications. Here, we present the protocol to obtain purified 5-hydroxymethylcytosine carbamoyltransferase enzymes and the associated assays to convert 5-hydroxymethylcytosine to 5-carbamoyloxymethylcytosine
. We include detailed assays using DNA, RNA, and single nucleotide/deoxynucleotide as substrates. These assays can be combined with downstream applications for genetic/epigenetic regulatory mechanism studies and next-generation sequencing purposes.
Covalent modifications of genomic DNA are crucial for most organisms to survive. Amplicon-based high-throughput sequencing technologies erase all DNA modifications to retain only sequence information ...for the four canonical nucleobases, necessitating specialized technologies for ascertaining epigenetic information. To also capture base modification information, we developed Methyl-SNP-seq, a technology that takes advantage of the complementarity of the double helix to extract the methylation and original sequence information from a single DNA molecule. More specifically, Methyl-SNP-seq uses bisulfite conversion of one of the strands to identify cytosine methylation while retaining the original four-bases sequence information on the other strand. As both strands are locked together to link the dual readouts on a single paired-end read, Methyl-SNP-seq allows detecting the methylation status of any DNA even without a reference genome. Because one of the strands retains the original four nucleotide composition, Methyl-SNP-seq can also be used in conjunction with standard sequence-specific probes for targeted enrichment and amplification. We show the usefulness of this technology in a broad spectrum of applications ranging from allele-specific methylation analysis in humans to identification of methyltransferase specificity in complex bacterial communities.
The oxidation activity of the mammalian ten-eleven translocation dioxygenase (TET) on 5-methylcytosine (5mC) of DNA is usually monitored by analytical methods such as dot blotting and liquid ...chromatography-mass spectrometry (LC-MS). Herein, we describe a high throughput capillary gel electrophoresis assay for monitoring the in vitro oxidation of 5mC by TET. The method is rapid and quantitative, and can serve as a powerful tool in mechanistic studies of TET.
DNA hydroxymethylation is a long known modification of DNA, but has recently become a focus in epigenetic research. Mammalian DNA is enzymatically modified at the 5(th) carbon position of cytosine ...(C) residues to 5-mC, predominately in the context of CpG dinucleotides. 5-mC is amenable to enzymatic oxidation to 5-hmC by the Tet family of enzymes, which are believed to be involved in development and disease. Currently, the biological role of 5-hmC is not fully understood, but is generating a lot of interest due to its potential as a biomarker. This is due to several groundbreaking studies identifying 5-hydroxymethylcytosine in mouse embryonic stem (ES) and neuronal cells. Research techniques, including bisulfite sequencing methods, are unable to easily distinguish between 5-mC and 5-hmC . A few protocols exist that can measure global amounts of 5-hydroxymethylcytosine in the genome, including liquid chromatography coupled with mass spectrometry analysis or thin layer chromatography of single nucleosides digested from genomic DNA. Antibodies that target 5-hydroxymethylcytosine also exist, which can be used for dot blot analysis, immunofluorescence, or precipitation of hydroxymethylated DNA, but these antibodies do not have single base resolution.In addition, resolution depends on the size of the immunoprecipitated DNA and for microarray experiments, depends on probe design. Since it is unknown exactly where 5-hydroxymethylcytosine exists in the genome or its role in epigenetic regulation, new techniques are required that can identify locus specific hydroxymethylation. The EpiMark 5-hmC and 5-mC Analysis Kit provides a solution for distinguishing between these two modifications at specific loci. The EpiMark 5-hmC and 5-mC Analysis Kit is a simple and robust method for the identification and quantitation of 5-methylcytosine and 5-hydroxymethylcytosine within a specific DNA locus. This enzymatic approach utilizes the differential methylation sensitivity of the isoschizomers MspI and HpaII in a simple 3-step protocol. Genomic DNA of interest is treated with T4-BGT, adding a glucose moeity to 5-hydroxymethylcytosine. This reaction is sequence-independent, therefore all 5-hmC will be glucosylated; unmodified or 5-mC containing DNA will not be affected. This glucosylation is then followed by restriction endonuclease digestion. MspI and HpaII recognize the same sequence (CCGG) but are sensitive to different methylation states. HpaII cleaves only a completely unmodified site: any modification (5-mC, 5-hmC or 5-ghmC) at either cytosine blocks cleavage. MspI recognizes and cleaves 5-mC and 5-hmC, but not 5-ghmC. The third part of the protocol is interrogation of the locus by PCR. As little as 20 ng of input DNA can be used. Amplification of the experimental (glucosylated and digested) and control (mock glucosylated and digested) target DNA with primers flanking a CCGG site of interest (100-200 bp) is performed. If the CpG site contains 5-hydroxymethylcytosine, a band is detected after glucosylation and digestion, but not in the non-glucosylated control reaction. Real time PCR will give an approximation of how much hydroxymethylcytosine is in this particular site. In this experiment, we will analyze the 5-hydroxymethylcytosine amount in a mouse Babl/C brain sample by end point PCR.