We report a new subgroup of Type III Restriction-Modification systems that use m4C methylation for host protection. Recognition specificities for six such systems, each recognizing a novel motif, ...have been determined using single molecule real-time DNA sequencing. In contrast to all previously characterized Type III systems which modify adenine to m6A, protective methylation of the host genome in these new systems is achieved by the N4-methylation of a cytosine base in one strand of an asymmetric 4 to 6 base pair recognition motif. Type III systems are heterotrimeric enzyme complexes containing a single copy of an ATP-dependent restriction endonuclease-helicase (Res) and a dimeric DNA methyltransferase (Mod). The Type III Mods are beta-class amino-methyltransferases, examples of which form either N6-methyl adenine or N4-methyl cytosine in Type II RM systems. The Type III m4C Mod and Res proteins are diverged, suggesting ancient origin or that m4C modification has arisen from m6A MTases multiple times in diverged lineages. Two of the systems, from thermophilic organisms, required expression of both Mod and Res to efficiently methylate an E. coli host, unlike previous findings that Mod alone is proficient at modification, suggesting that the division of labor between protective methylation and restriction activities is atypical in these systems. Two of the characterized systems, and many homologous putative systems, appear to include a third protein; a conserved putative helicase/ATPase subunit of unknown function and located 5' of the mod gene. The function of this additional ATPase is not yet known, but close homologs co-localize with the typical Mod and Res genes in hundreds of putative Type III systems. Our findings demonstrate a rich diversity within Type III RM systems.
Diverse bacteria, including several Pseudomonas species, produce a class of redox-active metabolites called phenazines that impact different cell types in nature and disease. Phenazines can affect ...microbial communities in both positive and negative ways, where their presence is correlated with decreased species richness and diversity. However, little is known about how the concentration of phenazines is modulated in situ and what this may mean for the fitness of members of the community. Through culturing of phenazine-degrading mycobacteria, genome sequencing, comparative genomics, and molecular analysis, we identified several conserved genes that are important for the degradation of three Pseudomonas-derived phenazines: phenazine-1-carboxylic acid (PCA), phenazine-1-carboxamide (PCN), and pyocyanin (PYO). PCA can be used as the sole carbon source for growth by these organisms. Deletion of several genes in Mycobacterium fortuitum abolishes the degradation phenotype, and expression of two genes in a heterologous host confers the ability to degrade PCN and PYO. In cocultures with phenazine producers, phenazine degraders alter the abundance of different phenazine types. Not only does degradation support mycobacterial catabolism, but also it provides protection to bacteria that would otherwise be inhibited by the toxicity of PYO. Collectively, these results serve as a reminder that microbial metabolites can be actively modified and degraded and that these turnover processes must be considered when the fate and impact of such compounds in any environment are being assessed.
Phenazine production by Pseudomonas spp. can shape microbial communities in a variety of environments ranging from the cystic fibrosis lung to the rhizosphere of dryland crops. For example, in the rhizosphere, phenazines can protect plants from infection by pathogenic fungi. The redox activity of phenazines underpins their antibiotic activity, as well as providing pseudomonads with important physiological benefits. Our discovery that soil mycobacteria can catabolize phenazines and thereby protect other organisms against phenazine toxicity suggests that phenazine degradation may influence turnover in situ. The identification of genes involved in the degradation of phenazines opens the door to monitoring turnover in diverse environments, an essential process to consider when one is attempting to understand or control communities influenced by phenazines.
We exploit the optical and spatial features of subwavelength nanostructures to examine individual receptors on the plasma membrane of living cells. Receptors were sequestered in portions of the ...membrane projected into zero-mode waveguides. Using single-step photobleaching of green fluorescent protein incorporated into individual subunits, the resulting spatial isolation was used to measure subunit stoichiometry in α4β4 and α4β2 nicotinic acetylcholine and P2X2 ATP receptors. We also show that nicotine and cytisine have differential effects on α4β2 stoichiometry.
Optical nanostructures have enabled the creation of subdiffraction detection volumes for single-molecule fluorescence microscopy. Their applicability is extended by the ability to place molecules in ...the confined observation volume without interfering with their biological function. Here, we demonstrate that processive DNA synthesis thousands of bases in length was carried out by individual DNA polymerase molecules immobilized in the observation volumes of zero-mode waveguides (ZMWs) in high-density arrays. Selective immobilization of polymerase to the fused silica floor of the ZMW was achieved by passivation of the metal cladding surface using polyphosphonate chemistry, producing enzyme density contrasts of glass over aluminum in excess of 400:1. Yields of single-molecule occupancies of ≈30% were obtained for a range of ZMW diameters (70-100 nm). Results presented here support the application of immobilized single DNA polymerases in ZMW arrays for long-read-length DNA sequencing.
The Helicobacter pylori phase variable gene modH, typified by gene HP1522 in strain 26695, encodes a N
-adenosine type III DNA methyltransferase. Our previous studies identified multiple ...strain-specific modH variants (modH1 - modH19) and showed that phase variation of modH5 in H. pylori P12 influenced expression of motility-associated genes and outer membrane protein gene hopG. However, the ModH5 DNA recognition motif and the mechanism by which ModH5 controls gene expression were unknown. Here, using comparative single molecule real-time sequencing, we identify the DNA site methylated by ModH5 as 5'-G
ACC-3'. This motif is vastly underrepresented in H. pylori genomes, but overrepresented in a number of virulence genes, including motility-associated genes, and outer membrane protein genes. Motility and the number of flagella of H. pylori P12 wild-type were significantly higher than that of isogenic modH5 OFF or ΔmodH5 mutants, indicating that phase variable switching of modH5 expression plays a role in regulating H. pylori motility phenotypes. Using the flagellin A (flaA) gene as a model, we show that ModH5 modulates flaA promoter activity in a GACC methylation-dependent manner. These findings provide novel insights into the role of ModH5 in gene regulation and how it mediates epigenetic regulation of H. pylori motility.
Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant ...inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
DNA modifications such as methylation and DNA damage can play critical regulatory roles in biological systems. Single molecule, real time (SMRT) sequencing technology generates DNA sequences as well ...as DNA polymerase kinetic information that can be used for the direct detection of DNA modifications. We demonstrate that local sequence context has a strong impact on DNA polymerase kinetics in the neighborhood of the incorporation site during the DNA synthesis reaction, allowing for the possibility of estimating the expected kinetic rate of the enzyme at the incorporation site using kinetic rate information collected from existing SMRT sequencing data (historical data) covering the same local sequence contexts of interest. We develop an Empirical Bayesian hierarchical model for incorporating historical data. Our results show that the model could greatly increase DNA modification detection accuracy, and reduce requirement of control data coverage. For some DNA modifications that have a strong signal, a control sample is not even needed by using historical data as alternative to control. Thus, sequencing costs can be greatly reduced by using the model. We implemented the model in a R package named seqPatch, which is available at https://github.com/zhixingfeng/seqPatch.
Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when ...repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications.
We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ∼30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ∼93.7%. We also observed a high correlation coefficient (R = 0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over 0,1 were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ∼99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682 bp in the medaka genome.
AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn
ysuzuki@cb.k.u-tokyo.ac.jp or moris@cb.k.u-tokyo.ac.jp
Supplementary data are available at Bioinformatics online.
We performed whole-genome analyses of DNA methylation in Shewanella oneidensis MR-1 to examine its possible role in regulating gene expression and other cellular processes. Single-molecule real-time ...(SMRT) sequencing revealed extensive methylation of adenine (N6mA) throughout the genome. These methylated bases were located in five sequence motifs, including three novel targets for type I restriction/modification enzymes. The sequence motifs targeted by putative methyltranferases were determined via SMRT sequencing of gene knockout mutants. In addition, we found that S. oneidensis MR-1 cultures grown under various culture conditions displayed different DNA methylation patterns. However, the small number of differentially methylated sites could not be directly linked to the much larger number of differentially expressed genes under these conditions, suggesting that DNA methylation is not a major regulator of gene expression in S. oneidensis MR-1. The enrichment of methylated GATC motifs in the origin of replication indicates that DNA methylation may regulate genome replication in a manner similar to that seen in Escherichia coli. Furthermore, comparative analyses suggest that many Gammaproteobacteria, including all members of the Shewanellaceae family, may also utilize DNA methylation to regulate genome replication.
Phlebotomine sand flies are the vectors of leishmaniasis, a neglected tropical disease. High-quality reference genomes are an important tool for understanding the biology and eco-evolutionary ...dynamics underpinning disease epidemiology. Previous leishmaniasis vector reference sequences were limited by sequencing technologies available at the time and inadequate for high-resolution genomic inquiry. Here, we present updated reference assemblies of two sand flies, Phlebotomus papatasi and Lutzomyia longipalpis. These chromosome-level assemblies were generated using an ultra-low input library protocol, PacBio HiFi long reads, and Hi-C technology. The new P. papatasi reference has a final assembly span of 351.6 Mb and contig and scaffold N50s of 926 kb and 111.8 Mb, respectively. The new Lu. longipalpis reference has a final assembly span of 147.8 Mb and contig and scaffold N50s of 1.09 Mb and 40.6 Mb, respectively. Benchmarking Universal Single-Copy Orthologue (BUSCO) assessments indicated 94.5% and 95.6% complete single copy insecta orthologs for P. papatasi and Lu. longipalpis. These improved assemblies will serve as an invaluable resource for future genomic work on phlebotomine sandflies.