The Transposon Registry Tansirichaiya, Supathep; Rahman, Md. Ajijur; Roberts, Adam P
Mobile DNA,
10/2019, Letnik:
10, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Transposable elements in prokaryotes are found in many forms and therefore a robust nomenclature system is needed in order to allow researchers to describe and search for them in publications and ...databases. Here we provide an update on The Transposon Registry which allocates numbers to any prokaryotic transposable element. Additionally, we present the completion of registry records for all transposons assigned Tn numbers from Tn1 onwards where sequence data or publications exist.
The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable elements (TEs). Because the ...types and sequences of TEs are highly variable across species, automated TE discovery and annotation are challenging and timeconsuming tasks. A critical first step is the de novo identification and accurate compilation of sequence models representing all of the unique TE families dispersed in the genome. Here we introduce RepeatModeler2, a pipeline that greatly facilitates this process. This program brings substantial improvements over the original version of RepeatModeler, one of the most widely used tools for TE discovery. In particular, this version incorporates a module for structural discovery of complete long terminal repeat (LTR) retroelements, which are widespread in eukaryotic genomes but recalcitrant to automated identification because of their size and sequence complexity. We benchmarked RepeatModeler2 on three model species with diverse TE landscapes and high-quality, manually curated TE libraries: Drosophila melanogaster (fruit fly), Danio rerio (zebrafish), and Oryza sativa (rice). In these three species, RepeatModeler2 identified approximately 3 times more consensus sequences matching with >95% sequence identity and sequence coverage to the manually curated sequences than the original RepeatModeler. As expected, the greatest improvement is for LTR retroelements. Thus, RepeatModeler2 represents a valuable addition to the genome annotation toolkit that will enhance the identification and study of TEs in eukaryotic genome sequences. RepeatModeler2 is available as source code or a containerized package under an open license (https://github.com/Dfam-consortium/ RepeatModeler, http://www.repeatmasker.org/RepeatModeler/).
CRISPR-associated transposons (CASTs) are mobile genetic elements that co-opt CRISPR-Cas systems for RNA-guided DNA transposition. CASTs integrate large DNA cargos into the attachment (att) site ...independently of homology-directed repair and thus hold promise for eukaryotic genome engineering. However, the functional diversity and complexity of CASTs hinder an understanding of their mechanisms. Here, we present the high-resolution cryoelectron microscopy (cryo-EM) structure of the reconstituted ∼1 MDa post-transposition complex of the type V-K CAST, together with different assembly intermediates and diverse TnsC filament lengths, thus enabling the recapitulation of the integration complex formation. The results of mutagenesis experiments probing the roles of specific residues and TnsB-binding sites show that transposition activity can be enhanced and suggest that the distance between the PAM and att sites is determined by the lengths of the TnsB C terminus and the TnsC filament. This singular model of RNA-guided transposition provides a foundation for repurposing the system for genome-editing applications.
Display omitted
•Landscape of assemblies of the V-K CAST post-transposition complex by cryo-EM•The stem loop of the sgRNA is inserted between two hairpin loops of TnsC filament•TnsC hairpin loop and LE and RE TnsB-binding sites mutants enhance activity•The length of the TnsB linker and the TnsC filament affect the PAM-to-att distance
CASTs are Tn7-like transposons that utilize CRISPR-Cas systems for specific DNA integration at target loci. Tenjo-Castaño et al. dissect the assembly mechanism of the type V-K CAST using cryo-EM. The authors shed light on the integration of the DNA cargo, leading to type V-K CAST variants with increased transposition activity.
A major resistance quantitative trait locus, qRfg1, significantly enhances maize resistance to Gibberella stalk rot, a devastating disease caused by Fusarium graminearum. However, the underlying ...molecular mechanism remains unknown.
We adopted a map-based cloning approach to identify the resistance gene at qRfg1 and examined the dynamic epigenetic changes during qRfg1-mediated maize resistance to the disease.
A CCT domain-containing gene, ZmCCT, is the causal gene at the qRfg1 locus and a polymorphic CACTA-like transposable element (TE1) c. 2.4 kb upstream of ZmCCT is the genetic determinant of allelic variation. The non-TE1 ZmCCT allele is in a poised state, with predictive bivalent chromatin enriched for both repressive (H3K27me3/H3K9me3) and active (H3K4me3) histone marks. Upon pathogen challenge, this non-TE1 ZmCCT allele was promptly induced by a rapid yet transient reduction in H3K27me3/H3K9me3 and a progressive decrease in H3K4me3, leading to disease resistance. However, TE1 insertion in ZmCCT caused selective depletion of H3K4me3 and enrichment of methylated GC to suppress the pathogen-induced ZmCCT expression, resulting in disease susceptibility. Moreover, ZmCCT- mediated resistance to Gibberella stalk rot is not affected by photoperiod sensitivity.
This chromatin-based regulatory mechanism enables ZmCCT to be more precise and timely in defense against F. graminearum infection.
Piwi-interacting RNAs (piRNAs) silence transposons to safeguard genome integrity in animals. However, the functions of the many piRNAs that do not map to transposons remain unknown. Here, we show ...that piRNA targeting in
can tolerate a few mismatches but prefer perfect pairing at the seed region. The broad targeting capacity of piRNAs underlies the germline silencing of transgenes in
Transgenes engineered to avoid piRNA recognition are stably expressed. Many endogenous germline-expressed genes also contain predicted piRNA targeting sites, and periodic An/Tn clusters (PATCs) are an intrinsic signal that provides resistance to piRNA silencing. Together, our study revealed the piRNA targeting rules and highlights a distinct strategy that
uses to distinguish endogenous from foreign nucleic acids.
High-copy-number transposable elements comprise the majority of eukaryotic genomes where they are major contributors to gene and genome evolution. However, it remains unclear how a host genome can ...survive a rapid burst of hundreds or thousands of insertions because such bursts are exceedingly rare in nature and therefore difficult to observe in real time. In a previous study we reported that in a few rice strains the DNA transposon mPing was increasing its copy number by approximately 40 per plant per generation. Here we exploit the completely sequenced rice genome to determine 1,664 insertion sites using high-throughput sequencing of 24 individual rice plants and assess the impact of insertion on the expression of 710 genes by comparative microarray analysis. We find that the vast majority of transposable element insertions either upregulate or have no detectable effect on gene transcription. This modest impact reflects a surprising avoidance of exon insertions by mPing and a preference for insertion into 5' flanking sequences of genes. Furthermore, we document the generation of new regulatory networks by a subset of mPing insertions that render adjacent genes stress inducible. As such, this study provides evidence for models first proposed previously for the involvement of transposable elements and other repetitive sequences in genome restructuring and gene regulation.
The origin and mobilization of the ~2,609-bp DNA segment containing the mobile colistin resistance gene mcr-1 continue to be sources of uncertainty, but recent evidence suggests that the gene ...originated in Moraxella species. Moreover mcr-1 can be mobilized as an ISApl1-flanked composite transposon (Tn6330), but many sequences have been identified without ISApl1 or with just a single copy (single ended). To further clarify the origins and mobilization of mcr-1, we employed the Geneious R8 software suite to comprehensively analyze the genetic environment of every complete mcr-1 structure deposited in GenBank as of this writing (September 2017) both with and without associated ISApl1 (n = 273). This revealed that the 2,609-bp mcr-1 structure was likely mobilized from a close relative of a novel species of Moraxella containing a chromosomal region sharing >96% nucleotide identity with the canonical sequence. This chromosomal region is bounded by AT and CG dinucleotides, which have been described on the inside ends (IE) of all intact Tn6330 described to date and represent the ancestral 2-bp target site duplications (TSDs) generated by ISApl1 transposition. We further demonstrate that all mcr-1 structures with just one ISApl1 copy or with no ISApl1 copies were formed by deletion of ISApl1 from the ancestral Tn6330, likely by a process related to the "copy-out-paste-in" transposition mechanism. Finally, we show that only the rare examples of single-ended structures that have retained a portion of the excised downstream ISApl1 including the entire inverted right repeat might be capable of mobilization.IMPORTANCE A comprehensive analysis of all intact mcr-1 sequences in GenBank was used to identify a region on the chromosome of a novel Moraxella species with remarkable homology to the canonical mcr-1 structure and that likely represents the origin of this important gene. These data also demonstrate that all mcr-1 structures lacking one or both flanking ISApl1 were formed from ancestral composite transposons that subsequently lost the insertion sequences by a process of abortive transposition. This observation conclusively shows that mobilization of mcr-1 occurs as part of a composite transposon and that structures lacking the downstream ISApl1 are not capable of mobilization.