We report DNA- and RNA-like systems built from eight nucleotide "letters" (hence the name "hachimoji") that form four orthogonal pairs. These synthetic systems meet the structural requirements needed ...to support Darwinian evolution, including a polyelectrolyte backbone, predictable thermodynamic stability, and stereoregular building blocks that fit a Schrödinger aperiodic crystal. Measured thermodynamic parameters predict the stability of hachimoji duplexes, allowing hachimoji DNA to increase the information density of natural terran DNA. Three crystal structures show that the synthetic building blocks do not perturb the aperiodic crystal seen in the DNA double helix. Hachimoji DNA was then transcribed to give hachimoji RNA in the form of a functioning fluorescent hachimoji aptamer. These results expand the scope of molecular structures that might support life, including life throughout the cosmos.
Artificially expanded genetic information systems (AEGISs) are unnatural forms of DNA that increase the number of independently replicating nucleotide building blocks. To do this, AEGIS pairs are ...joined by different arrangements of hydrogen bond donor and acceptor groups, all while retaining their Watson–Crick geometries. We report here a unique case where AEGIS DNA has been used to execute a systematic evolution of ligands by exponential enrichment (SELEX) experiment. This AEGIS–SELEX was designed to create AEGIS oligonucleotides that bind to a line of breast cancer cells. AEGIS–SELEX delivered an AEGIS aptamer (ZAP-2012) built from six different kinds of nucleotides (the standard G, A, C, and T, and the AEGIS nonstandard P and Z nucleotides, the last having a nitro functionality not found in standard DNA). ZAP-2012 has a dissociation constant of 30 nM against these cells. The affinity is diminished or lost when Z or P (or both) is replaced by standard nucleotides and compares well with affinities of standard GACT aptamers selected against cell lines using standard SELEX. The success of AEGIS–SELEX relies on various innovations, including (i) the ability to synthesize GACTZP libraries, (ii) polymerases that PCR amplify GACTZP DNA with little loss of the AEGIS nonstandard nucleotides, and (iii) technologies to deep sequence GACTZP DNA survivors. These results take the next step toward expanding the power and utility of SELEX and offer an AEGIS–SELEX that could possibly generate receptors, ligands, and catalysts having sequence diversities nearer to that displayed by proteins.
Expanding the number of nucleotides in DNA increases the information density of functional DNA molecules, creating nanoassemblies that cannot be invaded by natural DNA/RNA in complex biological ...systems. Here, we show how six‐letter GACTZP DNA contributes this property in two parts of a nanoassembly: 1) in an aptamer evolved from a six‐letter DNA library to selectively bind liver cancer cells; and 2) in a six‐letter self‐assembling GACTZP nanotrain that carries the drug doxorubicin. The aptamer‐nanotrain assembly, charged with doxorubicin, selectively kills liver cancer cells in culture, as the selectivity of the aptamer binding directs doxorubicin into the aptamer‐targeted cells. The assembly does not kill untransformed cells that the aptamer does not bind. This architecture, built with an expanded genetic alphabet, is reminiscent of antibodies conjugated to drugs, which presumably act by this mechanism as well, but with the antibody replaced by an aptamer.
Crazy train: Folded DNA molecules were evolved or designed from a six‐letter DNA library to perform two non‐genetic functions: binding selectively to liver cancer cells and binding the toxic drug doxorubicin. Combined into an aptamer‐nanotrain conjugate (6N‐Apt‐NTr‐Dox), these selectively kill liver cancer cells. This selective drug delivery is reminiscent of antibody–drug conjugates, but without proteins.
Artificially Expanded Genetic Information Systems (AEGIS) add independently replicable unnatural nucleotide pairs to the natural G:C and A:T/U pairs found in native DNA, joining the unnatural pairs ...through alternative modes of hydrogen bonding. Whether and how AEGIS pairs are recognized and processed by multi-subunit cellular RNA polymerases (RNAPs) remains unknown. Here, we show that E. coli RNAP selectively recognizes unnatural nucleobases in a six-letter expanded genetic system. High-resolution cryo-EM structures of three RNAP elongation complexes containing template-substrate UBPs reveal the shared principles behind the recognition of AEGIS and natural base pairs. In these structures, RNAPs are captured in an active state, poised to perform the chemistry step. At this point, the unnatural base pair adopts a Watson-Crick geometry, and the trigger loop is folded into an active conformation, indicating that the mechanistic principles underlying recognition and incorporation of natural base pairs also apply to AEGIS unnatural base pairs. These data validate the design philosophy of AEGIS unnatural basepairs. Further, we provide structural evidence supporting a long-standing hypothesis that pair mismatch during transcription occurs via tautomerization. Together, our work highlights the importance of Watson-Crick complementarity underlying the design principles of AEGIS base pair recognition.
The Varkud satellite ribozyme catalyses site-specific RNA cleavage and ligation, and serves as an important model system to understand RNA catalysis. Here, we combine stereospecific phosphorothioate ...substitution, precision nucleobase mutation and linear free-energy relationship measurements with molecular dynamics, molecular solvation theory and ab initio quantum mechanical/molecular mechanical free-energy simulations to gain insight into the catalysis. Through this confluence of theory and experiment, we unify the existing body of structural and functional data to unveil the catalytic mechanism in unprecedented detail, including the degree of proton transfer in the transition state. Further, we provide evidence for a critical Mg
in the active site that interacts with the scissile phosphate and anchors the general base guanine in position for nucleophile activation. This novel role for Mg
adds to the diversity of known catalytic RNA strategies and unifies functional features observed in the Varkud satellite, hairpin and hammerhead ribozyme classes.
The 4-letter DNA alphabet (A, T, G, C) as found in Nature is an elegant, yet non-exhaustive solution to the problem of storage, transfer, and evolution of biological information. Here, we report on ...strategies for both writing and reading DNA with expanded alphabets composed of up to 12 letters (A, T, G, C, B, S, P, Z, X, K, J, V). For writing, we devise an enzymatic strategy for inserting a singular, orthogonal xenonucleic acid (XNA) base pair into standard DNA sequences using 2'-deoxy-xenonucleoside triphosphates as substrates. Integrating this strategy with combinatorial oligos generated on a chip, we construct libraries containing single XNA bases for parameterizing kmer basecalling models for commercially available nanopore sequencing. These elementary steps are combined to synthesize and sequence DNA containing 12 letters - the upper limit of what is accessible within the electroneutral, canonical base pairing framework. By introducing low-barrier synthesis and sequencing strategies, this work overcomes previous obstacles paving the way for making expanded alphabets widely accessible.
Structural biology efforts using cryogenic electron microscopy are frequently stifled by specimens adopting "preferred orientations" on grids, leading to anisotropic map resolution and impeding ...structure determination. Tilting the specimen stage during data collection is a generalizable solution but has historically led to substantial resolution attenuation. Here, we develop updated data collection and image processing workflows and demonstrate, using multiple specimens, that resolution attenuation is negligible or significantly reduced across tilt angles. Reconstructions with and without the stage tilted as high as 60° are virtually indistinguishable. These strategies allowed the reconstruction to 3 Å resolution of a bacterial RNA polymerase with preferred orientation, containing an unnatural nucleotide for studying novel base pair recognition. Furthermore, we present a quantitative framework that allows cryo-EM practitioners to define an optimal tilt angle during data acquisition. These results reinforce the utility of employing stage tilt for data collection and provide quantitative metrics to obtain isotropic maps.
Rearranging hydrogen bonding groups adds nucleobases to an artificially expanded genetic information system (AEGIS), pairing orthogonally to standard nucleotides. We report here a large-scale ...synthesis of the AEGIS nucleotide carrying 2-amino-3-nitropyridin-6-one (trivially Z) via Heck coupling and a hydroboration/oxidation sequence. RiboZ is more stable against epimerization than its 2′-deoxyribo analogue. Further, T7 RNA polymerase incorporates ZTP opposite its Watson–Crick complement, imidazo1,2-a-1,3,5-triazin-4(8H)one (trivially P), laying grounds for using this “second-generation” AEGIS Z:P pair to add amino acids encoded by mRNA.
The ability of nucleic acids to catalyze reactions (as well as store and transmit information) is important for both basic and applied science, the first in the context of molecular evolution and the ...origin of life and the second for biomedical applications. However, the catalytic power of standard nucleic acids (NAs) assembled from just four nucleotide building blocks is limited when compared with that of proteins. Here, we assess the evolutionary potential of libraries of nucleic acids with six nucleotide building blocks as reservoirs for catalysis. We compare the outcomes of in vitro selection experiments toward RNA-cleavage activity of two nucleic acid libraries: one built from the standard four independently replicable nucleotides and the other from six, with the two added nucleotides coming from an artificially expanded genetic information system (AEGIS). Results from comparative experiments suggest that DNA libraries with increased chemical diversity, higher information density, and larger searchable sequence spaces are one order of magnitude richer reservoirs of molecules that catalyze the cleavage of a phosphodiester bond in RNA than DNA libraries built from a standard four-nucleotide alphabet. Evolved AEGISzymes with nitro-carrying nucleobase Z appear to exploit a general acid-base catalytic mechanism to cleave that bond, analogous to the mechanism of the ribonuclease A family of protein enzymes and heavily modified DNAzymes. The AEGISzyme described here represents a new type of catalysts evolved from libraries built from expanded genetic alphabets.