The human zinc finger antiviral protein (ZAP) recognizes RNA by binding to CpG dinucleotides. Mammalian transcriptomes are CpG-poor, and ZAP may have evolved to exploit this feature to specifically ...target non-self viral RNA. Phylogenetic analyses reveal that ZAP and its paralogue PARP12 share an ancestral gene that arose prior to extensive eukaryote divergence, and the ZAP lineage diverged from the PARP12 lineage in tetrapods. Notably, the CpG content of modern eukaryote genomes varies widely, and ZAP-like genes arose subsequent to the emergence of CpG-suppression in vertebrates. Human PARP12 exhibited no antiviral activity against wild type and CpG-enriched HIV-1, but ZAP proteins from several tetrapods had antiviral activity when expressed in human cells. In some cases, ZAP antiviral activity required a TRIM25 protein from the same or related species, suggesting functional co-evolution of these genes. Indeed, a hypervariable sequence in the N-terminal domain of ZAP contributed to species-specific TRIM25 dependence in antiviral activity assays. Crosslinking immunoprecipitation coupled with RNA sequencing revealed that ZAP proteins from human, mouse, bat and alligator exhibit a high degree of CpG-specificity, while some avian ZAP proteins appear more promiscuous. Together, these data suggest that the CpG- rich RNA directed antiviral activity of ZAP-related proteins arose in tetrapods, subsequent to the onset of CpG suppression in certain eukaryote lineages, with subsequent species-specific adaptation of cofactor requirements and RNA target specificity.
Infection of animal cells by numerous viruses is detected and countered by a variety of means, including recognition of nonself nucleic acids. The zinc finger antiviral protein (ZAP) depletes ...cytoplasmic RNA that is recognized as foreign in mammalian cells by virtue of its elevated CG dinucleotide content compared with endogenous mRNAs. Here, we determined a crystal structure of a protein-RNA complex containing the N-terminal, 4-zinc finger human (h) ZAP RNA-binding domain (RBD) and a CG dinucleotide-containing RNA target. The structure reveals in molecular detail how hZAP is able to bind selectively to CG-rich RNA. Specifically, the 4 zinc fingers create a basic patch on the hZAP RBD surface. The highly basic second zinc finger contains a pocket that selectively accommodates CG dinucleotide bases. Structure guided mutagenesis, cross-linking immunoprecipitation sequencing assays, and RNA affinity assays show that the structurally defined CG-binding pocket is not required for RNA binding per se in human cells. However, the pocket is a crucial determinant of high-affinity, specific binding to CG dinucleotide-containing RNA. Moreover, variations in RNA-binding specificity among a panel of CG-binding pocket mutants quantitatively predict their selective antiviral activity against a CG-enriched HIV-1 strain. Overall, the hZAP RBD RNA structure provides an atomic-level explanation for how ZAP selectively targets foreign, CG-rich RNA.