NUK - logo
E-viri
Recenzirano Odprti dostop
  • Machine-learning approach e...
    Gussow, Ayal B; Park, Allyson E; Borges, Adair L; Shmakov, Sergey A; Makarova, Kira S; Wolf, Yuri I; Bondy-Denomy, Joseph; Koonin, Eugene V

    Nature communications, 07/2020, Letnik: 11, Številka: 1
    Journal Article

    The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas and therefore have enormous potential for application as modulators of genome editing tools. Most Acrs are small and highly variable proteins which makes their bioinformatic prediction a formidable task. We present a machine-learning approach for comprehensive Acr prediction. The model shows high predictive power when tested against an unseen test set and was employed to predict 2,500 candidate Acr families. Experimental validation of top candidates revealed two unknown Acrs (AcrIC9, IC10) and three other top candidates were coincidentally identified and found to possess anti-CRISPR activity. These results substantially expand the repertoire of predicted Acrs and provide a resource for experimental Acr discovery.