Cas9 (CRISPR associated protein 9) is an RNA-guided DNA endonuclease enzyme associated with the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) adaptive immunity system in Streptococcus pyogenes, among other bacteria. S. pyogenes utilizes Cas9 to memorize and later interrogate and cleave foreign DNA, such as invading bacteriophage DNA or plasmid DNA. Cas9 performs this interrogation by unwinding foreign DNA and checking for sites complementary to the 20 basepair spacer region of the guide RNA. If the DNA substrate is complementary to the guide RNA, Cas9 cleaves the invading DNA. In this sense, the CRISPR-Cas9 mechanism has a number of parallels with the RNA interference (RNAi) mechanism in eukaryotes.
Apart from its original function in bacterial immunity, the Cas9 protein has been heavily utilized as a genome engineering tool to induce site-directed double strand breaks in DNA. These breaks can lead to gene inactivation or the introduction of heterologous genes through non-homologous end joining and homologous recombination respectively in many laboratory model organisms. Alongside zinc finger nucleases and TALEN proteins, Cas9 is becoming a prominent tool in the field of genome editing.
Cas9 has gained traction in recent years because it can cleave nearly any sequence complementary to the guide RNA. Because the target specificity of Cas9 stems from the guide RNA:DNA complementarity and not modifications to the protein itself (like TALENs and Zinc-fingers), engineering Cas9 to target new DNA is straightforward. Versions of Cas9 that bind but do not cleave cognate DNA can be used to localize transcriptional activator or repressors to specific DNA sequences in order to control transcriptional activation and repression. While native Cas9 requires a guide RNA composed of two disparate RNAs that associate to make the guide – the CRISPR RNA (crRNA), and the trans-activating RNA (tracrRNA)., Cas9 targeting has been simplified through the engineering of a chimeric single guide RNA (chiRNA). Scientists have suggested that Cas9-based gene drives may be capable of editing the genomes of entire populations of organisms. In 2015, scientists used Cas9 to modify the genome of human embryos for the first time.
- 1 CRISPR-mediated immunity
- 2 Structural studies of Cas9
- 3 Applications of Cas9 to transcription tuning
- 4 Further reading
- 5 See also
- 6 References
To survive in a variety of challenging, inhospitable habitats that are filled with bacteriophage, bacteria have evolved methods to evade and fend off predatory viruses. This includes the recently appreciated CRISPR system. CRISPR loci are composed of short, palindromic repeats that occur at regular intervals composed of alternate CRISPR repeats and variable CRISPR spacers. These CRISPR loci are usually accompanied by adjacent CRISPR-associated (cas) genes. In 2005, it was discovered by three separate groups that the spacer regions were homologous to foreign DNA elements, including plasmids and viruses. These reports provided the first biological evidence that CRISPRs might function as an immune system.
Overview of CRISPR-Cas types and function
CRISPR-Cas systems divided into three major types (type I, type II, and type III) and twelve subtypes, which are based on their genetic content and structural differences. However, the core defining features of all CRISPR-Cas systems are the cas genes and their proteins: cas1 and cas2 are universal across types and subtypes, cas3, cas9, cas10 are signature genes for type I, type II, and type III respectively.
The CRISPR-Cas defense can be described in three stages:
- Adaptation-Adaptation involves recognition and integration of spacers between two adjacent repeats in the CRISPR locus. The “Protospacer” refers to the sequence on the viral genome that corresponds to the spacer. A short stretch of conserved nucleotides exists proximal to the protospacer, which is called the protospacer adjacent motif (PAM). The PAM is a recognition motif that is used to acquire the DNA fragment. In type II, Cas9 recognizes the PAM during adaptation in order to ensure the acquisition of functional spacers.
- CRISPR Processing- CRISPR expression includes the transcription of a primary transcript called a CRISPR RNA (pre-crRNA), which is transcribed from the CRISPR locus by RNA polymerase. Specific endoribonucleases then cleave the pre-crRNAs into small CRISPR RNAs (crRNAs).
- Interference (or 'Immunity')- Interference involves the crRNAs within a multi-protein complex called CASCADE, which can recognize and specifically base-pair with regions of inserting complementary foreign DNA. The crRNA-foreign nucleic acid complex is then cleaved, however if there are mismatches between the spacer and the target DNA, or if there are mutations in the PAM, then cleavage will not be initiated. In the latter scenario, the foreign DNA is not targeted for attack by the cell, thus the replication of the virus proceeds and the host is not immune to viral infection. The interference stage can be mechanistically and temporally distinct from CRISPR acquisition and expression, yet for complete function as a defense system, all three phases must be functional.
Stage 1: CRISPR spacer integration. Protospacers and protospacer-associated motifs (shown in red) are acquired at the “leader” end of a CRISPR array in the host DNA. The CRISPR array is composed of spacer sequences (shown in colored boxes) flanked by repeats (black diamonds). This process requires Cas1 and Cas2 (and Cas9 in type II), which are encoded in the cas locus, which are usually located near the CRISPR array.
Stage 2: CRISPR expression. Pre-crRNA is transcribed starting at the leader region by the host RNA polymerase and then cleaved by Cas proteins into smaller crRNAs containing a single spacer and a partial repeat (shown as hairpin structure with colored spacers).
Stage 3: CRISPR interference. crRNA with a spacer that has strong complementarity to the incoming foreign DNA begins a cleavage event (depicted with scissors), which requires Cas proteins. DNA cleavage interferes with viral replication and provides immunity to the host. The interference stage can be functionally and temporarily distinct from CRISPR acquisition and expression (depicted by white line dividing the cell).
Structural studies of Cas9
Cas9 features a bi-lobed architecture with the guide RNA nestled between the alpha-helical lobe (blue; Fig. 2) and the nuclease lobe (cyan, orange and gray). These two lobes are connected through a single bridge helix. There are two nuclease domains located in the multi-domain nuclease lobe, the RuvC (gray) which cleaves the non-target DNA strand, and the HNH nuclease domain (cyan) that cleaves the target strand of DNA. Interestingly, the RuvC domain is encoded by sequentially disparate sites that interact in the tertiary structure to form the RuvC cleavage domain (See Figure 3).
A key feature of the target DNA is that it must contain a protospacer adjacent motif (PAM) consisting of the three-nucleotide sequence- NGG. This PAM is recognized by the PAM-interacting domain (PI domain, orange) located near the C-terminal end of Cas9. Cas9 undergoes distinct conformational changes between the apo, guide RNA bound, and guide RNA:DNA bound states, which are detailed below.
Crystal structures in detail
Cas9 recognizes the stem-loop architecture inherent in the CRISPR locus, which mediates the maturation of crRNA-tracrRNA ribonucleoprotein complex. Cas9 in complex with CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA) further recognizes and degrades the target dsDNA. In the co-crystal structure shown here (Fig. 4), the crRNA-tracrRNA complex is replaced by a chimeric single-guide RNA (sgRNA, in red) which has been proved to have the same function as the natural RNA complex. The sgRNA base paired with target ssDNA is anchored by Cas9 as a T-shaped architecture. This crystal structure of the DNA-bound Cas9 enzyme reveals distinct conformational changes in the alpha-helical lobe with respect to the nuclease lobe, as well as the location go the HNH domain. protein consists of a recognition lobe (REC) and a nuclease lobe (NUC). It should be noted that all regions except the HNH form tight interactions with each other and sgRNA-ssDNA complex, while the HNH domain forms few contacts with the rest of the protein. In another conformation of Cas9 complex observed in the crystal, the HNH domain is not visible. These structures suggest the conformational flexibility of HNH domain.
So far, three crystal structures have been studied and published. One representing a conformation of Cas9 in the apo state, and two representing Cas9 in the DNA bound state.
- Jinek et al. Structures of Cas9 endonucleases reveal RNA-mediated conformational activation. Science, Feb 2014 
- Anders et al. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature Sept 2014 
- Nishimasu et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell Feb 2014 
The interactions between sgRNA and Cas9
In sgRNA-Cas9 complex, based on the crystal structure, REC1, BH and PI domains have important contacts with backbone or bases in both repeat and spacer region. Several Cas9 mutants including REC1 or REC2 domains deletion and residues mutations in BH have been tested. REC1 and BH related mutants show lower or none activity compared with wild type, which indicate these two domains are crucial for the sgRNA recognition at repeat sequence and stabilization of the whole complex. Although the interactions between spacer sequence and Cas9 as well as PI domain and repeat region need further studies, the co-crystal demonstrates clear interface between Cas9 and sgRNA.
Previous sequence analysis and biochemical studies have suggested Cas9 contain RNase H and HNH endonuclease homologous domains which are responsible for cleavages of two target DNA strands, respectively. These results are finally proved in the structure. Although the low sequence similarity, the sequence similar to RNase H has a RuvC fold (one member of RNase H family) and the HNH region folds as T4 Endo VII (one member of HNH endonuclease family). Previous works on Cas9 have demonstrated that HNH domain is responsible for complementary sequence cleavage of target DNA and RuvC is responsible for the non-complementary sequence.
Applications of Cas9 to transcription tuning
Interference of transcription by dCas9
Due to the unique ability of Cas9 to bind to essentially any complement sequence in any genome, researchers wanted to use this enzyme to repress transcription of various genomic loci. To accomplish this, the two crucial catalytic residues of the RuvC and HNH domain can be mutated to alanine abolishing all endonuclease activity of Cas9. The resulting protein coined ‘dead’ Cas9 or ‘dCas9’ for short, can still tightly bind to dsDNA. This catalytically inactive Cas9 variant has been used for both mechanistic studies into Cas9 DNA interrogative binding and as a general programmable DNA binding RNA-Protein complex.
The interaction of dCas9 with target dsDNA is so tight that high molarity urea protein denaturant can not fully dissociate the dCas9 RNA-protein complex from dsDNA target. dCas9 has been targeted with engineered single guide RNAs to transcription initiation sites of any loci where dCas9 can compete with RNA polymerase at promoters to halt transcription. Also, dCas9 can be targeted to the coding region of loci such that inhibition of RNA Polymerase occurs during the elongation phase of transcription. In Eukaryotes, silencing of gene expression can be extented by targeting dCas9 to enhancer sequences, where dCas9 can block assembly of transcription factors leading to silencing of specific gene expression. Moreover, the guide RNAs provided to dCas9 can be designed to include specific mismatches to its complementary cognate sequence that will quantitatively weaken the interaction of dCas9 for its programmed cognate sequence allowing a researcher to tune the extent of gene silencing applied to a gene of interest. This technology is similar in principle to RNAi such that gene expression is being modulated at the RNA level. However, the dCas9 approach has gained much traction as there exist less off-target effects and in general larger and more reproducible silencing effects through the use of dCas9 compared to RNAi screens. Furthermore, because the dCas9 approach to gene silencing can be quantitatively controlled, a researcher can now precisely control the extent to which a gene of interest is repressed allowing more questions about gene regulation and gene stoichiometry to be answered.
Beyond direct binding of dCas9 to transcriptionally sensitive positions of loci, dCas9 can be fused to a variety of modulatory protein domains to carry out a myriad of functions. Recently, dCas9 has been fused to chromatin remodeling proteins(HDACs/HATs) to reorganize the chromatin structure around various loci. This is an important in targeting various eukaryotic genes of interest as heterochromatin structures hinder Cas9 binding. Furthermore, because Cas9 can react to heterochromatin, it is theorized that this enzyme can be further applied to studying the chromatin structure of various loci. Additionally, dCas9 has been employed in genome wide screens of gene repression. By employing large libraries of guide RNAs capable of targeting thousands of genes, genome wide genetic screens using dCas9 have been conducted.
Another method for silencing transcription with Cas9 is to directly cleave mRNA products with the catalytically active Cas9 enzyme. This approach is made possible by hybridizing ssDNA with a PAM complement sequence to ssRNA allowing for a dsDNA-RNA PAM site for Cas9 binding. This technology makes available the ability to isolate endogenous RNA transcripts in cells without the need to induce chemical modifications to RNA or RNA tagging methods.
Transcription activation by dCas9 fusion proteins
In contrast to silencing genes, dCas9 can also be used to activate genes when fused to transcription activating factors. These factors include subunits of bacterial RNA Polymerase II and traditional transcription factors in eukaryotes. Recently, genome-wide screens of transcription activation have also been accomplished using dCas9 fusions named ‘CRISPRa’ for activation.
- Kennedy, Edward M.; Cullen, Bryan R. (2015). "Bacterial CRISPR/Cas DNA endonucleases: A revolutionary technology that could dramatically impact viral research and treatment". Virology. 479–480: 213–220. doi:10.1016/j.virol.2015.02.024. PMC . PMID 25759096.
- DCas9 activation system
- Genome editing
- Zinc finger nuclease
- Transcription activator-like effector nuclease
- Heler, R; Samai, P; Modell, J. W.; Weiner, C; Goldberg, G. W.; Bikard, D; Marraffini, L. A. (2015). "Cas9 specifies functional viral targets during CRISPR–Cas adaptation". Nature. 519 (7542): 199–202. doi:10.1038/nature14245. PMC . PMID 25707807.
- Jinek, M.; Chylinski, K.; Fonfara, I.; Hauer, M.; Doudna, J. A.; Charpentier, E. (2012). "A Programmable Dual-RNA-Guided DNA Endonuclease in Adaptive Bacterial Immunity". Science. 337 (6096): 816–821. doi:10.1126/science.1225829. PMID 22745249.
- Mali, Prashant; Esvelt, Kevin M; Church, George M (2013). "Cas9 as a versatile tool for engineering biology". Nature Methods. 10 (10): 957. doi:10.1038/nmeth.2649. PMC . PMID 24076990.
- Mali, Prashant; Aach, John; Stranges, P Benjamin; Esvelt, Kevin M; Moosburner, Mark; Kosuri, Sriram; Yang, Luhan; Church, George M (2013). "CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering". Nature Biotechnology. 31 (9): 833. doi:10.1038/nbt.2675. PMC . PMID 23907171.
- Gilbert, Luke A.; Larson, Matthew H.; Morsut, Leonardo; Liu, Zairan; Brar, Gloria A.; Torres, Sandra E.; Stern-Ginossar, Noam; Brandman, Onn; Whitehead, Evan H.; Doudna, Jennifer A.; Lim, Wendell A.; Weissman, Jonathan S.; Qi, Lei S. (2013). "CRISPR-Mediated Modular RNA-Guided Regulation of Transcription in Eukaryotes". Cell. 154 (2): 442. doi:10.1016/j.cell.2013.06.044. PMC . PMID 23849981.
- Esvelt, Kevin M; Smidler, Andrea L; Catteruccia, Flaminia; Church, George M (2014). "Concerning RNA-guided gene drives for the alteration of wild populations". eLife. 3. doi:10.7554/eLife.03401.
- Cyranoski, David; Reardon, Sara (22 April 2015). "Chinese scientists genetically modify human embryos". Nature. doi:10.1038/nature.2015.17378.
- Garneau, Josiane E.; Marie-Ève Dupuis, Manuela Villion, Dennis A. Romero, Rodolphe Barrangou, Patrick Boyaval, Christophe Fremaux, Philippe Horvath, Alfonso H. Magadán, Sylvain Moineau (2010). "The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA". Nature. 468 (7320): 67–71. doi:10.1038/nature09523. ISSN 0028-0836. PMID 21048762.
- Horvath, P.; R. Barrangou (2010). "CRISPR/Cas, the Immune System of Bacteria and Archaea". Science. 327 (5962): 167–170. doi:10.1126/science.1179555. ISSN 0036-8075. PMID 20056882.
- Karginov, Fedor V.; Gregory J. Hannon (2010). "The CRISPR System: Small RNA-Guided Defense in Bacteria and Archaea". Molecular Cell. 37 (1): 7–19. doi:10.1016/j.molcel.2009.12.033. ISSN 1097-2765. PMC . PMID 20129051.
- Wiedenheft, Blake; Sternberg, Samuel H.; Doudna, Jennifer A. (2012). "RNA-guided genetic silencing systems in bacteria and archaea". Nature. 482 (7385): 331. doi:10.1038/nature10886. PMID 22337052.
- Ran, F Ann; Hsu, Patrick D; Wright, Jason; Agarwala, Vineeta; Scott, David A; Zhang, Feng (2013). "Genome engineering using the CRISPR-Cas9 system". Nature Protocols. 8 (11): 2281–2308. doi:10.1038/nprot.2013.143. PMC . PMID 24157548.
- Jinek, M; Jiang, F; Taylor, D. W.; Sternberg, S. H.; Kaya, E; Ma, E; Anders, C; Hauer, M; Zhou, K; Lin, S; Kaplan, M; Iavarone, A. T.; Charpentier, E; Nogales, E; Doudna, J. A. (2014). "Structures of Cas9 Endonucleases Reveal RNA-Mediated Conformational Activation". Science. 343 (6176): 1247997. doi:10.1126/science.1247997. PMC . PMID 24505130.
- Anders, C; Niewoehner, O; Duerst, A; Jinek, M (2014). "Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease". Nature. 513 (7519): 569–573. doi:10.1038/nature13579. PMC . PMID 25079318.
- Nishimasu, Hiroshi; Ran, F. Ann; Hsu, Patrick D.; Konermann, Silvana; Shehata, Soraya I.; Dohmae, Naoshi; Ishitani, Ryuichiro; Zhang, Feng; Nureki, Osamu (2014). "Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA". Cell. 156 (5): 935. doi:10.1016/j.cell.2014.02.001. PMC . PMID 24529477.
- Westra, et al. 2012; Wiedenheft, et al. 2014
- Sternberg, Samuel H.; Sy Redding, Martin Jinek, Eric C. Greene, Jennifer A. Doudna (2014). "DNA interrogation by the CRISPR RNA-guided endonuclease Cas9". Nature. 507 (7490): 62–67. doi:10.1038/nature13011. ISSN 0028-0836. PMC . PMID 24476820.
- Bikard, D; Jiang, W; Samai, P; Hochschild, A; Zhang, F; Marraffini, L. A. (2013). "Programmable repression and activation of bacterial gene expression using an engineered CRISPR-Cas system". Nucleic Acids Research. 41 (15): 7429–37. doi:10.1093/nar/gkt520. PMC . PMID 23761437.
- Heintze, J; Luft, C; Ketteler, R (2013). "A CRISPR CASe for high-throughput silencing". Frontiers in Genetics. 4: 193. doi:10.3389/fgene.2013.00193. PMC . PMID 24109485.
- Gilbert, L. A.; Horlbeck, M. A.; Adamson, B; Villalta, J. E.; Chen, Y; Whitehead, E. H.; Guimaraes, C; Panning, B; Ploegh, H. L.; Bassik, M. C.; Qi, L. S.; Kampmann, M; Weissman, J. S. (2014). "Genome-Scale CRISPR-Mediated Control of Gene Repression and Activation". Cell. 159 (3): 647–61. doi:10.1016/j.cell.2014.09.029. PMC . PMID 25307932.
- O'Connell, M. R.; Oakes, B. L.; Sternberg, S. H.; East-Seletsky, A; Kaplan, M; Doudna, J. A. (2014). "Programmable RNA recognition and cleavage by CRISPR/Cas9". Nature. 516 (7530): 263–6. doi:10.1038/nature13769. PMC . PMID 25274302.