Abstract
Among CRISPR-Cas genome editing systems, Streptococcus pyogenes Cas9 (SpCas9), sourced from a human pathogen, is the most widely used. Here, through in silico data mining, we have established an efficient plant genome engineering system using CRISPR-Cas9 from probiotic Lactobacillus rhamnosus. We have confirmed the predicted 5’-NGAAA-3’ PAM via a bacterial PAM depletion assay and showcased its exceptional editing efficiency in rice, wheat, tomato, and Larix cells, surpassing LbCas12a, SpCas9-NG, and SpRY when targeting the identical sequences. In stable rice lines, LrCas9 facilitates multiplexed gene knockout through coding sequence editing and achieves gene knockdown via targeted promoter deletion, demonstrating high specificity. We have also developed LrCas9-derived cytosine and adenine base editors, expanding base editing capabilities. Finally, by harnessing LrCas9’s A/T-rich PAM targeting preference, we have created efficient CRISPR interference and activation systems in plants. Together, our work establishes CRISPR-LrCas9 as an efficient and user-friendly genome engineering tool for diverse applications in crops and beyond.
Similar content being viewed by others
Introduction
The CRISPR-Cas system, an innate immune system in many bacteria and archaea, has been harnessed to develop genome engineering tools. Since the first demonstration of CRISPR-Cas9 for inducing targeted DNA double-strand breaks (DSBs) in prokaryote and eukaryote cells1,2, this powerful tool has revolutionized genome engineering including in plants3,4,5,6,7. Streptococcus pyogenes Cas9 (SpCas9), sourced from a human pathogen, has become the most widely used Cas9. And it has been adopted to achieve versatile plant genome engineering including base editing8,9, prime editing10,11, gene regulation12,13,14, and epigenetic editing15,16.
SpCas9 mainly recognizes a 5’-NGG-3’ PAM, which limits its targeting scope in the genomes. Two approaches have been adopted to overcome this problem. In the first approach, researchers focused on the discovery of functionally useful Cas9 orthologs with alternative PAM requirements such as SaCas917, St1Cas918, NmeCas919, Nme2Cas920, CjCas921, and BlatCas922. In general, these Cas9 orthologs recognize more complex PAM sequences with multiple nucleotides (6–8 bp) and possess limited editing activity. Recently, Cas9 orthologs with shorter PAMs have been reported such as ScCas9 which recognizes 5’-NNG-3’PAM23, and FrCas9 which recognizes 5’-NNTA-3’PAM24. ScCas9 was demonstrated for genome editing in rice, albeit with low efficiency25. The second approach applies rational design or direct protein evolution to engineer SpCas9 variants with altered PAM requirements, such as xCas9 for a 5’-NG-3’ PAM26, SpCas9-NG for a 5’-NG-3’ PAM27, and SpRY for 5’-NR/YN-3’ PAMs28. In plants, xCas9 was proved to mainly recognize the canonical 5’-NGG-3’ PAM29,30. SpCas9-NG and SpRY indeed expand targeting scope by recognizing relaxed PAMs but the editing efficiency is low29,31,32. Such compromised editing efficiency could be partly attributed to their self-cleavage effects9,31,33.
As an effective complement of the CRISPR-Cas9 system, the CRISPR-Cas12a system confers genome editing at canonical 5’-TTTV-3’ PAM sites, with LbCas12a being most widely used in plants34,35,36. While CRISPR-Cas12a is advantageous for multiplexed genome editing37,38 and promoter editing39, this system is sensitive to temperature40. Also, the lack of Cas12a nickases has hindered the development of more sophisticated precise genome editing tools such as base editors and prime editors. Only recently, efficient base editors using dCas12a proteins were reported for plant genome editing41,42. Although Cas12a-based CRISPRi systems were reported34, potent Cas12a-based CRISPRa systems are yet to be developed. So, despite great progress, CRISPR-Cas12a has not matched the versatility of CRISPR-Cas9 as a genome engineering platform yet.
In this study, we report a type II-A Cas9 from probiotic Lactobacillus rhamnosus GG named LrCas9. LrCas9 recognizes a 5’-NGAAA-3’ PAM, which overlaps (as a reverse complementary strand) the 5’-TTTV-3’ PAM of Cas12a. LrCas9 is efficient for genome editing in multiple plant species such as rice, wheat, tomato, and larix with high-fidelity. Moreover, LrCas9 shows higher editing efficiency than SpCas9-NG, SpRY, and LbCas12a when targeting the same DNA sequences. Further, LrCas9-derived cytosine base editor (CBE) and adenine base editor (ABE) are developed and demonstrated in rice and wheat. Finally, we engineer CRISPRi and CRISPRa systems using LrCas9, and these systems enable potent transcriptional repression and activation in plant cells, respectively.
Results
In-silico mining of CRISPR-Cas9 systems de novo
To mine potentially useful CRISPR-Cas9 systems for genome engineering in plants, we took three steps that include Cas protein identification, CRISPR array identification, and PAM prediction (Fig. 1a). For Cas protein identification, we analyzed a total number of 33,825 proteomes, including 32,023 bacterium and 1832 archaea, and identified 30,495 CRISPR clusters: 29,586 from bacteria and 909 from archaea (Supplementary Fig. 1a). Among them, different types of CRISPR-associated protein were identified (Supplementary Fig. 1b). For CRISPR array identification, 4963 clusters that contain Type II Cas9 proteins were identified by using the CRISPRFinder43. We first identified the potential crRNA sequences for each cluster, and then the tracrRNA was each identified by crRNA sequence alignment. These analyses revealed the CRISPR locus lengths of 5000–10,000 bp, and predominantly ~7000 bp (Supplementary Fig. 1c). The Cas9 protein lengths span from 1000aa to 1400aa, but mainly ~1400aa (Supplementary Fig. 1d). The tracrRNA lengths range from 100 nt to 500 nt (Supplementary Fig. 1e) while the crRNA lengths are around 36 nt (Supplementary Fig. 1f). There were 127 CRISPR arrays that could align to phage genomes among 739 candidates (Supplementary Data 1). By identifying the anti-protospacers (anti-spacers) in virus/phage genomes using BLAST44, we extracted the flanking sequences of anti-spacers and drew the PAMs for these 739 candidates using the WebLogo tool45. After removal of the redundant candidates and selection of CRISPR systems with high PAM scores, we narrowed down to 42 CRISPR-Cas9 candidates including many Type II-A systems and some Type II-C systems with different evolutionary lineages (Fig. 1b). There is structure difference between these two CRISPR-Cas9 types: Type II-A CRISPR-Cas9 loci consist of Cas9, Cas1, Cas2 and Csn2, while Type II-C CRISPR-Cas9 loci lack of Csn2 (Fig. 1b and Supplementary Figs. 2 and 3). A diverse panel of PAMs was predicted for the 42 candidate Cas9 nucleases (Supplementary Figs. 4 and 5). The crRNA of 42 candidates were aligned (Supplementary Fig. 6a), which revealed a conserved 5’-GUUUU-3’ motif at the 5’ end and 5’-AAAAC-3’ motif at the 3’ end (Supplementary Fig. 6b).
For experimental testing, we selected 2 Cas9 (BAI42646.1 from Lactobacillus rhamnosus GG and CAL43592.1 from Flavobacterium psychrophilum JIP02) with the potential to broaden the PAM recognition, and 2 Cas9 (EPW83356.1 from Streptococcus agalactiae STIR-CD-14 and OFH73969.1 from Listeria monocytogenes) that recognize the same PAM with SpCas9 to test whether any of them possesses higher nuclease activity. All four Cas9 proteins have a similar size to SpCas9 and contain all functional domains (Fig. 1c and Supplementary Fig. 7a–c). For example, The BAI42646.1 (named LrCas9) has the catalytic centers of RuvCI (D13) and HNH (H858) corresponding to D10 and H840 of the two domains in SpCas9 (Fig. 1c). The protein structures of SpCas9 and four Cas9 proteins mined from our dataset were predicted using SWISS-MODEL46 (Fig. 1d and Supplementary Fig. 7d–f). Interestingly, the HNH domain of LrCas9 is located in the center of the protein, while the SpCas9’s HNH domain faces backward from the center (Fig. 1d and Supplementary Fig. 8). This suggests different levels of conformation changes upon both Cas9 proteins’ binding to tracrRNA-crRNA and target DNA. Based on PAM prediction, LrCas9 recognizes a 5’-NGAAA-3’ PAM (Fig. 1e), CAL4359.2 recognizes a 5’-NAATAT-3’ PAM, while EPW83356.1 and OFH73969.1 recognize a 5’-NGG-3’ PAM (Supplementary Fig. 7g–i). We selected rice endogenous sites to test these Cas9s’ editing ability via co-transformation of protoplasts with all-in-one vectors expression crRNA, tracrRNA, and Cas9 (Fig. 1f). LrCas9 showed efficient genome editing with mutation rates from 4.3% to 64.2% at 3 selected sites (Fig. 1g), while other three Cas9 system showed no editing efficiency (Supplementary Fig. 7j–l). We thus focused on LrCas9 for further testing, especially because Lactobacillus rhamnosus GG is a probiotic strain.
Characterization and optimization of CRISPR-LrCas9 for genome editing
Phylogenetic analysis showed that LrCas9 is closely related to SpCas9, ScCas9, and FrCas9, all belonging to the Type II-A Cas9 group (Fig. 2a). Based on sequence analysis, we confirmed the CRISPR locus in the genome of Lactobacillus rhamnosus GG (Fig. 2b). The coding sequence of this locus contains Cas9, Cas1, Cas2, and Csn2. The non-coding sequence contains a 138 bp tracrRNA sequence between Cas9 and Cas1, and a CRISPR array composed of 23 spacer-crRNA units (Fig. 2b and Supplementary Data 2). We further validated the expression of tracrRNA and crRNA from small RNA-seq in E. coli that was transformed with this LrCas9 locus (Fig. 2b). To assess the PAM preference, we employed a depletion assay based on a double-antibiotics selection system in E. coli (Fig. 2c). The depletion assay showed the LrCas9 favored a 5’-NGAAA-3’ PAM (N = A, C, G, T) (Fig. 2d), which is consistent with the PAM prediction (Fig. 1e) and our initial genome editing results (Fig. 1g).
Our initial testing of the CRISPR-LrCas9 system used separate crRNA and tracrRNA (Fig. 1f). To combine both into a single guide RNA (sgRNA), we started with a long tetraloop (sgRNA V3.0) and made two truncated versions (sgRNA V2.0 and sgRNA V1.0) (Fig. 2e). We tested these three sgRNA scaffolds with 20 bp spacers at two independent 5’-NGAAA-3’ PAM sites (Os-CG01 and Os-TG01) in rice protoplasts. Both sgRNA V2.0 and V3.0 worked equally well (with ~20% to 25% editing efficiency measured by next-generation sequencing (NGS)), while sgRNA V1.0 failed to work (Fig. 2f). Since sgRNA V2.0 is shorter than sgRNA V3.0, we selected the compact sgRNA V2.0 for further applications. We next assessed different spacer lengths (14–22 bp) by targeting the Os-AG04 site with sgRNA V2.0 and identified 20 bp as the optimal spacer length (Fig. 2g).
Having optimized CRISPR-LrCas9 with its PAM requirement, sgRNA scaffold, and spacer length, we wanted to evaluate the reliability of this genome editing system. To this end, we targeted 19 endogenous sites in the rice genome and assessed editing efficiency using the protoplast system and NGS. Remarkably, LrCas9 showed detectable editing activity at all target sites, with efficiency ranging from 8.6% to 57.3% and averaged at 26.6% (Fig. 2h). Collective analysis of the editing data further confirmed that LrCas9 can edit all 5’-NGAAA-3’ PAM sites and the editing efficiencies at different 5’-NGAAA-3’ PAM groups showed no significant difference (Fig. 2i). By analyzing the genome editing outcomes among all these sites, it appears that LrCas9 cut 3 bp upstream of the PAM (Fig. 2j), resulting in predominantly 1 bp deletions (Fig. 2k). Furthermore, the insertion and deletion proportions were also calculated, and LrCas9 caused more insertions than deletions at most target sites (Supplementary Fig. 9).
Previously, we found SpCas9 showed an optimal editing efficiency at 32 °C, where editing efficiency at 22 and 28 °C was reduced in rice protoplasts40. To assess the temperature sensitivity of LrCas9, we conducted a similar protoplast assay by editing the Os-GG02 site. Based on the result, LrCas9 showed comparable genome editing efficiency at 28 and 32 °C, whereas the editing efficiency was reduced at 22 °C (Supplementary Fig. 10). This data suggests that LrCas9 may be less temperature sensitive than SpCas9, making the CRISPR-LrCas9 a potentially robust genome editing system in plants.
Benchmarking LrCas9 with other Cas nucleases and in diverse plant species
LrCas9 recognizes a 5’-NGAAA-3’ PAM, and its reverse complementary sequence forms a PAM for Cas12a nucleases such as the widely used LbCas12a (with a 5’-TTTV-3’ PAM) (Fig. 3a). In addition, SpCas9-NG and SpRY could recognize 5’-NGA-3’ PAM (Fig. 3b). These features allow for a close comparison between LrCas9 and these well-established Cas nucleases. We first compared the editing efficiency of LrCas9 and LbCas12a and its high-activity variants ttLbCas12a47 and LbCas12a-RRV48 at four independent loci in the rice genome49,50 (Fig. 3a). Consistent with our recent report48, the rice protoplast results showed that LbCas12a-RRV had the highest editing efficiency among Cas12a nucleases tested (Fig. 3c). Remarkably, LrCas9 had similar editing efficiency with LbCas12a at one site, but outperformed LbCas12a at the three other sites (Fig. 3c). Overall, LrCas9 is a significantly more potent nuclease than LbCas12a, with comparable editing efficiency to the ttLbCas12a variant (Fig. 3c). We also compared LrCas9 with SpCas9-NG and SpRY at editing seven independent sites with a 5’-NGA-3’ PAM (Fig. 3b). The protoplast assay showed that LrCas9 had significantly higher editing efficiency than SpRY and SpCas9-NG at five out of seven target sites, whereas all three nucleases failed to show detectable editing activity at two other sites (Fig. 3d). Together, these results demonstrate that LrCas9 is a more potent nuclease than LbCas12a, SpCas9-NG, and SpRY when targeting the same sequences with compatible PAM requirements.
LrCas9’s tolerance to lower temperatures and higher nuclease activity discovered in rice made it a promising tool for genome editing in other plant species. To this end, we tested LrCas9 for editing seven target sites in wheat (a polyploid monocot) (Fig. 3e), seven target sites in tomato (a dicot) (Fig. 3f), and six target sites in Dahurian larch (also known as larix gmelinii, a coniferous tree) (Fig. 3g). Using protoplast assays, we found LrCas9 displayed editing activity at all target sites, with an average mutation rate of ~5% across these three species (Fig. 3e–g). Such editing efficiencies are comparable to those previously reported for SpCas9 in wheat protoplasts51 and in Dahurian larch protoplasts52. Hence, LrCas9 appears to be a robust nuclease among many plant species.
Singular and multiplexed genome editing in stable rice plants
To test LrCas9 for genome editing in stably transgenic plants, we used six singular sgRNAs that target OsPDS, OsDEP1, OsBADH2, Os03g0568400, and Os03g0603100 in rice (Supplementary Table 1). All six target sites were edited in T0 lines, with editing efficiency ranging from 16.7% to 85% and biallelic editing efficiency ranging from 10% to 40% (Supplementary Table 1). The resulting mutations were predominantly 1 bp insertion and a few bp deletions (Supplementary Fig. 11), consistent with the editing profile in rice protoplasts (Fig. 2k). These data support LrCas9 as an efficient genome editing tool for generating edited plants.
We next developed a multiplexed CRISPR-LrCas9 system using the tRNA-based sgRNA processing system53. We first multiplexed two sgRNAs to simultaneously edit OsPDS and OsDEP1. Testing the construct in rice protoplasts showed high editing efficiency (>20%) at both genes (Supplementary Fig. 12). Twenty-three T0 lines were generated for analysis. For OsPDS, editing efficiency and biallelic editing efficiency were 30.4% and 13%, respectively. For OsDEP1, editing efficiency and biallelic editing efficiency were 52.2% and 47.8%, respectively (Table 1). Moreover, for all seven T0 lines that OsPDS was edited, OsDEP1 was also edited (Fig. 4a). Simultaneous editing could lead to chromosomal deletions. To test this, we multiplexed two sgRNAs (Os-TG01 and Os-TG02) for targeting OsPDS and generated 27 T0 lines. At Os-TG01 site, editing efficiency and biallelic editing efficiency were 85.2% and 81.5%, respectively. At the Os-TG02 site, editing efficiency and biallelic editing efficiency were 70.4% and 29.6%, respectively. For the lines that carry edits at Os-TG02, they all carry edits at Os-TG01 (Fig. 4b and Supplementary Fig. 13), which again suggests high-frequency simultaneous mutagenesis. Indeed, we were able to recover 4 T0 lines (14.8%) that contain large deletions due to simultaneous DSBs (Table 1). As expected, biallelic mutants of OsPDS showed an albino phenotype (Fig. 4c).
High-efficiency chromosomal deletion by LrCas9 further encouraged us to explore it for engineering quantitative traits via promoter editing as previously demonstrated in tomato54, maize55, and rice56. Recently, we developed a Cas12a promoter editing (CAPE) system that relies on target region prediction and efficient editing by Cas12a39. Since LrCas9 is a potent nuclease that is also very suitable for targeting A/T-rich promoters, we reasoned that LrCas9 would be a great tool for promoter editing. Previously, the promoter of OsWx (also known as OsGBSS1) has been edited for generating rice seeds with reduced amylose content. As a proof-of-concept, we designed a multiplexed LrCas9 construct for targeted deletion of ~2.1 kb in the OsWx promoter, which contains key regions A–F (Fig. 4d). Based on our prediction, deletion of this region could significantly reduce OsWx expression. Screen of T0 lines recovered multiple lines with such chromosomal deletions, and one line (no. 232-6) appears to be a homozygous deletion line (Fig. 4e and f). Transgene-free T1 homozygous deletion lines were obtained from this T0 line for further analysis. RNA expression of OsWx was reduced to ~40% of wild type (Fig. 4g). Accordingly, there was a 39.3% of reduction of amylose (Fig. 4h) and only an 8.1% reduction of total starch (Fig. 4i). Significant reduction of amylose content resulted in visible phenotypes of seeds before and after iodine staining (Fig. 4j). Under scanning electron microscope, the partially amylose-depleted mutant produced more small-sized starch granules (Fig. 4k and l), and some holes on the starch were also observed (Fig. 4k), which were characteristics of amylose-reduced rice grains57.
Comprehensive off-target assessment of LrCas9 in rice
Having demonstrated highly efficient genome editing by LrCas9 in rice, we sought to evaluate its potential off-target effects. First, we assessed LrCas9’s tolerance of mismatches in spacers. We systematically introduced 1-bp mismatches at every position of one chosen 20 bp spacer and assessed the ability of genome editing by these mutated spacers. Interestingly, we found 1-bp mismatches, including those at the 5’ end, largely reduced and destroyed LrCas9’s genome editing capability (Fig. 5a). Next, we use GUIDE-seq (Genome-wide, Unbiased Identification of DSBs Enabled by Sequencing), a highly sensitive tool to detect off-target sites58. We set up a rice protoplast-based GUIDE-seq system, where NGS was used to map and quantify the integration of double-strand oligonucleotide (dsODN) tags, known as GUIDE, at DSB sites (Fig. 5b). Our data showed that 10 pmol GUIDE concentration was sufficient to yield detectable (~3.5%) GUIDE insertion at the OsGG02 target site (Fig. 5c). Interestingly, we discovered asymmetric GUIDE insertion in this specific case (Fig. 5d). We tested off-target effects of four independent spacers with GUIDE-seq. In all cases, the vast majority of GUIDE insertions (87.8% and 98.8%) were found to be at the target sites (Fig. 5e and Supplementary Data 3). In addition, the captured off-target sites all contained six or more mismatches to the spacers and PAMs (Fig. 5e), making them unlikely to be true off-target sites9,36. To further confirm, we directly genotyped T0 lines edited at Os-AG04 and Os-TG02 sites. The top off-target sites nominated by GUIDE-seq and by CRISPR-GE59 were checked by Sanger sequencing. At all these sites tested, no off-target mutations were identified (Table 2, Supplementary Table 2, and Supplementary Fig. 14). Hence, these three independent assays collectively demonstrated that LrCas9 is a highly specific nuclease in genome editing.
Expanding base editing scope with LrCas9 base editors
Base editors such as CBEs and ABEs have greatly enriched plant genome editing toolbox41. Base editing scope, which dictates the usefulness of these tools, can be expanded by using Cas9 proteins that recognize different PAMs. Traditionally, this has been achieved in plants with engineered variants of SpCas9, and only occasionally with SaCas9 and its engineered variant41. We reasoned that LrCas9 could be a useful Cas protein to develop base editing tools in plants. To make a CBE, LrCas9-D13A nickase was fused with PmCDA160,61, as we recently showed that PmCDA1-UGI-based CBE had undetectable off-target effects in rice62. Two LrCas9 CBE expression systems were constructed for their applications in rice and wheat, respectively (Fig. 6a). Testing at eight target sites in rice protoplasts showed that up to 35% C-to-T conversion could be achieved (Fig. 6b), whereas very minimal levels of indel byproducts were found (Supplementary Fig. 15). In wheat protoplasts, up to 10% C-to-T editing was detected (Fig. 6c). In both plant species, this LrCas9 CBE favored editing at the 5’ end of the spacers (Fig. 6d and e), which is a characteristic of PmCDA160,62. Further testing in stable transgenic rice plants at two target sites showed 90.5% and 57.1% editing efficiency, respectively, with biallelic base editing events readily obtained (Fig. 6f and Table 3).
We also generated two versions of LrCas9 ABEs by fusing ecTadA-7.1063 and ecTadA-8e9,64 to the N-terminus of LrCas9 nickase (Fig. 6g). Testing at OsCG-01 site in OsDEP1 showed that LrCas9-based ABE8e based on ecTadA-8e (ABE V2.0), not the earlier ecTadA-7.10 (ABE V1.0), generated A-to-G conversion events in stable T0 plants (Fig. 6h and Table 3). Thus, efficient ABEs can be engineered when combining LrCas9-D13A and an efficient adenosine deaminase like ecTadA-8e.
CRISPR-LrCas9-based tools for highly efficient transcriptional repression and activation
The A-rich PAM requirement of CRISPR-LrCas9 also makes it an ideal system for developing transcriptional repression (CRISPRi) and activation (CRISPRa) tools. To engineer a CRISPRi system, we fused to LrCas9 (D13A, H858A), a deactivated LrCas9 (dLrCas9), with repressor domains KRAB65 and SRDX66, resulting in dLrCas9-KS (Fig. 7a). To test this CRISPRi system, we targeted three independent genes in rice with sgRNAs targeting a variety of positions, including close to or overlapping the transcription start site (TSS) (e.g., OsALT2-gR01 and OsGW7-gR01), downstream of the TSS (e.g., OsGW7-gR02), right upstream of the TSS (e.g., OsALT2-gR02; 273 bp from TSS), or far upstream of the TSS (e.g., OsWx-gR03; 1229 bp from TSS) (Fig. 7b). Strikingly, they all showed very potent transcriptional repression in rice protoplasts: the expression levels of all three target genes could be repressed to ~20% or lower (Fig. 7c). To our knowledge, this might represent the most potent CRISPRi system ever developed in plants.
To develop a CRISPRa system from LrCas9, we fused the potent transcriptional activator fusion, TV12, to the C-terminus of dLrCas9 (Fig. 7d). To test this CRISPRa system, we used six sgRNAs to target four independent genes (1–2 sgRNAs per gene) (Fig. 7e). In all cases, moderate transcriptional activation (2–3-fold) was observed in rice protoplasts (Fig. 7f). Similarly, we also found that when nuclease-active LrCas9-TV (Fig. 7g) was paired with short (14-bp) spacers, genome editing activity was abolished (Fig. 7h). We compared this nuclease-active CRISPRa system (LrCas9-TV) with the dLrCas9 based CRISPRa system (dLrCas9-TV) in rice protoplasts for targeted activation of OsmiR528 and OsWx. Remarkably, 19-fold and 150-fold activation was achieved with the nuclease-active LrCas9-TV system (Fig. 7i), representing 11x and 60x improvement over the dLrCas9-TV system. Hence, we developed a highly potent CRISPRa system based on nuclease-active LrCas9 and truncated spacers.
Discussion
CRISPR-Cas9 and Cas12a are the two main nuclease systems used in plant genome editing. Compared to Cas9, Cas12a lacks nickase versions and its crRNA restricts further modifications. For these reasons, CRISPR-Cas9 is a more versatile genome engineering system that goes far beyond targeted mutagenesis. Currently, base editors, prime editors, CRISPRi, and CRISPRa used in plants are predominantly based on CRISPR-Cas935,41. SpCas9 is the most widely used Cas9, partly due to its simple 5’-NGG-3’ PAM and high nuclease activity. While this PAM requirement is sufficient for targeted mutagenesis to knock out genes, it often falls short of directing precise modifications at well-defined sites by base editors and prime editors. Consequently, many SpCas9 variants, such as SpRY, Cas9-NG, and SpG, have been engineered to target alternative PAMs7,29,31,35. However, these SpCas9 variants appear to have reduced editing activity and hence are not as reliable as SpCas9. Limited number of Cas9 orthologs have been used for plant genome editing, such as SaCas9 (recognizing 5’-NNGRRT-3’ PAMs) and ScCas9 (recognizing a 5’-NNG’ PAM)35,41. Thus, it is desirable to develop more Cas9 orthologs for plant genome engineering with alternative PAM requirements.
Previously, nearly all these CRISPR systems were first developed for genome editing in human cells and later adopted for use in plants. However, genome editing in plants differs from that in mammalian cells in multiple aspects. First, many of the Cas proteins have optimal working temperatures higher than room temperature (like 25 °C). Hence, a Cas nuclease that works well in human cells may not always work in plants. For example, SpCas9 possesses much higher activity with high-temperature treatment regimes in plants67. In another example, AsCas12a works as efficiently as LbCas12a for genome editing in human cells68, yet it barely works in many plant species when genome editing experiments were carried out at lower temperatures35. Second, genome editing in humans has a great focus on the correction of single nucleotide polymorphisms (SNPs) related to diseases. In contrast, genome editing in plants is often presented as a powerful breeding tool for crop improvement. One promising application for crop improvement is to introduce quantitative traits either through Cas954 or Cas12a39. Considering the unique needs of plant genome engineering, it will be useful to develop a specific CRISPR-Cas9 system that can target A/T-rich promoter regions and can work well at a relatively low temperature. Third, SpCas9, despite its great success, is sourced from a human pathogen. It may taint public perception when SpCas9-engineered foods enter global markets.
It is due to all these aspects and considerations that we strive to conduct a de novo discovery approach to identify an effective CRISPR-Cas9 system that can enrich the plant genome engineering toolbox. Our bioinformatic and empirical analyses identified Cas from Lactobacillus rhamnosus GG (LrCas9), which recognizes a unique 5’-NGAAA-3’ PAM that is distinct from all known Cas9 proteins used in genome editing. We showed that LrCas9 is relatively tolerant to lower temperatures and confers efficient genome editing in a variety of plant species, including rice, wheat, tomato, and larix. Remarkably, when targeting the same sequences, we found that LrCas9 conferred higher editing efficiency than LbCas12a, SpCas9-NG, and SpRY, all of which are routinely used in genome editing.
Given the relatively complex PAM, one may wonder about the usefulness of LrCas9 as a genome editing tool. We conducted an in-silico analysis to compare LrCas9 and SpCas9 on targetability in six major crops (rice, maize, tomato, soybean, tobacco, and wheat). We found that LrCas9 can target as many genes as SpCas9, and it was when the number of target sites per gene increased, that we started to see differentiation among these two Cas9 systems (Supplementary Fig. 16). Since the targetable sites by LrCas9 and SpCas9 barely overlap, it is beneficial to have LrCas9 as a useful addition to the CRISPR-Cas9 toolbox. It is essential to enrich the base editing toolbox with base editors that can recognize alternative PAMs. In this regard, we successfully demonstrated LrCas9-derived CBE and ABE in rice and wheat. Prime editing69 is useful for molecular breeding in plants. We made a LrCas9 prime editor 2 (PE2) by linking LrCas9 nickase (H858A) with an M-MLV reverse transcriptase and using an epegRNA scaffold70 (Supplementary Fig. 17a). We successfully detected the desired C to A conversion with 0.1% editing efficiency in rice protoplasts (Supplementary Fig. 17b and c). The low prime editing efficiency could be due to the relatively long LrCas9’s sgRNA scaffold of 139nt, as compared to SpCas9’s sgRNA2.1 scaffold of 92nt. The longer scaffold may form a more complex secondary structure that may strongly influence the prime editing efficiency71. Further work to enhance the LrCas9-based prime editor is warranted.
LrCas9 recognizes an A/T-rich PAM, making it an alternative system for targeting promoter sequences in plants. Indeed, after developing a multiplexed CRISPR-LrCas9 system, we used it to create a large chromosomal deletion in the promoter of OsWx, which significantly reduced the OsWx expression and generated rice grains with less amylose content. We further explored this CRISPR-LrCas9 platform to develop efficient CRISPRi and CRISPRa systems for efficient transcriptional repression and activation, respectively. We found CRISPRi based on dLrCas9-KS is highly efficient, generating ~80% or more reduction in transcripts for target genes based on assays in rice protoplasts. Interestingly, our CRISPRa system based on nuclease-active LrCas9-TV and truncated spacers of 14 bp showed much higher transcriptional activation potency than the dLrCas9-TV system. This puts LrCas9 in a great position to be further developed into a CRISPR-Cas-based regulation system that would enable simultaneous genome editing and gene activation7. Interestingly, we also found that LrCas9 displayed variable genome editing efficacies at non-canonical and shorter 5’-NGAA-3’ PAMs in rice cells (Supplementary Fig. 18). Hence, as with SpCas9, we believe LrCas9 can be further improved with RNA engineering and protein engineering to recognize more diverse PAMs for promoter engineering and gene regulation.
Since CRISPR-LrCas9 is a currently developed genome engineering system, it is critical to investigate its potential off-target effects9,36. We employed three independent approaches to assess this. First, we introduced mismatched 1-bp mutations at every position of the 20-bp spacer. To our surprise, LrCas9 is less tolerant to 1-bp mutations, and this result is even better than the high-fidelity xCas9 evaluated in a similar assay29. This high specificity was further validated with the unbiased GUIDE-seq analysis, where not a single convincing off-target site was identified for all four independent spacers. Finally, we evaluated LrCas9-edited rice plants and could not find any off-target mutations on potential off-target sites nominated by either GUIDE-seq or CRISPR-GE. Hence, CRISPR-LrCas9 is a highly precise genome engineering system. Its unique PAM and spacer’s less tolerance to mismatch mutations could collectively contribute to its high targeting specificity.
The widely used SpCas9 is sourced from Streptococcus pyogenes1, which is a major human pathogen. Indeed, humans have developed adaptive immunity against SpCas9 protein72. By contrast, Lactobacillus rhamnosus GG, where our LrCas9 is sourced from, is a common probiotic that is widely consumed by humans due to its health benefits73. CRISPR-Cas systems hold great promise to boost crop breeding and enhance nutrition in our food. This bright future however will not be easily realized without the de-regulation of these genome engineering tools and the resulting products. We hope the use of this CRISPR-LrCas9 genome engineering system from a beneficial probiotic bacterial strain may help promote a favorable public perception of genome-edited crops.
Methods
Data mining
The Hidden Markov Models (HMMs)74 were used to analyze the CRISPR-Cas sequences. The models were trained and constructed using the seed sequences by TIGRFAMs75 (total number of 101) and Pfam76 (total number of 38) database of CRISPR-Cas for known Cas proteins. By using HMMs with the default parameters, the proteomes of each species obtained from GenBank (https://ftp.ncbi.nih.gov/genomes/genbank/bacteria and https://ftp.ncbi.nih.gov/genomes/genbank/archaea/) were analyzed, which resulted in the identification of 257,745 putative Cas proteins, and 30,495 CRISPR clusters. Further, CRISPRFinder77 was used for a genome-wide screen to identify potential CRISPR arrays adjacent to these clusters78. The PAM features were obtained by searching the anti-spacers in virus/phage genomes using BLAST tools for each putative CRISPR-Cas system and the PAM sequences were visualized using the WebLogo tools45. The species were prioritized with a high degree of certainty regarding the PAM sequences and very similar species were excluded from our analysis. After applying these criteria, 42 putative CRISPR-Cas systems were selected as candidates. Jalview79 was used to analyze the CRISPR RNA from 42 putative CRISPR-Cas systems and the feature sequence was drawn by WebLogo. The resulting phylogenetic tree for the 42 Cas9 proteins was displayed and annotated using the Interactive Tree Of Life (iTOL) tool80. The modeling of five Cas9 protein structures (CAL43592.1, BAI42646.1, OFH73969.1, EPW83356.1, AAK33936.1) was performed with SWISS-MODEL46.
Vector construction
The vectors were constructed based on the backbone of pGEL062 (Addgene#124894). The DNA sequences of chosen Cas9 genes were rice codon optimized and synthesized. The dCas9-TV12 was requested from Jianfeng Li’s lab. All the DNA fragments were constructed into linearized pGEL062 backbone by Gibson Assembly. For T-DNA vector construction, the spacer oligos were synthesized by Sangon Biotech (Shanghai, China) and annealed for standard Golden Gate reaction81 followed by DH5a E.coli transformation. The fragment for multiplex editing, flanked with BsaI restriction enzyme sites, was synthesized, and the final T-DNA vectors were constructed with Golden Gate reaction82. For the editing activity test in rice protoplasts of four Cas9 candidates, the crRNAs and tracrRNAs were driven by an OsU6 promoter and further processed by tRNA, while the Cas9 genes were driven by the Ubiquitin promoter of Zea mays83. For PAM depletion assay in E. coli, the crRNAs and tracrRNAs were driven by the J23119 promoter separately, and the LrCas9 genes were driven by the J23100 promoter. All the vectors were confirmed by Sanger sequencing.
PAM depletion assay
The LrCas9 expression vector and empty control vector (no LrCas9) were first transformed into E. coli DH5a to make the competent cell with Kanamycin selection. Then, 10 ng target plasmid that contained the crRNA and tracrRNA expression cassette and the target sequence were transformed into the competent cell by the heat shock method. After 1 h incubation at 37 °C, the mixture was plated on the Kanamycin and Ampicillin double selection LB solid medium. The plates were kept at 37 °C. After 16 h of growth, the colony numbers were counted by ImageJ (https://imagej.net/) software.
Protoplast transformation
In this study, we utilized various plant species, including the rice variety Nipponbare, wheat variety Chinese Spring, tomato variety Alisa Craig, and Larix species. The protocols for protoplast transformation were in accordance with previously published methods31,84. For rice and wheat, the process involved germinating sterilized seeds on 1/2 MS solid medium for 10 days in a dark chamber at 28 °C. Subsequently, healthy leaves were finely sliced into strips measuring 0.5–1.0 mm, followed by immersion in an enzyme solution and vacuum infiltration for 30 min. In the case of tomatoes, seeds were cultivated in the soil at 26 °C under an 8-h dark and 16-h light cycle for a duration of 30 days. Healthy leaves were then meticulously cut into strips of 0.5–1.0 mm, immersed in an enzyme solution, and incubated at 40 rpm for 8 h at 28 °C in darkness. To facilitate Larix protoplast transformation, Larix seeds were subjected to 14 days of darkness at 26 °C to induce callus formation. Larix callus tissues were subsequently transferred into the enzyme solution and hydrolyzed for 6 h. The resulting enzyme/protoplast solution was filtered through a 70 μm nylon mesh. During the protoplast transformation process, 30 μg of plasmid DNA in 30 μL was employed to transform 200 μL of protoplasts (2 × 105 protoplasts). These were gently mixed with 230 μL of a 40% PEG–CaCl2 transformation buffer. After incubation for 30 min in darkness, the reactions were quenched by adding 900 μL of W5 washing buffer. The protoplasts were subsequently centrifuged and transferred into a 12-well culture plate, followed by incubation at 32 or 28 °C in darkness for a period of 2 days. The protoplast transformation efficiencies were ~90% for rice and tomato, and ~40% for wheat and larix. After 48 h post-transformation, the protoplasts were harvested for DNA extraction by the CTAB method85.
Rice stable transformation
The stable transformation of rice was conducted following previously published protocols86,87. In brief, dehulled rice seeds were subjected to sterilization and subsequently cultured on an N6D solid medium. Rice calli, which had been pre-cultured, were transformed through inoculation with Agrobacterium EHA105 carrying the recombinant vector. Following a co-incubation period of 3 days between the rice calli and Agrobacterium, the calli were rinsed with sterilized water and transferred to N6-S medium for a 2-week selection. The newly grown calli were then transferred to RE-III medium for a 2-week cultivation. Resistant calli were subsequently moved to fresh RE-III medium every 2 weeks until regenerated plants were successfully obtained.
RNA sequencing
The CRISPR array and tracrRNA region of Lactobacillus rhamnosus GG was PCR amplified by ApexHF HS DNA Polymerase (Accurate Biology, China) and TA-cloned into pEASY-T1 cloning kit (TransGen, China), following the manufacturer’s instruction. The colonies confirmed by Sanger sequencing were cultured in the flask for 16 h. The cultured bacteria were collected by centrifuge and remove the supernatant. After fast freezing by liquid nitrogen, the samples were sent to Novogene (China, Tianjin) for RNA sequencing using the Novaseq6000 platform. The histogram was exported by the Integrative Genomics Viewer (IGV)88 software.
Mutagenesis analysis
The mutation frequency in protoplasts was analyzed by amplicon deep-sequencing61. The amplicons of the editing regions were amplified by ApexHF HS DNA Polymerase (Accurate Biology, China), and the barcodes were added through PCR primers. Amplicons were sent to Novogene (China, Tianjin) for deep-sequencing by the Novaseq6000 platform which produced 150 bp paired-end reads. The editing frequency and profile were analyzed by CRISPRMatch software89. The mutation frequencies in wheat and larix were normalized to the protoplast transformation efficiencies in both species. For stable rice T0 lines, the editing outcomes were identified by single-strand conformation polymorphism assay82 and direct PCR product Sanger sequencing provided by Sangon Biotech (Shanghai, China) and analyzed by CRISPR-GE DSDecodeM software59.
GUIDE-seq
The GUIDE-seq experiment was conducted by following the previous research protocol58 with slight modifications. Briefly, the different concentration GUIDE and 30 ug LrCas9 plasmids were co-transformed into rice protoplasts. The DNA was extracted after 48 h incubation at 32 °C. The libraries were constructed by using TruePrep DNA Library Prep Kit (Vazyme, China) following the manufacturer’s instructions. Constructed libraries were sent to Novogene (China, Tianjin) for deep sequencing. The GUIDE-seq results were obtained by analyzing the raw sequencing reads using Python scripts from Tsai’s Lab (https://github.com/tsailabSJ/guideseq).
Real-time quantitative PCR
The total RNA was extracted by using the SteadyPure plant RNA isolation kit (Accurate Biology, China), and the first strand cDNA was synthesized by using the HiScript III 1st Strand cDNA Synthesis Kit (Vazyme, China). The real-time qPCR was performed using ChamQ Universal SYBR qPCR Master Mix (Vazyme, China), and the relative expression levels were calculated by the 2−ΔΔCT method.
Data analysis
The data were analyzed using GraphPad Prism 8.0 software, and the figures were further processed using Adobe Photoshop and Adobe Illustrator software.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The vectors are available from Addgene, with the Addgene numbers listed in Supplementary Fig 19. The high-throughput sequencing data sets generated from this study are available at NCBI Sequence Read Archive under Bioproject PRJNA1017971 or China National GeneBank DataBase under CGNB Project CNP0004270. Source data are provided with this paper.
References
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–823 (2013).
Li, J. F. et al. Multiplex and homologous recombination-mediated genome editing in Arabidopsis and Nicotiana benthamiana using guide RNA and Cas9. Nat. Biotechnol. 31, 688–691 (2013).
Li, T. et al. Domestication of wild tomato is accelerated by genome editing. Nat. Biotechnol. 36, 1160–1163 (2018).
Hickey, L. T. et al. Breeding crops to feed 10 billion. Nat. Biotechnol. 37, 744–754 (2019).
Yu, H. et al. A route to de novo domestication of wild allotetraploid rice. Cell 184, 1156–1170 e1114 (2021).
Tang, X. & Zhang, Y. Beyond knockouts: fine-tuning regulation of gene expression in plants with CRISPR-Cas-based promoter editing. N. Phytol. 239, 868–874 (2023).
Zong, Y. et al. Precise base editing in rice, wheat and maize with a Cas9-cytidine deaminase fusion. Nat. Biotechnol. 35, 438–440 (2017).
Wu, Y. et al. Genome-wide analyses of PAM-relaxed Cas9 genome editors reveal substantial off-target effects by ABE8e in rice. Plant Biotechnol. J. 20, 1670–1682 (2022).
Lin, Q. et al. Prime genome editing in rice and wheat. Nat. Biotechnol. 38, 582–585 (2020).
Tang, X. et al. Plant prime editors enable precise gene editing in rice cells. Mol. Plant 13, 667–670 (2020).
Li, Z. et al. A potent Cas9-derived gene activator for plant and mammalian cells. Nat. Plants 3, 930–936 (2017).
Lowder, L. G. et al. Robust transcriptional activation in plants using multiplexed CRISPR-Act2.0 and mTALE-Act systems. Mol. Plant 11, 245–256 (2018).
Tang, X. et al. A CRISPR-Cpf1 system for efficient genome editing and transcriptional repression in plants. Nat. Plants 3, 17018 (2017).
Gallego-Bartolome, J. et al. Targeted DNA demethylation of the Arabidopsis genome using the human TET1 catalytic domain. Proc. Natl Acad. Sci. USA 115, E2125–E2134 (2018).
Liu, S. et al. Hypercompact CRISPR-Cas12j2 (CasPhi) enables genome editing, gene activation, and epigenome editing in plants. Plant Commun. 3, 100453 (2022).
Ran, F. A. et al. In vivo genome editing using Staphylococcus aureus Cas9. Nature 520, 186–191 (2015).
Kleinstiver, B. P. et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature 523, 481–485 (2015).
Hou, Z. et al. Efficient genome engineering in human pluripotent stem cells using Cas9 from Neisseria meningitidis. Proc. Natl Acad. Sci. USA 110, 15644–15649 (2013).
Edraki, A. et al. A compact, high-accuracy Cas9 with a dinucleotide PAM for in vivo genome editing. Mol. Cell 73, 714–726 e714 (2019).
Kim, E. et al. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni. Nat. Commun. 8, 14500 (2017).
Gao, N. et al. Characterization of Brevibacillus laterosporus Cas9 (BlatCas9) for Mammalian Genome Editing. Front. Cell Dev. Biol. 8, 583164 (2020).
Chatterjee, P., Jakimo, N. & Jacobson, J. M. Minimal PAM specificity of a highly similar SpCas9 ortholog. Sci. Adv. 4, eaau0766 (2018).
Cui, Z. et al. FrCas9 is a CRISPR/Cas9 system with high editing efficiency and fidelity. Nat. Commun. 13, 1425 (2022).
Wang, M. et al. Targeted base editing in rice with CRISPR/ScCas9 system. Plant Biotechnol. J. 18, 1645–1647 (2020).
Hu, J. H. et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature 556, 57–63 (2018).
Nishimasu, H. et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262 (2018).
Walton, R. T., Christie, K. A., Whittaker, M. N. & Kleinstiver, B. P. Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368, 290–296 (2020).
Zhong, Z. et al. Improving plant genome editing with high-fidelity xCas9 and non-canonical PAM-targeting Cas9-NG. Mol. Plant 12, 1027–1036 (2019).
Wang, J. et al. xCas9 expands the scope of genome editing with reduced efficiency in rice. Plant Biotechnol. J. 17, 709–711 (2019).
Ren, Q. et al. PAM-less plant genome editing using a CRISPR-SpRY toolbox. Nat. Plants 7, 25–33 (2021).
Ren, B. et al. Cas9-NG greatly expands the targeting scope of the genome-editing toolkit by recognizing NG and other atypical PAMs in rice. Mol. Plant 12, 1015–1026 (2019).
Qin, R. et al. SpCas9-NG self-targets the sgRNA sequence in plant genome editing. Nat. Plants 6, 197–201 (2020).
Tang, X. et al. A CRISPR-Cpf1 system for efficient genome editing and transcriptional repression in plants. Nat. Plants 3, 17103 (2017).
Zhang, Y., Malzahn, A. A., Sretenovic, S. & Qi, Y. The emerging and uncultivated potential of CRISPR technology in plant science. Nat. Plants 5, 778–794 (2019).
Tang, X. et al. A large-scale whole-genome sequencing analysis reveals highly specific genome editing by both Cas9 and Cpf1 (Cas12a) nucleases in rice. Genome Biol. 19, 84 (2018).
Tang, X. et al. Single transcript unit CRISPR 2.0 systems for robust Cas9 and Cas12a mediated plant genome editing. Plant Biotechnol. J. 17, 1431–1445 (2019).
Zhang, Y. et al. Expanding the scope of plant genome engineering with Cas12a orthologs and highly multiplexable editing systems. Nat. Commun. 12, 1944 (2021).
Zhou, J. et al. An efficient CRISPR-Cas12a promoter editing system for crop improvement. Nat. Plants 9, 588–604 (2023).
Malzahn, A. A. et al. Application of CRISPR-Cas12a temperature sensitivity for improved genome editing in rice, maize, and Arabidopsis. BMC Biol. 17, 9 (2019).
Molla, K. A., Sretenovic, S., Bansal, K. C. & Qi, Y. Precise plant genome editing using base editors and prime editors. Nat. Plants 7, 1166–1187 (2021).
Gaillochet, C. et al. Systematic optimization of Cas12a base editors in wheat and maize using the ITER platform. Genome Biol. 24, 6 (2023).
Grissa, I., Vergnaud, G. & Pourcel, C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35, W52–W57 (2007).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Waterhouse, A. et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296–W303 (2018).
Merker, L., Schindele, P., Huang, T. K., Wolter, F. & Puchta, H. Enhancing in planta gene targeting efficiencies in Arabidopsis using temperature-tolerant CRISPR/LbCas12a. Plant Biotechnol. J. 18, 2382–2384 (2020).
Zhang, L. et al. Boosting genome editing efficiency in human cells and plants with novel LbCas12a variants. Genome Biol. 24, 102 (2023).
Zhou, J. et al. Efficient deletion of multiple circle RNA loci by CRISPR-Cas9 reveals Os06circ02797 as a putative sponge for OsMIR408 in rice. Plant Biotechnol. J. 19, 1240–1252 (2021).
Zhou, J. et al. CRISPR-Cas9 based genome editing reveals new insights into MicroRNA function and regulation in rice. Front. Plant Sci. 8, 1598 (2017).
Kumar, R., Kaur, A., Pandey, A., Mamrutha, H. M. & Singh, G. P. CRISPR-based genome editing in wheat: a comprehensive review and future prospects. Mol. Biol. Rep. 46, 3557–3569 (2019).
Huang, L. Construction and Activity Evaluation of STU-Cas9 Genome Editing System for Larch (University of Electronic Science and Technology of China,2021).
Xie, K., Minkenberg, B. & Yang, Y. Boosting CRISPR/Cas9 multiplex editing capability with the endogenous tRNA-processing system. Proc. Natl Acad. Sci. USA 112, 3570–3575 (2015).
Rodriguez-Leal, D., Lemmon, Z. H., Man, J., Bartlett, M. E. & Lippman, Z. B. Engineering quantitative trait variation for crop improvement by genome editing. Cell 171, 470–480 e478 (2017).
Liu, L. et al. Enhancing grain-yield-related traits by CRISPR-Cas9 promoter editing of maize CLE genes. Nat. Plants 7, 287–294 (2021).
Song, X. et al. Targeting a gene regulatory element enhances rice grain yield by decoupling panicle number and size. Nat. Biotechnol. 40, 1403–1411 (2022).
Zhang, C. et al. Molecular structure and physicochemical properties of starches from rice with different amylose contents resulting from modification of OsGBSSI activity. J. Agric. Food Chem. 65, 2222–2232 (2017).
Tsai, S. Q. et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 33, 187–197 (2015).
Xie, X. et al. CRISPR-GE: a convenient software toolkit for CRISPR-based genome editing. Mol. Plant 10, 1246–1249 (2017).
Nishida, K. et al. Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems. Science 353, aaf8729–aaf8729 (2016).
Wu, Y. et al. CRISPR-BETS: a base-editing design tool for generating stop codons. Plant Biotechnol. J. 20, 499–510 (2022).
Ren, Q. et al. Improved plant cytosine base editors with high editing activity, purity, and specificity. Plant Biotechnol. J. 19, 2052–2068 (2021).
Gaudelli, N. M. et al. Programmable base editing of A*T to G*C in genomic DNA without DNA cleavage. Nature 551, 464–471 (2017).
Richter, M. F. et al. Phage-assisted evolution of an adenine base editor with improved Cas domain compatibility and activity. Nat. Biotechnol. 38, 883–891 (2020).
O’Geen, H. et al. dCas9-based epigenome editing suggests acquisition of histone methylation is not sufficient for target gene repression. Nucleic Acids Res. 45, 9901–9916 (2017).
Matsui, K. & Ohme-Takagi, M. Detection of protein-protein interactions in plants using the transrepressive activity of the EAR motif repression domain. Plant J. 61, 570–578 (2010).
LeBlanc, C. et al. Increased efficiency of targeted mutagenesis by CRISPR/Cas9 in plants using heat stress. Plant J. 93, 377–386 (2018).
Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell 163, 759–771 (2015).
Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019).
Nelson, J. W. et al. Engineered pegRNAs improve prime editing efficiency. Nat. Biotechnol. 40, 402–410 (2022).
Ponnienselvan, K. et al. Reducing the inherent auto-inhibitory interaction within the pegRNA enhances prime editing efficiency. Nucleic Acids Res. 51, 6966–6980 (2023).
Charlesworth, C. T. et al. Identification of preexisting adaptive immunity to Cas9 proteins in humans. Nat. Med. 25, 249–254 (2019).
Capurso, L. Thirty years of Lactobacillus rhamnosus GG: a review. J. Clin. Gastroenterol. 53(Suppl. 1), S1–S41 (2019).
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37 (2011).
Haft, D. H. et al. TIGRFAMs and genome properties in 2013. Nucleic Acids Res. 41, D387–D395 (2013).
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
Couvin, D. et al. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 46, W246–W251 (2018).
Tang, Z. et al. CasPDB: an integrated and annotated database for Cas proteins from bacteria and archaea. Database. 2019, baz093 (2019).
Waterhouse, A. M., Procter, J. B., Martin, D. M., Clamp, M. & Barton, G. J. Jalview Version 2-a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Engler, C., Gruetzner, R., Kandzia, R. & Marillonnet, S. Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS ONE 4, e5553 (2009).
Zheng, X. et al. Effective screen of CRISPR/Cas9-induced mutants in rice by single-strand conformation polymorphism. Plant Cell Rep. 35, 1545–1554 (2016).
Zhou, J. et al. CRISPR-Cas9 mediated OsMIR168a knockout reveals its pleiotropy in rice. Plant Biotechnol. J. 20, 310–322 (2022).
Zhang, Y. et al. Transcription activator-like effector nucleases enable efficient plant genome engineering. Plant Physiol. 161, 20–27 (2013).
Murray, M. G. & Thompson, W. F. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res 8, 4321–4325 (1980).
Hiei, Y., Ohta, S., Komari, T. & Kumashiro, T. Efficient transformation of rice (Oryza sativa L.) mediated by Agrobacterium and sequence analysis of the boundaries of the T-DNA. Plant J. 6, 271–282 (1994).
Zhou, J. et al. Multiplex QTL editing of grain-related genes improves yield in elite rice varieties. Plant Cell Rep. 38, 475–485 (2019).
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
You, Q. et al. CRISPRMatch: an automatic calculation and visualization tool for high-throughput CRISPR genome-editing data analysis. Int. J. Biol. Sci. 14, 858–862 (2018).
Acknowledgements
This research was supported by the National Natural Science Foundation of China (award nos. 32270433, 32072045, and 31960423) to Y.Z. and X.Z., the China Postdoctoral Science Foundation (award no. 2022M720663) and the Sichuan Postdoctoral Special Foundation (award no. TB2022015) to Z.Z., the Technology Innovation and Application Development Program of Chongqing (award no. CSTC2021JSCX-CYLHX0001) to Y.Z., the Incubation Program for Innovative Science and Technology in UESTC (award no. Y03023206100209) to J.H., the Open Foundation of Jiangsu Key Laboratory of Crop Genetics and Physiology (award no. YCSL202009) to Y.Z and T.Z. It is also supported by the NSF Plant Genome Research Program (IOS-2029889 and IOS-2132693) and USDA-NIFA (2021-67013-34554) to Y.Q.
Author information
Authors and Affiliations
Contributions
Y.Z. proposed the project and designed the experiments. Z.T., Z.Z., J.H. and Y.Z. predicted and analyzed the CRISPR loci. Z.Z. and S.X. constructed all the plasmids. Z.Z. and Y.H. performed the rice and wheat protoplast transformation. L.Y. and Z.Z. performed tomato protoplast transformation. L.H. performed the larix protoplast transformation. G.L. and T.Z. analyzed the mutation frequencies in protoplasts, the GUIDE-seq results, and the targeting scope in crop genomes. Z.Z., S.X., Y.H., S.L., T.F., X.Z. performed the rice stable transformation. Z.Z. and S.X. analyzed the genotypes of rice T0 lines. Y.Z., Y.Q., Z.Z. and J.H. analyzed the data and wrote the manuscript with input from other authors. All authors read and approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhong, Z., Liu, G., Tang, Z. et al. Efficient plant genome engineering using a probiotic sourced CRISPR-Cas9 system. Nat Commun 14, 6102 (2023). https://doi.org/10.1038/s41467-023-41802-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-41802-9
This article is cited by
-
High performance TadA-8e derived cytosine and dual base editors with undetectable off-target effects in plants
Nature Communications (2024)
-
Developing a CRISPR/FrCas9 system for core promoter editing in rice
aBIOTECH (2024)
-
Integrating machine learning and genome editing for crop improvement
aBIOTECH (2024)
-
Versatile plant genome engineering using anti-CRISPR-Cas12a systems
Science China Life Sciences (2024)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.