Complete biosynthesis of QS-21 in engineered yeast

Liu, Yuzhong; Zhao, Xixi; Gan, Fei; Chen, Xiaoyue; Deng, Kai; Crowe, Samantha A.; Hudson, Graham A.; Belcher, Michael S.; Schmidt, Matthias; Astolfi, Maria C. T.; Kosina, Suzanne M.; Pang, Bo; Shao, Minglong; Yin, Jing; Sirirungruang, Sasilada; Iavarone, Anthony T.; Reed, James; Martin, Laetitia B. B.; El-Demerdash, Amr; Kikuchi, Shingo; Misra, Rajesh Chandra; Liang, Xiaomeng; Cronce, Michael J.; Chen, Xiulai; Zhan, Chunjun; Kakumanu, Ramu; Baidoo, Edward E. K.; Chen, Yan; Petzold, Christopher J.; Northen, Trent R.; Osbourn, Anne; Scheller, Henrik; Keasling, Jay D.

doi:10.1038/s41586-024-07345-9

Download PDF

Article
Open access
Published: 08 May 2024

Complete biosynthesis of QS-21 in engineered yeast

Nature (2024)Cite this article

11k Accesses
152 Altmetric
Metrics details

Subjects

Abstract

QS-21 is a potent vaccine adjuvant and remains the only saponin-based adjuvant that has been clinically approved for use in humans^1,2. However, owing to the complex structure of QS-21, its availability is limited. Today, the supply depends on laborious extraction from the Chilean soapbark tree or on low-yielding total chemical synthesis^3,4. Here we demonstrate the complete biosynthesis of QS-21 and its precursors, as well as structural derivatives, in engineered yeast strains. The successful biosynthesis in yeast requires fine-tuning of the host’s native pathway fluxes, as well as the functional and balanced expression of 38 heterologous enzymes. The required biosynthetic pathway spans seven enzyme families—a terpene synthase, P450s, nucleotide sugar synthases, glycosyltransferases, a coenzyme A ligase, acyl transferases and polyketide synthases—from six organisms, and mimics in yeast the subcellular compartmentalization of plants from the endoplasmic reticulum membrane to the cytosol. Finally, by taking advantage of the promiscuity of certain pathway enzymes, we produced structural analogues of QS-21 using this biosynthetic platform. This microbial production scheme will allow for the future establishment of a structure–activity relationship, and will thus enable the rational design of potent vaccine adjuvants.

Complete biosynthesis of the potent vaccine adjuvant QS-21

Article Open access 26 January 2024

Synthetic Biology-derived triterpenes as efficacious immunomodulating adjuvants

Article Open access 13 October 2020

Deciphering triterpenoid saponin biosynthesis by leveraging transcriptome response to methyl jasmonate elicitation in Saponaria vaccaria

Article Open access 04 November 2023

Main

Adjuvants increase the efficacy of vaccines by stimulating or augmenting the human immune response to pathogens or disease-specific antigens⁵. Ever since its discovery in the 1920s, alum (aluminium hydroxide) has been the most widely used, clinically approved vaccine adjuvant⁶. More recently, QS-21 has been shown to exhibit potent immunoactivity^3,4,7, and it is an active ingredient in the Adjuvant System AS01 and Matrix M (refs. ^1,2). These formulations have been approved for GSK’s malaria (Mosquirix) and shingles (Shingrix) vaccines, as well as for Novavax’s COVID-19 vaccines. Motivated by the potent immune response and favourable safety profiles, researchers have since tested QS-21 in more than 120 clinical trials.

Despite major commercial interest, the availability of QS-21 remains limited, owing mainly to its structural complexity⁸. QS-21 consists of four distinct structural domains: (i) a lipophilic triterpenoid core, quillaic acid (QA, Fig. 1), flanked by (ii) a branched trisaccharide moiety on the C3 position, (iii) a linear tetrasaccharide chain on the C28 position and (iv) an unusual pseudodimeric acyl chain capped by an arabinofuranose (Fig. 2). Owing to the structural similarity of two isoforms, QS-21 exists as a heterogeneous mixture of QS-21-Api and QS-21-Xyl in a 65:35 ratio, with the sole difference being the C28 terminal sugar. Traditionally, QS-21 is extracted from the tree bark of the soapbark tree Quillaja saponaria, which is native to Chile. Isolation is complicated because the plant extract contains a multitude of different structurally related Quillaja saponins, rendering the purification process highly laborious and low yielding³. Using an intermediate saponin as the starting material, the total chemical synthesis of both the Xyl and the Api forms of QS-21 has been achieved^4,5. However, the synthetic route requires 76 steps, and the overall yield is negligible. Thus, developing alternative production processes that are more sustainable and scalable would help to meet the ever-increasing demand for potent vaccine adjuvants, and to address existing or emerging medical needs.

**Fig. 1: Complete biosynthetic pathway for the de novo production of QS-21 in yeast from simple sugars.**

**Fig. 2: Functionalization of QA (6) to yield QS-21-Xyl and QS-21-Api.**

The genes and enzymes for the QS-21 biosynthetic pathway have only been characterized from Q. saponaria recently^9,10. Here we report the complete biosynthesis of QS-21-Api and QS-21-Xyl, as well as their structural derivatives in Saccharomyces cerevisiae, starting from only simple sugars (glucose and galactose). To accomplish this, we first upregulated the yeast native mevalonate pathway to provide a high carbon flux towards 2,3-oxidosqualene, which is then cyclized by a heterologous β-amyrin synthase and site-selectively oxidized by plant cytochrome P450s to yield QA, the aglycone of QS-21. We further introduced plant nucleotide sugar synthetic pathways to make seven non-native uridine diphosphate sugars (UDP-sugars), which are used to add sugars onto the C3 hydroxy and C28 carboxy functional groups of QA through the co-expression of QS-21 pathway glycosyltransferases (GTs)⁹. Furthermore, an engineered type I polyketide synthase (PKS), two type III PKSs and two stand-alone ketoreductases (KRs) were expressed in yeast to form the dimeric acyl unit that constitutes the last step before the terminal arabinofuranose addition to yield QS-21 (ref. ¹⁰) (Fig. 1a and Fig. 2b). Pathway enzymes, as well as their functional homologues from various plants that produce structurally similar saponins (for example, Saponaria vaccaria), fungi (LovF from Aspergillus terreus) and bacteria, were functionally expressed in the engineered yeast. This combinatorial approach allowed us to select the activities that function optimally together in a yeast cell, thereby enabling the production of QS-21. Owing to the promiscuity of several enzymes, structural analogues of QS-21 were produced using the biosynthetic platform described here; this will enable a structure–bioactivity relationship to be established in the future, and will allow the rational design of potent vaccine adjuvants.

Biosynthesis of quillaic acid

The Saccharomyces cerevisiae strain JWy601 was chosen as the base strain to the triterpene core, β-amyrin, of QS-21. The mevalonate-based isoprenoid biosynthetic pathway in this strain had previously been upregulated to produce sesquiterpenes¹¹. In JWy601, all genes encoding enzymes that convert acetyl-CoA to farnesyl pyrophosphate (FPP) were placed under the control of galactose-inducible promoters for controlled overexpression. The culture was grown at first in a glucose-containing rich medium, YPD, for 48 h to maximize the cell mass before a 72-h production phase was initiated by the addition of galactose. β-Amyrin synthases (BASs) of various plant origins (Artemisia annua, Arabidopsis thaliana, Glycyrrhiza glabra and Saponaria vaccaria) were integrated into JWY601 to quantify the production of β-amyrin from squalene by gas chromatography–mass spectrometry (GC–MS). Among these candidates, the BAS homologue from S. vaccaria (SvBAS) yielded the highest titre of β-amyrin (1), the production of which was further confirmed by efficient consumption of squalene compared to the parent strain, JWy601 (Extended Data Fig. 1a,b). Further upregulation of mevalonate pathway genes encoding ERG20 and ERG1 (Fig. 1a) ultimately resulted in a β-amyrin titre of 899.0 mg l⁻¹ over a production period of three days (Extended Data Fig. 1c).

Expression cassettes containing cytochrome P450s identified in Q. saponaria⁹ as well as the redox partner, cytochrome P450 reductase (CPR, AtATR1 from A. thaliana), were integrated sequentially into the yeast genome to achieve the biosynthesis of the triterpenoid core, QA (Fig. 3a). Extraction of the culture medium and analysis by liquid chromatography–mass spectrometry (LC–MS) showed that the CPR was sufficient as a redox partner to facilitate the three-step oxidation at the C28 position to the carboxylic acid carried out by CYP716A224, reaching a titre of 263.4 mg l⁻¹ of oleanolic acid (3) by strain YL-1 (Extended Data Fig. 3c). By contrast, C23 oxidation required a Quillaja native cytochrome b₅ (Qsb₅) reductase for the hydroxy functional group to be oxidized to an aldehyde to yield gypsogenin (5, strain YL-3; Supplementary Fig. 1). Cytochromes b₅ have long been known to increase the activities of cytochrome P450 through (i) direct electron transfer from NADH-cytb₅ to P450s in a pathway independent of NADPH-CPR and (ii) potentially faster transfer of the second electron as compared with CPR^12,13. Indeed, co-expression of cytochrome b₅, cytochrome P450s and CPRs has yielded higher oxidation efficiencies, leading to higher titres of the oxidized products in heterologous hosts^14,15,16. For the C16 oxidation, despite a predicted transmembrane domain at the N terminus of the C16 oxidase, subcellular localization studies revealed that the yeast codon-optimized CYP716A297 (C-terminal mCherry fusion) was cytosolic, and no oxidized product was detected. A different localization pattern with more protein aggregation was observed for the same protein expressed from a gene with the native plant sequence, possibly owing to differences in yeast and plant codon frequencies and concomitant changes in protein translation efficiency (Fig. 3b). To localize P450 to the endoplasmic reticulum (ER) membrane, the predicted 22-amino-acid transmembrane domain (TMD) of the C28 oxidase was fused to the N terminus of the C16 oxidase, thereby creating the fusion protein TMD_C28–C16 and resulting in the production of QA (6) at 1.1 mg l⁻¹ (strain YL-4) (Fig. 3c).

**Fig. 3: Functional expression of cytochrome P450s and pathway engineering for QA.**

To optimize the P450 oxidation efficiency, we opted for the introduction of a membrane steroid-binding protein (MSBP) to act as a scaffold for co-localization of the P450s. Despite their spatial proximity on the ER membrane, cytochrome P450s do not directly interact with each other. In plants, MSBPs serve an important physiological role in regulating lignin biosynthesis in A. thaliana by establishing physical interaction with and organizing the pathway P450s¹⁷. Indeed, production of the final oxidation product QA increased by fourfold after the expression of a newly identified MSBP candidate from S. vaccaria (Extended Data Fig. 2a,b, Supplementary Methods, Supplementary Fig. 2 and Supplementary Table 5). Subcellular localization studies revealed that SvMSBP1 co-localizes with both C28 and C23 oxidases on the ER membrane (Extended Data Fig. 2c,d). Such spatial proximity further corroborates its scaffolding function for non-lignin-related P450s and thus constitutes an efficient and potentially universal strategy to improve the activities of P450 in heterologous hosts. To identify the bottleneck among the six oxidation steps with three P450 monooxygenases, C28, C23 and TMD_C28–C16 oxidases were overexpressed individually in a strain that contained a single copy of each P450 and ATR1 integrated into the chromosome (YL-8 to YL-10), leading to a fourfold, twofold and twofold increase, respectively, in QA production (Extended Data Fig. 3). In addition, overexpressing a second copy of the CPR in the C28-overexpressing strain (YL-11) increased the titre of 6 by eightfold, suggesting that the activities of all three P450s and their redox partners are suboptimal. To optimize the production of 6, two copies of the P450s, redox partners and MSBP were integrated into the strain YL-15 to yield 65.2 mg l⁻¹ of 6 in shake flask cultures (Fig. 3d and Extended Data Fig. 3).

C3 and C28 O-glycosylation

The final product QS-21 is a water-soluble triterpene glycoside with an amphiphilic character—a prerequisite for homogenous mixtures with soluble antigen in the vaccine formulation¹⁸. It is the sugar decorations on the C3 hydroxy and C28 carboxylic acid groups that render the non-polar triterpene core 6 hydrophilic. The complete glycosylation of QS-21 requires eight glycosylating steps, involving seven different UDP-sugars (that is, UDP-d-glucuronic acid (UDP-GlcA), UDP-d-galactose (UDP-Gal), UDP-d-xylose (UDP-Xyl), UDP-d-fucose (UDP-Fuc), UDP-l-rhamnose (UDP-Rha), UDP-d-apiose (UDP-Api) and UDP-l-arabinofuranose (UDP-Araf)). Among these, UDP-Gal is the only UDP-sugar that is native to yeast and can be obtained through galactose metabolism or UDP-glucose isomerization (Fig. 1a). As such, heterologous nucleotide sugar synthases were introduced into the yeast host¹⁹ along with their corresponding GTs in a stepwise manner. The detection of each glycosylated product confirmed the functional expression of both the sugar synthases and the transferases.

Two Q. saponaria GTs belonging to the cellulose synthase-like family of enzymes have been identified that add glucuronic acid to QA to give 3-O-{β-d-glycopyranosiduronic acid}-QA (CSLM1 and CSLM2)⁹. After co-expression of a UDP-glucose dehydrogenase from A. thaliana (AtUGD1) with CSLM1 (YL-16) or CSLM2 (YL-17), a new LC–MS peak that corresponds to the exact mass of 7 was detected. We observed that CSLM1 is more specific to the glucuronidation of 6, whereas CSLM2 can also glucuronidate less oxidized intermediates such as 3, 4 and 5, but is threefold more active towards 6 (Extended Data Fig. 4a and Supplementary Fig. 3). Therefore, CSLM2 was chosen for further pathway engineering. The first glycosylation step takes place in the ER membrane, along with the previous oxidation steps and the formation of the triterpenoid substrates (Extended Data Figs. 2 and 4). Both CSLMs were predicted to have seven transmembrane domains and confocal microscopy studies in both yeast and tobacco further confirmed the localization in the ER. It was also observed, when preparing the standards, that the glucuronidation of 6 substantially increases its water solubility. We speculate that 7 migrates to the cytoplasm, where the GTs are localized, to carry out the subsequent six C3 and C28 glycosylation steps (Extended Data Fig. 5).

The second glycosylation step of the C3 position is carried out by the cytosolic enzyme UGT73CU3 (C3-GalT), which efficiently galactosylates 7 by 1,2-glycosidic bond formation to yield 8 (Fig. 4, Extended Data Fig. 4b and Supplementary Fig. 4). When expressed alone, CSLMs cannot exhaust the pool of 6. However, expression of the downstream C3-GalT increased the conversion of 6 by pushing the equilibrium through the consumption of 7, thus leading to the production of 8 at 24.3 mg l⁻¹ (strain YL-18; Extended Data Fig. 4b). When UDP-xylose synthase (AtUXS3) was expressed in conjunction with UGT73CX1 (XylT) and the unmodified AtUGD1, no glycosylated product (that is, 7, 8 or 9) was observed (Supplementary Fig. 5a). This can be rationalized by the common feedback mechanism in which UDP-Xyl strongly inhibits UGDs to maintain the homeostasis of the intracellular UDP-Glc pool^20,21. To alleviate this inhibitory effect, an A101L mutation²² was introduced into AtUGD1 (strain YL-20) to reduce feedback inhibition by UDP-Xyl, thus allowing the xylosylation of 8 to yield 9 (Fig. 4 and Supplementary Fig. 5b).

**Fig. 4: Reconstitution of the glycosylation pathway by the functional expression of nucleotide sugar synthases and corresponding GTs.**

The C28 linear tetrasaccharide assembly follows a sequential order of d-fucose, l-rhamnose and d-xylose, as well as d-xylose or d-apiose as the terminal sugar. The d-fucose is linked to the C28 carboxylic acid functional group of QA by an esterification facilitated by a GT belonging to the GT1 family (UGT74BX1, C28FucT), with a UDP-sugar as the substrate. The biosynthetic pathway of UDP-d-Fuc, in which UDP-glucose is converted to UDP-4-dehydro-6-deoxy-d-glucose through UDP-glucose 4,6-dehydratase (SvUG46DH), has been reported only recently²³. C28FucT adds the UDP-deoxy-sugar, which is then reduced once it has been added onto the 9 backbone by FucSyn⁹, leading to the C28-fucosylated product 10 after the expression of all three enzymes (SvUG46DH, FucSyn and C28FucT in strain YL-25). Although residual 9 was observed in the presence of UGT74BX1 alone, the expression of a UDP-l-rhamnose synthase (AtRHM2) and the downstream UGT91AR1 (C28RhaT) helped to fully convert 10 to 11 (strain YL-27), thus efficiently pulling the equilibrium of C28FucT and increasing pathway flux (Fig. 4b). Pathway intermediates, in particular 8 and 11, started to accumulate after the expression of UGT91AQ1 (C28XylT3, strain YL-29). The fact that they are substrates for two xylosyltransferases (C3XylT and C28XylT3) indicated that UDP-Xyl was the limiting factor (Fig. 4b and Supplementary Fig. 6). As such, an additional copy of AtUXS was integrated and expressed (strain YL-30), which effectively relieved the accumulation of C3-glycosylated products and enabled the production of 12 as the major product. The last glycosylation step on the C28 position suffers from the tendency of both UGT73CY3 (C28XylT4) and UGT73CY2 (C28ApiT4) to misfold in yeast, leading to only trace amounts of the fully glycosylated products 13 and 14 in strains YL-33 and YL-34, respectively (Supplementary Fig. 7). These two enzymes have high protein sequence homology (94.89%) and thus, might display similar stability when expressed in a heterologous host. Subcellular localization studies of C-terminally GFP-tagged UGT73CY2 and UGT73CY3 revealed that although they are correctly localized to the cytoplasm at early stages of expression, aggregated forms adjacent to the vacuole become the dominant species with culture time (Supplementary Fig. 8c). However, when fresh carbon and nitrogen resources are provided (that is, fresh YP galactose), the expression of protein under galactose-inducible promoters is switched on when an additional inducer (galactose) is added to the medium, leading to higher cytosolic expression of UGT73CY3.

Biosynthesis and addition of the acyl unit

The specific immunological role of the acyl group in QS-21 remains unclear, but structure–activity relationship studies have shown that it is crucial to the potent activity of QS-21 in stimulating and soliciting cytokine responses mediated by T helper 1 cells^24,25. The biosynthesis of each of the dimeric C9 acyl chains requires two consecutive decarboxylative Claisen condensation reactions of malonyl-CoA with (S)-2-methylbutyryl-CoA (2MB-CoA; Fig. 5). This is catalysed by two type III polyketide synthases, PKS4 and PKS5, with the keto intermediate being reduced by two stand-alone ketoreductases, KR1 and KR2, to form the 3,5-dihydroxy moiety in C9-CoA (ref. ¹⁰) (Fig. 1b). No native metabolic pathway in yeast involves 2MB-CoA, and free 2MB acid was therefore first added exogenously to the culture medium at 50 mg l⁻¹, with the heterologous expression of a Quillaja short chain fatty acid CoA ligase (QsCCL1), to yield 2MB-CoA in YL-QsCCL (Fig. 5b).

**Fig. 5: Acylation and terminal glycosylation towards the complete biosynthesis of QS-21.**

The acyl biosynthetic cassette (PKS4, PKS5, KR1 and KR2) was first tested in YL-QsCCL, which can make 2MB-CoA intracellularly, but no production of C9-CoA could be detected directly by LC–MS, owing possibly to its chemical instability and potential cyclization into the lactone. Although ACT2 has been reported to efficiently convert the hepta-glycosylated 13 to 15 (Fig. 5a), it is also active on the hexa-glycosylated 12 (ref. ¹⁰) (Fig. 1). Therefore, the acyl biosynthetic cassette and the first acyl transferase ACT2 were first integrated into the 12-producing yeast strain (YL-30). In the presence of 2MB acid supplementation to the culture medium, the mono-acylated product 19 was detected by LC–MS, which was confirmed by its co-elution with a tobacco extract standard¹⁰ (strain YL-42; Extended Data Figs. 6 and 7). Because residual substrate 12 was still detected after the acylation, an increased concentration of 2MB acid was added to the culture medium up to 500 mg l⁻¹, which resulted in a much-improved acylation conversion (Extended Data Fig. 6). After the expression of C28XylT4 (strain YL-43), although the mono-acylated hepta-glycosylated product 15 was observed using the culture scheme developed above, residual 12 and 19 were still present in the medium extract, indicating that the terminal xylosylation still requires improvement (Extended Data Fig. 7). The second acyl transferase ACT3 fully acylates both mono-acylated products (15 and 19) with an additional C9 unit, resulting in 17 and 20, respectively (strain YL-45). This indicates that the MB acid supplement and the yeast endogenous malonyl-CoA pool provide sufficient C9-CoA for the two-step acyl biosynthesis and additions.

Plant UDP-l-Araf biosynthesis is closely associated with the Golgi apparatus because l-Araf is a key component in the plant cell wall²⁶. The biosynthesis of UDP-arabinopyranose (UDP-Arap) occurs mainly through the epimerization of UDP-Xyl in the Golgi lumen; UDP-Arap is then interconverted into UDP-Araf by UDP-Ara mutase, which is located outside on the cytosolic surface of the Golgi. The resultant UDP-Araf is transported back to the Golgi lumen for its later glycosylation applications²⁷. Owing to the lack of yeast native sugar transporters in the Golgi membrane, cytosolic homologues of these nucleotide sugar synthases were chosen. UDP-glucose epimerase 1 from A. thaliana (AtUGE1), a bifunctional enzyme that epimerizes UDP-Glc and UDP-Gal, as well as UDP-Xyl and UDP-Arap, and reversibly glycosylated polypeptide 1 (AtRGP1), which converts UDP-Arap to UDP-Araf, were expressed to produce UDP-Araf in vivo. Integration and expression in the presence of UGT73CZ2 (ArafT) led to a new LC–MS peak that corresponds to the exact mass of 1987.9164 and co-elutes with the QS-21 standard (Fig. 5c); this was further corroborated and confirmed by the identical isotopic fingerprint patterns of the extracted sample and QS-21. Note that two mass peaks were observed in the extracted LC–MS spectrum. When spiked with QS-21 standard, or 18+Xyl (QS-21 with an acyl terminal Xyl instead of Araf) individually¹⁰, the two LC–MS peaks were successfully identified as Xyl- or Araf-capped 17, respectively, with the latter being QS-21-Xyl (Fig. 5c) produced at 94.6 ± 8.3 μg l⁻¹ in YL-46. To further confirm the biosynthesis of QS-21-Xyl in YL-46, the production was scaled up to allow sufficient materials to be purified and characterized by high-resolution tandem mass spectrometry (HRMS2; Supplementary Methods and Supplementary Table 6) and ¹H nuclear magnetic resonance (NMR). The identical fragments observed in the MS2 spectra of purified QS-21-Xyl from YL-46 and in those of the standard, along with the mirroring corresponding ion intensities, provide evidence that they have the same structural composition (Extended Data Fig. 8 and Supplementary Table 7). In addition, the well-matched overall spectrum and, in particular, anomeric proton peaks confirm the correct structure and connectivity between sugar moieties (Extended Data Fig. 9). A similar strategy was used to engineer YL-47 to produce QS-21-Api—the same gene cassettes of C9-CoA and UDP-Araf biosynthesis and addition were integrated into the 14-producing strain to yield QS-21-Api at 31.1 ± 0.5 μg l⁻¹ in YL-47 (Supplementary Fig. 9).

To realize the complete biosynthesis of QS-21 from the simple sugar galactose without exogenous supplementation of 2MB acid, we first sought to express the branched-chain α-keto acid dehydrogenase complex with a transaminase from Bacillus subtilis, which readily converts isoleucine to 2MB-CoA during amino acid metabolism. However, no 2MB-CoA was detected in yeast, which is probably due to the fact that yeast lacks the necessary post-translational modification mechanism of subunit E2 of the cluster²⁸. Alternatively, a 7.6-kb gene encoding the type I PKS protein F (LovF) from the Lovastatin biosynthesis cluster from Aspergillus terreus was used to source 2MB-CoA intracellularly. This megasynthase converts two units of malonyl-CoA to 2MB covalently attached to the acyl carrier protein (ACP) domain, which would be directly transferred onto the lovastatin acid precursor monacolin in the native lovastatin pathway²⁹. We engineered LovF by truncating it after ACP and fusing it to the promiscuous erythromycin PKS (EryPKS) M6 thioesterase (TE)³⁰ through an interdomain linker. As a result, methylbutyryl-S-ACP was hydrolysed to release free 2MB. In the yeast that contained a chromosomal copy of the phosphopantetheinyl transferase (npgA)²⁹, detectable amounts of 2MB-CoA were observed in the LC–MS traces when LovF-TE and QsCCL were co-expressed (Fig. 5b), thus demonstrating the successful engineering of a PKS that catalyses the release of free 2MB acid from the LovF ACP domain and its subsequent CoA activation. The 2MB-CoA cassette was integrated into YL-46 and YL-47, leading to the production of QS-21-Xyl and QS-21-Api biosynthesized from only simple sugars (strains YL-50 and YL-51; Supplementary Fig. 10).

Discussion

In addition to the upregulated yeast native mevalonate pathway, our final strain contains 38 heterologous enzymes sourced from six species, spanning several enzyme families: a terpene synthase, P450s, nucleotide sugar synthases, GTs and acyl transferases, as well as type I and type III PKSs. To achieve the complete biosynthesis of QS-21, we mimicked in yeast the subcellular compartmentalization of plants from the ER membrane to the cytosol. QS-21-Xyl and QS-21-Api—two isomers of QS-21 with high structural similarity—can therefore be produced in separate yeast strains, and this enables them to be purified, and their immunoactivity to be characterized, in an independent manner.

Moreover, the yeast biosynthetic platform provides vast opportunities to produce structural variants of QS-21 by expressing alternative pathway enzymes or by making fragments of QS-21, exploiting the promiscuity of the enzyme in the pursuit of new leads for vaccine adjuvants. For example, the xylose in the C3 trisaccharide cluster can be replaced by rhamnose, with an additional methyl group, by expressing a rhamnose transferase (UGT73CX2, C3RhaT) instead of the xylose transferase described above (Extended Data Fig. 10). The rhamnose-containing trisaccharide 22 is also a substrate for downstream pathway enzymes and can easily yield a methylated QS-21 derivative. When investigating the glycan functions of QS-21, GTs can be intentionally left out to yield truncated oligosaccharides, highlighted here by the successful biosynthesis of 21 (Extended Data Fig. 7).

The traditional method of extraction and purification of QS-21 from the soapbark tree destroys the bark of the tree, and has prompted increased governmental regulations around its deforestation. Our demonstration of the total biosynthesis of QS-21 in an engineered yeast strain highlights the possibility of replacing the plantation-based supply of saponins with industrial fermentation at scale, which could markedly increase the availability of QS-21 to meet the rising demand for potent vaccine adjuvants. At present, strain YL-46 produces approximately 0.0012% w/w QS-21 per dry cell weight, which is less than the w/w yield from the tree (0.0032%; Supplementary Methods), but it does so over a period of days. As a result, the production of QS-21 in yeast is still considerably faster (by approximately 1,000 times) than it is in the native Q. saponaria, which produces QS-21 in trees only once they reach an age of 30–50 years³¹. Although key developments in strain engineering, production and fermentation schemes, as well as in the downstream extraction and purification processes, will still be necessary to produce yeast-derived QS-21 at scale, landmark successes in this arena, such as the industrial-scale production of the anti-malarial precursor artemisinic acid¹⁴, have paved the way for new opportunities in microbial biomanufacturing.

Methods

Chemicals

Numbers, trivial names and International Union of Pure and Applied Chemistry (IUPAC) names, as well as chemical structures of pathway metabolites, are listed in Supplementary Table 3. All chemical standards used in this study are analytical grade and are listed in Supplementary Table 4.

Plasmid construction

All plasmids were constructed by Gibson assembly (New England Biolabs, HiFi DNA Assembly Master Mix), followed by heat shock transformation into Escherichia coli DH5α competent cells, which were plated on Luria–Bertani (LB) agar containing 100 μg ml⁻¹ carbenicillin or kanamycin and grown at 37 °C overnight. E. coli transformants were grown in 5 ml LB medium containing 100 μg ml⁻¹ carbenicillin or kanamycin at 37 °C overnight, followed by miniprep plasmid extraction (Qiagen), and were validated by Sanger sequencing. All biosynthetic genes^9,10 with the exception of LovF-TE were codon-optimized for yeast expression and synthesized by Integrated DNA Technologies. The QS-21 biosynthetic pathway genes were directly inserted into the plasmid backbone for subcellular localization studies in Nicotiana benthamiana. All genes were assembled as expression cassettes in pESC plasmids or the plant binary expression vector pCaBGi for yeast and plant expression, respectively. All enzymes used in this study are listed in Supplementary Table 1.

Strain construction

DNA integrating sequences were constructed using a previously described method³². Manufacturer protocols and standard recombinant DNA procedures were followed for DNA purification (Qiagen), DNA amplification (New England Biolabs, Q5 HighFidelity 2X Master Mix). All primers were designed using CASdesigner. In brief, DNA fragments to be integrated were PCR-amplified then co-transformed with a Cas9-based plasmid facilitating integration at the targeted locus. Alternatively, selection markers were integrated using homologous recombination. For transformations, a fresh overnight culture of parent yeast was inoculated into 25 ml 2×YPD in a 250-ml shake flask at an optical density at 600 nm (OD_600 nm) of 0.2, and was incubated at 30 °C and 200 rpm until the OD_600 nm reached 1.0. Then, 5 OD of cells were collected by centrifuging for 2 min at 3,000g, and were washed with a half volume of H₂O. The pellet was then resuspended with DNA fragments for integration (2 µg) and pCUT plasmid (0.25 µg), which was then mixed with transformation mix (260 µl of 50% PEG3350, 36 µl of 1 M LiOAc and 10 µl of ssDNA)³³. The mixture was incubated at 42 °C for 30 min and the pellet was collected by centrifuging for 2 min at 3,000g. The pellets were then resuspended with 100 µl H₂O and this was plated onto selective agar plates. The integration was validated by colony PCR and sequencing; the correct colonies were used for further engineering after pCUT plasmid curing. Oligonucleotides and codon-optimized gBlock gene fragments were obtained from Integrated DNA Technologies. Yeast culture media were purchased from BD, and all agar plates were obtained from Teknova. All strains constructed in this study are listed in Supplementary Table 2.

In vivo production, extraction and analysis of QS-21 and its precursors

Strains were grown in 2 ml of yeast extract peptone dextrose (YPD, 4% D) medium for 48 h to reach OD_600 nm = 10–15, before being resuspended in 2 ml yeast extract peptone galactose (YPG, 4% G). All strains were incubated for 72 h in 24-deep-well plates at 30 °C and 200 rpm. YL-43 to YL-51 were supplemented with fresh YPG every 24 h. The medium was supplemented with 50–500 mg l⁻¹ (S)-2-methylbutyric acid when culturing YL-42 to YL-47.

β-amyrin production and GC–MS analysis

A single method was used to extract and quantify squalene and β-amyrin. Five hundred microlitres of culture medium in a microfuge tube was first treated with Zymolyase 100T (Arthrobacter luteus, AMSBIO) for 2 h at 37 °C before it was extracted with 500 µl ethyl acetate with bead-beating (3,800 rpm, 1 min × 2). Cholesterol was used as an internal standard. Organic and inorganic layers were separated by centrifugation at 12,000g for 1 min, and samples were extracted twice using cholesterol as an internal standard. Two hundred microlitres of the combined organic layer is derived by treatment with 200 µl of pyridine and 200 µl of BSTFA (Sigma-Aldrich) at 55 °C for 1 h. The derived sample was diluted in ethyl acetate before it was subjected to GC–MS (GC model 6890, MS model 5973 inert, Agilent). An aliquot of the sample (1 µl) was injected into a DB-WAX column (Agilent) operating at a helium flow rate of 1 ml min⁻¹. The oven temperature was held at 80 °C for 4 min after injection and was then ramped to 280 °C at 20 °C min⁻¹, held at 280 °C for 25 min, ramped to 300 °C at 20 °C min⁻¹ and finally held at 300 °C for 5 min (total method of 45 min). The MS ion source was held at 300 °C throughout, with the quadrupole at 200 °C and the GC–MS transfer line at 280 °C. Full mass spectra were generated for metabolite identification by scanning within the m/z range of 40–440. Standard curves for target molecules were routinely run at the start and end of each batch of samples.

Triterpenoid production and LC–MS analysis

The extraction and detection of erythrydiol (2) follow the procedure described for β-amyrin. For the rest of the triterpenoids, 200 µl of culture was collected in a microfuge tube before it was directly extracted with 800 µl methanol with bead-beating (3,800 rpm, 1 min × 2). The mixture was centrifuged at 12,000g for 1 min to separate the pellet. Two hundred microlitres of the supernatant was transferred into an Eppendorf tube, which was then evaporated in a vacuum concentrator at room temperature and the remainders were resuspended in 200 µl methanol. Finally, samples were filtered with Amicon Ultra 0.5-ml 3-kDa filter tubes or centrifuged at 15,000g for 5 min. Products were analysed using LC–MS (1260 Infinity II LC-MSD iQ, Agilent) equipped with a reverse phase C18 column (Kinetex 2.6 µm, 250 × 4.6 mm, XB-C18, Phenomenex). A 50-min isocratic method was performed with 10:90 of water (solvent A) and acetonitrile (solvent B) using a flow rate of 0.3 ml min⁻¹. Full mass spectra were generated for metabolite identification by scanning within the m/z range of 300–600 in negative-ion mode. Data acquisition and analysis were performed using OpenLab CDS version 2.4 (Agilent).

Production of glycosylated QA and LC–MS analysis

A similar extraction procedure was followed, by collecting 500 µl of culture and mixing with 500 µl methanol with bead-beating (3,800 rpm, 1 min × 2). Two hundred microlitres of the supernatant was evaporated and was resuspended in 200 µl of methanol before C28 glycosylation; otherwise, 800 µl of the supernatant was resuspended in 160 µl of methanol. Detection of glycosylated triterpenoids was performed using an LC-MSD iQ equipped with a Kinetex column 2.6 μm XB-C18 100 Å, 50 × 2.1 mm (Phenomenex) using the following parameters⁹: MS (ESI ionization, desolvation line temperature = 250 °C, nebulizing gas flow = 15 l min⁻¹, heat block temperature = 400 °C, spray voltage positive 4.5 kV, negative −3.5 kV). Method: solvent A: (H₂O + 0.1% formic acid); solvent B: (acetonitrile (CH₃CN) + 0.1% formic acid). Injection volume: 10 µl. Gradient: 15% B from 0 to 1.5 min, 15% to 60% B from 1.5 to 26 min, 60% to 100% B from 26 to 26.5 min, 100% B from 26.5 to 28.5 min, 100% to 15% B from 28.5 to 29 min, 35% B from 29 to 30 min. The method was performed using a flow rate of 0.3 ml min⁻¹. Full mass spectra were generated for metabolite identification by scanning within the m/z range of 400–1,350 in negative-ion mode. Data acquisition and analysis were performed using OpenLab CDS v.2.4 (Agilent). The production of target molecules was confirmed by co-elution with the purified standards previously reported⁹.

Production of acylated molecules and QS-21, and LC–QTOF–MS analysis

A similar extraction procedure was followed, by collecting 500 µl of culture and mixing with 500 µl methanol with bead-beating (3,800 rpm, 1 min × 2). Eight hundred microlitres of the supernatant was evaporated and was resuspended in 40 µl methanol, which was then filtered with Amicon Ultra 0.5-ml 3-kDa filter tubes or centrifuged at 15,000g for 5 min. Detection of the acylated molecules and QS-21 was performed by LC–MS (Agilent 6545 for quadrupole time-of-flight (QTOF), Agilent) using the following parameters¹⁰: MS (ESI ionization, desolvation line temperature = 250 °C, nebulizing gas flow = 15 l min⁻¹, heat block temperature = 400 °C, spray voltage positive 4.5 kV, negative −3.5 kV). Method: solvent A: (H₂O + 0.1% formic acid); solvent B: (acetonitrile (CH₃CN) + 0.1% formic acid). Injection volume: 10 µl. Gradient: 15% B from 0 to 0.75 min, 15% to 60% B from 0.75 to 13 min, 60% to 100% B from 13 to 13.25 min, 100% to 15% B from 13.25 to 14.5 min, 15% B from 14.5 to 16.5 min. The method was performed using a flow rate of 0.6 ml min⁻¹ and a Kinetex column 2.6 μm XB-C18 100 Å, 50 × 2.1 mm (Phenomenex). Full mass spectra were generated for metabolite identification by scanning within the m/z range of 400–2,500 in negative-ion mode¹⁹. Analysis was performed using MassHunter Qualitative Analysis v.B.06.00 (Agilent). Note that the standard used to spike in the QS-21 sample was 18-Xyl, which was generated in vitro using 18 with a terminal apiose on the C28 sugar chain. Because the molecules with a C28 terminal apiose or xylose co-elute, 18-Xyl (C28 terminal apiose) was used to determine the elution time of 17-Xyl (C28-terminal-xylose).

Extraction of CoA from engineered yeast and LC–MS analysis

The extraction procedure was adapted from previous reports^28,34. Specifically, 5 OD of cells were pelleted by centrifugation for 2 min at 4 °C at 3,000g and the supernatant was discarded. Cells were quenched and extracted by 100 μl of methanol: acetonitrile: 0.1% glacial acetic acid at a 45:45:10 ratio prechilled at −20 °C. The resuspended extracts were incubated on ice with intermittent vortexing for 15 min, followed by a 3-min centrifugation at 12,000g and 4 °C. The supernatant (10 µl) was injected for LC–MS analysis. Detection of CoA was performed using an LC-MSD iQ equipped with a Hypercarb column 5 μm, 250 Å, 150 × 1 mm (Thermo Fisher Scientific) using the following parameters: MS (ESI ionization, desolvation line temperature = 350 °C, nebulizing gas flow = 13 l min⁻¹, spray voltage positive 4.5 kV, negative −6.0 kV). Method: solvent A: (H₂O + 0.1% formic acid); solvent B: (acetonitrile (CH₃CN) + 0.1% formic acid). Injection volume: 10 µl. Gradient: 2% B from 0 to 15 min, 2% to 90% B from 15 to 17 min, 90% to 20% B from 17 to 18 min, 2% B from 18 to 35 min. The method was performed using a flow rate of 0.1 ml min⁻¹. Full mass spectra were generated for metabolite identification by scanning within the m/z range of 300–1,300 in negative-ion mode. Data acquisition and analysis were performed using OpenLab CDS v.2.4 (Agilent). The 2MB-CoA standard was synthesized according to a reported procedure²⁹.

Transient expression of fluorescent fusion proteins in tobacco plants

Leaves of four-week-old N. benthamiana plants were infiltrated following a procedure adapted from a previous study³⁵. In brief, constructs assembled into binary vectors were transformed into the Agrobacterium tumefaciens strain GV3101. Transformed Agrobacterium strains were grown in LB with appropriate antibiotics at 30 °C, shaking at 200 rpm, to an OD_600 nm of 0.8–1.2. Agrobacterium cells were collected by centrifugation at 4,000g for 10 min at room temperature and resuspended in infiltration buffer (10 mM MES, 10 mM MgCl₂ and 500 µM acetosyringone) to final OD_600 nm = 0.5. Cells were incubated in the infiltration buffer for 1 h with gentle shaking. N. benthamiana leaves were infiltrated with a 1-ml syringe with no needle attached by gently pressing the syringe to the abaxial side of the leaf while applying gentle pressure to the adaxial side. N. benthamiana plants were grown and maintained in a plant growth room at 25 °C in 16-h–8-h light–dark cycles with 50% humidity. Leaves were collected three days after infiltration.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Strains and plasmids developed for this study (Supplementary Table 2), along with annotated sequences, have been deposited in the JBEI Registry (https://registry.jbei.org) and are physically available from the authors upon reasonable request. Contractual obligations from commercial partnerships prohibit us from distributing (by ourselves or through a third party) strains described in our manuscript to for-profit commercial entities. However, we provide extensive genotypic descriptions of our strains, fully annotated DNA sequences and detailed methods that will enable others to build on our work. Strains will be provided to nonprofit, government or academic laboratories and institutions. Source data are provided with this paper.

References

Didierlaurent, A. M. et al. Adjuvant system AS01: helping to overcome the challenges of modern vaccines. Expert Rev. Vaccines 16, 55–63 (2017).
Article CAS PubMed Google Scholar
Lövgren Bengtsson, K., Morein, B. & Osterhaus, A. D. ISCOM technology-based Matrix M^TM adjuvant: Success in future vaccines relies on formulation. Expert Rev. Vaccines 10, 401–403 (2011).
Article PubMed Google Scholar
Kensil, C. R., Patel, U., Lennick, M. & Marciani, D. Separation and characterization of saponins with adjuvant activity from Quillaja saponaria Molina cortex. J. Immunol. 146, 431–437 (1991).
Article CAS PubMed Google Scholar
Pifferi, C., Fuentes, R. & Fernández-Tejada, A. Natural and synthetic carbohydrate-based vaccine adjuvants and their mechanisms of action. Nat. Rev. Chem. 5, 197–216 (2021).
Article CAS PubMed PubMed Central Google Scholar
Coffman, R. L., Sher, A. & Seder, R. A. Vaccine adjuvants: putting innate immunity to work. Immunity 33, 492–503 (2010).
Article CAS PubMed PubMed Central Google Scholar
Pasquale, A. D., Preiss, S., Silva, F. T. D. & Garçon, N. Vaccine adjuvants: from 1920 to 2015 and beyond. Vaccines 3, 320–343 (2015).
Article PubMed PubMed Central Google Scholar
Pulendran, B., S Arunachalam, P. & O’Hagan, D. T. Emerging concepts in the science of vaccine adjuvants. Nat. Rev. Drug Discov. 20, 454–475 (2021).
Article CAS PubMed PubMed Central Google Scholar
Barr, I. G., Sjölander, A. & Cox, J. C. ISCOMs and other saponin based adjuvants. Adv. Drug Deliv. Rev. 32, 247–271 (1998).
Article CAS PubMed Google Scholar
Reed, J. et al. Elucidation of the pathway for biosynthesis of saponin adjuvants from the soapbark tree. Science 379, 1252–1264 (2023).
Article ADS CAS PubMed Google Scholar
Martin, L. B. B. et al. Complete biosynthesis of the potent vaccine adjuvant QS-21. Nat. Chem. Biol. 20, 493–502 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wong, J., d’Espaux, L., Dev, I., van der Horst, C. & Keasling, J. De novo synthesis of the sedative valerenic acid in Saccharomyces cerevisiae. Metab. Eng. 47, 94–101 (2018).
Article CAS PubMed Google Scholar
Bridges, A. et al. Identification of the binding site on cytochrome P450 2B4 for cytochrome b₅ and cytochrome P450 reductase. J. Biol. Chem. 273, 17036–17049 (1998).
Article CAS PubMed Google Scholar
Porter, T. D. The roles of cytochrome b₅ in cytochrome P450 reactions. J. Biochem. Mol. Toxicol. 16, 311–316 (2002).
Article CAS PubMed Google Scholar
Paddon, C. J. et al. High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496, 528–532 (2013).
Article ADS CAS PubMed Google Scholar
Brown, S., Clastre, M., Courdavault, V. & O’Connor, S. E. De novo production of the plant-derived alkaloid strictosidine in yeast. Proc. Natl Acad. Sci. USA 112, 3205–3210 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Ignea, C. et al. Overcoming the plasticity of plant specialized metabolism for selective diterpene production in yeast. Sci. Rep. 7, 8855 (2017).
Article ADS PubMed PubMed Central Google Scholar
Gou, M., Ran, X., Martin, D. W. & Liu, C.-J. The scaffold proteins of lignin biosynthetic cytochrome P450 enzymes. Nat. Plants 4, 299–310 (2018).
Article CAS PubMed Google Scholar
Kensil, C. R. & Kammer, R. QS-21: a water-soluble triterpene glycoside adjuvant. Expert Opin. Investig. Drugs 7, 1475–1482 (1998).
Article CAS PubMed Google Scholar
Crowe, S. A. et al. Engineered Saccharomyces cerevisiae as a biosynthetic platform of nucleotide sugars. ACS Synth. Biol. https://doi.org/10.1021/acssynbio.3c00666 (2024).
Article PubMed Google Scholar
Oka, T. & Jigami, Y. Reconstruction of de novo pathway for synthesis of UDP-glucuronic acid and UDP-xylose from intrinsic UDP-glucose in Saccharomyces cerevisiae. FEBS J. 273, 2645–2657 (2006).
Article CAS PubMed Google Scholar
Turner, W. & Botha, F. C. Purification and kinetic properties of UDP-glucose dehydrogenase from sugarcane. Arch. Biochem. Biophys. 407, 209–216 (2002).
Article CAS PubMed Google Scholar
Beattie, N. R., Pioso, B. J., Sidlo, A. M., Keul, N. D. & Wood, Z. A. Hysteresis and allostery in human UDP-glucose dehydrogenase require a flexible protein core. Biochemistry 57, 6848–6859 (2018).
Article CAS PubMed Google Scholar
Chen, X. et al. Deciphering triterpenoid saponin biosynthesis by leveraging transcriptome response to methyl jasmonate elicitation in Saponaria vaccaria. Nat. Commun. 14, 7101 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Fernández-Tejada, A., Tan, D. S. & Gin, D. Y. Development of improved vaccine adjuvants based on the saponin natural product QS-21 through chemical synthesis. Acc. Chem. Res. 49, 1741–1756 (2016).
Article PubMed PubMed Central Google Scholar
Lacaille-Dubois, M.-A. Updated insights into the mechanism of action and clinical profile of the immunoadjuvant QS-21. Phytomedicine 60, 152905 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rautengarten, C. et al. The interconversion of UDP-arabinopyranose and UDP-arabinofuranose is indispensable for plant development in Arabidopsis. Plant Cell 23, 1373–1390 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rautengarten, C. et al. The elaborate route for UDP-arabinose delivery into the Golgi of plants. Proc. Natl Acad. Sci. USA 114, 4261–4266 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Krink-Koutsoubelis, N. et al. Engineered production of short-chain acyl-coenzyme A esters in Saccharomyces cerevisiae. ACS Synth. Biol. 7, 1105–1115 (2018).
Article CAS PubMed Google Scholar
Xie, X., Meehan, M. J., Xu, W., Dorrestein, P. C. & Tang, Y. Acyltransferase mediated polyketide release from a fungal megasynthase. J. Am. Chem. Soc. 131, 8388–8389 (2009).
Article CAS PubMed PubMed Central Google Scholar
Yuzawa, S. et al. Comprehensive in vitro analysis of acyltransferase domain exchanges in modular polyketide synthases and its application for short-chain ketone production. ACS Synth. Biol. 6, 139–147 (2017).
Article CAS PubMed Google Scholar
Martin, S. R. & Briones, R. Industrial uses and sustainable supply of Quillaja saponaria (Rosaceae) saponins. Econ. Bot. 53, 302–311 (1999).
Article Google Scholar
Reider Apel, A. et al. A Cas9-based toolkit to program gene expression in Saccharomyces cerevisiae. Nucleic Acids Res. 45, 496–508 (2017).
Article PubMed Google Scholar
Luo, X. et al. Complete biosynthesis of cannabinoids and their unnatural analogues in yeast. Nature 567, 123–126 (2019).
Article ADS CAS PubMed Google Scholar
Lu, C., Zhang, X., Jiang, M. & Bai, L. Enhanced salinomycin production by adjusting the supply of polyketide extender units in Streptomyces albus. Metab. Eng. 35, 129–137 (2016).
Article CAS PubMed Google Scholar
Sparkes, I. A., Runions, J., Kearns, A. & Hawes, C. Rapid, transient expression of fluorescent fusion proteins in tobacco plants and generation of stably transformed plants. Nat. Protoc. 1, 2019–2025 (2006).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank U. N. Andersen for high-resolution analytical measurement; R. San Martin for his knowledge on QS saponins; M. Wehrs, C. S. Diercks, N. Lee, J. Huang, Q. Dan, Z. Wang, M. G. Thompson and A. Zargar for discussions, M. Rejzek for synthesizing and providing UDP-d-fucose; and C. Owen for advice on bioinformatics. This work was supported by an industry grant to J.D.K. and under the financial assistance award 70NANB22H017 from the US Department of Commerce, National Institute of Standards and Technology, made possible, in part, with the support of the Bioindustrial Manufacturing and Design Ecosystem (BioMADE, the content expressed herein is that of the authors and does not necessarily reflect the views of BioMADE) and by National Institutes of Health grant R01 AT010593. The QB3/Chemistry Mass Spectrometry Facility received National Institutes of Health support (grant number 1S10OD020062-01). Mass spectrometry analysis by T.R.N. and S.M.K. was supported by the m-CAFEs Microbial Community Analysis and Functional Evaluation in Soils program, a Science Focus Area at Lawrence Berkeley National Laboratory funded by the US Department of Energy, Office of Science, Office of Biological & Environmental Research DE-AC02-05CH11231. This work was also supported by a Biotechnological and Biological Sciences Research Council (BBSRC) Super Follow-on-Fund award BB/R005508/1 (L.B.B.M., R.C.M., S.K. and A.E.-D.), industrial funding (J.R., R.C.M., S.K., A.E.-D. and A.O.), the John Innes Foundation (A.O.), and the BBSRC Institute Strategic Programme Grant ‘Harnessing Biosynthesis for Sustainable Food and Health’ (BB/XO10-97X/1) (A.O.).

Author information

Authors and Affiliations

California Institute of Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, USA
Yuzhong Liu, Xixi Zhao, Fei Gan, Samantha A. Crowe, Graham A. Hudson, Bo Pang, Anthony T. Iavarone & Jay D. Keasling
Joint BioEnergy Institute, Emeryville, CA, USA
Yuzhong Liu, Xixi Zhao, Fei Gan, Xiaoyue Chen, Kai Deng, Samantha A. Crowe, Graham A. Hudson, Michael S. Belcher, Matthias Schmidt, Maria C. T. Astolfi, Bo Pang, Minglong Shao, Jing Yin, Sasilada Sirirungruang, Xiaomeng Liang, Michael J. Cronce, Xiulai Chen, Chunjun Zhan, Ramu Kakumanu, Edward E. K. Baidoo, Yan Chen, Christopher J. Petzold, Trent R. Northen, Henrik Scheller & Jay D. Keasling
Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA
Xiaoyue Chen, Michael S. Belcher, Sasilada Sirirungruang & Henrik Scheller
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Xiaoyue Chen, Michael S. Belcher, Suzanne M. Kosina, Sasilada Sirirungruang, Trent R. Northen & Henrik Scheller
Department of Biomaterials and Biomanufacturing, Sandia National Laboratories, Livermore, CA, USA
Kai Deng
Department of Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, CA, USA
Samantha A. Crowe & Jay D. Keasling
Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Matthias Schmidt, Minglong Shao, Jing Yin, Xiaomeng Liang, Xiulai Chen, Chunjun Zhan, Ramu Kakumanu, Edward E. K. Baidoo, Yan Chen, Christopher J. Petzold & Jay D. Keasling
Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Aachen, Germany
Matthias Schmidt
Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
Maria C. T. Astolfi, Michael J. Cronce & Jay D. Keasling
Center for Biomolecular Structure, Function and Application, Suranaree University of Technology, Nakhon Ratchasima, Thailand
Sasilada Sirirungruang
John Innes Centre, Norwich Research Park, Norwich, UK
James Reed, Laetitia B. B. Martin, Amr El-Demerdash, Shingo Kikuchi, Rajesh Chandra Misra & Anne Osbourn
Department of Chemistry, Faculty of Sciences, Mansoura University, Mansoura, Egypt
Amr El-Demerdash
Center for Biosustainability, Danish Technical University, Lyngby, Denmark
Jay D. Keasling

Authors

Yuzhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xixi Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Fei Gan
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyue Chen
View author publications
You can also search for this author in PubMed Google Scholar
Kai Deng
View author publications
You can also search for this author in PubMed Google Scholar
Samantha A. Crowe
View author publications
You can also search for this author in PubMed Google Scholar
Graham A. Hudson
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Belcher
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Maria C. T. Astolfi
View author publications
You can also search for this author in PubMed Google Scholar
Suzanne M. Kosina
View author publications
You can also search for this author in PubMed Google Scholar
Bo Pang
View author publications
You can also search for this author in PubMed Google Scholar
Minglong Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jing Yin
View author publications
You can also search for this author in PubMed Google Scholar
Sasilada Sirirungruang
View author publications
You can also search for this author in PubMed Google Scholar
Anthony T. Iavarone
View author publications
You can also search for this author in PubMed Google Scholar
James Reed
View author publications
You can also search for this author in PubMed Google Scholar
Laetitia B. B. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Amr El-Demerdash
View author publications
You can also search for this author in PubMed Google Scholar
Shingo Kikuchi
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Chandra Misra
View author publications
You can also search for this author in PubMed Google Scholar
Xiaomeng Liang
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Cronce
View author publications
You can also search for this author in PubMed Google Scholar
Xiulai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chunjun Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Ramu Kakumanu
View author publications
You can also search for this author in PubMed Google Scholar
Edward E. K. Baidoo
View author publications
You can also search for this author in PubMed Google Scholar
Yan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Petzold
View author publications
You can also search for this author in PubMed Google Scholar
Trent R. Northen
View author publications
You can also search for this author in PubMed Google Scholar
Anne Osbourn
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Scheller
View author publications
You can also search for this author in PubMed Google Scholar
Jay D. Keasling
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.L., F.G., Xiaoyue Chen, H.V.S. and J.D.K. conceived the study. Y.L., X.Z., F.G., Xiaoyue Chen, S.A.C., G.A.H., M.S.B., M. Schmidt, M.C.T.A., B.P., M. Shao, J.Y., M.J.C., Xiulai Chen and C.Z. constructed plasmids and yeast strains and performed microbiological manipulations and extractions. K.D. contributed to the purification and NMR studies of QS-21 from the yeast culture. Y.L., R.K., E.E.K.B., Y.C., C.J.P., A.T.I., X.L., S.M.K. and T.R.N. performed mass spectrometry. Xiaoyue Chen and S.S. performed plant infiltration and transient expression of target proteins. Y.L., Xiaoyue Chen and M.S.B. performed subcellular localization studies using confocal microscopy. Xiaoyue Chen performed bioinformatic analysis. J.R., L.B.B.M., A.E.-D., S. K., R.C.M. and A.O. provided genes, sequences, purified QS-21 and pathway intermediates, methods and advice. All authors contributed to the manuscript.

Corresponding author

Correspondence to Jay D. Keasling.

Ethics declarations

Competing interests

J.D.K. has a financial interest in Amyris, Demetrix, Maple Bio, Lygos, Napigen, Berkeley Yeast, Zero Acre Farms, Ansa Biotechnologies, Apertor Pharmaceuticals, ResVit Bio, and Cyklos Materials. J.R., L.B.B.M. and A.O. are inventors of patents arising from work on QS-21 pathway characterization.

Peer review

Peer review information

Nature thanks Ryan Nett, Kristala Prather and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Biosynthesis of β-amyrin in engineered yeast and culture condition optimization.

Full-length β-amyrin synthase (BAS) sourced from Saponaria vaccaria, SvBAS, was integrated into JWY601, a strain with upregulated mevalonate pathway. Production of β-amyrin was sampled over the course of three days after induction. MLY-01 harbouring overexpressed copies of ERG20 and ERG1 led to 899 mg l⁻¹ production of β-amyrin. Data are mean ± s.d.; n = 3 biologically independent samples. *P value < 0.05; ** P < 0.01. Student’s two-tailed t-test. More statistical analysis is available in the source data file.

Source Data

Extended Data Fig. 2 Optimization of P450 oxidation efficiency through the expression of a scaffolding MSBP.

a, Phylogenetic tree of Saponaria vaccaria MSBP homologues and related MSBPs (see details in the Supplementary Methods). b, Expression of various MSBP homologues, including 2 newly identified SvMSBPs, led to 2- to 4-fold increase of QA production. Increased of QA production can be attributed to the improved oxidation efficiency. Data are mean ± s.d.; n = 3 biologically independent samples. *P value < 0.05; ** P < 0.01. Student’s two-tailed t-test. More statistical analysis is available in the source data file. Subcellular localization studies show that SvMSBP1 co-localizes with both c, C28 and d, C23 oxidases on the ER membrane of yeast. Images were acquired using a Zeiss LSM 710 confocal microscope (scale bar represents 10 μm; at least three independent experiments were conducted).

Source Data

Extended Data Fig. 3 Functional expression of cytochrome P450s and pathway engineering for QA.

a, Introduction of additional copies of pathway enzymes revealed reaction bottlenecks and the increased production of QA. b, Distribution of the terpenoid products in the engineered yeast strains after overexpression of the cytochrome P450s, as well as their redox partners. c, Calculated titres of target products in these yeast strains with a 65-fold increase of QA production in YL-15 compared to YL-4. Data are mean ± s.d.; n = 3 biologically independent samples.

Source Data

Extended Data Fig. 4 C3 glycosylation studies.

a, Activity of the two CSL homologues CSLM1 and CSLM2 was investigated by comparing the LC–MS ion intensity of the oxidized intermediates oleanolic acid (3), hederagenin (4), gypsogenin (5), QA (6), as well as their glucuronidated counterparts in the presence of CSLM1 or CSLM2 (shown in grey and orange, respectively). CSLM1 was more specific towards the glucuronidation of 6. Although CSLM2 can also glucuronidate less oxidized intermediates such as 3, 4, and 5, it was 3-fold more active towards 6. *P value < 0.05; ** P < 0.01. Student’s two-tailed t-test. More statistical analysis is available in the source data file. b, Distribution of the glycosylated products and their precursors in the engineered yeast strains. Expression of the downstream C3-GalT effectively increased the conversion of 6 by pushing the equilibrium through the consumption of 7, thus leading to the production of 8. Data are mean ± s.d.; n = 3 biologically independent samples.

Source Data

Extended Data Fig. 5 Subcellular localization studies of QS-21 pathway proteins in yeast and tobacco.

C-terminal fluorescent protein fusions of the pathway enzymes are visualized. The first GT CSLM1 was localized in the ER membrane and the downstream enzymes were expressed in the cytosol except for KR2. Images were acquired using a Zeiss LSM 710 confocal microscope (scale bar represents 10 μm).

Extended Data Fig. 6 The C9 acylation of 12 in the presence of exogenously supplemented 2MB acid.

The conversion of 12 to 19 was limited possibly due to the limited availability of intracellular 2MB acid. Increased concentration of 2MB added to the culture medium effectively improved the conversion and the concentration of 2MB supplementation is thus determined to be 500 mg l⁻¹. Data are mean ± s.d.; n = 3 biologically independent samples. *P value < 0.05. Student’s two-tailed t-test. More statistical analysis is available in the source data file.

Source Data

Extended Data Fig. 7 Acylated and glycosylated intermediates towards the complete biosynthesis of QS-21.

The C9-CoA unit can be added to both the C28 trisaccharide 12 and tetrasaccharide 13 with the same acyl transferase ACT2. ACT3, which catalyses the second acylation to form a repeating dimeric C18 structural motif, used both 19 and 15 as substrates, before the terminal arabinofuranose was added to the 5-OH of the C18 group. Bottom, ion-extracted chromatograms of the extracted yeast samples showed the efficient addition of the two C9 units onto the glycosylated molecule substrates (12 and 13) to yield fully acylated 12-C18 (20), and 13-C18 (17). The arabinofuranosylation of 20 and 17 led to the biosynthesis of 21 and QS-21-Xyl. 21, a structural derivative of QS-21 accumulated in yeast due to the competitive pathways between glycosylation and acylation.

Extended Data Fig. 8 Characterization of purified QS-21-Xyl from engineered yeast YL-46 compared to that of the QS-21 standard.

QS-21-Xyl production was scaled up according to the protocol in the Supplementary Information to afford purified QS-21-Xyl. a, EIC spectra of QS-21-Xyl extracted from YL-46 cultures show the co-elution with the standard. b, Isotopic fingerprint patterns of the QS-21-Xyl extracted from YL-46 cultures and purified (top) and the QS-21 standard (bottom). c, Structure of QS-21-Xyl with the characteristic mass fragments are indicated. d, Mirror plot comparison of MS2 spectra of QS-21 extracted from YL-46 cultures (top) and of the standard (bottom). A list of the MS2 fragments and their corresponding intensities can be found in Supplementary Table 7. The fragments observed are consistent with those reported in literature¹⁰. All LC–MS chromatograms were extracted with the theoretical m/z values of the respective compounds of interest.

Extended Data Fig. 9 ¹H NMR spectra of purified QS-21-Xyl from engineered yeast YL-46 compared to that of the QS-21 standard.

a, Full spectral comparison between the QS-21 extracted and purified from engineered yeast YL-46 (top, red) and a QS-21 standard (bottom, black) shows an overall match of all proton peaks. In particular, the peak at chemical shift 9.33 demonstrates the presence of the C23 aldehyde in the QS-21 made in engineered yeast. b, Expanded spectral comparison between the purified QS-21 from YL-46 (top, red) and a QS-21 standard (bottom, black). The well-matched anomeric proton peaks confirm the correct connectivity between the sugar moieties. The absence of apiose proton peaks, highlighted in orange, in the yeast sample further confirms the production of QS-21-Xyl exclusively, owing to the specificity of the GT. Both spectra were recorded in acetonitrile-d₃:D₂O 1:1, 500 MHz.

Extended Data Fig. 10 Reconstitution of the glycosylation pathway of QS-21 with a C3 terminal rhamnose structural analogue.

a, The sequential addition of the C3 branched trisaccharide (GlcA-Gal-Rha) before a linear tetrasaccharide is added stepwise to the C28 carboxylic acid (Fuc-Rha-Xyl) by the functional expression of nucleotide sugar synthases and corresponding GT. The C28 GTs are promiscuous and can also use the Rha trisaccharide derivative. b, The LC–MS peak area of corresponding products produced in yeast after the expression of the indicated enzymes and the necessary nucleotide sugar synthases. Data are mean ± s.d.; n = 3 biologically independent samples.

Source Data

Supplementary information

Supplementary Information

Reporting Summary

Source data

Source Data Figs. 2 and 3 and Source Data Extended Data Figs. 1–4, 6 and 10

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, Y., Zhao, X., Gan, F. et al. Complete biosynthesis of QS-21 in engineered yeast. Nature (2024). https://doi.org/10.1038/s41586-024-07345-9

Download citation

Received: 23 March 2023
Accepted: 22 March 2024
Published: 08 May 2024
DOI: https://doi.org/10.1038/s41586-024-07345-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Main

Biosynthesis of quillaic acid

C3 and C28 O-glycosylation

Biosynthesis and addition of the acyl unit

Discussion

Methods

Chemicals

Plasmid construction

Strain construction

In vivo production, extraction and analysis of QS-21 and its precursors

β-amyrin production and GC–MS analysis

Triterpenoid production and LC–MS analysis

Production of glycosylated QA and LC–MS analysis

Production of acylated molecules and QS-21, and LC–QTOF–MS analysis

Extraction of CoA from engineered yeast and LC–MS analysis

Transient expression of fluorescent fusion proteins in tobacco plants

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data figures and tables

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links