Introduction

Rubisco (D-ribulose-1,5-bisphosphate carboxylase/ oxygenase) catalyses net CO2-fixation in all photosynthetic organisms but also reacts with oxygen, leading to a wasteful loss of previously fixed CO2, nitrogen and energy1,2,3. In plants with the C3 photosynthetic pathway, which includes most major staple crops (e.g. rice, wheat and soybean), Rubisco has a relatively slow turnover rate and operates under sub-optimal CO2 concentrations of less than half that required for maximal rates of carboxylation4,5,6. Improving the operating efficiency of Rubisco remains a key strategy to increase productivity in C3 crops7. This is an important goal given the steep trajectory of crop yield improvements required to meet the demands of our rising global population8.

Rubisco is typically evenly distributed throughout the chloroplast stroma in plants. However, most unicellular eukaryotic photosynthetic organisms and some non-vascular land plants have evolved highly efficient CO2-concentrating mechanisms (CCMs) that condense Rubisco into a microcompartment within the chloroplast called the pyrenoid9,10. The CCM functions to enrich the pyrenoid with high concentrations of CO2 to enhance the performance of Rubisco carboxylation and suppress oxygenation11. Introduction of such CCMs into C3 plants is predicted to lead to significant increases in CO2-fixation efficiency and biomass yield8,12,13. Recent work in the model green alga Chlamydomonas reinhardtii (hereafter Chlamydomonas) has shown that the pyrenoid is a liquid-like condensate that undergoes liquid−liquid phase separation when the CCM is active14. Phase separation is facilitated by multivalent interactions between the intrinsically disordered Rubisco-binding protein EPYC1 and the small subunit of Rubisco (SSU)15,16,17,18,19,20, of which residues on the α-helices of the native SSU isoforms are critical for binding to EPYC1. Previously we have shown that Arabidopsis can assemble a functional plant-algal hybrid Rubisco complex, with the native large subunit of Rubisco (LSU) and an SSU from Chlamydomonas, which has similar kinetic properties to wild-type (WT) Arabidopsis Rubisco21. However, expression of the full coding sequence of EPYC1 in Arabidopsis with plant-algal hybrid Rubisco did not result in Rubisco condensation18. Evidence from Chlamydomonas and in vitro studies suggested that an appropriate stoichiometric balance between EPYC1 and Rubisco, and possibly additional regulatory components, might be required for efficient phase separation9,14,17,22.

Here we show that expression of mature EPYC1 and a plant-algal hybrid Rubisco in the model C3 plant Arabidopsis leads to spontaneous condensation of Rubisco into a single phase-separated compartment in the chloroplast. The condensate has liquid-like properties similar to a pyrenoid matrix. Furthermore, the condensate displaces the thylakoid membranes and is enriched in hybrid Rubisco containing the algal Rubisco small subunit required for phase separation. Promisingly, photosynthetic CO2 fixation and growth is not impaired in stable transformants compared to azygous segregants.

Results

Expression of maturated EPYC1 in Arabidopsis

We initially sought to test whether high levels of expression of a mature form of EPYC1 could lead to phase separation in a higher plant chloroplast. EPYC1 was truncated according to the predicted transit peptide cleavage site between residues 26 (V) and 27 (A)18. A dual GFP expression system was developed to achieve high levels of EPYC1 expression and a favourable stoichiometry with Rubisco. This consisted of a binary vector containing two gene expression cassettes, each encoding mature EPYC1 with an Arabidopsis chloroplastic signal peptide and fused to a different version of GFP (turboGFP (tGFP) or enhanced GFP (eGFP)) to avoid possible recombination events (Fig. 1a and Supplementary Fig. 1). The dual GFP construct (EPYC1-dGFP) was transformed into WT plants or the Arabidopsis 1a3b Rubisco mutant complemented with an SSU from Chlamydomonas (S2Cr)21. The resulting transgenic plants (Ep) expressed both EPYC1::eGFP and EPYC1::tGFP, of which the latter was generally more abundant at the protein level (Fig. 1b and Supplementary Table 2). Previously, immunoblots against full-length EPYC1 expressed in S2Cr or WT plants showed additional lower molecular weight bands indicative of proteolytic degradation18. Here, degradation products of the mature EPYC1 were relatively reduced when expressed in S2Cr compared to WT plants.

Fig. 1: Expression of EPYC1 in the Arabidopsis line S2Cr results in condensate formation.
figure 1

a Schematic representation of the dual GFP expression system (EPYC1-dGFP) for EPYC1 truncated at amino acid residue 27 (as indicated by yellow triangles) and fused at the N-terminus to the chloroplastic transit peptide (TP) sequence of the Arabidopsis Rubisco small subunit RbcS1A. EPYC1 was further fused at the C-terminus to either enhanced GFP (eGFP) or turboGFP (tGFP), and driven by the 35S CaMV promoter (35S prom) or CsVMV promoter (CsVMV prom), respectively. For the latter expression cassette, a dual terminator system was used to increase expression53 (Supplementary Fig. 1). b EPYC1 protein levels in Arabidopsis plants as assessed by immunoblot analysis with anti-EPYC1 antibodies. Shown are three T2 S2Cr transgenic plants expressing EPYC1-dGFP (Ep1−3) and azygous segregants (Az1−3), and S2Cr plants transformed with only EPYC1::tGFP (55.4 kDa) or EPYC1::eGFP (63.9 kDa). Also displayed are a T2 EPYC1-dGFP WT transformant (EpWT) and azygous segregant (AzWT). Anti-actin is shown as a loading control underneath. c Expression of EPYC1-dGFP in WT, S2Cr and 1AAtMOD backgrounds. Green and purple signals are GFP and chlorophyll autofluorescence, respectively. Overlapping signals are white. Scale bar = 10 µm for all images. d TEM images of chloroplasts from S2Cr plants with (right) and without (left) expression of EPYC1. A white arrowhead indicates the dense dark grey area of the EPYC1 condensate. The large white structures are starch granules. Scale bars = 0.5 µm. A representative chloroplast from a wild-type plant expressing EPYC1-dGFP is shown for comparison in Extended Data Fig. 3. e Chlorophyll autofluorescence is reduced at the site of EPYC1-dGFP accumulation (white arrow). Scale bar = 5 µm. f SIM microscopy showing EPYC1-dGFP condensates inside the chloroplast (see also Supplementary Movie 1). The magenta puncta show the position of grana stacks. Light magenta puncta indicate grana stacks behind the condensate. Scale bar = 2 µm. g Example comparison of condensate size (left, 2 µm) with that of a pyrenoid in Chlamydomonas (right, representative TEM image highlighting the pyrenoid in green and chloroplast in purple). Scale bar for TEM image = 0.5 µm. Images of EPYC1-dGFP condensates in the S2Cr background are from line Ep3 (c, e, f, g). Confocal images are representative of multiple imaging sessions (c, e). SIM microscopy (f, g) imaging was performed once. The immunoblots shown were derived from the same experiment and gels/blots were processed in parallel. Immunoblots results were representative of six gels/blots (a), TEM images are representative of 26 and 12 images from S2Cr plants with and without EPYC1, respectively (d) and 55 images from Chlamydomonas (g). Source data are provided as a Source Data file.

The fluorescence signal for EPYC1-dGFP in WT plants was distributed evenly throughout the chloroplast (Figs. 1c and 2a). In contrast, EPYC1-dGFP in the hybrid S2Cr plants showed only a single dense chloroplastic signal. Transmission electron microscopy confirmed the presence of a single prominent condensed complex in the chloroplast stroma (Fig. 1d). The condensates were spherical in shape and displaced native chlorophyll autofluorescence (Fig. 1e, f and Supplementary Movie 1), indicating that the thylakoid membrane matrix was excluded from the condensate. In protoplasts of leaf mesophyll cells, a condensate was visible in each chloroplast (Supplementary Fig. 2a), and the average size of the condensates was related to the expression level of EPYC1-dGFP (Fig. 1b and Supplementary Fig. 2b, c). We observed that the average diameter of the condensates was 1.6 ± 0.1 µm (n = 126; 42 from three individual S2Cr transgenic lines), which was comparable with that of the size range measured for the Chlamydomonas pyrenoid (1.4 ± 0.1 µm; n = 55)23 (Fig. 1g and Supplementary Fig. 2c). The estimated volume of the condensates was 2.7 ± 0.2 µm3 (ca. 5% of the chloroplast volume). Variations in condensate volume within individual S2Cr transgenic Ep lines were not correlated with chloroplast volume, suggesting that regulation of condensate formation and size was largely independent of chloroplast morphology, and likely limited by the influence of the surrounding network stiffness (i.e. the stroma, which is densely packed with thylakoids) on the dynamics of droplet ripening24.

Fig. 2: In planta condensates behave like liquid−liquid phase-separated microcompartments.
figure 2

a Fluorescence distribution plots of EPYC1-dGFP across the chloroplast. The intensity of the GFP fluorescence signal over the cross-section of a chloroplast is shown in WT (n = 28), S2Cr (line Ep3, n = 17) and 1AAtMOD (n = 22) backgrounds. Both GFP fluorescence and cross-section values have been normalised to 1 (as indicated by the dashed line), with the highest value in the centre. b Fluorescence recovery after photobleaching (FRAP) assays. Condensates are shown from live (top) and fixed (bottom) leaf tissue from S2Cr transgenic line Ep3 expressing EPYC1-dGFP. Scale bar = 1 µm for both images. c Fluorescence recovery of the bleached area in relation to the non-bleached area of condensates. The mean ± SEM are shown for live (n = 13) and fixed (n = 16) chloroplasts. Source data are provided as a Source Data file.

Condensates were also observed when EPYC1-dGFP was expressed in the Arabidopsis 1a3b Rubisco mutant complemented with a native Arabidopsis SSU modified to contain the two α-helices necessary for pyrenoid formation from the Chlamydomonas SSU (1AAtMOD)15,21 (Fig. 1c). However, condensates in the 1AAtMOD background were less punctate (Fig. 2a), which is consistent with the lower affinity of the modified native SSU for EPYC1 observed previously in yeast two-hybrid experiments18. Furthermore, we confirmed that condensates were formed when either EPYC1::tGFP or EPYC1::eGFP expression cassettes were transformed into S2Cr individually (Supplementary Fig. 3a). Expression of a full-length (i.e. non-truncated) variant of EPYC1-dGFP in Arabidopsis chloroplasts previously did not result in phase separation18, which was attributed to low levels of expression and an incompatible stoichiometry between EPYC1 and Rubisco, and possible proteolytic degradation. The results here indicate that the formation of a condensate is primarily due to the expression of the mature variant of EPYC1 rather than the level of expression per se, and that the required stoichiometry with Rubisco is achievable in planta. Furthermore, the observed reduction in proteolytic degradation may be accounted for by sequestration of EPYC1 within a phase-separated compartment (Fig. 1b), which would be less accessible to large protease complexes25.

Characterisation of the condensate

We next tested if the condensates exhibited internal mixing characteristics consistent with the liquid-like behaviour of pyrenoids (Fig. 2b, c). Fluorescence recovery after photobleaching (FRAP) assays showed that condensates in live leaf cells had similar or increased rates of re-homogenisation of EPYC1-dGFP (i.e. full recovery after 20−40 s) compared to previous in vitro17 and in alga14 reports. The more rapid interchange in condensates compared to Chlamydomonas pyrenoids may be due to the relatively reduced availability of EPYC1 binding sites on Rubisco in the S2Cr plant-algal hybrid Rubisco background19. In contrast, condensates in leaf tissue chemically cross-linked with formaldehyde showed no recovery after photobleaching, which is consistent with that observed for Chlamydomonas pyrenoids14.

To test for the presence of Rubisco, the condensates were extracted from leaf tissue and examined by immunoblot. The condensates could be pelleted by gentle centrifugation16, and isolated condensates were shown to be enriched in EPYC1-dGFP and Rubisco (Fig. 3a). Protein samples were probed with either a polyclonal Rubisco antibody with a greater specificity for higher plant LSU and SSUs, or an antibody raised against the RbcS2 SSU isoform of Chlamydomonas (CrRbcS2). Comparison of the resulting immunoblot data showed enrichment of the Chlamydomonas SSU in the pelleted condensate compared to native SSUs. Subsequent Coomassie staining of denatured, gel-separated extracts revealed that nearly half (49%) of Rubisco in the initial extract contained Chlamydomonas SSU, while 82% of Rubisco in the pelleted condensate contained Chlamydomonas SSU (Fig. 3b). Pelleted condensates coalesced into larger droplets when resuspended, as expected for the liquid-like behaviour of EPYC1-Rubisco interactions (Fig. 3c)17. Immunogold analysis of chloroplast TEM images from S2Cr plants expressing EPYC1-dGFP showed approximately half (54%) of all Rubisco was contained within the condensate (Figs. 3d, e and Supplementary Fig. 3b). Consistent with Coomasie staining, 81% of Rubisco containing Chlamydomonas SSU was located in the condensate. Thus, condensation of Rubisco is strongly associated with Rubisco complexes bearing the Chlamydomonas SSU, which constituted approximately 50% of the Rubisco pool. The latter is consistent with the expected expression levels of plant-algal hybrid Rubisco in S2Cr21. It is currently unclear if Rubisco can form a heterogenous L8S8 complex with different SSU isoforms, or if only a single SSU isoform is favoured during assembly26. Thus, it remains unclear whether the Rubisco pool within the condensate was comprised of a mixture of homogeneous Rubisco complexes, or those containing both Arabidopsis and Chlamydomonas SSUs.

Fig. 3: Condensates contain EPYC1 and plant-algal hybrid Rubisco.
figure 3

a EPYC1 and Rubisco protein levels in whole leaf tissue (input), the supernatant following condensate extraction and centrifugation (supernatant) and the pellet (pellet) as assessed by immunoblot analyses with anti-EPYC1, anti-Rubisco (LSU and SSU shown) or anti-CrRbcS2 antibodies. Samples are shown for WT plants (WT), and S2Cr mutants not expressing EPYC1 (S2Cr) and expressing EPYC1-dGFP (S2Cr_EPYC1). The pellet is 40× more concentrated than the input and supernatant. Molecular weights: LSU, 55 kDa; RbcS1B, RbcS2B and RbcS3B, 14.8 kDa; AtRbcS1A, 14.7 kDa; CrRbcS2 15.5 kDa. The immunoblots shown were derived from the same experiment and gels/blots were processed in parallel. Immunoblots results were representative of four gels/blots for EPYC1, two for Rubisco, and one for CrRbcS2. b Coomassie-stained SDS-PAGE gel showing the composition of the pelleted condensate. Images are representative of three gels. c The condensates in the S2Cr_EPYC1 pellet coalescence to form large liquid droplets. Scale bars = 25 µm. d Representative immunogold labelling of Rubisco in chloroplasts of an S2Cr transgenic line Ep3 expressing EPYC1-dGFP probed with polyclonal anti-Rubisco (left) or anti-CrRbcS2 (right) antibodies (dots are highlighted for the latter). The condensates are marked by a white arrowhead. Large white structures are starch granules. Scale bar = 0.5 µm for both images. e Proportion of gold nanoparticles inside the condensate compared to the chloroplast for each antibody. The mean ± SEM are shown for n = 37 individual chloroplast images. Source data are provided as a Source Data file.

Growth phenotype of EPYC1 transformants

We compared the growth performance of three separate T2 EPYC1-dGFP S2Cr transgenic lines (Ep1−3) screened for the presence of condensates with their matching T2 azygous segregant S2Cr lines where the EPYC1-dGFP transgene was absent (Az1−3). Plants were cultivated under light levels typical for Arabidopsis growth (200 μmol photons m−2 s−1) or under higher light levels (900 μmol photons m−2 s−1) where growth rates are more limited by Rubisco activity (Figs. 4a, b and Supplementary Fig. 4)27. Regardless of the growth conditions, rosette expansion rates or biomass accumulation were not distinguishable between transformants and their segregant controls. Similarly, T2 EPYC1-dGFP WT plants (EpWT) showed no significant differences compared to T2 segregant lines (AzWT). Due to the reduced Rubisco content in the S2Cr background and differences in Rubisco catalytic characteristics, the growth performance of S2Cr lines was slightly decreased compared to WT plants. The observed differences in growth were in line with those reported previously for S2Cr and WT plants in the absence of EPYC121.

Fig. 4: EPYC1-mediated condensation of Rubisco has no negative impact on growth and photosynthesis.
figure 4

a Fresh and dry weights of three T2 EPYC1-dGFP S2Cr transgenic lines (Ep1−3) and an EPYC1-dGFP WT transformant (EpWT) (both in green) with their respective azygous segregants (Az1−3 and AzWT) (in grey). Plants were measured after 32 days of growth under 200 μmol photons m−2 s−1 light. The mean ± SEM are shown for n = 10−26 individual plants for each line. b Rosette expansion of S2Cr and WT lines in (a). c Net CO2 assimilation (A) based on intercellular [CO2] (Ci) under saturating light (1500 μmol photons m−2 s−1). Values show the mean ± SEM of measurements made on individual leaves from individual rosettes (n = 5−8). d Variables derived from gas exchange data include maximum rate of Rubisco carboxylation (Vcmax), maximum electron transport rate (Jmax), stomatal conductance (Gs), mesophyll conductance (Gm) and the net CO2 assimilation rate at ambient concentrations of CO2 normalised to Rubisco (ARubisco). Letters indicate significant difference (p < 0.05) of EpWT lines compared to Ep lines as determined by one-way ANOVA followed by Tukey’s honestly significant difference (HSD) post-hoc tests (a, d). e Algal CCM components required for enhancing photosynthesis. Generating a pyrenoid-like condensate in a plant chloroplast provides a platform for introducing bicarbonate (HCO3) channels/pumps at the chloroplast envelope (e.g. LCIA, shown in red)34 and thylakoid membrane (e.g. BST1−3, shown in orange)35, a lumenal carbonic anhydrase to convert HCO3 to CO2 for release into the surrounding Rubisco condensate (CAH3, shown in blue)36, mechanisms to capture CO2 as HCO3 (LCIB and LCIC, shown in purple)37,38 and traversing thylakoid membranes33. Current models suggest that introduction of a functional biophysical CCM into a C3 plant could lead to productivity gains of up to 60% 29,30,31. Source data are provided as a Source Data file.

Photosynthetic parameters derived from response curves of CO2 assimilation rate to the intercellular CO2 concentration under saturating light were similar between respective Ep and Az lines (Fig. 4c, d and Extended Data Table 1). The presence of condensates did not influence the maximum achievable rates of Rubisco carboxylation (Vcmax). Notably, the CO2 assimilation rates at ambient concentrations of CO2 for Ep and Az lines were comparable to WT lines when normalised for Rubisco content (ARubisco) (Fig. 4d). This suggests that the modest reductions in Rubisco turnover rate (kcatc) and specificity (SC/O) for the plant-algal hybrid Rubisco in S2Cr compared to WT plants have only a mild impact on the efficiency of photosynthetic CO2 assimilation and that the observed differences in growth rates are more associated with the reduced levels of Rubisco in S2Cr plants21. Mesophyll conductance (gm) levels were also reduced in all S2Cr lines compared to WT plants, which is consistent with the impact of reduced Rubisco content on gm observed in transplastomic tobacco28. Condensate formation in the 1AAtMOD background, where catalytic characteristics of the hybrid Rubisco are indistinguishable from that of WT Rubisco, indicates that the SSU can be further engineered to optimise phase separation, and Rubisco content and performance (Fig. 1c)18. Measurements of the maximum electron transport rate (Jmax) and the maximum potential quantum efficiency of Photosystem II (Fv/Fm) were also indistinguishable between transformant and segregant lines (Extended Data Table 1). Thus, the apparent displacement of the thylakoid membrane matrix by the condensates (Fig. 1e) had no obvious impact on the efficiency of the light reactions of photosynthesis.

Discussion

Our findings show that EPYC1 and specific residues on the SSU are sufficient to aggregate Rubisco into a single proto-pyrenoid condensate, which has no apparent negative impact on plant growth. The overall photosynthetic performances of S2Cr transgenic lines appeared unaffected by the condensate, which suggests that conditions inside higher plant chloroplasts are highly compatible with the presence of pyrenoid-type bodies. Our data are arguably the key step for the assembly of a pyrenoid-based CCM into plants that could increase crop yield potentials by up to 60% (Fig. 4e)29,30,31. Previously described approaches for engineering the cyanobacterial carboxysome-based CCM require engineering of the chloroplast-encoded Rubisco large subunit, which is not generally feasible in major grain crops such as wheat and rice32, although expression of the large subunit from the nucleus of maize plants has been recently demonstrated2. Furthermore, transgenic plants expressing cyanobacterial Rubisco require high concentrations of CO2 to grow. Here we demonstrate that condensation of Rubisco is achievable through modification of the nuclear-encoded SSU, which is significantly more amenable to genetic modification. Future work will focus on progressing towards a minimal functional algal CCM by tethering the condensate to a thylakoid membrane for bicarbonate delivery using appropriate transporters33,34,35, as well as introduction of a lumenal carbonic anhydrase and a stromal CO2 salvage system36,37,38.

Methods

Plant material and growth conditions

Arabidopsis (Arabidopsis thaliana, Col-0 background) seeds were sown on compost, stratified for 3 days at 4 °C and grown at 20 °C, ambient CO2 and 70% relative humidity under either 200 or 900 μmol photons m−2 s−1 supplied by cool white LED lights (Percival SE-41AR3cLED, CLF PlantClimatics GmbH, Wertingen, Germany) in 12 h light, 12 h dark. For comparisons of different genotypes, plants were grown from seeds of the same age and storage history, harvested from plants grown in the same environmental conditions.

Construct design and transformation

The coding sequence of EPYC1 was codon optimised for expression in higher plants as in Atkinson et al.18. Mature EPYC1 was cloned directly into the level 0 acceptor vector pAGM1299 of the Plant MoClo system39. To generate fusion proteins, gene expression constructs were assembled into binary level 2 acceptor vectors. The 35S cauliflower mosaic virus (CaMV) promoter and CsVMV (cassava vein mosaic virus) promoter were used to drive expression. Level 2 vectors were transformed into Agrobacterium tumefaciens (AGL1) for stable insertion in Arabidopsis plants by floral dipping40. Homozygous transgenic and azygous lines were identified in the T2 generation using the pFAST-R selection cassette41.

Protein analyses

Soluble protein was extracted from frozen leaf material of 21-day-old plants (sixth and seventh leaf) in protein extraction buffer (50 mM HEPES-KOH pH 7.5 with 17.4% glycerol, 2% Triton X-100 and cOmplete Mini ethylenediaminetetraacetic acid (EDTA)-free Protease Inhibitor Cocktail (Roche, Basel, Switzerland). Samples were heated at 70 °C for 15 min with 1× Bolt LDS sample buffer (ThermoFisher Scientific, UK) and 200 mM dithiothreitol (DTT). Extracts were centrifuged and the supernatants subjected to SDS-PAGE on a 12% (w/v) polyacrylamide gel and transferred to a nitrocellulose membrane. Membranes were probed with rabbit serum raised against wheat Rubisco at 1:10,000 dilution42, the SSU RbcS2 from Chlamydomonas (CrRbcS2) (raised to the C-terminal region of the SSU (KSARDWQPANKRSV) by Eurogentec, Southampton, UK) at 1:1000 dilution, Actin (66009-1-Ig, Proteintech, UK) at 1:1000 dilution and EPYC1 at 1:2000 dilution16, followed by IRDye 800CW goat anti-rabbit IgG (LI-COR Biotechnology, Cambridge, UK) at 1:10,000 dilution, and visualised using the Odyssey CLx imaging system (LI-COR Biotechnology), or by HRP-linked goat anti-rabbit IgG (Abcam) at 1:10,000 dilution, and visualised using Pierce ECL Western Blotting Substrate (Life Technologies).

qRT-PCR analysis

Total RNA was isolated from leaves of 21-day-old plants (as above) using the RNeasy plant mini kit (Qiagen, USA). Isolated RNA was treated with DNase (Qiagen, USA) and reverse transcribed with random primers using the GoScript Reverse Transcription kit (Promega, USA). Reverse transcription quantitative PCR (RT-qPCR) was carried out with Takyon No ROX SYBR mastermix (Eurogentec, Belgium) as in Atkinson et al.12, and run on a LightCycler 480 (Roche, Switzerland). Gene-specific primers were designed to amplify either eGFP (5′-CAGATTACGCCGTGAGCAAG-3′ and 5′-GCTGAACTTGTGGCCGTTTA-3′) or tGFP (5′-GCATCAGGGGTCTTGAAAGC-3′ and 5′-TCCTTCAAAACGGTGGACCT-3′) (IDT, Belgium). Amplification efficiency was determined with a calibration curve for each primer pair. Two reference genes At4g26410 (RHIP1) and At1g13320 (PP2A) were used for normalisation43. The relative expression of eGFP and tGFP was calculated according to Vandesompele et al.44.

Condensate extraction

Soluble protein was extracted as before, then filtered through Miracloth (Merck Millipore, Burlington, Massachusetts, USA), and centrifuged at 500 ×g for 3 min at 4 °C, as in Mackinder et al.16. The pellet was discarded, and the extract centrifuged again for 12 min. The resulting pellet was washed once in protein extraction buffer, then resuspended in a small volume of buffer and centrifuged again for 5 min. Finally, the pellet was resuspended in 25 µl of extraction buffer and used in confocal analysis or SDS-PAGE electrophoresis.

Growth analysis and photosynthetic measurements

Rosette growth rates were quantified using an in-house imaging system45. Maximum quantum yield of photosystem II (PSII) (Fv/Fm) was measured on 32-day-old plants using a Hansatech Handy PEA continuous excitation chlorophyll fluorimeter (Hansatech Instruments Ltd, King’s Lynn, UK)46. Gas exchange and chlorophyll fluorescence were determined using a LI-COR LI-6400 (LI-COR, Lincoln, Nebraska, USA) portable infra-red gas analyser with a 6400-40 leaf chamber on either the sixth or seventh leaf of 35- to 45-day-old non-flowering rosettes grown in large pots under 200 μmol photons m−2 s−1 to generate leaf area sufficient for gas exchange measurements47. The response of A to the intercellular CO2 concentration (Ci) was measured at various CO2 concentrations (50, 100, 150, 200, 250, 300, 350, 400, 600, 800, 1000 and 1200 µmol mol−1) under saturating light (1500 μmol photons m−2 s−1). For all gas exchange experiments, the flow rate was kept at 200 μmol mol−1, leaf temperature was controlled at 25 °C and chamber relative humidity was maintained at ca. 70%. Measurements were performed after net assimilation and stomatal conductance had reached steady state. Gas exchange data were corrected for CO2 diffusion from the measuring chamber as in Bellasio et al.48. To estimate Vcmax, Jmax, ARubisco, the CO2 compensation point (Γ) and mesophyll conductance to CO2 (gm) the A/Ci data were fitted to the C3 photosynthesis model as in Ethier and Livingston49,50 using the catalytic parameters Kcair and affinity for O2 (Ko) values for wild-type Arabidopsis Rubisco at 25 °C and the Rubisco content of WT and S2Cr lines21.

Confocal laser scanning and fluorescence recovery after photobleaching

Leaves were imaged with a Leica TCS SP8 laser scanning confocal microscope (Leica Microsystems, Milton Keynes, UK) as in Atkinson et al.12. Processing of images was done with Leica LAS AF Lite software. Condensate and chloroplast dimensions were measured from confocal images using Fiji (ImageJ, v1.52n)51. Condensate volume was calculated as a sphere, while chloroplast volume was calculated as an ellipsoid, where depth was estimated as 25% of the measured width. Chloroplast volumes varied between 24 and 102 µm3, which was within the expected size range and distribution for Arabidopsis chloroplasts52. Comparative measurements of pyrenoid area were performed using Fiji on TEM cross-section images of WT C. reinhardtii cells (cMJ030) as described in Itakura et al.23. FRAP was carried out on live leaf tissue and tissue that had been fixed by infiltrating with 4% (v/v) formaldehyde for 90 min in a vacuum chamber, then infiltrating three times for 5 min each with phosphate-buffered saline (PBS) at pH 7.4. The SP8 microscope was used with a ×40 water immersion objective and a photomultiplier tube (PMT) detector. The 488 nm laser was set to 2% power for pre- and post-bleach images, and 25% for the bleaching step. Pre-bleach images were captured at 220 ms intervals (6 in total) and post-bleach images were captured at 400 ms intervals (at least 120). For photobleaching the laser was directed to a region with a diameter of 0.5−0.6 µm on one side of the EPYC1 aggregate. Recovery time was calculated by comparing GFP expression to an unbleached region of the same size.

Super-resolution image microscopy

Super-resolution images were acquired using structured illumination microscopy. Samples were prepared on high precision cover-glass (Zeiss, Jena, Germany). 3D SIM images were acquired on an N-SIM (Nikon Instruments, UK) using a ×100 1.49NA lens and refractive index matched immersion oil (Nikon Instruments). Samples were imaged using a Nikon Plan Apo TIRF objective (NA 1.49, oil immersion) and an Andor DU-897X-5254 camera using a 488 nm laser line. Z-step size for z stacks was set to 0.120 µm as required by the manufacturer’s software. For each focal plane, 15 images (5 phases, 3 angles) were captured with the NIS-Elements software. SIM image processing, reconstruction and analysis were carried out using the N-SIM module of the NIS-Element Advanced Research software. Images were checked for artefacts using the SIMcheck software (www.micron.ox.ac.uk/software/SIMCheck.php). Images were reconstructed using NiS Elements software (v4.6, Nikon Instruments) from a z stack comprising of no less than 1 µm of optical sections. In all SIM image reconstructions, the Wiener and Apodization filter parameters were kept constant.

Immunogold labelling and electron microscopy

Leaf samples were taken from 21-day-old S2Cr plants and S2Cr transgenic lines expressing EPYC1-dGFP and fixed with with 4% (v/v) paraformaldehyde, 0.5% (v/v) glutaraldehyde and 0.05 M sodium cacodylate (pH 7.2). Leaf strips (1 mm wide) were vacuum infiltrated with fixative three times for 15 min, then rotated overnight at 4 °C. Samples were rinsed three times with PBS (pH 7.4) then dehydrated sequentially by vacuum infiltrating with 50%, 70%, 80% and 90% ethanol (v/v) for 1 h each, then three times with 100%. Samples were infiltrated with increasing concentrations of LR White Resin (30%, 50%, 70% [w/v]) mixed with ethanol for 1 h each, then 100% resin three times. The resin was polymerised in capsules at 50 °C overnight. Sections (1-μm thick) were cut on a Leica Ultracut ultramicrotome, stained with Toluidine Blue, and viewed in a light microscope to select suitable areas for investigation. Ultrathin sections (60-nm thick) were cut from selected areas and mounted onto plastic-coated copper grids. Grids were blocked with 1% (w/v) BSA in TBSTT (Tris-buffered saline with 0.05% [v/v] Triton X-100 and 0.05% [v/v] Tween 20), incubated overnight with anti-Rubisco antibody in TBSTT at 1:250 dilution or anti-CrRbcS2 antibody at 1:50 dilution, and washed twice each with TBSTT and water. Incubation with 15 nm gold particle-conjugated goat anti-rabbit secondary antibody (Abcam, Cambridge, UK) in TBSTT was carried out for 1 h at 1:200 dilution for Rubisco labelling or 1:10 for CrRbcS2 labelling, before washing as before. Grids were stained in 2% (w/v) uranyl acetate then viewed in a JEOL JEM-1400 Plus TEM (JEOL, Peabody, Massachusetts, USA). Images were collected on a GATAN OneView camera (GATAN, Pleasanton, California, USA).

Statistical analyses

Results were subjected to one-way analysis of variance (ANOVA) to determine the significance of the difference between sample groups. When one-way ANOVA was performed, Tukey’s honestly significant difference (HSD) post-hoc tests were conducted to determine the differences between the individual treatments (IBM SPSS Statistics, Ver. 26.0).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this paper.