A cybergenetic framework for engineering intein-mediated integral feedback control systems

Anastassov, Stanislav; Filo, Maurice; Chang, Ching-Hsiang; Khammash, Mustafa

doi:10.1038/s41467-023-36863-9

Download PDF

Article
Open access
Published: 11 March 2023

A cybergenetic framework for engineering intein-mediated integral feedback control systems

Nature Communications volume 14, Article number: 1337 (2023) Cite this article

3798 Accesses
10 Citations
15 Altmetric
Metrics details

Subjects

Abstract

The ability of biological systems to tightly regulate targeted variables, despite external and internal disturbances, is known as Robust Perfect Adaptation (RPA). Achieved frequently through biomolecular integral feedback controllers at the cellular level, RPA has important implications for biotechnology and its various applications. In this study, we identify inteins as a versatile class of genetic components suitable for implementing these controllers and present a systematic approach for their design. We develop a theoretical foundation for screening intein-based RPA-achieving controllers and a simplified approach for modeling them. We then genetically engineer and test intein-based controllers using commonly used transcription factors in mammalian cells and demonstrate their exceptional adaptation properties over a wide dynamic range. The small size, flexibility, and applicability of inteins across life forms allow us to create a diversity of genetic RPA-achieving integral feedback control systems that can be used in various applications, including metabolic engineering and cell-based therapy.

Development of multiplexed orthogonal base editor (MOBE) systems

Article 21 May 2024

CRISPRi-based circuits to control gene expression in plants

Article 20 May 2024

Single molecule delivery into living cells

Article Open access 23 May 2024

Introduction

One of the essential features of living systems is their ability to maintain a robust behavior despite disturbances coming from their external uncertain and noisy environments. This feature is referred to as homeostasis, which is typically achieved via endogenous feedback regulatory mechanisms shaped by billions of years of evolution. Pathological diseases are often linked to loss of homeostasis^1,2. As a result, restoring homeostasis has become a major focus of research in the emerging field of cybergenetics³, which combines control theory and synthetic biology. In particular, the rational design and implementation of biomolecular feedback controllers^{4,5,6,7,8,9,10,11,12,13} offers promising candidates that may accompany or even replace such failed mechanisms^14,15,16.

A notion, which is similar to homeostasis, but more stringent, is Robust Perfect Adaptation (RPA) (see e.g.^17,18) which is the biological analogue of the well-known notion of robust steady-state tracking in control theory. A controller succeeds in achieving RPA if it drives the steady state of a variable of interest to a prescribed level despite varying initial conditions, uncertainties and/or constant disturbances. Motivated by the internal model principle¹⁹, which establishes that the designed controller must implement an integral feedback component to be able to achieve RPA, the antithetic integral feedback (AIF) controller²⁰ was brought forward. The basic antithetic integral feedback motif is depicted in Fig. 1(a). It is comprised of two species Z₁ and Z₂ whose end goal is to robustly steer the concentration of the output species of interest X_L to a prescribed level, referred to as the setpoint, in spite of disturbances and uncertainties in the regulated network — represented here as the various reactions occurring between species X₁ through X_L. RPA is achieved via four controller reaction channels. First, Z₁ is constitutively produced at a rate μ to encode for the setpoint. Second, Z₂ is catalytically produced from the output species X_L at a rate θx_L to sense its concentration. The third reaction is the annihilation or sequestration reaction between Z₁ and Z₂ occurring at a rate ηz₁z₂. The sequestration reaction encodes a comparison operation and produces an inactive complex that has no function and thus its concentration need not be mathematically tracked. Finally, the feedback control action (actuation) is encrypted as a production reaction of the species X₁, which acts as the input of the regulated network, at a rate kz₁ proportional to the concentration of controller species Z₁. The underlying Ordinary Differential Equations (ODEs) governing the dynamics of the concentrations of Z₁ and Z₂ are shown in Fig. 1(a). Throughout the paper, bold uppercase letters (e.g. Z₁) denote the names of biochemical species, while their corresponding lowercase letters (e.g. z₁) denote their concentrations. By looking at the dynamics of z₁−z₂, it is straightforward to reveal the integral control action where the temporal error μ/θ−x_L(t) at time t, or deviation of the output concentration from the setpoint μ/θ, is mathematically integrated. This establishes that, as long as the closed-loop system is stable (i.e. asymptotically converges to some fixed point), the output concentration x_L will converge to the prescribed setpoint μ/θ which is independent of the regulated network and initial conditions, and thus achieves RPA. While RPA is a steady-state property, the transient dynamic properties and tuning of the antithetic integral controller are also extensively studied as well^21,22,23.

In vivo antithetic integral feedback controllers have been previously built in both bacteria^4,6 and mammalian cells⁷, where RPA is experimentally demonstrated. A quasi-integral controller using a slight variant of the antithetic controller was also demonstrated in Escherichia coli⁹. More recently, a protein based antithetic controller in mammalian cells was also recently proposed²⁴. In⁶, the controller is implemented in E. coli using sigma/anti-sigma factors as the basic parts that realize the sequestration reaction — the heart of the antithetic integral motif. In⁷, the controller is implemented in HEK293T cells using sense/anti-sense mRNAs. The sequestration reactions in both designs are achieved by the heterodimerization of Z₁ and Z₂. In the case of sigma/anti-sigma factors, the heterodimerization reaction is reversible, a fact that may lead to reduced performance in certain operating regimes. Moreover, when present in high quantities, xenogenic sigma factors may be toxic due to their inherent property of sequestering RNA polymerases from housekeeping sigma factors²⁵. Finally, sigma factors are specific to the transcription mechanism of bacteria and cannot be easily transferred to other domains of life, which is why the sense/anti-sense RNA controller⁷ was developed for mammalian cells. At the same time, sense/anti-sense hybridizations produce double-stranded RNAs that, in high abundance, may initiate global translational repression²⁶, leading to a reduction in the effective dynamic range of operation. These constraints give rise to the need for genetic parts that are nontoxic, are transferable between different forms of life, and enjoy wider dynamic ranges of operation. Nevertheless, the suitable choice of genetic parts is a difficult task because they need to adhere to the strict design rules of the basic antithetic integral feedback motif. In this paper, we show that split inteins serve as the ideal candidate parts that are capable of doing both: adhering to the design rules and avoiding the aforementioned disadvantages. We build on the universality result of the antithetic motif⁶ to examine more complex integral controller designs for RPA as demonstrated in Fig. 1(b). While more complex mathematical topologies do not have to be necessarily more difficult to implement, they certainly broaden the biological design space. This expansion, in fact, becomes necessary due to the biological implementation constraints.

An intein is a protein segment that is capable of autocatalytically excising itself from the protein while re-ligating the remaining segments, called exteins, via forming new peptide bonds²⁷ (see Supplementary Information, Fig. 22). Inteins are universal as they can be naturally found in all domains of life spanning eukaryotes, bacteria, archaea and viruses^28,29. Split inteins - a subset class of inteins - are, as the name suggests, inteins split into two halves commonly referred to as Int^N and Int^C. Split inteins have been widely studied and characterized due to their extensive usage in various life science disciplines and their ability to perform fast, reliable and irreversible post-translational modifications^{30,31,32,33,34}. Small split inteins like Gp41-1^C are comprised of around 40 amino acids³⁵ and are well within the size range of synthetic protein linkers³⁶. It is then possible to use them as “functional” linkers to connect different protein segments. The split inteins, when active, are capable of heterodimerizing and performing protein splicing reactions on their own where they irreversibly break and form new peptide bonds in a strict stoichiometric ratio of one to one. We shall refer to these reactions as “intein-splicing reactions” where molecules containing active Int^C segments react with molecules containing active Int^N segments to undergo a particular splicing mechanism. When two molecules undergo an intein-splicing reaction, the Int^N and Int^C segments are permanently inactivated as they are unable to perform further splicing reactions due to the alteration of their respective biochemical structures. However the products of such a reaction may still have other functions such as activating or repressing gene expression due to the presence of other protein domains that may not be affected by the splicing reaction. Split inteins can be exploited to exchange, cleave or ligate amino acid sequences (see Fig. 1(c)). These features serve as the basis of realizing the sequestration reaction of the antithetic integral motif. A selection of antithetic “sequestrations” based on functional conversion, spatial separation, inactivation, degradation and intein removal are shown in Fig. 1(c) to emphasize the modularity and the vast flexibility of intein-based designs. Nonetheless, this high design flexibility comes with a price: simple intein-based implementations may lead to complicated network topologies very quickly as illustrated in Fig. 1(d). Here we exploit a time-scale separation argument to establish a structural model reduction result which provides an easy-to-use recipe to simplify the underlying models. This facilitates the mathematical analysis of the otherwise complicated controller network, and allows us to uncover the underlying controller structure which is not necessarily limited to integral control only.

Integral control is the fundamental building block in most controllers spanning a broad range of industrial applications in the fields of electrical, mechanical and chemical engineering; however, it is rarely used alone. In fact, Integral (I) controllers are typically augmented with Proportional (P) and/or Derivative (D) controllers to obtain PI/PID controllers that offer more flexibility in enhancing the dynamic performance while maintaining the RPA property. Recently, more advanced molecular controllers such as PI/PID controllers found their way to molecular biology^{7,37,38,39,40,41,42}. Ideally, pure proportional control is achieved via instantaneous negative feedback from the output X_L to the input species X₁ and it is shown that it is not only capable of enhancing the transient dynamic performance, but also reducing cell-to-cell variability^37,40. The first biomolecular (filtered) PI controller was genetically engineered in⁷ where additional genetic parts are appended to the antithetic integral motif to realize the proportional component. Here, we establish that a filtered PI controller can be built without introducing additional genetic parts by harnessing the sequestration products of the split inteins³⁸.

Besides proposing intein-based implementation strategies for RPA-achieving controllers and laying down the necessary theoretical foundation, we have also selected, built and tested five structurally different controller topologies for experimental verification of RPA. All circuits were tested in HEK293T cells and range from pure I to filtered PI controllers based on the functional conversion, inactivation and intein removal strategies illustrated in Fig. 1(c).

Split inteins offer a high degree of flexibility in realizing biomolecular integral feedback controllers. This flexibility is mainly a consequence of their compatibility with essentially any transcription factor (TF). In fact, the particular structure of the expressed transcription factor including the choices of the Activation Domain (AD), Dimerization Domain (DD), DNA-Binding Domain (DBD) and insertion position of the split intein (Int^C) open the possibilities to a broad design space of controllers. Specifically, dimeric transcription factors, such as tetracycline transactivator (tTA), give rise to multiple homo- and hetero-dimerization reactions as well as multiple sequestration reactions and thus make the controller network more complex to mathematically analyze. To this end, we develop a theoretical framework tailored to mathematically analyze and simplify complex intein-based controller networks that generalize the basic antithetic integral motif which has no dimerization reactions and a single sequestration reaction. We refer the reader to Supplementary Table 1 for a list of all the abbreviations used in this paper.

Results

Achieving robust perfect adaptation using inteins

In this section, we establish a theoretical framework embodied as a set of simple rules that allows us to design biomolecular controllers enabling RPA using split inteins. Consider the general closed-loop network depicted in Fig. 2 where an arbitrary network comprised of L species ${{{{{\bf{X}}}}}}:\!\!\!=\left\{{{{{{{\bf{X}}}}}}}_{{{{{{\bf{1}}}}}}},\cdots \,,{{{{{{\bf{X}}}}}}}_{{{{{{\bf{L}}}}}}}\right\}$, referred to as the regulated network, is in a feedback interconnection with the controller network comprised of M species ${{{{{\bf{Z}}}}}}:\!\!\!=\left\{{{{{{{\bf{Z}}}}}}}_{{{{{{\bf{1}}}}}}},\cdots \,,{{{{{{\bf{Z}}}}}}}_{{{{{{\bf{M}}}}}}}\right\}$. The overall objective of the feedback controller network is to achieve RPA of the regulated output species X_L by automatically actuating (producing and/or degrading) the input species X₁. Each controller species Z_i, for i=1,2,⋯,M, belongs to one of three classes: ${{{{{{{\mathcal{C}}}}}}}}$-class, ${{{{{{{\mathcal{N}}}}}}}}$-class and ${{{{{{{\mathcal{S}}}}}}}}$-class. These classes separate the controller network into three subnetworks as depicted in Fig. 2. The classification of the controller species and the allowed reactions follow the rules that are listed in Fig. 2. In particular, the setpoint and sensing of the regulated output species X_L are encoded in the constitutive and/or catalytic production reactions following Reaction Rule 1 given by

$$\mathop{{{{{{{{\bf{Setpoint/Sensing}}}}}}}}}\limits_{{{{{{{{\bf{Rxns}}}}}}}}}:\,\,\,\varnothing \mathop{---\longrightarrow }\limits^{{\mu }_{i}+{\theta }_{i}{x}_{L}}{{{{{{{{\bf{Z}}}}}}}}}_{{{{{{{{\boldsymbol{i}}}}}}}}}\quad (i=1,\cdots,\,M),$$

(1)

with at least one μ_i and one θ_i strictly positive. The following theorem provides a guarantee for RPA of the regulated output species when controlled with intein-based controllers.

**Fig. 2: A theoretical framework for RPA-achieving intein-based integral controllers.**

Theorem 1

Consider the closed-loop network depicted in Fig. 2 where the controller network respects the set of listed rules. Let ${q}_{i}^{+}$ and ${q}_{i}^{-}$ respectively denote the number of active Int^C and Int^N segments present in controller species Z_i for i=1, ⋯, M. Define the vector $q:\!\!\!={\left[\begin{array}{ccc}{q}_{1}^{+}-{q}_{1}^{-}&\cdots &{q}_{M}^{+}-{q}_{M}^{-}\end{array}\right]}^{T}$. Then, if the closed-loop network is stable, the controller network ensures RPA of X_L with

$$\mathop{\lim }\limits_{t\to \infty }{x}_{L}(t)=-\frac{{q}^{T}\mu }{{q}^{T}\theta } \, > \,0,$$

(2)

where $\mu :\!\!\!={\left[\begin{array}{ccc}{\mu }_{1}&\cdots &{\mu }_{M}\end{array}\right]}^{T}$ and $\theta :\!\!\!={\left[\begin{array}{ccc}{\theta }_{1}&\cdots &{\theta }_{M}\end{array}\right]}^{T}$. Furthermore, the integrated variable is given by z_I := q^Tz which reveals the underlying integral controller given by

$${z}_{I}(t)={z}_{I}(0)+\int\nolimits_{0}^{t}\left[{\mu }^{{{{{{{{\rm{eff}}}}}}}}}-{\theta }^{{{{{{{{\rm{eff}}}}}}}}}{x}_{L}(\tau )\right]d\tau,$$

(3)

with μ^eff := q^Tμ and θ^eff := −q^Tθ.

The proof of Theorem 1 can be found in Supplementary Information Section 2. Before we proceed, we provide two remarks.

Remark 1.1

Theorem 1 is a special case of a more general theorem (see Supplementary Theorem 1) which can be also applied to any non-intein-based biomolecular controller with similar structure as demonstrated in the example of Box 3. This more general theorem interprets q⁺ and q⁻ as the number of positive and negative charges (where, here, the number of inteins is an instantiation of the charge analogy) and extends the RPA sufficiency result in⁶ to the case of multiple sensing and setpoint reactions. In fact, if Z₁ is the only controller species that is constitutively produced and Z₂ is the only controller species that is catalytically produced by the regulated output species X_L, then μ₁, θ₂ > 0, μ_i = θ_j = 0 for (i,j) ≠ (1,2) and $q={\left[\begin{array}{ccccc}1&-1&&\star &\end{array}\right]}^{T}$ which yields $\mathop{\lim }\limits_{t\to \infty }{x}_{L}(t)={\mu }_{1}/{\theta }_{2}$ — the RPA result in⁶.

Remark 1.2

Although the result presented in Theorem 1 is for the deterministic setting, it also holds in the stochastic setting. It is shown in Supplementary Information Section 2 that, under the assumption that the closed-loop network is ergodic — a stochastic notion of stability, the steady-state (stationary) expectation of the regulated output is also given by ${{\mathbb{E}}}_{\pi }[{X}_{L}]=-\frac{{q}^{T}\mu }{{q}^{T}\theta }$.

Remark 1.3

The catalytic sensing terms θ_ix_L for i=1,⋯,M shown in Fig. 2 and (1) do not necessarily have to be linear in the deterministic setting. In fact, these terms can be replaced by more general nonlinear functions f_i(x_L) such as Hill-type functions that are allowed to be monotonically decreasing to account for repressive sensing. These sensing mechanisms will preserve RPA in the deterministic setting, but the setpoint expression will be different from (2).

Implementations using various transcription factors

So far, we have described, theoretically, how split inteins can be exploited to build a broad class of biomolecular integral controllers capable of achieving RPA. Here, we demonstrate how commonly used transcription factors can be converted into controller species that respect the rules of Fig. 2 and, as a result, enables RPA according to Theorem 1. In particular, we use three common DNA binding domains: zinc finger (ZF)^43,44, tetracycline repressor (TetR)⁴⁵ and Gal4 to construct four structurally different biomolecular controllers (Fig. 3), that serve as instantiations of the class of controllers described in Theorem 1. We also provide experimental proof (Fig. 4) that these intein-based controllers are indeed capable of achieving RPA and thus rejecting perturbations over a wide dynamic range.

**Fig. 3: Intein-based implementation of RPA-achieving integral controllers using ZF, TetR and Gal4 as DBDs.**

**Fig. 4: Experimental demonstration of RPA.**

There are a few considerations that have to be taken into account to successfully build intein-based integral controllers.

These considerations, given in Box 1, should be experimentally verified for any intein-based controller to function properly. In particular, to minimize the impairment of the protein of interest as per Building Consideration 1, we use the smaller Int^C of the fast reacting intein Gp41-1⁴⁶ for all our modified activators.

Next, we provide a detailed description of the four different controller circuits depicted in Fig. 3. We start with the ZF controller which has the simplest topological structure. It is obtained by using ZF as the DBD, and introducing the split intein in the floppy linker between the AD VP64 and the DBD ZF. This TF, denoted by Z₁, is constitutively produced at a rate μ₁ and is capable of actuating the regulated network of interest by activating the expression of the input X₁. The regulated output X_L produces the second split intein Int^N, denoted by Z₂, at a rate θx_L that is proportional to the regulated output concentration. The intein-splicing reaction between Z₁ and Z₂ occurs at a rate η and leads to a cleavage within the TF, which separates the AD from the DBD. The resulting free floating AD is not tracked due to its inability to initiate transcription on its own. The other spliced product is the DBD, denoted by Z₃, which competes with Z₁ for the promoter binding sites, and thus exerts a repressive actuation.

The second controller design, labeled as intra dimerization domain (intraDD), is based on TetR whose goal is to illustrate that it is possible to build intein-based antithetic integral controllers without functional spliced products. This controller is obtained by introducing the split intein within the DD of TetR without disrupting it. The transcription factor Z₁ is generated by fusing VPR to the modified TetR. The dimer Z₄ comprised of two molecules of Z₁ acts as the actuating controller species. Unlike the previous controller, Int^N, denoted by Z₂, can now undergo an intein-splicing reaction with either the monomer Z₁ or the dimer Z₄. The intein-splicing reaction with Z₁ leads to the cleavage of the protein sequence next to the Int^C, which is acting as a linker holding the two halves of the split DD together. This results in two products: the AD VPR with part of the disabled DD, denoted by Z₃, and a monomeric TetR with the rest of the disabled DD (not tracked in Fig. 3 due to its inactivity). Neither of them are able to further interact with the controller or the regulated network. Similarly, the intein-splicing reaction with the dimer Z₄ leads to the cleavage of one of the monomers within the DD. This results in the immediate falling apart of the dimer into one Z₁ and one Z₃.

The third controller design is obtained by inserting an Int^C segment between TetR and the AD. The expressed TF, denoted by Z₁, has to dimerize to form Z₅ to be able to actuate the regulated network. Int^N, denoted by Z₂, can undergo an intein-splicing reaction with either the monomer Z₁ or the dimer Z₅. The intein-splicing reaction between Z₁ and Z₂ leads to the separation of the AD from the remaining DBD and DD to produce Z₃ and a free floating AD which is not tracked anymore due to its inactivity. The spliced product Z₃ can still heterodimerize with Z₁ to yield a TetR dimer with only one AD, denoted by Z₆, which is sufficient to bind to the promoter and initiate transcription. Note that Z₆ can be also obtained via the intein-splicing reaction between the fully intact dimer Z₅ and Z₂. Furthermore, since Z₆ still has one functional Int^C segment, it is able to perform a second intein-splicing reaction with Z₂, which removes the last AD by cleavage and hence forms a tetR dimer, denoted by Z₄. Note that, this dimer can be also obtained via the homodimerization of Z₃. The dimer Z₄ can recognize and bind to the promoter, but can not initiate transcription unlike the other two dimers Z₅ and Z₆. It therefore, competes with Z₅ and Z₆ for the promoter binding sites and, as a result, acts as a repressor.

The last controller design of Fig. 3 is based on the yeast derived DBD Gal4 and is thus labeled as the Gal4 controller. Here, we introduced an Int^C segement between the DBD and the DD. Similar to TetR, Gal4 needs to be a dimer (Z₅) in order to bind to the promoter and actuate the regulated network. Once again, Int^N, denote by Z₂, can undergo an intein-splicing reaction with either Z₅ or Z₁. The intein-splicing reaction with Z₁ leads to the separation of the DBD from the remaining DD and the AD to produce Z₃. As already mentioned, Gal4 cannot bind to the promoter as a monomer, and so we do not track this species due to its inactivity. Furthermore, the intein-splicing reaction with Z₅ leads to the removal of one DBD from the dimer through cleavage, which renders the entire complex unable of binding to the DNA. This truncated dimer, denoted by Z₆, can perform a second intein splicing reaction with Z₂ to remove the second DBD and form a new dimer denoted by Z₄ which is also incapable of acting directly on the regulated network. However, it is able to disassociate into its monomers, Z₃, which are able to reversibly sequester Z₁ through a heterodimerization reaction yielding the non-functional dimer Z₆.

It is fairly straight forward to verify that all the reaction rules listed in Fig. 2 are respected by all of the proposed four controllers. As a result, by applying Theorem 1, we conclude that all four proposed controllers achieve RPA (as long as the closed-loop network is stable) such that the concentration of the regulated output x_L converges to μ₁/θ₂ at steady state. Next, we provide an experimental verification to back up our developed theory. To do so, all of the four proposed controller circuits were first tested for the three Building Considerations. In fact, to test for Building Consideration 1, we expressed all of the modified activators constitutively and compared their ability to transcribe a fluorophore. We observed a drop in activity for all modified ZF, tetR and GAL4 based TFs ranging from significant to minor (see Supplementary Information, Fig. 24). To this end, strong impairments were partially compensated by using stronger activation domains like VPR. Intein insertions within floppy linkers were relatively straight forward; however, insertions within functional protein domains, as was the case for the intraDD-Circuit (see Fig. 3), required some screening (see Supplementary Information, Fig. 23). Next we tested for Building Considerations 2 and 3 by constitutively expressing the modified activator carrying Int^C together with the second split intein (Int^N) and observed the levels of a fluorescent reporter. If the Building Considerations are satisfied the fluorescent output will decrease with increased levels of Int^N. We were able to reach background levels for every controller type upon a high expression of the second split intein (see Supplementary Information, Fig. 25). This indicates that the intein-splicing reaction is indeed happening as expected.

After making sure that all Building Considerations were fulfilled, we proceeded with characterizing the controllers in the closed-loop setting. We opted for a simple two-plasmid, closed-loop system for testing the controller performance as demonstrated in Fig. 4(a). This allowed us to focus on the controller behavior without having to worry about potential cross-talks⁴⁷, resource burden⁴⁸ or saturation⁴⁹ which might appear in larger circuits. The first plasmid encodes for the modified transcription factor Z₁ and the other one encodes for either Int^C for the open-loop circuit or Int^N for the closed-loop circuit. In both cases, the split intein was encoded with a P2A-T2A linker and the fluorophore mVenus. Note that the P2A-T2A linker leads to the translation of two separate proteins (Int^N and mVenus) in a fixed ratio from a common mRNA due to ribosome skipping⁵⁰. The fluorophore is used as a proxy for its own mRNA, which is the regulated species expected to exhibit RPA. The advantage of this setup is that changing the copy numbers of the two transfected plasmids can be conveniently used to characterize the controllers. More precisely, μ₁ and hence the setpoint can be easily tuned by altering the amount of the plasmid encoding for the activator. Furthermore, the translation rate θ₂ of the mRNA is independent from the plasmid copy numbers in the cell. Perturbing the copy numbers of plasmid 2 only leads to an increase in the transcription rate of the output mRNA and should be rejected if the integral controller works as expected. Hence, to experimentally test the four controllers for RPA, we perturb the regulated network by increasing the copy number of plasmid 2 as it does not affect the setpoint parameters μ₁ and θ₂. The experimental results, depicted in Fig. 4, detail the steady-state measurements of the reporter, serving as a proxy for the regulated output (mRNA) for all four controllers. The measurements were taken for all the circuits operating in both open and closed loop, with and without disturbance. All four circuits were able to reject the disturbance over a wide titration of plasmid 1, which defines the output setpoint through tuning μ₁. The best performance was observed with the ZF circuit, which succeeded in rejecting the disturbances over the entire range from the detection limit to the onset of burden (see Supplementary Information, Fig. 27).

We have used so far only the split inteins of Gp41-1, and we have successfully shown the implementation of intein-based RPA-achieving integral controllers using different TFs. Many split intein pairs with different properties have been described in literature with some of them being orthogonal to each other⁵¹. To demonstrate that intein-based integral controllers are not limited to Gp41-1, and that it is possible to have multiple orthogonal intein-based integral controllers within the same cell, we have modified our ZF controller accordingly. In particular, we exchanged the Gp41-1, for NrdJ-1 Int^C, one of the many orthogonal split inteins characterized by Pinto et al.⁵¹ and closed the loop with the corresponding Int^N of NrdJ-1. However, instead of using Int^C for the open-loop circuit, we used the Int^N that corresponds to Gp41-1. Finally, we performed the experiment with the same plasmid ratios, which was deemed suitable for the previous Gp41-1 ZF experiment. The disturbance rejection was only visible for the compatible intein pair, and the dynamic range was similar to the experiment performed with the Gp41-1 containing ZF (see Supplementary Information, Fig. 26).

Model reduction

The broad class of intein-based, RPA-achieving controllers introduced in Theorem 1 gives rise to a high degree of design flexibility and thus allows topologies that may possibly involve a large number of controller species Z_i. Furthermore, these species are allowed to react among each other via multiple binding, conversion and intein-splicing reactions according to the Reaction Rules listed in Fig. 2. This possible large number of control species and reactions may lead to complex mathematical models of high dimensions whose dynamics are not easy to understand. In this section, we consider a subset of the general RPA-achieving controllers of Theorem 1 to provide a model reduction result that makes the otherwise complex dynamics more transparent and easy to analyze. Our model reduction result is structural in the sense that its validity is independent of the particular values of the rate parameters.

Consider the Species and Reaction Rules of Fig. 2 and replace Reaction Rule 8 with five additional rules given in Box 2.

Note that Rule 9 makes Rule 2 stricter in the sense that the intein-splicing reactions are not optional anymore so that any two active intein pairs have a strictly positive propensity to undergo an intein-splicing reaction. Rule 12 takes into account the more realistic situation where δ > 0 which implies that RPA is not exact anymore; however, robust adaptation remains practically satisfactory as long as the dilution rate is small compared to the other rates in the network (see^6,52). Finally, Rule 13 relates the intein-splicing rate to the number of participating active inteins. The following theorem provides a recipe for model reduction of (possibly complex) intein-based controllers. The model reduction result is valid in both the ideal (δ = 0) and non-ideal (δ > 0) settings and for any rate-parameter regimes.

Theorem 2

Consider the closed-loop network depicted in Fig. 2 where the controller network respects Species Rules 1-3 and Reaction Rules 1-7,9-13. Let ${q}_{i}^{+}$ and ${q}_{i}^{-}$ respectively denote the number of active Int^C and Int^N segments present in controller species Z_i for i = 1, ⋯ , M. Let ${q}_{i}^{0}$ denote the number of monomers in species Z_i with no active inteins, and construct the three vectors ${q}^{k}:\!\!\!={\left[\begin{array}{ccc}{q}_{1}^{k}&\cdots &{q}_{M}^{k}\end{array}\right]}^{T}$, for k ∈ { + , − , 0}. Furthermore, let (S_B, S_C) and (λ_B(z), λ_C(z)) respectively denote the stoichiometry matrices and total propensity functions associated with the reversible binding and conversion reactions that are assumed to be fast enough. If the following conditions are satisfied:

S_B is full-column rank.
The columns of S_C are linearly independent from those of S_B.
p + rank(S_C) = M − 3,

where p is the number of reversible binding reactions, then all controller networks respecting the structure described in Fig. 2 reduce to the simple motif, depicted in Fig. 5, which is governed by only three effective species Z⁺, Z⁻ and Z⁰ whose concentrations are linear combinations of the controller species Z_i for i = 1, ⋯ , M.

**Fig. 5: A model reduction recipe for Intein-based controllers.**

The proof of Theorem 2 can be found in Supplementary Information Section 2. Before we proceed, we provide five remarks.

Remark 2.1

Once again, Theorem 2 is a special case of a more general theorem (see Supplementary Theorem 2) which can be also applied to any non-intein-based biomolecular controller with similar structure as demonstrated in Box 3. The proof essentially invokes the deficiency-zero theorem⁵³ and singular perturbation theory⁵⁴.

Remark 2.2

The dynamics of the reduced model are depicted in the box of Fig. 5, in general, as a set of Differential Algebraic Equations (DAEs) comprised of only three differential equations (describing the basic effective motif) and a set of M − 3 algebraic equations that should be solved for $\tilde{z}\ge 0$. In certain cases, these algebraic equations can be explicitly solved and thus further reducing the dynamics to a set of three ODEs (see Fig. 6). Otherwise, the algebraic equations can be left in their implicit form.

**Fig. 6: Reduced models for the ZF, intraDD, TetR and Gal4 controllers.**

Remark 2.3

Unlike the effective species Z⁺ and Z⁻, Z⁰ has an extra production term, in general, that is equal to ${\delta }_{0}{\left[{\mathbb{1}}({q}^{+}+{q}^{-})\circ {q}^{0}\right]}^{T}\psi ({z}^{{{{{{{{\rm{tot}}}}}}}}})$, where ${\mathbb{1}}(.)$ is the indicator function, ∘ is the Hadamard (elementwise) product and ψ(z^tot) is given implicitly in Fig. 5. This production term is zero in two cases: (1) if there are no degradation reactions (δ₀ = 0), or (2) if no controller species simultaneously hold both an active intein and a monomer with no active inteins (${\mathbb{1}}({q}^{+}+{q}^{-})\circ {q}^{0}=0$). Intuitively, this extra production term can be explained as follows. Controller species holding both an active intein and a monomer with no active inteins belong to either the ${{{{{{{\mathcal{C}}}}}}}}$- or ${{{{{{{\mathcal{N}}}}}}}}$-class (Species Rules), and are thus not allowed to degrade (Reaction Rules). Nevertheless, these species are still represented within Z⁰ since they hold monomers with no active inteins. As a result, the extra production term compensates for those species that do not degrade yet are represented by Z⁰ which degrades at a rate δ₀.

Remark 2.4

Observe that no matter what the original controller network in Fig. 1 is and as long as it satisfies the conditions of Theorem 2, the underlying effective motif is the same and is dictated by the three effective species Z⁺, Z⁻ and Z⁰ as depicted in Fig. 5. However, different controller networks give rise to different actuation functions ${{{{{{{{\mathcal{U}}}}}}}}}^{\pm }$ and production functions ψ. The forms of these functions lead to different control designs that may offer different tuning knobs capable of enhancing the overall performance.

Remark 2.5

Unlike Theorem 1, it is unclear whether Theorem 2 can be extended to the stochastic setting. While a mathematically rigorous approach is left for future work, we have conducted a simulation-based case study which revealed that the reduced model was capable of accurately capturing the stochastic dynamics of the full model. See Supplementary Information Section 6 for more details.

Next, we apply Theorem 2 to the four controller circuits of Fig. 3 to obtain a reduced mathematical model for each. Here, we consider the more practical scenario where all controller species dilute at a rate δ > 0. Furthermore, we assume, for simplicity, that the degradation of the various proteins are negligible compared to the dilution rate; however, this assumption can be easily relaxed (see Supplementary Information Section 3). The model reduction results are compactly depicted in Fig. 6 for all four controllers. The underlying reduced motif, as illustrated in Fig. 6, is the same for all four controller circuits and is comprised of only three effective species Z⁺, Z⁻ and Z⁰ whose concentrations are linear combinations of the biological species Z_i. The differences between the reduced models of each controller circuit is encrypted in the effective control action $u={{{{{{{\mathcal{U}}}}}}}}({z}^{+}\!\!,\,{z}^{0})$ which is a function of the concentrations of Z⁺ and Z⁰. Observe that the control action is given in an explicit form for the ZF and intraDD controllers; whereas, for the TetR and Gal4 controllers, it is given implicitly as a set of three algebraic equations. Once these algebraic equations are solved for $\left({\tilde{z}}_{1},\,{\tilde{z}}_{2},\,{\tilde{z}}_{3}\right)\ge 0$, the control actions can be directly computed as a function of z⁺ and z⁰. The topology of the reduced models is clearly simpler to analyze compared to the full models described in Fig. 3, and thus the underlying control architecture can be uncovered more easily. In fact, the intraDD controller realizes a standalone antithetic integral controller since the control action $u={{{{{{{\mathcal{U}}}}}}}}({z}^{+})$ depends (monotonically) on Z⁺ only. On the other hand, it is shown in Supplementary Information Sections 3.D and 3.B that the control action $u={{{{{{{\mathcal{U}}}}}}}}({z}^{+}\!\!,\, {z}^{0})$ of the ZF- and Gal4-Circuits depends on both Z⁺ and Z⁰, such that ${{{{{{{\mathcal{U}}}}}}}}$ is monotonically increasing (resp. decreasing) in z⁺ (resp. z⁰). This particular topology can be shown to realize a filtered Proportional-Integral (PI) controller, where the proportional component can be used as an additional knob to enhance the dynamic performance (see³⁸ for a thorough analysis). Finally, it is shown in Supplementary Information Section 3.A that the control action $u={{{{{{{\mathcal{U}}}}}}}}({z}^{+}\!\!,\, {z}^{0})$ of the TetR controller also depends on both Z⁺ and Z⁰. Nevertheless, ${{{{{{{\mathcal{U}}}}}}}}$ is a monotonically increasing function of z⁺, but its monotonicity switches from increasing (at low levels of z⁰ and z⁺) to decreasing (at higher levels of z⁰ and z⁺). We refer the reader to Supplementary Information Section 3.A for more details on the exact monotonicity analysis of ${{{{{{{\mathcal{U}}}}}}}}$. Interestingly, this architecture realizes a filtered PI controller whose proportional component switches from positive to negative gain. This gives rise to a nice feature that initially speeds up the response when the concentrations of the controller species are low, and then switches to negative feedback as the concentrations rise and thus favoring closed-loop stability. The various reduced models are validated via simulations that demonstrate the highly accurate matching between the dynamics of the full and reduced models in Supplementary Information Section 4.

Box 2 Additional reaction rules

9. All controller species in the ${{{{{{{\mathcal{C}}}}}}}}$-class undergo an intein-splicing reaction with all controller species in the ${{{{{{{\mathcal{N}}}}}}}}$-class.

10. Binding and conversion reactions are reversible and conserve the number of inactive (bound) Int^C-Int^N complexes.

11. All controller species belonging to the ${{{{{{{\mathcal{S}}}}}}}}$-class degrade at a rate δ₀ ≥ 0.

12. All controller species dilute at a rate δ ≥ 0.

13. The intein-splicing rate between Z_i and Z_j is ${\eta} .{\max} ({q}_{i}^{+},{q}_{j}^{-})$ with η > 0.

Integral controllers with competing sequestrations

In this section, we demonstrate that Theorem 1 can be applied to controller circuits that are more general compared to those of Theorem 2. That is, there are certain intein-based controllers that can be easily tested for RPA using Theorem 1; however, their model reduction cannot be carried out by applying Theorem 2. We do so by considering the circuit depicted in Fig. 7(a), where two independent controller species (active Int^N denoted by Z₂ versus inactive Int^N denoted by Z₄ in Fig. 7(a)) stoichiometrically compete to sequester another controller species (Z₁ in Fig. 7(a)). In this circuit, we constructed two genes encoding for an AD fused to an active Int^C (expressing Z₁) and a DBD-DD fused to an inactive Int^N (expressing Z₄). Although the inactive Int^N lacks essential amino acids to undergo the intein-splicing reaction⁵⁵, Z₄ can still reversibly bind to Z₁ to form a heterodimeric transcription factor. In this controller design, the intein-splicing reaction can occur only between the expressed Int^N, denoted by Z₂, and Z₁, because Z₁ is the only controller species that contains an active Int^C segment in its unbound state. In fact, although the other controller species containing active Int^C segments (Z₆, Z₇ and Z₈) belong to the ${{{{{{{\mathcal{C}}}}}}}}$-class, they cannot directly undergo intein-splicing reactions since they are bound to the inactive Int^N. This results in a violation of Reaction Rule 9 rendering the model reduction recipe of Theorem 2 inapplicable. Nonetheless, it is straightforward to check that the conditions of Theorem 1 still apply and, as a result, RPA is still guaranteed as long as the closed-loop system is stable. Furthermore, applying (2), by noting that $q={\left[\begin{array}{cccccccc}1&-1&0&0&0&1&2&1\end{array}\right]}^{T}$, yields the setpoint expression given by μ₁/θ₂ (see Fig. 7(a)). Observe that the rate of expression μ₄ of Z₄ does not affect the setpoint — a result that is not immediate without resorting to (2). Similar to Fig. 4, the experimental results depicted in Fig. 7(b) demonstrate that the controller indeed ensures RPA yielding an average steady-state error of 3.9% over a wide dynamic range of setpoints compared to an error of 40.9% when operating in open loop.

**Fig. 7: Inactive-intein controller: theoretical and experimental analysis.**

Although the model reduction recipe provided in Theorem 2 cannot be applied here, one can still invoke singular perturbation theory to this particular controller circuit to obtain the reduced mathematical model depicted in Fig. 7(c). The model reduction here assumes, once again, that the reversible binding reactions are fast. Observe that, unlike the previous controllers, the reduced model is four dimensional. Intuitively, this is a result of an additional conservation law imposed by the inactive inteins which introduce an additional (fourth) vector q^⋆ required to carry out the state transformation. Hence, the reduced mathematical model is described by the set of four ODEs for $\left({z}^{+},{z}^{-},{z}^{0},{z}^{\star }\right)$ shown in Fig. 7(c) where the functions ϕ and ψ are implicitly given in Supplementary Information Section 3.E. A “fictitious network” describing the ODEs is also depicted in Fig. 7(c) to emphasize that the reduced model is mainly mathematical and cannot be easily translated to a simple motif. This highlights that controller circuits not adhering to the conditions of Theorem 2 fail to reduce to the simple motif given in Fig. 2. The reduced model is validated by the simulation results shown in Fig. 7(d) for four different setpoints and by applying a disturbance.

Discussion

In this paper, we introduced a theoretical and experimental framework to design, build and analyze a broad class of biomolecular integral feedback controllers that achieve RPA. The framework is based on custom-built split inteins that are shown to be capable of realizing the sequestration reaction — the heart of the basic antithetic integral feedback motif — via protein splicing. The sequestration reaction in previously proposed^{20,37,39,40,52} and built^6,7,9,13 integral controllers, whether in vivo, or in vitro, relies on the complete stoichiometric annihilation of two controller species (see Z₁ and Z₂ in Fig. 1(a)). Here, we relax this requirement by establishing that the sequestration reaction does not have to fully annihilate the participating controller species, and, in fact, it suffices to stoichiometrically annihilate sub-components within these two controller species. Indeed, this is precisely what intein-splicing reactions do: active split inteins inserted in two target proteins are inactivated by undergoing the splicing reaction. While the function of the active split inteins is indeed annihilated, the spliced target proteins are still allowed to have specific functions. In fact, we showed that one can harness the function of the spliced proteins to augment the standalone integral controller with a filtered proportional component to yield a PI controller. We previously computationally demonstrated (see³⁸) that the resulting filtered PI controller adds an extra degree of freedom which enables the enhancement of the transient performance and the reduction of cell-to-cell variability while maintaining the RPA property. However, it is left for future work to back up this theory with experimental demonstrations. It is worth to mention that the realization of a molecular PI controller in mammalian cells is not new. Ideally, a proportional component can be theoretically achieved by appending the integral controller with an instantaneous negative feedback from the regulated output species X_L onto the input species X₁ (see e.g.^37,40). This requires the output species X_L to have multiple functions including the production of Z₂ for sensing and the inhibition of the input species X₁ to realize the proportional component. In practice this might not be possible as the output species is determined by the biological application. In⁷, this was circumvented by introducing additional genetic parts to express a proxy to the regulated output upon which the proportional control action is based on. Here, in contrast, the design flexibility and modularity offered by inteins allowed us to implement PI controllers by simply choosing an actuator and a suitable insertion site of the split-intein (see Fig. 3 and 6) without adding additional genetic parts and without requiring the regulated output X_L to have multiple functions.

The simple antithetic integral feedback control topology was first introduced in²⁰, and more recently a generalized antithetic topology was introduced in⁶ which characterizes all RPA-achieving controllers involving exactly one sensing and one setpoint-encoding reaction. This characterization lead to simple algebraic conditions that enable RPA and are expressed in terms of quantities that are referred to as “charges”. The general charge analogy borrowed from electronics was made due to the lack of biological parts capable of respecting the algebraic conditions. This is exactly where inteins came in, because they naturally satisfy the RPA algebraic conditions and act as “charges” neutralizing each other via the intein-splicing reactions. Indeed, split inteins are typically charged at the locations where they interact⁵⁶. This makes the charge analogy biologically suitable. In fact, Theorem 1, which is a direct application of Supplementary Theorem 1 (tailored towards inteins), is a generalization of the RPA sufficiency result of⁶ such that multiple sensing and setpoint-encoding reactions are now allowed. Theorem 1 facilitates the screening of controller circuit designs for RPA. Furthermore, we went one step further here, beyond establishing RPA, to provide an easy-to-apply recipe for model reduction. The recipe is given in Theorem 2 which is, once again, a direct application of Supplementary Theorem 2 tailored towards inteins (see Box Box 3 for an application example of these theorems in a purely mathematical and more general context, that is, without an intein-based interpretation). The model reduction result presented here exploits the time-scale separation imposed by fast reversible binding and conversion reactions and is established by invoking singular perturbation theory⁵⁴ and the deficiency zero theorem⁵³ to prove structural (rate-independent) stability of the slow manifold.

The five controller circuit implementations presented in this paper (see Fig. 3 and 7) are based on the widely used DNA binding domains TetR, ZF and Gal4. For the experimental verification of RPA, we used a simple regulated network (see Fig. 4(a)) that resulted in a two (resp. three) plasmid closed-loop system depicted in Fig. 4 (resp. 7). The regulated network was intentionally chosen to be simple here, in order to minimize possible cross talks which might emerge from larger networks (e.g. burden)⁴⁸. This allowed us to focus our study on the controllers themselves instead of possible undesirable behaviors incurred by larger networks — an important topic that is not within the scope of the current study and is left for future work. Note that with this experimental setup, we were not able to directly detect the regulated output which is an mRNA (see Fig. 4(a)). To circumvent this, we used a fluorescent reporter which, unlike the regulated (mRNA) species, is not robust to translational burden. This implies that although RPA is not observed at high setpoints by the reporter, it may actually be achieved by the mRNA.

The controller circuits that are designed, built and analyzed in this paper are all based on controller species generated using TFs. However, split inteins can also be introduced in other protein classes such as proteases (Supplementary Information, Fig. 18) and receptors (Supplementary Information, Fig. 19). Split inteins can be even introduced in endogenous proteins to convert them into controller species. This has an attractive advantage of exploiting parts of the regulated network to realize the controller and, as a result, requiring less to no additional genes. From a protein engineering point of view, such designs may be more challenging than designs based on the well-characterized TFs used in this study. Besides tinkering with insertion sites, linker lengths and split-intein pairs, it is also possible to use more systematic approaches like transposon screens with inteins as performed by Ho et al.⁵⁷ or computationally-guided optimizations by Dolber et al.⁵⁸.

The remarkable flexibility offered by inteins for building integral controllers opens the doors to many possible future research directions. For instance, it is easy to think of regulated networks with negative gain, in other words, producing more input species X₁ leads to a lower concentration of the regulated output species X_L. For example, producing more insulin leads to a lower concentration of glucose in the blood. As a result, to realize an overall negative feedback, the actuation direction of the controller species Z₁ would have to be flipped, that is instead of having Z₁ upregulating X₁ (like in Fig. 1(a) and, in fact, all previously built antithetic integral controllers), Z₁ would have to downregulate X₁ (see Supplementary Information, Fig. 18). Intein-based realizations of such “negative actuation” mechanisms can be easily carried out using repressors or proteases. Furthermore, n inteins (with n = 1, 2, 3,⋯) can be embedded sequentially in a single controller species leading to the scaling of the setpoint by an integer n (see Supplementary Information, Fig. 11 and 12). Note that other functional domains can be placed between inteins to alter the functionality of the various spliced products (see Supplementary Information, Fig. 13 and 14). The flexibility offered by inteins also allows us to freely design the (multi)functionality of the spliced products as activators and/or repressors (e.g. Supplementary Information, Fig. 15, 16 and 17).

Another possible future direction is intein-based implementations of more advanced controllers. For example, one can easily add functional domains to the controller species Z₂, which was comprised of a standalone Int^N segment in all the controller circuits proposed here. These added domains enable the implementation of the rein controller introduced in⁵⁹ which is capable of enhancing the overall performance. Another example is the implementations of more advanced biomolecular Proportional-Integral-Derivative (PID) controllers³⁷ that are capable of shaping the transient response and reducing cell-to-cell variability. In particular, the wide library of orthogonal split inteins⁵¹ allows one to implement the fourth order PID controller³⁷ that is comprised of two antithetic motifs: antithetic integrator and antithetic differentiator.

In conclusion, rather than providing another way of implementing antithetic integral controllers, we propose here a systematic (theoretical and experimental) approach of designing, building and analyzing a broad class of biomolecular integral controllers that are capable of achieving RPA. The key of our approach is the exploitation of the splicing reactions that occur between split inteins. Due to their simplicity, modularity, irreversibility, lack of side effects and applicability across species, we believe that inteins will revolutionize biomolecular controllers and partake in filling the gap between theory and experiments.

Box 3 Example application of the RPA and model reduction theorems to a general controller network

The goal of this illustrative example is to reduce the full model of the controller network comprised of M = 6 species Z₁, Z₂, ⋯ , Z₆ to the simple motif comprised of Z⁺, Z⁻ and Z⁰ which is generally depicted in Fig.5. The full model here is chosen to be purely mathematical, i.e. with no relevance to inteins, in order to demonstrate that the theorems are not restricted to intein-based controllers only. Essentially, the q and (q⁺, q⁻, q⁰) vectors in Theorem 1 and 2 that were related to the number of inteins present in the controller species, are now more generally referred to as charge vectors inspired by⁶. For establishing RPA, computing q: = q⁺ − q⁻ is enough; whereas to carry out the model reduction technique, all individual positive, negative and neutral charge vectors (q⁺, q⁻, q⁰) need to be computed. Unlike the intein-based controller networks where the charge vectors are simply computed by counting intein segments, here we lay down general rules for constructing these charge vectors. Once these charge vectors are available, then one can easily apply Theorems 1 and 2 for establishing RPA and performing the model reduction, respectively. Note that in this example, we include three reversibly binding reactions but no conversion reactions for simplicity, and thus the conditions of Theorem 2 boil down to requiring the stoichiometry matrix of the binding network to be full-column rank and equal to M − 3 = 3. Furthermore, for simplicity, we start by assuming that Z₁ and Z₂ carries only one positive and negative charge, respectively. This assumption can however be relaxed to a more general number of charges (see Supplementary Information, Theorem 2). Two other mathematical controller topologies are also proposed and similarly analyzed in Supplementary Information, Section 4: one controller involves conversion reactions, and the other involves a trimer.

Methods

Plasmid construction

All plasmids were generated with a mammalian adaptation of the modular cloning (MoClo) yeast toolkit standard⁶⁰. All individual parts were generated by PCR amplification (Phusion Flash High-Fidelity PCR Master Mix; Thermo Scientific) or synthesized with Twist Bioscience. PCR primers were obtained from Sigma-Aldrich and Integrated DNA Technologies. The parts were then assembled with golden gate assembly. All enzymes for plasmid construction were obtained from New England Biolabs (NEB). Constructs were chemically transformed into E. coli Top10 strains (Invitrogen). The plasmid list and protein sequences can be found in Supplementary Information Section 9. DNA and oligo sequences can be found in the Data Source file.

Cell culture

All experiments were performed with HEK293T cells (ATCC, strain number CRL-3216, LGC standards). The cells were cultured in Dulbecco’s modified Eagle’s medium (DMEM; Gibco) supplemented with 10 % FBS (Sigma-Aldrich), 1x GlutaMAX (Gibco), 1 mm Sodium Pyruvate (Gibco), penicillin (100U/μL), and streptomycin (100 μg/mL) (Gibco) at 37^∘ with 5 % CO₂. The cell culture was passaged into a fresh T25 flask (Axon Lab) every 2 to 3 days. Upon detachment some part of the cell suspension was used for the transfection.

Transfection

All plasmids were isolated using ZR Plasmid Miniprep-Classic (Zymo Research). The plasmids were introduced to the HEK293T cells via suspension transfection. A transfection solution in Opti-MEM I (Gibco) was prepared using Polyethylenimine (PEI) “MAX” (MW 40000; Polysciences, Inc.) at a 1:3 (μg DNA to μg PEI) ratio while the culture was detached with Trypsin-EDTA (Gibco). The cell density was assessed with the automated cell counter Countess II FL (Invitrogen). 100 μL of culture with 26’000 cells was transferred in each well of the plate Nunc Edge 96-well plate (Thermo Scientific). The transfection mixture was added to the cells once it has incubated for approximately 30 min. All transfection tables can be found in Supplementary Information Section 9.

Flow cytometry

The cells were detached approximately 48 h after transfection on the Eppendorf ThermoMixer C at 25 ^∘C at 700 rpm with 53 μL Accutase solution (Sigma-Aldrich) per well for 20 min. The fluorescence data was collected on the Beckman Coulter CytoFLEX S flow cytometer with the 488 nm excitation with a 525/40+OD1 bandpass filter and the 638 nm excitation with a 660/10 bandpass filter. All data was processed with the CytExpert 2.3 software. A representative example of the gating strategy can be found in Supplementary Information, Fig. 28. The data was visualized with GraphPad 8.2.0.

Numerical simulations and visualizations

All simulations are carried out in MATLAB R2021a (academic use). Stochastic simulations shown in the supplementary information file are carried out on the Euler cluster (https://scicomp.ethz.ch/wiki/Euler). Manuscript figures were structured and formatted on Illustrator (2022 26.5), MATLAB and TexStudio (v3.1.1, open source).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The fluorescence measurements data of Fig. 4 and 7 are provided in the Supplementary Information File under Section 9. Plasmid and oligo sequences can be found in the Source Data file. Plasmids are available from the corresponding author upon request. Source data are provided with this paper.

Code availability

The MATLAB code for generating all the figures is available at a dedicated GitHub repository: https://github.com/Maurice-Filo/Inteins-in-the-Loop⁶¹.

References

Kotas, M. E. & Medzhitov, R. Homeostasis, inflammation, and disease susceptibility. Cell 160, 816–827 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mullur, R., Liu, Yan-Yun & Brent, G. A. Thyroid hormone regulation of metabolism. Physiol. Rev. 94, 355–382 (2014).
Article CAS PubMed PubMed Central Google Scholar
M, Khammash, Mario Di, Bernardo, and Diego Di, Bernardo Cybergenetics: Theory and methods for genetic control system. In 2019 IEEE 58th Conference on Decision and Control (CDC), pages 916–926. IEEE, (2019).
Lillacci G., Aoki S., Schweingruber D., & Khammash M. A synthetic integral feedback controller for robust tunable regulation in bacteria. BioRxiv. 170951, https://doi.org/10.1101/170951 (2017).
Lillacci, G., Benenson, Y. & Khammash, M. Synthetic control systems for high performance gene expression in mammalian cells. Nucl. Acids Res. 46, 9855–9863 (2018).
Article CAS PubMed PubMed Central Google Scholar
Aoki, S. K. et al. A universal biomolecular integral feedback controller for robust perfect adaptation. Nature 570, 533–537 (2019).
Article CAS PubMed Google Scholar
Frei, T., Chang, Ching-Hsiang, Filo, M., Arampatzis, A. & Khammash, M. A genetic mammalian proportional–integral feedback control circuit for robust and precise gene regulation. Proc. Nat Acad. Sci. 119, e2122132119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Franco, E., Giordano, G., Forsberg, Per-Ola & Murray, R. M. Negative autoregulation matches production and demand in synthetic transcriptional networks. ACS Synthetic Biol. 3, 589–599 (2014).
Article CAS Google Scholar
Huang, Hsin-Ho, Qian, Y. & Del Vecchio, D. A quasi-integral controller for adaptation of genetic modules to variable ribosome demand. Nat. Commun. 9, 1–12 (2018).
Article Google Scholar
Ng, A. H. et al. Modular and tunable biological feedback control using a de novo protein switch. Nature 572, 265–269 (2019).
Article ADS CAS PubMed Google Scholar
Hsiao, V., De Los Santos, EmmanuelL. C., Whitaker, W. R., Dueber, J. E. & Murray, R. M. Design and implementation of a biomolecular concentration tracker. ACS Synthetic Biol. 4, 150–161 (2015).
Article CAS Google Scholar
Kelly, CiaránL. et al. Synthetic negative feedback circuits using engineered small rnas. Nucl. Acids Res. 46, 9875–9889 (2018).
Article CAS PubMed PubMed Central Google Scholar
Agrawal, D. K., Marshall, R., Noireaux, V. & Sontag, E. D. In vitro implementation of robust gene regulation in a synthetic biomolecular integral controller. Nat. Commun. 10, 1–12 (2019).
Article Google Scholar
Kemmer, C. et al. Self-sufficient control of urate homeostasis in mice by a synthetic circuit. Nat. Biotechnol. 28, 355–360 (2010).
Article CAS PubMed Google Scholar
Rössger, K., Charpin-El-Hamri, G. & Fussenegger, M. A closed-loop synthetic gene circuit for the treatment of diet-induced obesity in mice. Nat. Commun. 4, 1–9 (2013).
Article Google Scholar
Xie, M. et al. β-cell–mimetic designer cells provide closed-loop glycemic control. Science 354, 1296–1301 (2016).
Article ADS CAS PubMed Google Scholar
F., Xiao and J. C., Doyle. Robust perfect adaptation in biomolecular reaction networks. In 2018 IEEE Conference on Decision and Control (CDC), pages 4345–4352. IEEE, (2018).
Khammash, M. H. Perfect adaptation in biology. Cell Sys. 12, 509–521 (2021).
Article CAS Google Scholar
Francis, B. A. & Wonham, WalterMurray The internal model principle of control theory. Automatica 12, 457–465 (1976).
Article MathSciNet MATH Google Scholar
Briat, C., Gupta, A. & Khammash, M. Antithetic integral feedback ensures robust perfect adaptation in noisy biomolecular networks. Cell Sys. 2, 15–26 (2016).
Article CAS Google Scholar
Olsman, N., Xiao, F. & Doyle, J. C. Architectural principles for characterizing the performance of antithetic integral feedback networks. Iscience 14, 277–291 (2019).
Article ADS PubMed PubMed Central Google Scholar
Olsman, N. et al. Hard limits and performance tradeoffs in a class of antithetic integral feedback networks. Cell Sys. 9, 49–63 (2019).
Article CAS Google Scholar
M., Filo and M., Khammash. Optimal parameter tuning of feedback controllers with application to biomolecular antithetic integral control. In 2019 IEEE 58th Conference on Decision and Control (CDC), pages 951–957. IEEE, (2019).
Bernardo D.di. Engineering the next generation of mammalian whole cell biosensors for advanced biotechnological applications. 2022 International Mammalian Synthetic Biology Workshop, (2022).
Yudkin, M. D. The sigma-like product of sporulation gene spoiiac of bacillus subtilis is toxic to escherichia coli. Mol. General Genetics MGG. 202, 55–57 (1986).
Article CAS Google Scholar
Nejepinska, J., Malik, R., Wagner, S. & Svoboda, P. Reporters transiently transfected into mammalian cells are highly sensitive to translational repression induced by dsrna expression. PloS One. 9, e87517 (2014).
Article ADS PubMed PubMed Central Google Scholar
Sarmiento, C. & Camarero, J. A. Biotechnological applications of protein splicing. Current Protein Peptide Sci. 20, 408–424 (2019).
Article CAS Google Scholar
Lennon, C. W. & Belfort, M. Inteins. Current Biol. 27, R204–R206 (2017).
Article Google Scholar
Shah, N. H. & Muir, T. W. Inteins: nature’s gift to protein chemists. Chem. Sci. 5, 446–461 (2014).
Article CAS PubMed Google Scholar
Muldoon, J. J. et al. Model-guided design of mammalian genetic programs. Sci. Adv. 7, eabe9375 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Palanisamy, N. et al. Split intein-mediated selection of cells containing two plasmids using a single antibiotic. Nat. Commun. 10, 1–15 (2019).
Article Google Scholar
Jillette, N., Du, M., Zhu, JacquelineJufen, Cardoz, P. & Wu Cheng, A. Split selectable markers. Nat. Commun. 10, 1–8 (2019).
Article Google Scholar
Chen, Hao-Shan et al. An intein-split transactivator for intersectional neural imaging and optogenetic manipulation. Nat. Commun. 13, 1–14 (2022).
Google Scholar
Wang H., Wang L., Zhong B., & Dai, Z. Protein splicing of inteins: A powerful tool in synthetic biology.Frontiers in Bioengineering and Biotechnology. Front Bioeng. Biotechnol. 10, 810180 (2022).
Dassa, B., London, N., Stoddard, B. L., Schueler-Furman, O. & Pietrokovski, S. Fractured genes: a novel genomic arrangement involving new split inteins and a new homing endonuclease family. Nucl. Acids Res. 37, 2560–2573 (2009).
Article CAS PubMed PubMed Central Google Scholar
Klein, J. S., Jiang, S., Galimidi, R. P., Keeffe, J. R. & Bjorkman, P. J. Design and characterization of structured protein linkers with differing flexibilities. Protein Eng. Des. Sel. 27, 325–330 (2014).
Article CAS PubMed PubMed Central Google Scholar
Filo, M., Kumar, S. & Khammash, M. A hierarchy of biomolecular proportional-integral-derivative feedback controllers for robust perfect adaptation and dynamic performance. Nat. Commun. 13, 1–19 (2022).
Article Google Scholar
Filo, M., Kumar, S., Anastassov, S. & Khammash, M. Exploiting the nonlinear structure of the antithetic integral controller to enhance dynamic performance. In 2022 IEEE 61st Conference on Decision and Control (CDC), 1294–1299 (IEEE, 2022).
Chevalier, M., Gómez-Schiavon, M., Ng, A. H. & El-Samad, H. Design and analysis of a proportional-integral-derivative controller with biological molecules. Cell Sys. 9, 338–353 (2019).
Article CAS Google Scholar
Briat, C., Gupta, A. & Khammash, M. Antithetic proportional-integral feedback for reduced variance and improved control performance of stochastic reaction networks. J. Royal Society Interface. 15, 20180079 (2018).
Article Google Scholar
Samaniego, ChristianCuba & Franco, E. Ultrasensitive molecular controllers for quasi-integral feedback. Cell Sys. 12, 272–288 (2021).
Article Google Scholar
V., Martinelli, D., Salzano, D., Fiore, and M., di Bernardo. Multicellular pi control for gene regulation in microbial consortia. bioRxiv (2022).
Donahue, P. S. et al. The comet toolkit for composing customizable genetic programs in mammalian cells. Nat. Commun. 11, 1–19 (2020).
Article Google Scholar
Khalil, A. S. et al. A synthetic biology framework for programming eukaryotic transcription functions. Cell 150, 647–658 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gossen, M. & Bujard, H. Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. Proc. Natl. Acad. Sci. 89, 5547–5551 (1992).
Article ADS CAS PubMed PubMed Central Google Scholar
Carvajal-Vallejos, P., Pallissé, R., Mootz, H. D. & Schmidt, S. R. Unprecedented rates and efficiencies revealed for new natural split inteins from metagenomic sources. J. Biol. Chem. 287, 28686–28696 (2012).
Article CAS PubMed PubMed Central Google Scholar
E Müller, I. et al. Gene networks that compensate for crosstalk with crosstalk. Nat. Commun. 10, 1–8 (2019).
Article ADS Google Scholar
Frei, T. et al. Characterization and mitigation of gene expression burden in mammalian cells. Nat. Commun. 11, 1–14 (2020).
Article ADS Google Scholar
Nóbel, F., De Battista, H., Nuñez, S. & Picó, J. Reference conditioning anti-windup for the biomolecular antithetic controller. IFAC-PapersOnLine. 52, 156–162 (2019).
Article MathSciNet Google Scholar
Sharma, P. et al. 2a peptides provide distinct solutions to driving stop-carry on translational recoding. Nucl. Acids Res. 40, 3143–3151 (2012).
Article CAS PubMed Google Scholar
Pinto, F., Thornton, EllaLucille & Wang, B. An expanded library of orthogonal split inteins enables modular multi-peptide assemblies. Nat. Commun. 11, 1–15 (2020).
Article ADS PubMed PubMed Central Google Scholar
Qian, Y. & Del Vecchio, D. Realizing ‘integral control’in living cells: how to overcome leaky integration due to dilution? J. Royal Society Inter. 15, 20170902 (2018).
Article Google Scholar
Feinberg, M. Chemical reaction network structure and the stability of complex isothermal reactors-i. the deficiency zero and deficiency one theorems. Chem. Eng. sci. 42, 2229–2268 (1987).
Article CAS Google Scholar
H. K., Khalil. Nonlinear systems third edition. Patience Hall 115, (2002).
Stevens, A. J. et al. Design of a split intein with exceptional protein splicing activity. J. American Chem. Society. 138, 2162–2165 (2016).
Article CAS Google Scholar
Beyer, H. M., Mikula, K. M., Li, M., Wlodawer, A. & Iwai, H. The crystal structure of the naturally split gp41-1 intein guides the engineering of orthogonal split inteins from cis-splicing inteins. FEBS J. 287, 1886–1898 (2020).
Article CAS PubMed Google Scholar
Ho, TrevorY. H. et al. A systematic approach to inserting split inteins for boolean logic gate engineering and basal activity reduction. Nat. Commun. 12, 1–12 (2021).
Article ADS Google Scholar
Dolberg, T. B. et al. Computation-guided optimization of split protein systems. Nat. Chem. Biol. 17, 531–539 (2021).
Article CAS PubMed PubMed Central Google Scholar
A., Gupta and M., Khammash. An antithetic integral rein controller for bio-molecular networks. In 2019 IEEE 58th Conference on Decision and Control (CDC), pages 2808–2813. IEEE, (2019).
Lee, M. E., DeLoache, W. C., Cervantes, B. & Dueber, J. E. A highly characterized yeast toolkit for modular, multipart assembly. ACS Synthetic Biol. 4, 975–986 (2015).
Article CAS Google Scholar
M., Filo. Inteins-in-the-Loop: A Framework for Engineering Advanced Biomolecular Controllers for Robust Perfect Adaptation. Github repository: Inteins-in-the-Loop, (2022). https://doi.org/10.5281/zenodo.7048552

Download references

Acknowledgements

This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (CyberGenetics; grant agreement 743269). We acknowledge the ETH Zurich D-BSSE Single Cell Facility for its excellent support and assistance. We would also like to thank Drs. Zhou Fang, Timothy Frei and Stephanie Aoki for the insightful discussions.

Author information

These authors contributed equally: Stanislav Anastassov, Maurice Filo.

Authors and Affiliations

Department of Biosystems Science and Engineering, ETH Zürich, 4058, Basel, Switzerland
Stanislav Anastassov, Maurice Filo, Ching-Hsiang Chang & Mustafa Khammash

Authors

Stanislav Anastassov
View author publications
You can also search for this author in PubMed Google Scholar
Maurice Filo
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Hsiang Chang
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa Khammash
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A. and M.F. conceived the study. M.F. developed the theorems and carried out the mathematical analyses and simulations. S.A. carried out the parts characterizations. C.-H.C. designed the two-plasmid testing framework for RPA. S.A. and C.-H.C designed, built and experimentally validated the controller circuits and performed the data analysis; all authors wrote the paper. M.K. supervised the project and secured funding.

Corresponding author

Correspondence to Mustafa Khammash.

Ethics declarations

Competing interests

ETH Zurich has filed a patent application on behalf of the inventors S.A., M.F., C.-H.C. and M.K. on the genetic circuit designs described (application no. EP22186956.3).

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anastassov, S., Filo, M., Chang, CH. et al. A cybergenetic framework for engineering intein-mediated integral feedback control systems. Nat Commun 14, 1337 (2023). https://doi.org/10.1038/s41467-023-36863-9

Download citation

Received: 06 September 2022
Accepted: 21 February 2023
Published: 11 March 2023
DOI: https://doi.org/10.1038/s41467-023-36863-9

This article is cited by

Competition and evolutionary selection among core regulatory motifs in gene expression control
- Andras Gyorgy
Nature Communications (2023)
Customizing cellular signal processing by synthetic multi-level regulatory circuits
- Yuanli Gao
- Lei Wang
- Baojun Wang
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.