An Allee-based distributed algorithm for microbial whole-cell sensors

Cravo, Fabricio; Függer, Matthias; Nowak, Thomas

doi:10.1038/s41540-024-00363-3

Download PDF

Article
Open access
Published: 22 April 2024

An Allee-based distributed algorithm for microbial whole-cell sensors

npj Systems Biology and Applications volume 10, Article number: 43 (2024) Cite this article

213 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Reliable detection of substances present at potentially low concentrations is a problem common to many biomedical applications. Complementary to well-established enzyme-, antibody-antigen-, and sequencing-based approaches, so-called microbial whole-cell sensors, i.e., synthetically engineered microbial cells that sense and report substances, have been proposed as alternatives. Typically these cells operate independently: a cell reports an analyte upon local detection.

In this work, we analyze a distributed algorithm for microbial whole-cell sensors, where cells communicate to coordinate if an analyte has been detected. The algorithm, inspired by the Allee effect in biological populations, causes cells to alternate between a logical 0 and 1 state in response to reacting with the particle of interest. When the cells in the logical 1 state exceed a threshold, the algorithm converts the remaining cells to the logical 1 state, representing an easily-detectable output signal. We validate the algorithm through mathematical analysis and simulations, demonstrating that it works correctly even in noisy cellular environments.

Bioelectronic control of a microbial community using surface-assembled electrogenetic cells to route signals

Article 29 March 2021

Redox-enabled electronic interrogation and feedback control of hierarchical and networked biological systems

Article Open access 21 December 2023

A new paradigm of reliable sensing with field-deployed electrochemical sensors integrating data redundancy and source credibility

Article Open access 22 February 2023

Introduction

Numerous disease indicators are based on detecting that the abundance of a particular substance exceeds a threshold concentration^1,2,3. Widely-adapted techniques are sequencing for genetic information and antibody-based detection for proteins⁴. Recently microbial whole-cell sensors (MWCS), i.e., cells engineered to sense and report substances, emerge as an easy-to-use and cost-effective alternative to these classical detection methods⁵. MWCS have been demonstrated to successfully sense pollutants⁶, detect inflammation in mice models⁷, and provide means for environmental monitoring^8,9,10.

To detect analytes at low concentrations, MWCS use techniques such as optimizing the cellular sensing circuitry^7,11 and sensing multiple, correlated analytes⁷. These approaches are examples for local sensor designs, where engineered cells locally sense and report analytes. In this case, the population-level readout is obtained as the cumulative single-cell responses, which inherently limits the population-level response: Assume a population C of n cells, a small fraction α ∈ [0, 1] of which detect the analyte of interest, and an ideal local cell response out_c(in_c) that maps the presence (in_c = 1) or absence (in_c = 0) of a detection event at cell c to a local cell output. In presence of an ideal local cell response that outputs a (normalized) 1 if in_c = 1 and 0 otherwise, the population-level response out_pop is given by

$$\begin{array}{r}{{{\mbox{out}}}}_{{{{\rm{pop}}}}}=\frac{1}{n}\mathop{\sum}\limits_{c\in C}{{{\mbox{out}}}}_{c}({{{\mbox{in}}}}_{c})=\alpha ,\end{array}$$

(1)

i.e., remains linear in the fraction α of cells that detect the analyte.

By contrast, population-based designs use communication between cells to potentially achieve improved threshold-like population responses. In terms of the example before this is achieved by allowing out_c to depend not only on in_c, but also on the other cells’ (communicated) inputs: the population-level response

$$\begin{array}{r}{{{\mbox{out}}}}_{{{{\rm{pop}}}}}=\frac{1}{n}\mathop{\sum}\limits_{c\in C}{{{\mbox{out}}}}_{c}({{{\mbox{in}}}}_{1},\ldots ,{{{\mbox{in}}}}_{n})\end{array}$$

(2)

is not necessarily proportional to α in this case.

A commonly used mechanism to communicate is via quorum sensing (QS) molecules^12,13, which diffuse through the cell membrane and, if present in sufficiently high concentrations, allow cells to trigger a population-level response^8,9,10. For example, the circuit by Hsu, Chen, Hu, and Chen¹⁴ uses QS for population-level signal amplification: when cells sense metal ions, they start to secrete the QS molecule. This molecule, then diffuses into the surrounding medium and inside the population’s cells. If a cell’s internal concentration exceeds a certain threshold, a reporting pathway is triggered. Further examples for population-based designs based on QS are the detection of mercury¹⁵ and phenolic compounds¹¹.

In this article, we analyze a simple distributed algorithm that acts as a distributed amplification circuit and, together with a local sensory and reporting circuit, yields a population-based MWCS design. In our algorithm, cells transition from a low (L) state to a high (H) state upon reacting with rare-event substances of interest. When a specific number of cells successfully alter their state within a predetermined time frame, the algorithm guarantees the production of an amplified reporter signal, indicating the presence of rare events. Conversely, if this threshold is not met, the reporter signal is guaranteed to remain low and is not amplified at population-level. Given the high noise levels in biological circuits¹⁶, the threshold is designed to mitigate the number of false positives compared to naive broadcasting methods, offering noise protection.

The algorithm is inspired by the Allee effect¹⁷ as observed in biological populations: While in populations that compete for a shared resource, lower densities are supposedly more likely to thrive¹⁸, the Allee effect describes the phenomenon that the fitness of small populations often decreases, e.g., due to the reliance on cooperation strategies within the population^19,20,21. The algorithm is designed such that the population of H cells shows an Allee-like behavior: for low H densities, the “birth” rate, i.e., the rate by which L cells are transformed into H cells, is compensated by their “death” rate, i.e., the rate by which H cells are transformed into L cells. Above a certain threshold cell density, the situation is reversed, and the birth rate outweighs the death rate.

Further, differently than other QS amplification circuits^11,14,15, our algorithm autonomously maintains the amplified state indefinitely. This is achieved through a positive feedback loop, where the signal responsible for amplification promotes itself.

Results

The concentration of the analyte of interest is denoted by $A(t)\in {{\mathbb{R}}}_{+}$. Cells of the detection algorithm are in either of two states: low (L), voting for the absence of the analyte of interest, or high (H), voting for its presence. We denote the density of cells in L at time t by $L(t)\in {{\mathbb{R}}}_{+}$, and the density of cells in H by $H(t)\in {{\mathbb{R}}}_{+}$. We write P(t) = H(t) + L(t) for the total population size. Assuming that the population size is within a steady state, we neglect replication and cell-death reactions.

A naive algorithm to obtain a non-linear population-level response would be to broadcast any detection of the analyte by a cell to all other cells. Such an algorithm comprises of two reactions

$${{{\rm{L}}}}+{{{\rm{A}}}}\longrightarrow {{{\rm{H}}}}+{{{\rm{A}}}}$$

(3)

$${{{\rm{L}}}}+{{{\rm{H}}}}\longrightarrow 2\,{\cdot}\,{\rm{H}}$$

(4)

the first of which models detection of an analyte by a cell and the second the broadcast of such an event to all other cells. However, as we show later, this algorithm does not tolerate incorrect detections, i.e., cells that incorrectly transition from state L to state H in absence of an analyte; a problem any biological implementation will necessarily have.

To address the problem of the broadcasting algorithm to deal with faulty state transitions, we propose an algorithm that tolerates erroneous detection of the analyte up to a certain rate. Our algorithm comprises of three reactions that determine when a cell switches state: (i) Reaction (Detect): A cell in state L changes to state H upon local detection of the analyte. We assume that this happens with a rate ${\sigma }_{A}\in {{\mathbb{R}}}_{+}$. (ii) Reaction (Hold): A cell in state L also switches to state H with a rate that depends on the density H according to a Hill function with parameters $\kappa \in {{\mathbb{R}}}_{\ > \ 0},K\in {{\mathbb{R}}}_{\ > \ 0}$, and $n\in {\mathbb{R}}$ with n > 1. The Hill function models the fact that this reaction is triggered by a QS molecule which is secreted by cells in the H state; see, e.g., ref. ²² for a Hill-function model of a QS circuit. (iii) Reaction (Reset): A cell in state H switches back to state L with a certain reset rate$\rho \in {{\mathbb{R}}}_{\ > \ 0}$. Intuitively, this is to prevent accumulation of incorrect detection events in the system.

$${{\mbox{(Detect):}}}\,{{{\rm{L}}}}+{{{\rm{A}}}}\longrightarrow {{{\rm{H}}}}+{{{\rm{A}}}}\quad \left[{\sigma }_{A}\cdot A\cdot L\right]$$

(5)

$$({\mbox{Hold}})\!:\,{{{\rm{L}}}}+{{{\rm{H}}}}\longrightarrow 2\,{\cdot}\, {{{\rm{H}}}}\quad \left[\kappa\, \cdot \,\frac{{H}^{n}}{{H}^{n}+{K}^{n}}\,\cdot \,L\right]$$

(6)

$$({\mbox{Reset}})\!:\,{{{\rm{H}}}}\longrightarrow {{{\rm{L}}}}\,\quad \,\left[\rho \,\cdot \,H\right]$$

(7)

Cells may incorrectly detect the analyte with a rate ${\sigma }_{{{{\rm{err}}}}}\in {{\mathbb{R}}}_{\ > \ 0}$, accounted for in the additionally reaction1

$$({\mbox{Error}})\!:\,{{{\rm{L}}}}\longrightarrow {{{\rm{H}}}}\left[{\sigma }_{{{{\rm{err}}}}}\,\cdot \,L\right]$$

(8)

For the purpose of analysis, unless stated otherwise, we will subsume both (5) and (8) into the single reaction

$$({\mbox{Set}})\!:\,{{{\rm{L}}}}\longrightarrow {{{\rm{H}}}}\quad \left[\sigma \,\cdot \,L\right]$$

(9)

calling σ = σ_A ⋅ A + σ_err the rare-event detection rate.

Figure 1 illustrates the algorithm’s reactions (Fig. 1a) and three modes of operation: in absence of the analyte (Fig. 1b), in its presence (Fig. 1c), and after detection of the analyte with the analyte potentially being absent (Fig. 1d). Observe that the high density of H cells is maintained in the latter case.

**Fig. 1: Illustration of the algorithm.**

Figure 2 visualizes the Allee-like behavior of H cells: Fig. 2a, b show the “birth” rate of H cells, i.e., the sum of the rates in (6) and in (9), as well as the “death” rate of H cells, i.e., the rate in (7), over the density of H cells. In the absence of the analyte (Fig. 2a), the rare-event detection rate σ is low, and stable steady states for the H density are either low (close to 0 mL⁻¹) or high (about 1.5 ⋅ 10⁸ mL⁻¹). We show later that the low-density steady state is reached if the initial H density was below a threshold, and the high-density steady state if it was above this threshold, thus guaranteeing the memory effect for a once detected analyte. Conversely, in presence of the analyte, and a consequently high rare event detection rate σ, the H cell density converges to a high value (Fig. 2b).

**Fig. 2: Allee-like behavior of the H cells.**

We now argue correctness of the algorithm. We can write an ordinary differential equation (ODE) for the cell densities H and P from reactions (9), (7), and (6), obtaining

$$\frac{dH}{dt}=\sigma \cdot (P-H)+\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H.$$

(10)

Using our proof strategy, we can establish the correctness of the Allee-based algorithm by showing that under certain conditions of its parameters κ, K, n, and the total population size P, the algorithm guarantees: (i) convergence to a low density of H cells if the rare event detection rate σ is below a critical rate σ_c and the initial density of H cells is low (no memorized detection happened), and (ii) to a high density of H cells either if the initial population of H cells was high (a detection was memorized), or the rare event rate σ exceeds the critical threshold rate σ_c (the analyte is being detected).

Theorem 1

If ${\max}_{H\in [0,P]}(\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H)\, > \,0$, then there exist α_i, α_f in $\left]0,1\right[$ with α_i < α_f and σ_c > 0 such that:

If σ < σ_c, there exists a critical point H_c,σ such that:
1. 1.
  If H(0) < H_c,σ, then H(t) converges to a value in $\left[\right.0,{\alpha }_{i}P\left[\right.$.
2. 2.
  If H(0) > H_c,σ, then H(t) converges to a value in $\left[\right.{\alpha }_{f}P,P\left[\right.$.
If σ > σ_c, then H(t) converges to a value in $\left.\right]{\alpha }_{f}P,P\left[\right.$.

The following two corollaries immediately follow from Theorem 1 and establish the correctness of detection (Corollary 1) and memorization of a previously detected analyte (Corollary 2).

Corollary 1

(Detection). If the conditions for Theorem 1 hold, and with α_i, α_f, a nd σ_c as defined in Theorem 1, if H(0) = 0 then H(t) converges to a value in $\left[0,{\alpha }_{i}P\right[$ if σ < σ_c, and to a value in $\left[\right.{\alpha }_{f}P,P\left[\right.$ if σ > σ_c.

Corollary 2

(Memory). If the conditions for Theorem 1 hold, and under the notation for Theorem 1, if H(0) > α_iP, then H(t) converges to a value in $\left[\right.{\alpha }_{f}P,P\left[\right.$.

We finally establish an upper bound on the time the algorithm needs to converge to a high density of H cells in presence of an analyte. The proof is given in the Supplementary Material, for which the convergence time is established.

Theorem 2

(Convergence time). Let σ > σ_c. Let ${\max}_{H\in [0,P]}(\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H)\, > \,0$. Let H_c,0 be the second lowest non-negative root solution of $\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H=0$. If ${\max}_{H\in [0,P]}(\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H)\, > \,0$, there exists a time ${t}_{l}={\beta }_{{\sigma }_{1}}\ln (1-{R}_{{\sigma }_{1}})$ with β_σ and R_σ functions of σ, such that H(t) > H_c,0 for any t > t_l.

To validate that the algorithm performs well within realistic parameter ranges, we estimated parameters and ran simulations from a potential genetic circuit implementation (Fig. 3). Following previous QS circuit designs in synthetic biology^22,23, we use an N-acyl homoserine lactone (AHL) as the QS molecule. The AHL molecule is synthesized by LuxI under the control of a promoter (p1 in Fig. 3) that is activated by the binding of an LuxR- AHL complex. LuxR is consituently expressed by the circuit (not shown in the figure). Additionally, the detection of the analyte by the cell is assumed to activate promoter p1.

**Fig. 3: Genetic circuit implementation of the Allee-based algorithm.**

We next outline how this circuit implements the Allee-based algorithm: The algorithm’s cell states L and H model cells with low internal LuxI, respectively, high internal LuxI concentrations. Reaction (9) models the fact that an anlyte leads to an increasing internal concentration of LuxI, thus converting an L cell to an H cell. Also, LuxI is degraded and diluted within the cell, accounting for reaction (7). Finally, H cells synthesize AHL that diffuses into the medium and from there into surrounding cells. The so-formed LuxR-AHL complex consequently activates promoter p1 that shows a Hill-type activation profile. The promoter’s activation again leads to expression of LuxI, making an L cell switch to an H cell, as required by reaction (6).

For simulations we parametrized the algorithm’s reactions with rate parameters from literature (Table 1). For the previously discussed implementation, κ corresponds to the expression of LuxI controlled by p1. From the model by ref. ²² [supplementary information], we choose κ = 35 h⁻¹. The reaction rate constant ρ corresponds to the degradation rate constant of LuxI and was set to 14 h^[−122. The Hill coefficient n for activation of p1 via LuxR-AHL was set to 4²². The threshold parameter K for the activation via LuxR-AHL was set to a relatively high value of 8 ⋅ 10⁷ mL⁻¹, reported in a circuit by Smith and Schuster²⁴. The total cell density was set to a value larger than 2-times the threshold K, and well in the range of reachable E. coli cell densities; we used P = 1.5 ⋅ 10⁸ mL⁻¹. The parameters in Table 1 fulfill the condition of Theorem 1.

Table 1 Reaction and population parameters used in simulations

Full size table

The critical threshold rate σ_c was determined through binary search. We start from the interval $I=[0,{\sigma }_{{{{\rm{utr}}}}}]$, where ${\sigma }_{{{{\rm{utr}}}}}\cdot (P-H)+\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H=0$ has only one solution for H. From Theorem 1, the equation $\kappa \frac{{H}^{n}}{{H}^{n}+{K}^{n}}(P-H)-\rho H=0$ necessarily has three solutions for H. We then repeatedly determine the midpoint of the interval, and verify if the above equation has three or one solution with the midpoint as σ. In case of three solutions, we replace the left bound of I by the midpoint, and in case of one solution we replace the right bound by the midpoint. In case two solutions or a sufficient precision is reached, the search terminates. We refer the reader to the Supplementary Material for more details. For our setting, we find that σ_c ≈ 3.04 h⁻¹, about 22% of the system’s lowest rate constant, which is ρ.

The parameters in Table 1 were used to obtain the transient and steady-state simulation results. Figure 4a shows $\frac{dH}{dt}$, i.e., the net birth rate of H cells, versus H for different rare event detection rates σ. While larger values of σ lead to a single equilibrium points, lower values (0 h⁻¹ and 1.5 h⁻¹ in the figure) result in three equilibrium points. Of the three points, the smallest and the largest are stable and are seen to have different H densities for different σ: the smallest equilibrium point corresponds to a negative detection result and the largest to a positive detection result. Further, choosing σ = 3.04 h⁻¹ close to the critical σ_c, results in a net birth rate function that barely touches the x-axis.

We next ran transient simulations for the same σ values as in Fig. 4a over a time range of 0.6 h simulated time (Fig. 4b). One observes, the convergence of H(t) to a high density of H cells for a σ of 4 h⁻¹ and 5 h⁻¹, and to a low density of H cells for a σ of 0 h⁻¹ and 1.5 h⁻¹; in agreement with Theorem 1. The transient simulation for σ = 3.04 h⁻¹ close to the critical σ_c does not visibly converge in the simulated time. Increasing the simulation time, however, shows that it converges to a high H density.

he memorization of the presence of an analyte that has been removed thereafter. In Fig. 4c we varied the exposure time of the cells to the analyte, while in Fig. 4d the concentration of the analyte was varied. The memorization above a certain critical rate σ_c(t) = σ_A ⋅ A(t) is observed in both cases, which is consistent with Corollary 2.

We finally ran simulations for an extended duration to determine steady-state values for different settings of rare event detection rates σ (Table 2). The results are consistent with Theorem 1. When σ is above the critical rate σ_c, larger values of σ lead to faster convergence of H(t) to its steady state. As shown in Table 2, the highest convergence time is 0.61 h, which occurs for σ = 3.04 h⁻¹, a value close to σ_c. For a slightly higher σ = 4 h⁻¹, the convergence time is already reduced to 0.44 h. For a mathematical analysis of the convergence times, we refer the reader to the Supplementary Material (Lemmas 7 and 19).

Table 2 Steady-state density of H(t) and convergence times for different rare event detection rates

Full size table

To demonstrate the effectiveness of the Allee-based algorithm, we compare its performance to other algorithms, including two natural adaptations and one presented by ref. ¹⁴. In the comparison we focus on the thresholding behavior of the algorithms: ideally the detection algorithm shows a strong amplification of its detection output around a threshold concentration of the analyte, below of which the output is strong negative, and above of which it is strong positive.

A natural simplification of the Allee-based algorithm is to remove the (6) reaction, and only keep the reactions that transform cells to H cells in presence of the analyte, as well as reset H cells to L cells with a certain reset rate ρ:

$$\,{{\mbox{(Set):}}}\,{{{\rm{L}}}}\longrightarrow {{{\rm{H}}}}\quad \left[\sigma \cdot L\right]$$

(11)

$$\,{{\mbox{(Reset):}}}\,{{{\rm{H}}}}\longrightarrow {{{\rm{L}}}}\quad \left[\rho \cdot H\right]$$

(12)

Steady-state analysis of H cells via setting $\frac{dH}{dt}=0$ and subsequent algebraic manipulation yields, $H=\frac{\sigma }{\rho }P/(1+\frac{\sigma }{\rho })=P\frac{\sigma }{\rho +\sigma }$ as the unique steady-state. Since it is unique, the algorithm lacks the possibility to memorize previous presence of the analyte. Further, for most applications we expect σ to be small compared to the other rates (and in particular ρ), implying a low amplification from the analyte concentration A to the output H.

A natural distributed algorithm that solves the problem of detecting an analyte is to broadcast any detection of the analyte to all other cells that relay this broadcast. Here, relay is obtained by a cell in state H that had been informed of the presence of the analyte, to pass this information to any L cell it interacts with. In terms of reactions, and referring to κ > 0 as the broadcasting rate, this algorithm can be written as

$$\,{{\mbox{(Set):}}}\,{{{\rm{L}}}}\longrightarrow {{{\rm{H}}}}\quad \left[\sigma \cdot L\right]$$

(13)

$$({\mbox{Relay}}\!:)\,{{{\rm{L}}}}+{{{\rm{H}}}}\longrightarrow 2\,{\cdot}\,{{{\rm{H}}}}\quad \left[\kappa\,\cdot \,L\,\cdot \,H\right]$$

(14)

and its dynamics are $\frac{dH}{dt}=\sigma \cdot (P-H)+\kappa H(P-H)$.

Since 0 ≤ H ≤ P, one has that H is bounded. We now perform a case distinction between the cases where σ > 0 and σ = 0. Case σ = 0: For any $H\in \left[0,P\right[$, one has $\frac{dH}{dt}\, > \,0$. From the monotonicity of H and its boundedness, it follows that H converges to a finite steady-state. Since $\frac{dH}{dt}=0$ when H = P, one has that H(t) converges to P. Case σ > 0: For any $H\in \left]0,P\right[$, one has $\frac{dH}{dt}\, > \,0$. If H(0) is in $\left]0,P\right]$, from the monotonicity of H in $\left]0,P\right]$ and its boundedness, it follows that H converges to a finite steady-state. Since $\frac{dH}{dt}=0$ when H = P, one has that H(t) converges to P. If H(0) = 0, since, one has that H(t) = 0 for all t. Indeed, two steady states, obtained by setting $\frac{dH}{dt}=0$, are possible: (i) σ = 0 and H = 0, and (ii) σ > 0 and H = P.

While the presence of the two steady-states shows that the algorithm can memorize previously detected analytes, the steady-states also show that the algorithm cannot tolerate incorrect transitions of L cells to H cells: for an arbitrarily small σ, all cells switch to state H.

Ref. ¹⁴ proposed a circuit that uses distributed amplification via a QS pathway: Cells that detect the analyte synthesize the QS molecule. Similar to the model for the Allee-based algorithm, we abstract this via two cell types: L cells with low internal concentrations of LuxI and H cells with high concentrations of LuxI. Any cell whose QS threshold is triggered, expresses a reporter molecule S (e.g., YTP). In terms of reactions, this algorithm is expressed as

$$\,{{\mbox{(Set):}}}\,{{{\rm{L}}}}\longrightarrow {{{\rm{H}}}}\quad \left[\sigma \cdot L\right]$$

(15)

$$\,{{\mbox{(Reset):}}}\,{{{\rm{H}}}}\longrightarrow {{{\rm{L}}}}\quad \left[\rho \cdot H\right]$$

(16)

$$({\mbox{Signal-Secretion}})\!\!:\,\forall C\in \{L,H\}\!:\,{{{\rm{C}}}}+{{{\rm{H}}}}\longrightarrow {{{\rm{C}}}}+{{{\rm{H}}}}+{{{\rm{S}}}}\quad \left[\kappa \,\cdot \,\frac{{H}^{n}}{{H}^{n}+{K}^{n}}\cdot C\right]$$

(17)

$$\,{{\mbox{(Signal-Decay):}}}\,{{{\rm{S}}}}\longrightarrow {{\emptyset}}\quad \left[\rho \cdot S\right]$$

(18)

where S is the reporter molecule, and the reaction (18) accounts for the decay of S.

To compare the thresholding behavior of the distributed amplification, the Set-reset, and the broadcasting algorithm to the Allee-based algorithm, we ran simulations in presence of the same analyte concentration A for all four algorithms (Fig. 5). Simulation parameters where chosen identically for similar reactions. One observes the strong amplification of the Allee-based algorithm around a non-zero critical concentration of A. The distributed amplification algorithm shows a threshold behavior, but with a weaker amplification. The broadcasting algorithm has a strong threshold at 0, and the Set-reset algorithm shows no thresholding behavior.

**Fig. 5: Comparison of the four algorithms: Allee-based, Set-reset, broadcasting, and distributed amplification algorithm.**

The non-zero thresholding behavior around a critical threshold σ_c as shown in Theorem 1 and demonstrated by simulations in Fig. 5 suggests that the Allee-based algorithm is robust to incorrectly detected analytes by L cells (σ_err > 0 in our model). To demonstrate that this is the case, also for stochastically varying incorrect detections as exprected in a real genetic circuit implementation, we ran simulations where we stochastically varied σ over time, simulating the effect of a stochastic σ_err in absence of an anlyte. For any such simulation, an ideal algorithm is expected to keep H cell densities low, and thus not wrongly signal the detection of an analyte.

Simulations were carried out in MobsPy²⁵ with a simulated time of 1000 h. For the stochastic model of wrongly detected analytes we chose a stochastic birth-death process of H cells with a birth rate β of 0.5 h⁻¹ and a death rate γ of 2 h⁻¹. The parameters have been set to the leaky expression rate and the decay rate of LacI²² to reflect a realistic parameter range for leaky expression resulting in incorrect L to H transitions of a cell. The so-obtained stochastic rates were then fed into a deterministic transient-time simulation of the Allee-based algorithm. Figure 6 shows the resulting densities H(t) as well as the rate σ(t) over time t for a simulated time of 100 h. We can observe that while the stochastic event detection rate is capable of increasing the density H(t), the algorithm does not amplify the H cells further, thus preventing the cells from incorrectly detecting the analyte.

**Fig. 6: Robustness of the Allee-based algorithm to an incorrect detection of the analyte.**

To examine the impact of parameters like the population density P, as well as the rate parameter κ from (6) and ρ from (7), on the algorithm’s critical threshold σ_c, we determined σ_c for parameter sweeps (Fig. 7). The parameter ranges that violate the condition of Theorem 1 are marked with setting σ_c = 0 in the heat map.

**Fig. 7: Parameter variations showing σ_c for different reaction rate parameters κ and ρ.**

In all the heat maps, we can observe a distinct linear boundary separating a region where σ_c = 0 indicating a violation of the condition of 1 and a valid region. For instance, in Fig. 7c, the region marked as R3 falls in this category where the reset rate constant ρ is significantly larger than the hold rate constant κ, leading to the inability of the amplification process to trigger. Consequently, the region possesses only a stable equilibrium state with H = 0 and no stable equilibrium state for H with a high cell density. This is expected as the (7) reaction dominates the system behavior in this case.

The heat maps also reveal that when the (6) rate parameter κ is higher than the (7) rate constant ρ, the critical threshold σ_c is often close to zero. This is evident in the region marked as R1 in Fig. 7c. Due to the higher hold rate, the algorithm can produces wrong positives for a lower σ.

Discussion

We presented and discussed a distributed algorithm to detect rare events, such as the presence of a rare analyte, by a population of engineered cells. The algorithm is intended to be used in combination with sensory and reporting circuits within MWCS. The algorithm is inspired by the Allee effect observed in natural systems: it uses the fact that a certain critical threshold cell density that signal the presence of an analyte is hard to reach initially, but once it is reached, the fact that an analyte has been detected it quickly propagated to the whole cell population.

We have established conditions under which the algorithm provably works as intended (Theorem 1). Numerical simulations of a proof-of-concept circuit demonstrate that the algorithm shows strong amplification of near a critical threshold concentration of the analyte (Fig. 5). This is in contrast to three other natural algorithms that have been discussed in this work (Fig. 5). Additionally, hybrid stochastic–deterministic simulations (Fig. 6) were carried out to demonstrate the robustness of the algorithm to cells that wrongly detect the analyte.

Since the detection thresholds and the targeted total cell populations may differ significantly per application, one may need to adapt the reaction parameters for these cases. We speculate that the simplicity of the detection algorithm as well as the mechanistic understanding of all parameters greatly simplifies this adaption, e.g., via plasmid copy number manipulation to affect the reaction rates²⁶ and gene removal to alter the parameter K²⁴. We leave the impact of inter-cellular differences and changing population sizes to future work.

Finally, as has been show by the robustness simulations, the choice of the critical threshold rate σ_c balances the capability of sensing rare events and robustness: while a low threshold favors early detection, a high threshold tolerates a larger concentration of wrongly detected analytes.

Methods

Model construction

The model was constructed using the ODE CRN formulation²⁷. In this formulation, considering no inflow and outflow of matter into the system, the rate of change over time for the concentration/density X(t) for any species X is expressed as:

$$\frac{dX(t)}{dt}=\,{{\mbox{Generative reaction rates of X}}}-{{\mbox{Consuming reaction rates of X}}}\,$$

(19)

Here, the “generative reaction rates of X” refer to the rates of reactions where X is produced times its stoichiometry in each reaction, while the “consuming reaction rates of X” refer to the rates of reactions where X is consumed times its stoichiometry in each reaction.

For this model, reaction rates from Reactions (5) and (9) are defined using mass-action kinetics²⁸, which assumes that the reaction rate is the result of the product of the reactants’ concentration and a constant reaction rate. Conversely, Reaction (6) uses the Hill formulation²⁹, which accounts for cooperativity among multiple ligand binding sites.

Taking X as H, writing the ODE associated with the proposed chemical reaction network by inserting the proposed rate expressions, and assuming a constant steady-state population value such that L = P − H yields Equation (10).

Proof strategy

Theorem 1 is proven with the following strategy: From Equation (10), algebraic manipulation of $\frac{dH}{dt}=0$ yields up to three potential equilibrium points for H, the smallest of which is stable, the middle one unstable, and the largest one being stable (if they exist). Using the monotonicity of $\frac{dH}{dt}$ within certain subdomains of H, one can show that for the interval I between the first, stable, and the second, unstable, equilibrium point, $\frac{dH}{dt}$ is negative for H ∈ I, and $I={{\emptyset}}$ for rare event detection rates σ larger than a critical rate σ_c. The proof we give in the Supplementary Material is based on the intermediate value theorem. For sufficiently large values of σ, only one stable equilibrium point remains and its value can be bounded away from 0. The proof in the Supplementary Material uses a quadratic Lyapunov function to show convergence to this fixed point.

Theorem 2 is proven with the following strategy: We begin by establishing a lower bound ordinary differential equation (ODE), $\frac{d{H}^{* }}{dt} < \frac{dH}{dt}$, known for its exponential convergence to a fixed point. In parallel, we define an interval I starting at zero. If, at any point in time t_c, the value of H(t_c) falls outside this interval, it indicates that H(t) will converge to the amplified state. Furthermore, leveraging the ODE’s lower bound and the initial conditions H^*(0) = H(0) = 0, if there exists a specific time instance t_l where H^*(t_l) exits the I interval, it implies that H(t_l) must also leave this interval.

Simulation model

We used the Python simulation framework MobsPy²⁵ to obtain transient and steady-state simulation results for the parameters in Table 2. The core of the simulation code is shown in Listing 1.

The robustness simulation was separated into two parts. First, we generated a birth and death process to simulate noisy event rates introduced to the system, as shown in code Listing 2.

Subsequently, the generated data was incorporated into another model using simulation events, which represent changes in species values during simulation. Its core code is shown in Listing 3.

# s = sigma, k = kappa, and p = rho

L, H = BaseSpecies(2)

L > > H [s] # Set

H > > L [p] # Reset

L + H > > 2 * H [lambda l, h: f'{k}*{l}*1/(1 + ({K}/{h})^{n})'] # Hold

H(0), L(P) # initially, H(0) = 0 and L(0) = P

MySim = Simulation(L ∣ H)

Listing 1: MobsPy simulation code for the Alle-effect-based algorithm. Initialization of reaction and population parameters is according to Table 1. Parameter σ (s in the code) was varied in the simulations.

A = BaseSpecies(1)

Zero > > A [pr]

A > > Zero [dr]

MySim = Simulation(E)

MySim.simulation_method = ‘stochastic’

Listing 2: MobsPy simulation code for the birth and death noise generation. The simulation parameters were set to pr = 0.5 and dr = 2.

with open('noise.pkl', 'rb') as file: noise = pickle.load(file) L, H, A = BaseSpecies(3)

L + A > > H + A[1]

H > > L[p]

L + H > > 2 * H[lambda l, h: f'{k}*{l}*1/(1 + ({K}/{h})^{n})']

L(P) S1 = Simulation(H ∣ L ∣ A) for time, data in zip(noise['Time'], noise['Data']): with S1.event_time(time): A(data)

S1.simulation_method = ‘stochastic’

Listing 3: MobsPy simulation code for the robustness test. Initialization of reaction and population parameters is according to Table 1. The noise.pkl file contains the results from Listing 2.

All simulations were run in Python version 3.10 and MobsPy version 2.2.0. The hardware used was a MacBook Air with 1,1 GHz Quad-Core Intel Core i5, Intel Iris Plus Graphics 1536 MB, and 8 GB 3733 MHz LPDDR4X. The operating system was Mac OS.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data generated from the simulations was generated by the code available at https://github.com/BioDisCo/alle_effect_for_rare_event.

References

Anker, P., Mulcahy, H., Qi Chen, X. & Stroun, M. Detection of circulating tumour dna in the blood (plasma/serum) of cancer patients. Cancer Metastasis Rev. 18, 65–73 (1999).
Article CAS PubMed Google Scholar
Vestergaard, J. et al. Hedgehog signaling in small-cell lung cancer: frequent in vivo but a rare event in vitro. Lung Cancer 52, 281–290 (2006).
Article PubMed Google Scholar
Zimmerlin, L., Donnenberg, V. S. & Donnenberg, A. D. Rare event detection and analysis in flow cytometry: bone marrow mesenchymal stem cells, breast cancer stem/progenitor cells in malignant effusions, and pericytes in disaggregated adipose tissue. Flow Cytometry Protoc. 251–273 https://link.springer.com/protocol/10.1007/978-1-61737-950-5_12 (2011).
Sidransky, D. Emerging molecular markers of cancer. Nat. Rev. Cancer 2, 210–219 (2002).
Article CAS PubMed Google Scholar
Andreescu, S. & Sadik, O. A. Trends and challenges in biochemical sensors for clinical and environmental monitoring. Pure Appl. Chem. 76, 861–878 (2004).
Article CAS Google Scholar
Belkin, S. Microbial whole-cell sensing systems of environmental pollutants. Curr. Opin. Microbiol. 6, 206–212 (2003).
Article CAS PubMed Google Scholar
Woo, S.-G. et al. A designed whole-cell biosensor for live diagnosis of gut inflammation through nitrate sensing. Biosens. Bioelectron. 168, 112523 (2020).
Article CAS PubMed Google Scholar
Miller, M. B. & Bassler, B. L. Quorum sensing in bacteria. Annu. Rev. Microbiol. 55, 165–199 (2001).
Article CAS PubMed Google Scholar
Dong, Y.-H. & Zhang, L.-H. Quorum sensing and quorum-quenching enzymes. J. Microbiol. 43, 101–109 (2005).
CAS PubMed Google Scholar
Pu, L., Yang, S., Xia, A. & Jin, F. Optogenetics manipulation enables prevention of biofilm formation of engineered pseudomonas aeruginosa on surfaces. ACS Synth. Biol. 7, 200–208 (2018).
Article CAS PubMed Google Scholar
He, J., Zhang, X., Qian, Y., Wang, Q. & Bai, Y. An engineered quorum-sensing-based whole-cell biosensor for active degradation of organophosphates. Biosens. Bioelectron. 206, 114085 (2022).
Article CAS PubMed Google Scholar
Wu, Y., Wang, C.-W., Wang, D. & Wei, N. A whole-cell biosensor for point-of-care detection of waterborne bacterial pathogens. ACS Synth. Biol. 10, 333–344 (2021).
Article CAS PubMed Google Scholar
Moraskie, M. et al. Microbial whole-cell biosensors: current applications, challenges, and future perspectives. Biosens. Bioelectron. 191, 113359 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hsu, C.-Y., Chen, B.-K., Hu, R.-H. & Chen, B.-S. Systematic design of a quorum sensing-based biosensor for enhanced detection of metal ion in escherichia coli. IEEE Transact. Biomed. Circ. Syst. 10, 593–601 (2016).
Article Google Scholar
Cai, S. et al. Engineering highly sensitive whole-cell mercury biosensors based on positive feedback loops from quorum-sensing systems. Analyst 143, 630–634 (2018).
Article CAS PubMed Google Scholar
Macía, J., Posas, F. & Solé, R. V. Distributed computation: the new wave of synthetic biology devices. Trends Biotechnol. 30, 342–349 (2012).
Article PubMed Google Scholar
Allee, W. & Bowen, E. S. Studies in animal aggregations: mass protection against colloidal silver among goldfishes. J. Exp. Zool. 61, 185–207 (1932).
Article CAS Google Scholar
Courchamp, F., Berec, L. & Gascoigne, J. Allee effects in ecology and conservation (OUP Oxford, 2008).
Berec, L., Angulo, E. & Courchamp, F. Multiple allee effects and population management. Trends Ecol. Evol. 22, 185–191 (2007).
Article PubMed Google Scholar
Mooring, M. S., Fitzpatrick, T. A., Nishihira, T. T. & Reisig, D. D. Vigilance, predation risk, and the allee effect in desert bighorn sheep. J. Wildl. Manag. 68, 519–532 (2004).
Article Google Scholar
Clutton-Brock, T. et al. Predation, group size and mortality in a cooperative mongoose, suricata suricatta. J. Animal Ecol. 68, 672–683 (1999).
Article Google Scholar
Din, M. O. et al. Synchronized cycles of bacterial lysis for in vivo delivery. Nature 536, 81–85 (2016).
Article CAS PubMed PubMed Central Google Scholar
Danino, T., Mondragón-Palomino, O., Tsimring, L. & Hasty, J. A synchronized quorum of genetic clocks. Nature 463, 326–330 (2010).
Article CAS PubMed PubMed Central Google Scholar
Smith, P. & Schuster, M. Antiactivators prevent self-sensing in pseudomonas aeruginosa quorum sensing. Proc. Natl. Acad. Sci. 119, e2201242119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cravo, F., Függer, M., Nowak, T. & Prakash, G. Mobspy: a meta-species language for chemical reaction networks. In International Conference on Computational Methods in Systems Biology, 277–285 (Springer, 2022).
Tsao, K.-L. & Waugh, D. S. Balancing the production of two recombinant proteins inescherichia coliby manipulating plasmid copy number: high-level expression of heterodimeric ras farnesyltransferase. Protein Expr. Purif. 11, 233–240 (1997).
Article CAS PubMed Google Scholar
Feinberg, M. Foundations of chemical reaction network theory (Springer, 2019).
Horn, F. & Jackson, R. General mass action kinetics. Arch. Ration. Mech. Anal. 47, 81–116 (1972).
Article Google Scholar
Weiss, J. N. The hill equation revisited: uses and misuses. FASEB J. 11, 835–841 (1997).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the ANR project DREAMY (ANR-21-CE48-0003) (M.F., T.N.) .

Author information

Authors and Affiliations

LMF, Université Paris-Saclay, CNRS, ENS Paris-Saclay, Gif-sur-Yvette, France
Fabricio Cravo, Matthias Függer & Thomas Nowak
LISN, Université Paris-Saclay, CNRS, Gif-sur-Yvette, France
Fabricio Cravo
Institut Universitaire de France, Paris, France
Thomas Nowak

Authors

Fabricio Cravo
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Függer
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Nowak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.C., M.F., T.N. designed and conceived the algorithm, F.C., M.F., T.N. simulated and generated data, F.C., M.F., T.N. carried out the mathematical analysis, F.C., M.F., T.N. wrote and revised the final manuscript.

Corresponding authors

Correspondence to Matthias Függer or Thomas Nowak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cravo, F., Függer, M. & Nowak, T. An Allee-based distributed algorithm for microbial whole-cell sensors. npj Syst Biol Appl 10, 43 (2024). https://doi.org/10.1038/s41540-024-00363-3

Download citation

Received: 25 September 2023
Accepted: 27 March 2024
Published: 22 April 2024
DOI: https://doi.org/10.1038/s41540-024-00363-3