Introduction

Injection-induced seismicity has recently drawn much scientific, societal, and political attention owing to the recent surge of globally reported events1,2,3. Fluid injections into the subsurface, associated with wastewater disposal, hydraulic fracturing, carbon dioxide (CO2) sequestration, or geothermal heat extraction operations at energy production sites, can reduce the shear strength of prestressed faults, triggering earthquakes4,5. Although fluid injections may routinely cause none or some minor microseismicity in the injection area6, in other cases, intense microseismicity or even larger magnitude events with damaging effects can be induced2,7. While such incidents have long been known4,8,9, the increasing demand for low-carbon energy production, the rapid advancement of new drilling and exploitation technologies and new larger-scale industrial projects and injection sites will likely cause induced seismicity cases to proliferate.

The main driving mechanism of injection-induced seismicity is attributed to an increase in pore pressure in proximity to the injection area, which can reduce the effective normal stress acting across fault planes, allowing them to slip4. However, poroelastic stresses acting on the broader injection area10, stress interactions between induced events11, thermoelastic-induced stress changes12 and pore-fluid pressure-driven aseismic slip13,14 can also play a prime role in inducing earthquakes. In addition, the spatiotemporal occurrence of induced events can be erratic, expanding in larger than expected distances away from the injection site at timescales that may vary from days up to months or even years15,16,17. To effectively mitigate the associated risk, the main challenge is to better understand the in situ and operational conditions that may lead to larger and potentially hazardous events and to effectively delineate the spatiotemporal evolution of injection-induced seismicity.

Where, when and how many induced earthquakes may occur above a given threshold magnitude are generally obscure owing to the complexity of the injection site, the local geological conditions, the regional stress field, and the injection parameters18. Given the large heterogeneities within the Earth’s crust, the complex and frequently unknown fracture and fault network and the highly anisotropic and dynamically changing permeability field that may regionally vary over several orders of magnitude, the front of the pore-fluid pressure increase can migrate at large distances away from the injection site, bringing critically stressed faults closer to failure, even in the far field15,16,17,18. In addition, poroelastic stresses can be even more remote, expanding well outside the hydraulically connected area where the pore-fluid pressure increases16,19. Stress interactions and thermal effects further add to the complexity of the physical processes occurring in the subsurface following fluid injection, making any prognostic evaluation of the evolution of induced seismicity extremely difficult.

Considering the complexity that may characterize the evolution of injection-induced seismicity, the large uncertainties regarding the in situ physical properties in the subsurface and the wide range of spatial and temporal scales that may appear, we consider a purely stochastic framework that generates the essential properties of the process. Along this line, we build on the well-established continuous-time random walk (CTRW) theory and the fractional diffusion equation to map the spatiotemporal evolution of injection-induced seismicity and to make predictions regarding the conditional probabilities of earthquake occurrence. To test and demonstrate the applicability of the model, we use data from the 2003 and 2012 hydraulic stimulations that were performed at the Cooper Basin geothermal field (South Australia). Fluid injections in the Cooper Basin, aimed to enhance the hydraulic permeability of the hot granitic basement and develop an enhanced geothermal system (EGS), induced thousands of microearthquakes and provided some of the most prolific seismic datasets in EGSs. The analysis and results within the CTRW framework demonstrate the applicability of the model in such cases of anomalous earthquake diffusion, as discussed analytically in the following section.

Methods

Within the CTRW context, we consider seismicity as a point process in time and space marked by the magnitude of the event. We also consider that, starting from the origin (x0 = 0, t0 = 0), seismicity undergoes a random walk in time and space, where each new event site is the new position of the random walk that occurs after waiting some time τ at the previous site. Then, we calculate the 3D Euclidean distance x(t) between each event and the origin, taken as the injection point, and estimate the mean square displacement (MSD) \(\left\langle {x^{2} \left( t \right)} \right\rangle\) of seismicity with time according to20:

$$\left\langle {x^{2} \left( t \right)} \right\rangle = \frac{1}{N}\mathop \sum \limits_{n = 1}^{N} x_{n}^{2} \left( t \right).$$
(1)

where N is the total number of events and xn(t) is the distance of the nth event from the origin that occurred at time t from the initiation of injection. In various complex systems, the MSD grows nonlinearly with time t, frequently taking the form of the power-law function21,22:

$$\left\langle {x^{2} \left( t \right)} \right\rangle \propto t^{a} .$$
(2)

In such cases, anomalous diffusion arises for a ≠ 1, distinguishing superdiffusion for a > 1 and subdiffusion for 0 < a < 1, while normal diffusion is recovered for a = 1. In the following, we use the term “anomalous” to characterize the diffusive process that deviates from the linear growth of the MSD with time (a ≠ 1) and eventually from the linear diffusion equation (see “Supplementary Methods”).

In addition, the subdiffusive regime in the CTRW model, which is the relevant one for the cases that we study (see “Results” section), corresponds to broad waiting-time distributions with divergent characteristic waiting times T (see “Supplementary Methods”). To approximate the scaling behavior of the waiting-time distribution, we use the q-generalized gamma function that has been found suitable for nonstationary earthquake time series23. The q-generalized gamma function reads as follows:

$$f\left( \tau \right) = C\left( {\frac{\tau }{{\tau_{0} }}} \right)^{\gamma - 1} \exp_{q} \left( { - \frac{\tau }{{\tau_{0} }}} \right),$$
(3)

where C is a normalization constant, τ0 is a positive scaling parameter and γ is the scaling exponent. The last term on the right-hand side of Eq. (3) is the q-exponential function defined as follows:

$$\exp_{q} \left( x \right) = \left[ {1 + \left( {1 - q} \right)x} \right]^{{1/\left( {1 - q} \right)}}$$
(4)

The q-exponential function is associated with generalized statistical mechanics, as it maximizes the nonadditive entropy Sq under appropriate constraints24, while its applicability to earthquake dynamics has been demonstrated in various studies (e.g.,25,26 and references therein). For q > 1, the q-exponential function exhibits asymptotic power-law behavior, while at the q → 1 limit, the ordinary exponential function is exactly recovered. Hence, for q > 1, the q-generalized gamma function (Eq. (3)) exhibits double power-law behavior, where short and large τ scale as power laws according to \(\sim \tau^{\gamma - 1}\) and \(\sim \tau^{{\left( {1 - \gamma } \right)/\left( {1 - q} \right)}}\), respectively, while for q → 1, the ordinary gamma function is exactly recovered23.

Moreover, the subdiffusive regime of the CTRW model is interchangeable with the time-fractional diffusion equation (TFDE)27:

$$\frac{\partial }{\partial t}P\left( {x,t} \right) = {}_{0}D_{t}^{1 - a} K_{a} \frac{{\partial^{2} }}{{\partial x^{2} }}P\left( {x,t} \right),$$
(5)

where P(x,t) is the probability density function of the walker being at some position x after time t, Ka the generalized diffusion coefficient and \({}_{0}D_{t}^{1 - a}\) the Riemann–Liouville fractional operator of order 1–a (0 < a < 1). In the “Supplementary Methods”, we describe the derivation of the TFDE and its asymptotic solution in terms of the standard theorem of the Fox hypergeometric functions. For large x \(\left( {x > \sqrt {K_{a} t^{a} } } \right)\), the asymptotic behavior of F(x,t), i.e., the probability density of the number of earthquakes that have just occurred at some position x from the origin after some time t, is given by20,28:

$$F\left( {x,t} \right)\sim \frac{{\left( {\tau^{\prime}} \right)^{ - a} }}{{\sqrt {{K}_{\alpha } } t^{{1 - \left( {a/2} \right)}} }}\left( {\frac{\left| x \right|}{{\sqrt {K_{a} t^{a} } }}} \right)^{{d\left( {1 - a} \right)/\left( {2 - a} \right)}} \times \exp \left[ { - \left( {1 - \frac{a}{2}} \right)\left( \frac{a}{2} \right)^{{a/\left( {2 - a} \right)}} \left( {\frac{\left| x \right|}{{\sqrt {K_{a} t^{a} } }}} \right)^{{2/\left( {2 - a} \right)}} } \right]$$
(6)

where a is the diffusion exponent (Eq. (2)), d is the spatial dimension and \(\tau^{\prime} = T\left( {{\Gamma }\left( {1 - a} \right)} \right)^{1/a}\) (for a < 1). The previous asymptotic solution of the TFDE model indicates a power-law increase in F(x,t) for short x, with a slope of \(d\left( {1 - a} \right)/\left( {2 - a} \right)\), while the second term on the right-hand side of Eq. (6) indicates the asymptotic exponential decay of F(x,t) for long x.

Results

Hydraulic stimulations and induced seismicity in the Habanero field

Hydraulic stimulations at the Cooper Basin initiated in early November 2003 with the main stimulation of the Habanero-1 well. Fluid injections commenced on 30 November 2003 and lasted approximately 9.5 days (Supplementary Fig. S1). During stimulation, more than 20,000 m3 of water was injected into the 4421-m-deep well, indicating a distinct flow zone at a depth of ~ 4254 m29. Flow rates increased stepwise during the main stimulation period, reaching a maximum rate of ~ 45 L/s and wellhead pressure peak values of ~ 65 MPa (Supplementary Fig. S1). Fluid injections induced more than 28,000 microseismic events, with 15,709 recorded during the main stimulation period. Induced microseismicity showed a general tendency to migrate away from the well with continuation of injection (Supplementary Fig. S2), while the alignment of the events along a zone of vertical thickness ~ 100–150 m indicated a single fracture zone as the focus of microseismicity29.

The 4204-m-deep Habanero-4 well was hydraulically stimulated in 2012 to further extend the previous stimulated reservoir. Fluid injections commenced on 14 November 2012, while the main stimulation period lasted approximately 14 days, between 16 and 30th November 2012 (Supplementary Fig. S1). In total, 34,000 m3 of water was injected into the well, inducing more than 29,000 microseismic events during a 2-month period30. During the main stimulation of the well, the flow rate reached a maximum value of ~ 60 L/s, and the wellhead pressure reached a peak value of ~ 50 MPa (Supplementary Fig. S1). The recorded microseismicity reached 17,266 events, indicating a general tendency to migrate away from the injection point, particularly toward the north (Supplementary Fig. S2). Vertical location errors on the order of ~ 100 m, as well as image logs from the Habanero-3 well, indicate that a few-meter-thick single planar fault zone, shallowly dipping to the west—southwest, dominates the occurrence of fluid flow and induced seismicity in the Habanero field30.

To proceed with the analysis, we used the microseismic data collected during the main stimulation periods, i.e., 30/11–09/12/2003 for Habanero-1 and 16/11–30/11/2012 for Habanero-4 (Supplementary Fig. S1). We estimate the magnitude of completeness (Mc) for each earthquake catalog using the nonparametric method of median-based analysis of the segment slope (mbass;31), which is considered more adequate for gradually curved frequency–magnitude distributions32. The implementation of the mbass method indicates an apparent breakpoint in the discrete frequency-magnitude distributions of the Habanero-1 and Habanero-4 earthquake catalogs at M –0.7 and M –0.6, respectively (Supplementary Fig. S3), which are further considered as the Mc for the two catalogs. For M ≥ Mc, the final catalogs that are further used in the analysis consist of 9491 and 7708 events for Habanero-1 and Habanero-4, respectively.

Earthquake diffusion properties

To quantify the rate of earthquake diffusion during the two hydraulic stimulations in terms of the MSD of induced seismicity with time (Eq. (2)), we take as the origin the intersection point of the open hole sections in the two wells and the distinct flow zones (− 4254 m, Habanero-1; − 4160 m, Habanero-4). We then calculate \(\left\langle {x^{2} \left( t \right)} \right\rangle\) according to Eq. (1) for the events with M ≥ Mc.

The MSD of induced seismicity with time for Habanero-1 and Habanero-4 is shown in Fig. 1 (for logarithmically binned data and on log–log axes). For both cases, the MSD grows almost continuously with time, highlighting the diffusive process. The increase in the MSD with time approximates the power-law relationship of Eq. (2), with diffusion exponents a = 0.45 ± 0.04 (R2 = 0.97) for Habanero-1 and a = 0.70 ± 0.05 (R2 = 0.98) for Habanero-4 (Fig. 1), with uncertainties referring to 95% confidence intervals. A lower than unity exponent a, observed for both Habanero-1 and Habanero-4, indicates anomalous diffusion of induced seismicity according to a subdiffusive process. The rate of earthquake diffusion, as quantified by the exponent a, is greater for Habanero-4 (a = 0.70) than for Habanero-1 (a = 0.45). Note, however, that the latter does not necessarily signify longer distances of induced events from the origin. For instance, in Fig. 1, we observe that the instantaneous spatial response of induced seismicity during the first days of injection is greater in Habanero-1 than in Habanero-4. For Habanero-4, we also observe that during the last two days of injection, the MSD departs from the trend ~ t0.7 and instead shows faster growth, as the northern part of the stimulated volume, located further away from the well, becomes seismically active (Supplementary Fig. S2).

Figure 1
figure 1

MSD of induced seismicity with time for Habanero-1 and Habanero-4 (filled squares), in logarithmically spaced bins and on double logarithmic axes. The solid lines represent the best-fitting solutions according to the power-law relationship of Eq. (2). The dashed line represents the trend (a = 1) for normal diffusion.

The waiting-time distribution

To validate the CTRW model and to extract further information regarding the temporal structure of induced seismicity during the two hydraulic stimulations, we investigate the distribution of waiting times τ, i.e., the time intervals between successive earthquakes, defined as τi = ti+1ti, with ti being the time of occurrence of the ith event. To ensure homogeneity in the analysis, we only consider earthquakes with magnitudes M ≥ Mc. We estimate the probability density P(τ) of waiting times τ by counting the number of τ that fall into logarithmically spaced bins and then normalize this value by dividing this number by the bin width and by the total number of counts so that the probabilities of occupation sum to one. The analysis is restricted to waiting times τ ≥ 10 s due to significant scatter that appears in the estimated P(τ) for τ < 10 s, particularly for the Habanero-4 earthquake catalog, which may bias the scaling analysis.

The normalized probability densities P(τ) for Habanero-1 and Habanero-4 are shown in Fig. 2. For both datasets, we observe a bimodal behavior of P(τ). For short τ, P(τ) decays slowly up to a characteristic waiting time, where a gradual crossover to faster decaying probabilities appears for larger τ (Fig. 2). We approximate this scaling trend with Eq. (3), using a weighted nonlinear least squares algorithm to estimate the model parameters and their associated uncertainties (95% confidence intervals). Regression yields the parameter values C = 0.21 ± 0.02, γ = 0.20 ± 0.03, τ0 = 131.21 ± 38.69 and q = 1.19 ± 0.07, with a root-mean-square error (RMSE) of 0.0043 for Habanero-1 and the values C = 0.38 ± 0.03, γ = –0.07 ± 0.03, τ0 = 459.63 ± 93.4 and q = 1.36 ± 0.08, with a RMSE of 0.0037, for Habanero-4 (Fig. 2). The observed P(τ) for both Habanero-1 and Habanero-4 can well be approximated with the q-generalized gamma function, contrasting with random Poissonian behavior and suggesting clustering effects at all time scales in the evolution of injection-induced seismicity in the Habanero field. We recall that Gaussian propagation is associated with asymptotic exponential scaling in the time domain33 so that a long-tailed waiting-time distribution with asymptotic power-law scaling is an additional manifestation of anomalous diffusion, particularly subdiffusion.

Figure 2
figure 2

Normalized probability densities P(τ) of the waiting times τ (filled circles) on double logarithmic axes for Habanero-1 (top) and Habanero-4 (bottom). The model (solid line) represents Eq. (3) for the parameter values referred to in the main text, while the dotted lines the standard gamma function fitted to the data.

Propagation of induced seismicity

In the previous sections, we established the subdiffusive character of the seismicity induced during the main stimulations of the Habanero-1 and Habanero-4 wells. In the current section, we proceed one step further to model the propagation of induced seismicity in space and time in terms of the pdf F(x,t), or else the concentration profile, which expresses the probability density of the number of earthquakes that have just occurred at some position x from the origin after some time t (see also “Supplementary Methods”). To model F(x,t), we use the time-fractional diffusion equation (TFDE) (Eq. (5)) and its asymptotic solution (Eq. (6)).

To estimate F(x,t), we construct the histogram of the absolute 3D distances between each induced event (for M ≥ Mc) and the origin for various time periods. We then normalize F(x,t) such that \(\smallint dxF\left( {x,t} \right) = 1\). The normalized probability densities F(x,t) for 50 m wide consecutive bins and for two time periods are shown in Fig. 3 for Habanero-1 and in Fig. 4 for Habanero-4. For Habanero-1, we estimate F(x,t) for t = 5 days and for t = 10 days, which mark approximately the time of occurrence of 50% of the induced events and the termination of injection, respectively, while for Habanero-4, we estimate F(x,t) for t = 9 days (45% of the induced events) and for t = 14 days (termination of injection). From Figs. 3 and 4, we can immediately verify for both cases the departure of F(x,t) from the classic bell-shaped Gaussian function that signifies normal diffusion (see also “Supplementary Methods”). Instead, we observe a peak closer to the origin, i.e., the injection point, and a tail that stretches toward greater distances.

Figure 3
figure 3

Normalized probability density of the absolute 3D distances between the induced events and the origin during the main stimulation of the Habanero-1 well, represented as a stem plot for t = 5 days (top) and t = 10 days (bottom). The solid line and the shaded area indicate the asymptotic solution of the TFDE (Eq. (6)) and the corresponding confidence intervals (see text), respectively.

Figure 4
figure 4

Normalized probability density of the absolute 3D distances between the induced events and the origin during the main stimulation of the Habanero-4 well, represented as a stem plot for t = 9 days (top) and t = 14 days (bottom). The solid line and the shaded area indicate the asymptotic solution of the TFDE (Eq. (6)) and the corresponding confidence intervals, respectively.

To model the observed F(x,t) with the solution of the TFDE (Eq. (6)), we use the model parameters that emerged from the previous analysis of the waiting-time distribution and the MSD of seismicity with time. In particular, for Habanero-1, the mean waiting time was T = 89.42 s, and the diffusion exponent a = 0.45; for Habanero-4, the mean waiting time was T = 143.63 s and a = 0.70. In addition, we set the spatial dimensions to d = 3, as we analyze 3D distances between earthquake hypocenters and the origin. We then use the generalized diffusion coefficient Ka as the only free parameter to optimize the model. The nonlinear least squares algorithm was applied to optimize Eq. (6) for Ka and to estimate the associated 95% confidence intervals, yielding Ka = 23.67·103 ± 6.1·103 (for t = 10 days) for Habanero-1 and Ka = 8.42·103 ± 2·103 (for t = 14 days) for Habanero-4. As shown in Figs. 3 and 4, the model successfully captures the main features of earthquake occurrence, namely, the peak concentration close to the injection site, the narrowing and broadening of the peak with time and the stretched relaxation of seismicity with distance. However, some discrepancies are still present, as we can observe for Habanero-1 (Fig. 3), where the model fails to predict the second greater peak at ~ 500–550 m for t = 5 days and the quite broad peak (~ 300–650 m) for t = 10 days. For Habanero-4 and for t = 9 days, the model slightly underpredicts the peak earthquake concentration and overpredicts the concentration of distant events, while for t = 14 days, it successfully predicts the peak concentration and the spatial relaxation of seismicity (Fig. 4).

Propagation along the principal axes

To account for anisotropic hydraulic diffusivities inside the stimulated volumes, as implied by the elongation of the seismicity clouds along a general NS direction (Supplementary Fig. S2), and to further validate the applicability of the model, we downscaled the analysis to 1D and studied the spatial propagation of seismicity along the principal axes of an ellipsoid that best fit the seismicity clouds. The latter enables the estimation of the spatial density and the average propagation of seismicity in terms of the centered MSD along the main spreading directions. To perform this analysis, we initially use the statistical method of principal component analysis (PCA) to estimate the principal components of the seismicity cloud and to define the principal axes of the ellipsoid.

The approximate 2D geometry of the seismicity cloud during stimulation of the Habanero-1 and Habanero-4 wells29,30 is further confirmed with PCA, as the eigenvalues of the two main principal components can explain more than 98.5% of the total observed spatial variance of seismicity in 3D space. Hence, we can approximate the spatial extent of seismicity clouds with an ellipse rather than an ellipsoid, assuming that 95% of all events are located inside this ellipse. The estimated ellipses for the two seismicity clouds, centered to the two wells, define areas of approximately 3.27 km2 and 3 km2, with the first principal axes oriented 22.5° and 354° from the North for Habanero-1 and Habanero-4, respectively (Supplementary Fig. S4).

We rotate the seismicity clouds around the two origin points so that the x- and y-axes coincide with the principal axes. We then calculated the 1D absolute distances of the rotated events from the two origin points to estimate the MSD and the spatial density of the observed seismicity along the two principal axes. As previously described, the MSD grows as a power-law with time with diffusion exponents less than unity in all cases. In particular, for Habanero-1, a = 0.39 ± 0.06 and a = 0.43 ± 0.06 along the first and second principal axes, respectively, for t in the range between 2 and 10 days (Supplementary Fig. S5). For Habanero-4, a = 0.68 ± 0.09 and a = 0.83 ± 0.11 along the first and second principal axes, respectively, for t > 3 days (Supplementary Fig. S5). By further setting d = 1 in Eq. (6) and by using Ka as the only free parameter to optimize the model, we estimate the asymptotic solution of the TFDE fitted to the data. As previously described in 3D space, the model manages to capture the main properties of earthquake progression in time and space, particularly for Habanero-4 (Supplementary Figs. S6 and S7).

Furthermore, we use the estimated pdfs F(x,t) to model the propagation of induced seismicity in 2D space with Eq. (6). In this case, we estimate the joint probability density function as the product of the marginal pdfs F(x,t). The results are presented in Figs. 5 and 6 for Habanero-1 and Habanero-4, respectively, and for two time periods. In particular, Figs. 5 and 6 show the spatial distributions of induced seismicity along the two principal axes of the ellipse that resulted from PCA and the joint probability density of earthquake occurrence according to Eq. (6). For Habanero-1 and for both t = 5 days and t = 10 days, the model successfully predicts the area of greater earthquake occurrence close to the origin, the greater extent of seismicity along the first principal axis and the decay of earthquake occurrence with distance from the origin (Fig. 5). A similar image was obtained for Habanero-4 (Fig. 6). In this case, for both t = 9 days and t = 14 days, the model successfully predicts the peak earthquake occurrence close to the origin, which remains nearly fixed with increasing time, and the gradual decay of seismicity with distance from the origin, while it seems to slightly underpredict the occurrence of seismicity at distances greater than ~ 900 m along the first principal axis for t = 14 days (see also Supplementary Fig. S7).

Figure 5
figure 5

Joint probability density of earthquake occurrence at Habanero-1 during the main stimulation period, after 5 (top) and 10 days (bottom) from initiation of injections. The absolute earthquake locations along the two principal axes, estimated with PCA, are shown with black dots. Figures were created with MATLAB.

Figure 6
figure 6

Joint probability density of earthquake occurrence at Habanero-4 during the main stimulation period, after 9 (top) and 14 days (bottom) from initiation of injections. The absolute earthquake locations along the two principal axes, estimated with PCA, are shown with black dots. Figures were created with MATLAB.

Discussion

Injection-induced seismicity represents a major challenge for the development of EGSs, and its effective management is vital for public consensus and successful completion34,35. Under these conditions, deciphering the actual physical mechanisms that underlie the spatiotemporal evolution of induced seismicity and distinguishing between normal and anomalous earthquake diffusion are fundamental for predicting the essential characteristics of the system.

Pursuant to the latter, we presented the application of a purely stochastic framework based on the CTRW model and the TFDE to the spatiotemporal evolution of injection-induced seismicity in EGSs. The key quantity in the CTRW model is ψ(x,t), which in our case expresses the joint probability density for an earthquake occurring at position x after time t. Having defined ψ(x,t), the spatiotemporal evolution of seismicity can be determined without further assumptions with the appropriate master equation (see “Supplementary Methods”).

The applicability of the model was demonstrated for two case studies associated with hydraulic stimulations conducted in the Cooper Basin geothermal field in 2003 and 2012. In both cases, the MSD of induced seismicity away from the injection well scales as a fractional order power-law with time with diffusion exponents less than unity, signifying subdiffusion of induced seismicity. Subdiffusion is most commonly encountered in triggered earthquake diffusion despite the driving mechanism20,36,37,38,39,40,41,42,43, indicating that nonlinear inelastic relaxation of the crust occurs in response to a stress perturbation. However, the whole hierarchy of diffusion exponents reported manifests the variability of the physical process, and further studies are needed to decipher the exact details (triggering mechanism, geologic conditions, rheology, stress regime, etc.) that give rise to particular diffusion regimes.

Subdiffusion is further inferred from the power-law dependence of ψ(x,t) on time t. In both cases that we studied, the waiting-time distributions scale according to the q-generalized gamma function presenting asymptotic power-law behavior. The latter indicates long-term memory effects in the temporal evolution of injection-induced seismicity in the Habanero field, contrasting a random (Poissonian) occurrence. Furthermore, this scaling behavior implies that in the long-term limit, the characteristic waiting time T diverges, signifying the emergence of even longer waiting times between successive earthquakes. The latter suggests that sporadic earthquakes between long quiescent periods will continue to occur in the stimulated volume long after termination of injection, in agreement with observations from various EGS where induced earthquakes are observed years after hydraulic stimulation18.

Subdiffusion further implies that the concentration profile of seismicity is highly non-Gaussian but instead presents a skewed distribution, with the peak concentration close to the initial source of the stress perturbation (i.e., the injection well) slowly moving away with time. This is exactly the scaling behavior observed in both studied cases. For both Habanero-1 and Habanero-4, the peak earthquake occurrence is centered close to the two injection wells, while their spatial density gradually decays with increasing distance stretching out the concentration profiles. To model the observations, we used the TFDE, which is interchangeable with the subdiffusive regime of the CTRW model. In such cases of anomalous diffusion, fractional derivatives naturally arise to account for memory effects and long-range correlations often encountered in diffusion processes in complex heterogeneous media44. The asymptotic solution of the TFDE applied to the data shows good agreement, capturing the essential characteristics of earthquake occurrence. As might be expected, some discrepancy between the model and the data exists owing to the complexity of the process and to the nature of the modeling approach, as the theory is for an ensemble average, while the recorded seismicity represents only one “realization” responding to the hydraulic stimulation.

Goebel and Brodsky19 argued that the spatial occurrence of injection-induced seismicity is represented by two populations, the first showing an abrupt exponential decay associated with a dominant pore-pressure triggering mechanism and the second showing a power-law spatial decay associated with poroelastic effects. As previously discussed, the concentration profiles of seismicity in the Habanero field show a peak event concentration close to the two injection wells that spreads slowly with time and an exponential decay of seismicity away from the wells. The latter is in agreement with the results of19 regarding the spatial exponential decay of induced seismicity in the Habanero field and implies that the relaxation of fluid overpressures is the dominant triggering mechanism of induced seismicity in this case, in further agreement with Refs.29,45. In such cases, anomalous diffusion of injection-induced seismicity can emerge from a “non-Fickian” fluid transport process that channels along complex flow pathways with anisotropic hydraulic properties and within a heterogeneous and dynamically evolving permeability field that may vary over several orders of magnitude within the medium. Similar phenomena have been discussed for fluid flow in fractured media46,47,48 and fault damage zones49.

Finally, our results further suggest that the CTRW model can be directly incorporated into EGS-induced seismicity forecasts. Once the diffusion exponent of seismicity is defined, the model can make reasonable and computationally inexpensive predictions regarding the ensemble average of the spatial expansion of seismicity with time, an essential assessment for designing appropriate mitigation and regulatory strategies. This exact stochastic nature, which does not need to incorporate finer details of the system, has established the CTWR model during the last 5 decades quite compelling for modeling anomalous diffusion phenomena.