Assessing effect of best management practices in unmonitored watersheds using the coupled SWAT-BiLSTM approach

Zhang, Xianqi; Qi, Yu; Li, Haiyang; Sun, Shifeng; Yin, Qiuwen

doi:10.1038/s41598-023-44531-7

Download PDF

Article
Open access
Published: 11 October 2023

Assessing effect of best management practices in unmonitored watersheds using the coupled SWAT-BiLSTM approach

Xianqi Zhang^1,2,3,
Yu Qi¹,
Haiyang Li¹,
Shifeng Sun¹ &
…
Qiuwen Yin¹

Scientific Reports volume 13, Article number: 17168 (2023) Cite this article

1097 Accesses
2 Citations
Metrics details

Subjects

Abstract

In order to enhance the simulation of BMPs (Best Management Practices) reduction effects in unmonitored watersheds, in this study, we combined the physically-based hydrological model Soil & Water Assessment Tool (SWAT) and the data-driven model Bi-directional Long Short-Term Memory (Bi-LSTM), using the very-high-resolution (VHR) Land Use and Land Cover (LULC) dataset SinoLC-1 as data input, to evaluate the feasibility of constructing a water environment model for the Ba-River Basin (BRB) in central China and improving streamflow prediction performance. In the SWAT-BiLSTM model, we calibrated the top five SWAT parameters sorted by P-Value, allowing SWAT to act as a transfer function to convert meteorological data into base flow and storm flow, serving as the data input for the Bi-LSTM model. This optimization improved the Bi-LSTM's learning process for the relationship between the target and explanatory variables. The daily streamflow prediction results showed that the hybrid model had 9 regions rated as "Very good," 2 as "Good," 2 as "Satisfactory," and 1 as "Unsatisfactory" among the 14 regions. The model achieved an NSE of 0.86, R² of 0.85, and PBIAS of −2.71% for the overall daily streamflow prediction performance during the verification period of the BRB. This indicates that the hybrid model has high predictive accuracy and no significant systematic bias, providing a sound hydrodynamic environment for water quality simulation. The simulation results of different BMPs scenarios showed that in the scenarios with only one BMP measure, stubble mulch had the best reduction effect, with average reductions of 17.83% for TN and 36.17% for TP. In the scenarios with a combination of multiple BMP measures, the combination of stubble mulch, soil testing and formula fertilization, and vegetative filter strip performed the best, achieving average reductions of 42.71% for TN and 50.40% for TP. The hybrid model provides a novel approach to simulate BMPs' reduction effects in regions without measured hydrological data and has the potential for wide application in BMP-related decision-making.

Widespread societal and ecological impacts from projected Tibetan Plateau lake expansion

Article 27 May 2024

Metal mobilization from thawing permafrost to aquatic ecosystems is driving rusting of Arctic streams

Article Open access 20 May 2024

Current and future global water scarcity intensifies when accounting for surface water quality

Article 23 May 2024

Introduction

With the development of water environment management technology and practices, it is widely recognized that besides direct discharge of wastewater, the main causes of water quality deterioration and eutrophication in rivers and lakes are due to human activities, including agricultural activities and urban emissions, which disrupt the structure and functioning of watershed ecosystems and degrade the intrinsic elements, leading to non-point source pollution, such as carbon, nitrogen, and phosphorus¹. In some data-scarce regions, modeling the water environment to assess the effectiveness of BMPs is a challenging task². Due to limited hydrological monitoring stations and the increasing and diversified demand for hydrological data with socio-economic development, the issue of data scarcity is expected to persist. In this context, the development of improved watershed hydrological models to enhance the simulation of streamflow and water quality in data-scarce areas has become an urgent necessity. NPS pollution is characterized by a wide range of sources, strong randomness, and high concentrations of pollutants, making its control strategies significantly different from point source pollution. Currently, BMPs have been proven to be one of the most effective measures for managing NPS pollution³. BMPs are divided into agricultural BMPs and structural BMPs⁴, which manage the water environment through engineering and management measures, respectively. In order to achieve environmental goals for non-point source pollution control, such as reducing nitrogen and phosphorus loads by 30%, a certain amount of economic cost is required for construction and management. However, under the same economic cost, different spatial configuration schemes have different environmental benefits. To develop the most efficient plan (minimizing economic costs and maximizing pollution reduction), it is essential to quantitatively evaluate its benefits before implementation⁵. Therefore, enhancing the understanding of BMPs' performance is crucial.

Watershed models are mathematical representations of hydrological, ecological, erosion, and nutrient cycling processes within a watershed. Based on their approach and the processes they simulate, they are typically classified as empirical models and physical models (or process models)⁶. Process models, also known as hydrological process-based models, are built on hydrological processes such as rainfall, evaporation, infiltration, and runoff, and they describe the transport and transfer of pollutants using water as a carrier. Additionally, they simulate processes like vegetation growth, soil erosion, and nutrient cycling. Compared to empirical models, process models can better describe the migration paths and transformation mechanisms of pollutants⁷. Therefore, in recent years, physical models including Area Nonpoint Source Watershed Environment Simulation (ANSWERS), LOAD ESTimator (LOADEST), SWAT and Hydrological River Basin Environment Assessment Model (HydroBEAM) have been widely used in various water environment studies^8,9,10,11. These models have different structures and mechanisms and use different equations to describe BMPs¹². In general, most models tend to focus on simulating only one or a few processes within the watershed, such as hydrology, soil erosion, or nutrient cycling, with only a few models, including SWAT, considering various processes within the watershed¹³. Moreover, SWAT's built-in equations provide a more detailed description of agricultural activities and BMPs¹⁴. Previous research has shown that SWAT can effectively study the performance of BMPs due to changes in hydro-meteorological characteristics, land use and land cover (LULC), and soil properties¹⁰. However, SWAT requires various types of input data (such as precipitation, temperature, evaporation, topography, soil properties, and LULC), demanding higher temporal and spatial resolutions, and its performance is highly dependent on the quality of input data and parameters¹⁵. Additionally, the calibration of parameters in SWAT is subject to complex uncertainties due to the intricate issue of equifinality¹⁶, increasing the modeling difficulty and consuming a significant amount of researchers' time.

In recent years, data-driven models have been widely applied in various water environment studies, and their reliability has been validated¹⁷. Essentially, data-driven models aim to derive the linear or nonlinear relationships between explanatory and target variables based on a large amount of input data, without considering the physical characteristics of the variables. Bi-LSTM model is one type of data-driven model, consisting of two opposite-directional Long Short-Term Memory (LSTM) models, and its performance has been shown to outperform single-directional LSTM models in many aspects^18,19. Using Bi-LSTM to simulate watershed runoff can bypass the complex and uncertain calibration process, significantly reducing modeling difficulty²⁰. However, its main challenge lies in the high requirement for representativeness of training data,once events fall outside the range of the training data, the predictive performance of the model will deteriorate significantly²¹. Additionally, data-driven models like Bi-LSTM cannot account for the impact of spatiotemporal characteristics of rainfall on the runoff generation process in the watershed. For example, Jiang et al.²² observed that when the rainfall center is close to the outlet of the watershed, the water level at the outlet section rises rapidly. This is because such data-driven models use rainfall time series from different meteorological stations as input data, overlooking the potential influence of spatial variability on runoff variations in the study area.

However, whether it is conceptual hydrological models or machine learning models, their performance in data-scarce regions remains unsatisfactory. In a streamflow simulation study of a sub-basin in the Tonle Sap Basin of Cambodia, where no actual measured data were available, researchers calibrated and validated the SWAT model using daily runoff observations at the watershed outlet. After numerous attempts, the model achieved an NSE of 0.38 and a PBIAS of -78.38% for daily streamflow simulation results during the validation period. This indicates that hydrological models based on physical processes, such as SWAT, perform inadequately in data-scarce regions²³. Moreover, due to the absence of observed hydrological data for training, machine learning models cannot be directly applied to data-scarce watersheds. In this study, to enhance streamflow and water quality simulation in data-scarce watersheds, and to overcome the limitations inherent in both conceptual hydrological models and machine learning models, we developed a coupled SWAT-BiLSTM model. In this hybrid model, the SWAT model serves as a transfer function, combining meteorological information, including temperature, precipitation, wind speed, and humidity, with topographic, soil, and LULC data to transform them into two hydrological variables: baseflow and quickflow. Bi-LSTM, on the other hand, captures linear or nonlinear underlying relationships between the two hydrological variables (explanatory variables) and observed streamflow data (target variable), ultimately enabling streamflow prediction in data-scarce watersheds. This provides a new approach for modeling water environments in areas without measured hydrological data, thus reducing the difficulty in evaluating the reduction effects of BMPs schemes in these regions.

In this study, to assess the reduction effects of different BMPs scenarios in areas without measured hydrological data, we selected the BRB in Shaanxi Province, China, which is known for its severe water pollution, as the case study area. The objectives of this study were as follows: (1) to establish a hybrid model combining SWAT and Bi-LSTM and use it to predict the streamflow in the assumed data-scarce areas; (2) to evaluate the predictive performance of the hybrid model in different regions of BRB; (3) to simulate and evaluate the reduction effects of different BMPs scenarios.

Materials and methods

Study area

The Ba-River is located in the southeastern part of Xi'an City, Shaanxi Province, China. It originates from the northern slope of the Qinling Mountains, north of Lantian County, and flows through Baqiao District and Weiyang District before joining the Yellow River's main tributary, the Wei-River, in Gaoling County. The river has a total length of 109 km and is the largest tributary on the south bank of the Wei River. The Ba-River Basin (33°50′ N–34°27′ N, 109°00′ E–109°47′ E) covers an area of 2581 km², with its topography mainly composed of mountains, ranging in elevation from 357 to 2424 m (Fig. 1). The southern and eastern parts of the basin are mainly covered by forests, while the central part is dominated by extensive farmland. Villages are distributed along both sides of the river, and the northern part is primarily used for urban construction. The predominant soil types in the basin are yellow–brown soil and brown soil. The BRB experiences a warm temperate semi-humid continental monsoon climate with significant seasonal characteristics. The majority of heavy rainfall occurs from July to September, often in the form of continuous rainy days and heavy storms, with a spatial distribution that is generally more rain in the south and less in the north. The total annual precipitation ranges from 502 to 873 mm, with an average of 697 mm over multiple years. The average annual temperature in the basin is between 13.0 and 14.8 ℃, with a multi-year average of 13.7 ℃. The average annual evaporation is 776 mm²⁴. The overall groundwater quality in the BRB is good, but in the Ba-River Ecological Zone, human activities have led to fluoride and total coliform exceeding the standards in some areas, resulting in poor water quality. The pollution sources in different locations of the Ba-River are influenced by the hydrological characteristics, uneven population distribution, and regional economic disparities. The upper and middle reaches of the river are mainly affected by livestock farming discharge, domestic sewage, and agricultural pollution, while the lower reaches receive concentrated urban domestic sewage and industrial wastewater. In recent years, with the construction of sewage treatment facilities in the BRB, the pollution load from point sources has been continuously reduced, but NPS pollution has become increasingly prominent²⁵. Therefore, there is an urgent need to conduct research on the effectiveness of NPS pollution emission control schemes in this area.

Soil and water assessment tool (SWAT)

SWAT is a physically-based semi-distributed watershed hydrological model developed by Dr. Arnold of the Agricultural Research Service (ARS) of the United States Department of Agriculture (USDA). Initially, SWAT was applied to large-scale and complex watersheds with different soil types, land use, and management conditions to predict and evaluate the long-term impacts of human activities, such as land use management, on the water cycle, sediment, and agricultural pollutant transport in the watershed²⁶. SWAT model is based on the Simulator for Water Resources in Rural Basins (SWRRB) model and incorporates several characteristics of ARS models. The improvement of the SWRRB model originated from the daily rainfall hydrological model of Chemicals, Runoff, and Erosion from Agricultural Management Systems (CREAMS). In the late 1980s, for water quality assessment, the SWRRB model incorporated pesticide components from the Groundwater Loading Effects of Agricultural Management Systems (GLEAMS) model, the SCS curve method, and newly developed sediment yield calculation equations to address watershed management issues. The SWAT model can simulate the movement of water in evapotranspiration, groundwater, and soil based on empirical equations and the principle of water balance. It not only simulates the water cycle process but also studies the processes of soil erosion, nutrient transport, pesticide, and pathogen cycling using the water cycle as a carrier. In recent years, the model has also been widely used in various aspects such as non-point source pollution detection and control, mechanistic process exploration simulation, and spatial–temporal distribution of pollution load^27,28. In the SWAT modeling process, a watershed is first divided into several sub-basins, and then, combining with data such as land use types and soil types, the sub-basins are further divided into different Hydrological Response Units (HRUs). The Soil Conservation Service (SCS) method is used to independently calculate water infiltration and surface runoff in each HRU, and the surface water is calculated at the outlet of the sub-basin, and finally, the routing process is calculated using a simulation computation method. Table 1 shows the data and sources used to establish the SWAT model in the BRB. Figure 2 illustrates the Station ID of the 14 hydrological stations in the SWAT database and the corresponding geographical locations.

Table 1 Data inputs for the SWAT model.

Full size table

It is worth mentioning that this study used the first 1-m resolution national-scale land-cover map of China created with the deep learning framework to improve modeling accuracy. This dataset is derived from the State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing (LIESMARS), Wuhan University, with a resolution of 1 m^27,28. Figure 3 shows the comparison of SinoLC-1 with other LULC datasets at a larger spatial scale. Based on the analysis of the VHR satellite image in Fig. 3a, the land cover performance of ESRI_GLC10 in Fig. 3e and GlobeLand30 in Fig. 3g is the most blurred, with farmland, buildings, and forests in urban areas being severely confused. GLC_FCS30 performs the worst in terms of forest cover, transportation roads, rivers, and runoff. FROM_GLC10 shows accurate performance on water bodies (such as artificial lakes and rivers), but its performance in forest cover types does not meet expectations. ESA_GLC10 relatively performs better compared to other comparative products, but its performance in water bodies is still inadequate. In comparison, SinoLC-1 has the best overall performance, accurately representing fine details of land cover such as small rivers, artificial lakes, small ponds, vegetation, and buildings. It can precisely identify the boundaries of different land use types, which significantly reduces the phenomenon of confusion between different LULC types during the SWAT modeling process and contributes to the accurate delineation of HRUs. To quantitatively assess the performance differences between SinoLC-1 and five other widely used large-scale land cover products, a total of 106,852 random samples extracted from each LULC product were compared and analyzed against official land survey reports provided by the Chinese government. The validation results indicated that SinoLC-1 achieved an overall accuracy of 91.7% and a kappa coefficient of 0.7595, indicating a high level of consistency between SinoLC-1 and actual LULC data. Information for the other five LULC products is presented in Table 2²⁸.

Table 2 Information for the comparative land-cover products.

Full size table

To simulate the effects of BMPs, the BRB was divided into 23 sub-basins based on the terrain and real river network vector data, using a threshold of 50 km². Based on the SinoLC-1 dataset and soil type data, 21 sub-basins were further subdivided into 713 HRUs by setting thresholds for LULC, soil type, and slope at 13%, 20%, and 20%, respectively. The daily runoff records from the hydrological station at the BRB outlet were used to calibrate and validate the SWAT model. The calibration period was from January 1, 2015, to December 31, 2019, and the validation period was from January 1, 2020, to December 31, 2022. Considering the issue of equifinality involved in the model calibration process, simultaneous calibration of a large number of parameters can lead to significant uncertainty¹⁶. Based on previous research in the WRB region²⁹, we selected different types of parameters ranked in the top five P-Values in sensitivity analysis for calibration. The calibrated parameters and their values are presented in Table 3. This process was performed using the Sequential Uncertainty Fitting 2 (SUFI-2) algorithm built into SWAT-CUP.

Table 3 Calibrated parameters in SWAT.

Full size table

Bi-LSTM

LSTM is an improved type of Recurrent Neural Network (RNN) that addresses the issue of long-term dependencies encountered in traditional RNNs³⁰. In the structure of LSTM, the hidden layer neurons are equipped with input gates, forget gates, and output gates. These gates, determined by Sigmoid functions and element-wise multiplication, decide which information should be remembered, giving LSTM the ability of long-term memory and effectively overcoming the vanishing and exploding gradient problems encountered in traditional RNNs³¹. The internal mechanism of a single LSTM neuron is illustrated in Fig. 4, and the mechanisms of the three gates are represented by the following equations:

Forget gate:

$${f}_{t}=\sigma \left({W}_{f}\cdot \left[{h}_{t-1},{x}_{t}\right]+{b}_{f}\right)$$

(1)

Input gate:

$${i}_{t}=\sigma \left({W}_{i}\cdot \left[{h}_{t-1},{x}_{t}\right]+{b}_{i}\right)$$

(2)

$${\widetilde{C}}_{t}=\mathit{tanh}\left({W}_{C}\cdot \left[{h}_{t-1},{x}_{t}\right]+{b}_{C}\right)$$

(3)

$${C}_{t}={f}_{t}\odot {C}_{t-1}+{i}_{t}\odot {\widetilde{C}}_{t}$$

(4)

Output gate:

$${o}_{t}=\sigma \left({W}_{O}\cdot \left[{h}_{t-1},{x}_{t}\right]+{b}_{O}\right)$$

(5)

$${h}_{t}={O}_{t}\odot \mathit{tanh}\left({C}_{t}\right)$$

(6)

Here, ${f}_{t}$, ${i}_{t}$, and ${o}_{t}$ represent the forget gate, input gate, and output gate, respectively; ${C}_{t}$ is the memory cell; ${h}_{t}$ is the output of the neuron's short-term memory at time $t$; ${\widetilde{C}}_{t}$ represents the memory from the new input; $h$ is the hidden vector; $\sigma$ is the activation function; $W$ is the weight matrix; $b$ is the bias term; [M, N] denotes the concatenation of two vectors; ⨀ represents element-wise multiplication.

Although LSTM has the ability of long-term memory, it can only perform forward learning and extract information from unidirectional time series, which limits its learning capacity. Bi-LSTM, by stacking two LSTM networks in opposite directions, utilizes time series data twice and can more fully explore the potential correlation information between the input variables and the target variables. To determine the appropriate network structure and hyperparameters for optimizing Bi-LSTM's performance, we used the Firefly optimizer (FHO) to find the best combination of different hyperparameters. The population size, maximum number of iterations, extinction coefficient, and attraction coefficient were set to 60, 1000, 0.7, and 4, respectively. FHO is an evolutionary algorithm inspired by the foraging behavior of the Black kite, the Maroon Oriole, and the Brown Falcon, with strong global search capabilities³². Compared to traditional gradient-based optimization algorithms, FHO does not rely on the gradient information of the objective function, making it suitable for optimizing problems involving non-continuous, non-smooth, and even black-box functions³³. After conducting multiple experiments by using the hyperparameters of Bi-LSTM as the search dimensions of FHO, the optimal Bi-LSTM model was found to have 512 neurons in the first hidden layer and four dense layers with 256, 78, 32, and 1 neurons, respectively. To prevent overfitting, a Dropout rate of 0.3 was set for the model. Additionally, Rectified Linear Unit (ReLU) was used as the activation function for the hidden layers to reduce computation and avoid the vanishing gradient problem³⁴.

Coupling SWAT with Bi-LSTM

In this study, to establish a water environment model in data-scarce regions, we used a hybrid model combining SWAT and Bi-LSTM. SWAT is responsible for simulating baseflow and stormflow generated by precipitation events. The simulated results are then used to train the Bi-LSTM model to predict daily streamflow during the simulation period. In this process, only some parameters of SWAT are calibrated, which significantly reduces the uncertainty and unnecessary time and effort invested in the modeling process while ensuring model performance. Essentially, SWAT acts as a transfer function, transforming input variables such as terrain, soil type, weather, and LULC into two output variables: baseflow and stormflow. To validate the performance of the hybrid model in different regions, we adopted a cross-validation approach. There are a total of 14 hydrological stations in the BRB. We iteratively excluded one station and used the daily streamflow data from the remaining stations to train an independent Bi-LSTM model. Finally, the excluded station was used to verify the performance of the trained model, and the performance of each model in the target region was obtained. In this process, a total of 14 Bi-LSTM models were trained. The flowchart of this approach is shown in Fig. 5.

Model performance evaluation

Three metrics were used to evaluate the performance of the established SWAT and Bi-LSTM hybrid model in predicting streamflow in data-scarce regions. They are the Nash–Sutcliffe efficiency (NSE), coefficient of determination (R²), and Percent Bias (PBIAS). NSE and R² reflect the degree of collinearity between observed and simulated values, while PBIAS reflects the systematic bias between simulated and observed values. Their calculation formulas are as follows:

$$NSE=1-\frac{{\sum }_{i=1}^{N}{\left({M}_{i}-{S}_{i}\right)}^{2}}{{\Sigma }_{i=1}^{N}{\left({M}_{i}-{\overline{M} }_{i}\right)}^{2}}$$

(7)

$${R}^{2}=1-\frac{{\Sigma }_{i=1}^{N}{\left({M}_{i}-{S}_{i}\right)}^{2}}{{\Sigma }_{i=1}^{N}{\left({M}_{i}-{\overline{M} }_{i}\right)}^{2}{\Sigma }_{i=1}^{N}{\left({S}_{i}-{\overline{S} }_{i}\right)}^{2}}$$

(8)

$$PBIAS=\frac{\sum_{i=1}^{n}100\left({M}_{i}-{S}_{i}\right)}{\sum_{i=1}^{n}{M}_{i}}$$

(9)

where, ${M}_{i}$ and ${S}_{i}$ represent the observed values and simulated values, respectively; ${\overline{M} }_{i}$ and ${\overline{S} }_{i}$ represent the mean of observed values and the mean of simulated values, respectively. Furthermore, to demonstrate the daily streamflow prediction performance of the hybrid model trained based on neighboring areas when different regions in the BRB became assumed no-data regions, we utilized the ranking method presented in Table 4 to assess the model performance in different areas. This ranking criteria is derived from previous research that employed machine learning models for simulating and predicting water environments³⁵. However, in the SWAT-BiLSTM coupled model, SWAT is used with default parameters solely as a transfer function. Therefore, we did not opt for performance grading standards biased towards conceptual hydrological models. During the evaluation process, model performance is ranked based on the worse of the two metrics.

Table 4 Performance ranking criteria.

Full size table

BMP scenario settings

BMPs have been widely used for the prevention and control of NPS pollution and have shown significant effects. However, different BMPs have distinct spatial variations in their reduction effects on NPS pollution, requiring tailored management measures that suit the actual characteristics of the watershed. Based on the natural characteristics of the BRB (LULC, soil types, slope, and topography), socio-economic development (population density and water quality), and current NPS pollution status, we set two major categories of measures: Agricultural BMPs and Structural BMPs. Agricultural BMPs include formula fertilization by soil testing and stubble mulch, while Structural BMPs encompass vegetative filter strips and grassed waterways. The pollutant reduction effects of these four BMPs have been proven in previous studies³⁶. Table 5 presents the information on each BMP and the parameters that need adjustment in SWAT.

Table 5 The description and the simulation method of each BMP.

Full size table

We obtained relevant information on fertilizer application in the watershed through on-site field surveys of farmers. The cultivated area in the BRB is 401.01 km², with main crops being wheat and corn. The commonly used fertilizers are nitrogen-based (mainly urea and ammonium bicarbonate) and phosphate-based (mainly calcium superphosphate). The average application rates of chemical fertilizers for wheat and corn are 1125 kg/ha and 750 kg/ha, respectively. The total annual application of chemical fertilizers in the cultivated land of the watershed is 50,126 tons. The fertilization method is mainly broadcasting, resulting in lower fertilizer utilization efficiency, and significant nitrogen and phosphorus nutrient loss due to rainfall runoff. To address this, we implemented the measure of formula fertilization by soil testing to reduce the amount of chemical fertilizers while maintaining crop yields and reducing pollution loads³⁷. Formula fertilization by soil testing was achieved by reducing FRT_KG by 20% in SWAT parameters. Stubble mulch is an effective agricultural measure in reducing nitrogen and phosphorus losses. The pollutant reduction mechanisms of stubble mulch primarily come from two processes: (1) stubble mulch favors the accumulation of organic matter in the soil, improving soil water-holding capacity, reducing soil erosion, and lowering the risk of nitrogen and phosphorus nutrient loss,(2) stubble mulch reduces soil permeability and promotes the accumulation of reactive substances, effectively facilitating denitrification processes in the soil, leading to more nitrogen being released in the form of gas rather than being discharged into the rivers³⁸. Stubble mulch was implemented by adjusting SWAT parameters as follows: USLE_P was set to 0.29, USLE_C was set to 0.7, and OV_N was set to 0.3. Vegetative filter strip (VFS) refers to vegetated areas with gentle slopes that slow down surface runoff and remove pollutants and sediments from runoff through vegetation interception and soil infiltration³⁹. In this study, VFSRATIO was set to 40, VFSCON was set to 0.5, and VFSCH was set to 0. Grassed waterways mainly use vegetation to trap and store runoff, reduce flow velocity, and control the migration and transformation of pollutants in runoff, thereby reducing pollutant levels⁴⁰. The width of the grassed waterways was set to 5 m. VFS and grassed waterways were implemented along the entire length of the river reach. In this study, we conducted individual scenario simulations and combination scenario simulations for the four BMPs (Table 6). In all combination scenario simulations, each BMP was set as in the individual scenario simulations. In this study, the pollutant reduction effect of BMPs was expressed as the annual removal rate, defined as follows:

Table 6 Scenario settings.

Full size table

$$r=\frac{LOA{D}_{Pre}-LOA{D}_{post}}{LOA{D}_{Pre}}\times 100\%$$

(10)

Here, $LOA{D}_{Pre}$ represents the annual pollution load before implementing the BMP, and $LOA{D}_{post}$ represents the annual pollution load after implementing the BMP.

Results and discussion

Simulation performance comparison

In this study, we sequentially exclude the data of one hydrological station from the training dataset and use it to validate the model's streamflow prediction performance in a hypothetical area with no measured data. This process generated 14 groups of training dataset, each containing data from 13 hydrological stations, along with corresponding validation data. Table 7 compares the performance of the hybrid model in predicting daily streamflow in different regions of the BRB during the calibration period (January 1, 2015, to December 31, 2017) and the validation period (January 1, 2018, to December 31, 2022). In the validation data from these 14 stations, the absolute values of PBIAS for more than half of the stations are below 10%, with only three stations exceeding 15%. This indicates that the hybrid model's predictions of streamflow in areas without data did not exhibit significant systematic biases. However, there are still four stations with absolute PBIAS values exceeding 10%, and two of them even exceed 20%, suggesting that the hybrid model's predictive performance of daily streamflow is relatively poor in certain specific areas due to spatial factors such as terrain, LULC, and soil types. The performance ratings of the hybrid model in regions with different soil types, terrains, and LULC are shown in Fig. 6. The soil names and brief descriptions corresponding to the soil codes are displayed in Table 8.

Table 7 Comparison of daily streamflow prediction performance among the 14 hydrological stations.

Full size table

Table 8 Soil type information.

Full size table

Based on the analysis of terrain, soil type, and LULC data, in the test areas ranked as "Very good," imperviousness ranged from 19 to 54%, and forest cover ranged from 2 to 27%. In the test areas rated as "Good," imperviousness ranged from 18 to 51%, and forest cover ranged from 2 to 35%. Since most of the hydrological data used to train the Bi-LSTM model came from urbanized areas with relatively flat terrain, the Bi-LSTM model showed better daily streamflow prediction performance in such regions. As forest cover increased, the predictive performance of the hybrid model gradually declined. Moreover, the hybrid model performed better in test areas with higher imperviousness, indicating that the model had better predictive accuracy for highly urbanized watersheds. When the test area's forest cover exceeded 20%, the model's performance rating started to decline. Similarly, when the imperviousness exceeded 30%, the model showed more accurate predictions of daily streamflow. Among the 14 test areas, the highest accuracy prediction had an NSE of 0.92 and a PBIAS of 1.34%, corresponding to regions with forest cover ranging from 2 to 5% and imperviousness ranging from 47 to 54%. This indicates that the hybrid model meets the demand for daily streamflow prediction in this area, showing high predictive accuracy with no significant systematic bias, and can provide a good hydraulic environment for subsequent simulations of BMPs' pollution reduction effects.

Figure 7 shows the simulation performance of the hybrid model for total streamflow in the BRB during both the calibration and validation periods. The figure also displays the fitted linear regression line and R² between simulated and observed data. Throughout the simulation process, the hybrid model exhibited an underestimation trend for daily streamflow exceeding 200m³/s. For daily streamflow below 200m³/s, the performance of the hybrid model during the validation period was relatively worse compared to the calibration period, with the flow data points scattered more widely around the 1:1 line. As a data-driven model, Bi-LSTM's performance is greatly influenced by the input data used to train the model. If the training data is not representative, the performance of Bi-LSTM may not meet expectations. In this study, only daily streamflow data from the calibration period were used to train the Bi-LSTM model, which may be a reason for the hybrid model's poorer performance during the validation period. Nevertheless, considering the overall distribution of daily streamflow data points, they are evenly scattered on both sides of the 1:1 line without showing any significant systematic bias trend, which still meets the requirements for establishing a water environment model. Accurate streamflow simulation results contribute to better estimates of pollutant loads and ensure the precise modeling of pollutant transport processes, which are essential input data for water quality models. In this study, the SWAT-BiLSTM model's ensemble average of performance metrics for the water quality simulation results in the BRB is presented in Table 9. During the calibration and validation periods, the model achieved NSE and R² values exceeding 0.8 for TN and TP simulation results. This indicates a high level of consistency between the simulated results and the actual values. Additionally, the absolute values of PBIAS were all below 7%, suggesting that the model did not exhibit significant systematic bias in simulating water quality.

Table 9 Summary of calibration and validation statistics for TP and TN.

Full size table

Efficiencies of individual BMP scenarios in reducing NPS pollution loads

Figure 8 illustrates the reduction effects of BMPs on TN and TP under four scenario with single BMP measure. In scenario 1, the average reduction rates of TN and TP pollutant loads across the entire watershed by adopting Formula Fertilization by Soil Testing were 5.36% and 9.18%, respectively. Overall, this measure showed a better reduction effect on TP than on TN. The main reason for this result is that the major land use type in the BRB is rainfed agriculture, which is more prone to phosphorus runoff. The results indicate that Formula Fertilization by Soil Testing can reasonably reduce the use of chemical fertilizers with minimal impact on crop yields while reducing the pollution load. However, the overall reduction effect on TN and TP is limited, with average reduction rates within 10% throughout the watershed. In scenario 2, stubble mulch resulted in an average reduction rate of 17.83% for TN and 36.17% for TP across the entire watershed, indicating a significant reduction effect on nitrogen and phosphorus losses. Moreover, this measure showed a better reduction effect on TP than on TN, which may be attributed to the dominant soluble phosphorus pollution in the study area, which is carried into water bodies through surface runoff. Previous research suggested that stubble mulch can reduce about 60% of surface runoff, preventing pollutants from entering water bodies with surface runoff. Therefore, this measure exhibited a better reduction effect on TP in the study area. In scenario 3 and scenario 4, VFS and Grassed Waterways demonstrated average reduction efficiencies of 19.07% and 10.95% for TN, and 22.02% and 10.52% for TP, respectively, across the entire watershed. The results indicate that Vegetative Buffer Strips were more effective than Grassed Waterways, possibly due to their significant sediment interception effect, reducing the entry of particulate pollutants attached to sediment into water bodies. Among all single BMP scenarios, stubble mulch showed the best reduction effect on TN and TP in the BRB and is considered one of the BMP strategies that farmers can easily implement. It can be prioritized when planning to adopt a single BMP measure for controlling NPS pollution emission in the BRB.

Efficiencies of combined BMP scenarios in reducing NPS pollution loads

Considering that optimization design often involves implementing multiple BMPs to achieve the reduction goals for multiple pollutants, BMP combination scenarios should be designed to assess the overall effectiveness of various BMPs. The reduction rates of TN and TP under combined BMP scenarios are shown in Fig. 9. Among the five combination scenarios, the combination of Formula Fertilization by Soil Testing, stubble mulch, and VFS demonstrated the best reduction effect, achieving reduction rates of 42.71% for TN and 50.40% for TP. Other combination scenarios also showed favorable reduction effects. Overall, the combined BMPs scenarios exhibited higher average reduction rates for TN and TP compared to single BMP scenarios, with an increase of 13.75% and 15.27%, respectively. The combination of agricultural BMPs and structural BMPs proved to be more effective in controlling NPS pollution in the BRB.

Conclusion

In this study, a highly functional framework combining SWAT and Bi-LSTM models was developed to explore the effectiveness of different BMP scenarios in reducing NPS pollution in areas without measured runoff data. In this approach, SWAT served as a transfer function to convert meteorological data into baseflow and stormflow, which were then used as inputs for the Bi-LSTM model. The model performance was evaluated using three metrics: NSE, R², and PBIAS. The results showed that the hybrid model achieved an NSE and R² of 0.88 and 0.89, respectively, during the calibration period, and both remained above 0.85 during the validation period. The absolute maximum PBIAS was 2.71%, indicating that the hybrid model has high predictive accuracy without significant systematic bias, meeting the demand for simulating NPS pollution emission control schemes. The partial calibration of SWAT model parameters and coupling with the Bi-LSTM model helped address the uncertainty caused by equifinality in the SWAT calibration process. This framework provides a promising approach for simulating NPS pollution emission control schemes in other regions without measured streamflow data.

Based on the hybrid model, the hydrodynamic environment established, and the control effect of different BMP scenarios on NPS pollution in the BRB evaluated. The results showed that stubble mulch and vegetative filter strips were more effective in reducing pollutants than formula fertilization by soil testing and grassed waterways, reducing TN loads by 17.83% and 19.07%, and TP loads by 36.17% and 22.02%, respectively. Stubble mulch demonstrated the best overall reduction effect for both TN and TP, being farmer-friendly and prioritized for single BMP-based NPS pollution control plans. Furthermore, compared to single BMP scenarios, combined BMP scenarios increased the average reduction rates of TN and TP by 13.75% and 15.27%, respectively. The combination of VFS, formula fertilization by soil testing, and stubble mulch showed the best reduction effect, with reduction rates of 42.71% for TN and 50.40% for TP. These results provide powerful support and evidence for decision-makers in formulating NPS pollution emission control schemes for the BRB.

The hybrid model combining SWAT and Bi-LSTM simplified the hydrological processes and made some assumptions, introducing uncertainty to predictions. In the future, more advanced deep learning models or hybrid models could be explored, combining various modeling methods to better simulate complex hydrological processes and achieve more accurate predictions in areas without measured streamflow data. Additionally, more types of BMPs and pollutants can be considered in further research to promote practical applications. The coupling of models such as vine copulas could also be used to predict the probability of achieving emission control goals with various combined BMP scenarios.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Ravikumar, Y., Yun, J., Zhang, G., Zabed, H. M. & Qi, X. A review on constructed wetlands-based removal of pharmaceutical contaminants derived from non-point source pollution. Environ. Technol. Innov. 26, 102504 (2022).
Article CAS Google Scholar
Moges, E., Demissie, Y., Larsen, L. & Yassin, F. Sources of hydrological model uncertainties and advances in their analysis. Water 13(1), 28 (2021).
Article Google Scholar
Liu, Y. et al. A review on effectiveness of best management practices in improving hydrology and water quality: Needs and opportunities. Sci. Total Environ. 601, 580–593 (2017).
Article ADS PubMed Google Scholar
Horner, R., May, C., Livingston, E., Blaha, D., Scoggins, M., Tims, J., & Maxted, J. Structural and non-structural BMPs for protecting streams. In Linking Stormwater BMP Designs and Performance to Receiving Water Impact Mitigation . 60–77 (2002).
Ricci, G. F., D’Ambrosio, E., De Girolamo, A. M. & Gentile, F. Efficiency and feasibility of best management practices to reduce nutrient loads in an agricultural river basin. Agric. Water Manag. 259, 107241 (2022).
Article Google Scholar
Abdulkareem, J. H., Pradhan, B., Sulaiman, W. N. A. & Jamil, N. R. Review of studies on hydrological modelling in Malaysia. Model. Earth Syst. Environ. 4, 1577–1605 (2018).
Article Google Scholar
Tong, X. et al. Source, fate, transport and modelling of selected emerging contaminants in the aquatic environment: Current status and future perspectives. Water Res. 217, 118418 (2022).
Article CAS PubMed Google Scholar
Xue, J., Wang, Q. & Zhang, M. A review of non-point source water pollution modeling for the urban–rural transitional areas of China: Research status and prospect. Sci. Total Environ. 826, 154146 (2022).
Article ADS CAS PubMed Google Scholar
Yanan, J. I. N., Baifa, Z. H. A. N. G., Yun, H. A. O., Jianhong, W. U. & Jun, L. Y. U. Dynamic analysis of river nitrogen and phosphorus pollution based on LOADEST model and wavelet transform. Acta Agric. Zhejiangensis 32(9), 1692 (2020).
Google Scholar
Aloui, S. et al. A review of soil and water assessment tool (SWAT) studies of Mediterranean catchments: Applications, feasibility, and future directions. J. Environ. Manag. 326, 116799 (2023).
Article Google Scholar
Abdelmoneim, H., Soliman, M. R. & Moghazy, H. M. Evaluation of TRMM 3B42V7 and CHIRPS satellite precipitation products as an input for hydrological model over Eastern Nile Basin. Earth Syst. Environ. 4, 685–698 (2020).
Article Google Scholar
Devia, G. K., Ganasri, B. P. & Dwarakish, G. S. A review on hydrological models. Aquat. Proc. 4, 1001–1007 (2015).
Article Google Scholar
Al Khoury, I., Boithias, L. & Labat, D. A Review of the Application of the soil and water assessment tool (SWAT) in Karst watersheds. Water 15(5), 954 (2023).
Article CAS Google Scholar
Liu, Y. et al. Evaluating efficiencies and cost-effectiveness of best management practices in improving agricultural water quality using integrated SWAT and cost evaluation tool. J. Hydrol. 577, 123965 (2019).
Article CAS Google Scholar
Ait M’Barek, S., Bouslihim, Y., Rochdi, A. & Miftah, A. Effect of LULC data resolution on hydrological and erosion modeling using SWAT model. Model. Earth Syst. Environ. 9(1), 831–846 (2023).
Article Google Scholar
Shah, S. et al. Evaluating the added value of multi-variable calibration of SWAT with remotely sensed evapotranspiration data for improving hydrological modeling. J. Hydrol. 603, 127046 (2021).
Article Google Scholar
Chen, J. et al. Improved data splitting methods for data-driven hydrological model development based on a large number of catchment samples. J. Hydrol. 613, 128340 (2022).
Article Google Scholar
Siami-Namini, S., Tavakoli, N., & Namin, A. S. The performance of LSTM and BiLSTM in forecasting time series. In 2019 IEEE International Conference on Big Data (Big Data). 3285–3292. (IEEE, 2019).
Kowsher, M. et al. LSTM-ANN & BiLSTM-ANN: Hybrid deep learning models for enhanced classification accuracy. Proc. Comput. Sci. 193, 131–140 (2021).
Article Google Scholar
Yang, S. et al. Coupling SWAT and Bi-LSTM for improving daily-scale hydro-climatic simulation and climate change impact assessment in a tropical river basin. J. Environ. Manag. 330, 117244 (2023).
Article Google Scholar
Jajarmizadeh, M., Kakaei Lafdani, E., Harun, S. & Ahmadi, A. Application of SVM and SWAT models for monthly streamflow prediction, a case study in South of Iran. KSCE J. Civ. Eng. 19, 345–357 (2015).
Article Google Scholar
Jiang, S., Zheng, Y., Babovic, V., Tian, Y. & Han, F. A computer vision-based approach to fusing spatiotemporal data for hydrological modeling. J. Hydrol. 567, 25–40 (2018).
Article ADS Google Scholar
Ang, R. & Oeurng, C. Simulating streamflow in an ungauged catchment of Tonlesap Lake Basin in Cambodia using soil and water assessment tool (SWAT) model. Water Sci. 32(1), 89–101 (2018).
Article Google Scholar
Zhang, T. et al. Evaluation of the impacts of human activities on propagation from meteorological drought to hydrological drought in the Weihe River Basin, China. Sci. Total Environ. 819, 153030 (2022).
Article ADS CAS PubMed Google Scholar
Yang, T. et al. Comprehensive ecological risk assessment for semi-arid basin based on conceptual model of risk response and improved TOPSIS model—A case study of Wei River Basin, China. Sci. Total Environ. 719, 137502 (2020).
Article ADS CAS PubMed Google Scholar
Wang, Y., Jiang, R., Xie, J., Zhao, Y., Yan, D., & Yang, S. Soil and water assessment tool (SWAT) model: A systemic review. J. Coast. Res. 93(1), 22–30 (2019).
Li, Y. et al. Applying water environment capacity to assess the non-point source pollution risks in watersheds. Water Res. 240, 120092 (2023).
Article CAS PubMed Google Scholar
Li, Z. et al. SinoLC-1: The first 1-meter resolution national-scale land-cover map of China created with the deep learning framework and open-access data. Earth Syst. Sci. Data Discuss. 2023, 1–38 (2023).
Google Scholar
Xu, R., Qiu, D., Wu, C., Mu, X., Zhao, G., Sun, W., & Gao, P. Quantifying climate and anthropogenic impacts on runoff using the SWAT model, a Budyko-based approach and empirical methods. Hydrol. Sci. J. (just-accepted) (2023).
Yu, Y., Si, X., Hu, C. & Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 31(7), 1235–1270 (2019).
Article MathSciNet PubMed MATH Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997).
Article CAS PubMed Google Scholar
Azizi, M., Talatahari, S. & Gandomi, A. H. Fire hawk optimizer: A novel metaheuristic algorithm. Artif. Intell. Rev. 56(1), 287–363 (2023).
Article Google Scholar
Shishehgarkhaneh, M. B., Azizi, M., Basiri, M. & Moehler, R. C. BIM-based resource tradeoff in project scheduling using fire hawk optimizer (FHO). Buildings 12(9), 1472 (2022).
Article Google Scholar
Yarotsky, D. Error bounds for approximations with deep ReLU networks. Neural Netw. 94, 103–114 (2017).
Article PubMed MATH Google Scholar
Kalin, L., Isik, S., Schoonover, J. E. & Lockaby, B. G. Predicting water quality in unmonitored watersheds using artificial neural networks. J. Environ. Qual. 39(4), 1429–1440 (2010).
Article CAS PubMed Google Scholar
Liu, T., Bruins, R. J. & Heberling, M. T. Factors influencing farmers’ adoption of best management practices: A review and synthesis. Sustainability 10(2), 432 (2018).
Article PubMed PubMed Central Google Scholar
Wu, H., Li, J. & Ge, Y. Ambiguity preference, social learning and adoption of soil testing and formula fertilization technology. Technol. Forecast. Soc. Change 184, 122037 (2022).
Article Google Scholar
Li, S., Li, J., Hao, G. & Li, Y. Evaluation of Best Management Practices for non-point source pollution based on the SWAT model in the Hanjiang River Basin, China. Water Supply 21(8), 4563–4580 (2021).
Article Google Scholar
Krutz, L. J., Senseman, S. A., Zablotowicz, R. M. & Matocha, M. A. Reducing herbicide runoff from agricultural fields with vegetative filter strips: A review. Weed Sci. 53(3), 353–367 (2005).
Article CAS Google Scholar
Leh, M. D., Sharpley, A. N., Singh, G. & Matlock, M. D. Assessing the impact of the MRBI program in a data limited Arkansas watershed using the SWAT model. Agric. Water Manag. 202, 202–219 (2018).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Water Conservancy College, North China University of Water Resources and Electric Power, Zhengzhou, 450046, China
Xianqi Zhang, Yu Qi, Haiyang Li, Shifeng Sun & Qiuwen Yin
Collaborative Innovation Center of Water Resources Efficient Utilization and Protection Engineering, Zhengzhou, 450046, China
Xianqi Zhang
Technology Research Center of Water Conservancy and Marine Traffic Engineering, Zhengzhou, 450046, Henan, China
Xianqi Zhang

Authors

Xianqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Qi
View author publications
You can also search for this author in PubMed Google Scholar
Haiyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Shifeng Sun
View author publications
You can also search for this author in PubMed Google Scholar
Qiuwen Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z. and Y.Q. wrote the main manuscript text. Q.Y., H.L. and S.S. prepared all figures. All authors reviewed the manuscript.

Corresponding author

Correspondence to Yu Qi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, X., Qi, Y., Li, H. et al. Assessing effect of best management practices in unmonitored watersheds using the coupled SWAT-BiLSTM approach. Sci Rep 13, 17168 (2023). https://doi.org/10.1038/s41598-023-44531-7

Download citation

Received: 01 August 2023
Accepted: 10 October 2023
Published: 11 October 2023
DOI: https://doi.org/10.1038/s41598-023-44531-7

This article is cited by

A new interpretable streamflow prediction approach based on SWAT-BiLSTM and SHAP
- Feiyun Huang
- Xuyue Zhang
Environmental Science and Pollution Research (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.