Seasonally varying effects of environmental factors on phytoplankton abundance in the regulated rivers

Kim, Jun Song; Seo, Il Won; Baek, Donghae

doi:10.1038/s41598-019-45621-1

Download PDF

Article
Open access
Published: 25 June 2019

Seasonally varying effects of environmental factors on phytoplankton abundance in the regulated rivers

Jun Song Kim^1,3,
Il Won Seo² &
Donghae Baek²

Scientific Reports volume 9, Article number: 9266 (2019) Cite this article

2551 Accesses
15 Citations
1 Altmetric
Metrics details

Subjects

Abstract

This study investigates a seasonally varying response of phytoplankton biomass to environmental factors in rivers. Artificial neural network (ANN) models incorporated with a clustering technique, the clustered ANN models, were employed to analyze the relationship between chlorophyll a (Chl-a) and the explanatory variables in the regulated Nakdong River, South Korea. The results show that weir discharge (Q) and total phosphorus (TP) were the most influential factors on temporal dynamics of Chl-a. The relative importance of both variables increased up to higher than 30% for low water temperature seasons with dominance of diatoms. While, during summer when cyanobacteria predominated, the significance of Q increased up to 45%, while that of TP declined to about 10%. These tendencies highlight that the effects of the river environmental factors on phytoplankton abundance was temporally inhomogeneous. In harmful algal bloom mitigation scenarios, the clustered ANN models reveals that the optimal weir discharge was 400 m³/s which was 67% of the value derived from the non-clustered ANN models. At the immediate downstream of confluence of the Kumho River, the optimal weir discharge should increase up to about 1.5 times because of the increase in the tributary pollutant loads attributed to electrical conductivity (EC).

Spatio-temporal dynamics of phytoplankton community in a well-mixed temperate estuary (Sado Estuary, Portugal)

Article Open access 30 September 2022

Numerical analysis of the relationship between mixing regime, nutrient status, and climatic variables in Lake Biwa

Article Open access 16 November 2022

Consistent stoichiometric long-term relationships between nutrients and chlorophyll-a across shallow lakes

Article Open access 27 January 2024

Introduction

Riverine ecosystems are significantly impacted by consequences of human activities such as effluents introduced from wastewater treatment plants and flow regulation associated with hydraulic structures even if rivers serve as essential sources of drinking water and provide habitats for freshwater fishes and invertebrates^1,2. Phytoplankton have commonly been used as ecological indicators to assess these human effects on freshwater environments because phytoplankton blooms are usually results of excessive nutrient loading and extended water residence time induced by the artificial flow control^3,4.

During the summer season, surface water quality is prone to be contaminated by cyanobacterial blooms that degrade water clarity, and even produce a variety of taste-and-odor compounds and toxins⁵. From early winter to late spring, high accumulation of diatoms adversely affects water intake activities by causing filter clogging⁶. For monitoring phytoplankton abundance, chlorophyll a (Chl-a) has widely been accepted as a measure of phytoplankton population in rivers and lakes⁷. Therefore, it is indispensable to understand a relationship between Chl-a and river environmental factors in order to predict its seasonal fluctuation and prepare countermeasures against phytoplankton blooms.

The Chl-a dynamics is conventionally simulated using numerical models based on an advection-dispersion-reaction equation. These kinds of physics-based models usually require not only extensive data related to boundary conditions, bathymetry and parameter estimation but also specialized knowledge of physical processes accounting for fate and transport of various water quality constituents⁸. As an alternative to the numerical approaches, the numerous studies have increasingly adopted data-driven techniques leveraged on artificial neural networks (ANNs) for modeling water quality^9,10,11. Olden (2000) reported that Chl-a abundance was directly associated with nutrient and zooplankton variables through the ANN application to the prediction of phytoplankton succession¹². Jeong et al. (2001, 2006) and Kim et al. (2014) evaluated the variable contribution of the ANN models to the predicted Chl-a, and their results indicated that Chl-a was most sensitive to pH and chemical oxygen demand (COD) rather than growth limiting factors generally known as water temperature, phosphorus and nitrogen^13,14,15. Wu et al.¹⁶ also documented that total phosphorus and dissolved inorganic nitrogen considerably influenced the daily dynamics of Chl-a by performing sensitivity analysis with explanatory variables of the ANN models¹⁷.

The previous studies hypothesized that the relationship between Chl-a and the explanatory variables is seasonally homogeneous. In riverine and lacustrine systems, high water temperature favors cyanobacterial growth, whereas diatoms usually proliferate at low water temperature^14,18. Thus, the transition of dominant phytoplankton species occurs with change in water temperature. Here, each of these phytoplankton groups exhibits its own growth characteristic as different species have different values of growth rate, half-saturation constant of nutrients such as phosphorus, nitrogen and silica, and optimal water temperature and light intensity^19,20,21,22. Hence, an invariant stimulus can result in heterogeneous responses of Chl-a due to the inherent growth features inherent of phytoplankton species predominated in the water body, which changes with water temperature.

The conventional ANN models usually adopted water quality constituents that are byproducts of phytoplankton bloom like dissolved oxygen (DO), COD, turbidity and pH, as estimators of Chl-a²³. In water bodies, phytoplankton produce and consume DO through photosynthesis and respiration, respectively. Simultaneously, phytoplankton detritus increases COD and herein resulted in DO depletion through bacterial decomposition²⁴. In addition, the rapid accumulation in phytoplankton biomass degrades water transparency with increasing turbidity and cause increase in pH by consuming hydrogen during the photosynthesis^25,26. However, the ANN modeling with the above water quality variables not physically affecting the growth dynamics of phytoplankton could be prone to mask or distort the cause and effect relationship between Chl-a and the growth limiting factors such as water temperature, residence time and nutrient concentrations.

This study is aimed at understanding seasonally varying response of Chl-a to the river environmental factors that directly contribute to phytoplankton growth mechanism in riverine systems. In this work, ANN models incorporated with a cluster technique (clustered ANN models) were used to consider the effects of seasonal transition of phytoplankton communities on relationships between Chl-a and input variables by partitioning field data according to different ranges of water temperature. The model performance was evaluated by comparison with that of the conventional ANN model without clustering. Using the clustered ANN models, this study estimated the relative importance of the environmental variables on Chl-a prediction and furthermore performed the scenario-based simulations to propose an optimal flow condition for suppressing the phytoplankton bloom in regulated rivers, where flow discharge is artificially controlled by hydraulic structures.

Materials and Methods

Study area

The Nakdong River is one of the major rivers in South Korea, which is about 525 km long and includes large tributaries including the Kumho River. This large river is served as important water supply sources for about 10 million residents in the south eastern area, passing through the major cities of Busan and Daegu. The drainage area is 23,817 km² with an average channel width and water depth of about 250 m and 7.4 m, respectively. The 5-year average precipitation of the Nakdong River watershed was 958 mm during 2013–2017, and 18.4% and 1.9% of the total watershed area are used as agricultural and industrial complex areas, respectively. The non-point source with rainfall-runoff events contributes to about 60% of total phosphorus loading in the study area²⁷.

Corresponding to the latest climate change, multi-purpose weirs were constructed across the Nakdong River in 2012 to achieve the flood protection and drought combat, as shown in Fig. 1. After the weir construction, this regulated river has suffered the aberrant proliferation of phytoplankton with the change in seasonal patterns of river flow²⁸. In the regulated Nakdong River, toxic cyanobacteria such as Microcystis, Aphanizomenon and Anabaena have been abundant, and herein resulted in the significant level of cyanobacterial cell counts around 10,000–20,000 cells/ml during summer owing to the water temperature higher than 25 °C and the extended water residence time arisen from the artificial flow control by the weirs²⁹. From winter to early spring, diatoms with the prevalence of Stephanodiscus have usually predominated in the study area³⁰.

Field data

This study selected water temperature (WT), total phosphorus (TP) and total nitrogen (TN) as predictors of Chl-a in order to only take into account water quality factors physically associated with the phytoplankton growth dynamics. The flow velocity representing water residence time can moreover be deemed to the significant factor for predicting the phytoplankton abundance. The extended water residence time attributed to flow velocity facilitates the phytoplankton accumulation as well as stabilizes the water column to result in the thermal stratification, which favors cyanobacterial bloom during summer, in the regulated rivers^31,32. In the Nakdong River, the weirs has not only fixed water surface elevation to the design level but also controlled flow rate using movable weir gates. Herein, flow velocity of the river can be estimated by the discharge from the weirs since the water surface elevation and channel width remain constant. For this reason, this study additionally considered weir discharge (Q) as the hydraulic predictor of Chl-a.

In the midstream of the Nakdong River, the large urban-industrial complexes are located along the tributaries, where wastewater effluents containing excessive loadings of phosphorus and nitrogen are continuously spilled into the Nakdong River. Due to this tributary-driven contamination, the water quality after the confluence of the tributaries has severely deteriorated³³. Especially, phytoplankton blooms have frequently propagated from the merging point of the Kumho River which is the largest tributary located in the middle reach of the Nakdong River³⁴. The 5-year average electrical conductivity (EC) and Chl-a of the Kumho River are 710 μS/cm (678–780 μS/cm) and 48.8 mg/m³ (35.6–61 mg/m³), respectively for 2013–2017. These levels are higher than twice the values observed at the monitoring stations, particularly Dasan station which is located before the confluence of this tributary, as shown in Table 1. Here, EC can be the appropriate indicator to explain the tributary effects on the seasonal variation of Chl-a in the study area because the tributary influx, which is the external source of Chl-a, usually exhibits high concentration of EC induced by the industrial facilities³⁵. Therefore, WT, EC, TN, TP and Q were determined as the input variables of the ANN models to derive the relationship between the river environmental factors and Chl-a using the field-based monitoring data.

Table 1 Daily water quality and discharge data collected at monitoring stations in the Nakdong River for 5 years (2013–2017).

Full size table

The ANN models were constructed at several water quality monitoring stations such as Shinam station, Dasan station, and Sangdong station located at upstream, midstream and downstream of the Nakdong River, respectively, as shown in Fig. 1. The 5-year (2013–2017) daily data for WT, EC, TN, TP and Chl-a were available at these monitoring stations, and the corresponded data for Q were retrieved from the streamflow gauge stations at the multi-purpose weirs located upstream of each monitoring station. Goryung station was additionally selected as the prediction site, which is located downstream of the confluence of the Kumho River, to analyze the contribution of the tributary effluents with the high concentration of EC introduced from the tributary to the phytoplankton biomass. The input and output data of the ANN models are presented in Table 1. From this table, one can notice the abrupt rise in average values of EC, TN, TP and Chl-a at Goryung station due to the tributary effect.

Model description

The architecture of ANN models was composed of the input layer, the single hidden layers, and output layer. In these networks, the input variables in the input layer were multiplied by the weight factors to link the connections between the input layer and the hidden layer, and then the bias was added as:

$${H}_{j}=\sum _{i=1}^{N}{x}_{i}{W}_{ij}+{b}_{j}$$

(1)

where i indicates the node in the input layer; x_i is the input variable at i; j indicates the node in the hidden layer; W_ij is the weight factors at j; b_j is the bias at j; and N is the number of nodes in the input layer. The values of input variable such as WT, EC, TN, TP, and Q had different ranges so that they were scaled to the uniform ranges using the min-max normalization technique as following:

$${X}_{i}=\frac{{x}_{i}-\,\min ({x}_{i})}{\max ({x}_{i})-\,\min ({x}_{i})}$$

(2)

where X_i is the normalized value of the input variables. W_ij is the key parameter of the ANN models, and these values were optimized during the training phase by minimizing the error between prediction and target. In this process, the initial values of W_ij were selected by Xavier initialization to initiate the training of the ANN models, in which b_j was initially set to be zero³⁶. Using this technique, the weight factors were randomly selected from the uniform distribution with the interval of $\pm 1/\sqrt{N}$.

The values of H_j in the hidden layer were then transferred to the output layer passing through the activation function. With this treatment, the ANN models can generate the nonlinear relationship between the input and output values. This study adopted the tangent hyperbolic function as the activation function, which can be described as:

$$\delta ({H}_{j})=\,\tanh ({H}_{j})$$

(3)

The values processed with the activation function in the hidden layer were multiplied by the weight factors that are located between the hidden layer and the output layer. The output layer included the single node which indicates Chl-a predicted by the ANN models and can be obtained as:

$$\hat{C}=\sum _{j=1}^{M}\delta ({H}_{j}){W}_{jk}+{b}_{k}$$

(4)

where $\hat{C}$ is the predicted Chl-a; k denotes the node in the output layer; W_jk is the weight factor at k; b_k is the bias at k; and M is the number of the nodes in the hidden layer. The ANN models were optimized using the cost function which represents the sum of squared errors calculated as:

$${\rm{SSE}}=\frac{1}{2}\sum _{i=1}^{n}{({\hat{C}}_{i}-{C}_{i})}^{2}$$

(5)

where n is the number of data; and ${\hat{C}}_{i}$ and C_i are the i^th predicted and observed Chl-a, respectively. To minimize the cost function, all trainable parameters including the weight factors and the bias in the ANN models were updated at each epoch using the adaptive gradient descent algorithm³⁷. In this process, the ANN models were trained using 70% of the total datasets to estimate the trainable parameters. The remaining 30% datasets were used for testing the trained ANN models. Using the optimized parameters, the contribution of the river environmental factors to the Chl-a prediction can be calculated as³⁸:

$$R{I}_{i}=\sum _{i=1}^{N}\frac{|{W}_{ij}{W}_{jk}|}{\sum _{j=1}^{M}|{W}_{ij}{W}_{jk}|}\times 100 \% $$

(6)

where RI_i is the relative importance of the i^th input variable. This method determines the relative importance of each input variable in the ANN models by partitioning the neural network connection weights³⁹.

Data clustering

As aforementioned, the response of Chl-a to the explanatory variables changes according to the phytoplankton species dominant in rivers. For this reason, the ANN models need to be structured separately for capturing the temporally varying relationship between the river environmental factors and Chl-a. In the Nakdong River, the phytoplankton communities such as cyanobacteria and diatoms shifts seasonally so that their succession can be explained by the change of the water temperature⁴⁰. However, it is not usually feasible to identify the thresholds of the water temperature for differentiating cyanobacteria and diatoms from the time-series data of Chl-a. Hence, the total datasets were partitioned into several clusters based on the water temperature by adopting K-means clustering in order to construct the multiple ANN models corresponding to the number of the clusters. This clustering technique is the algorithm to split the datasets into K clusters, in which the partitioned datasets belong to each cluster with the nearest mean. The partition of the datasets was processed by minimizing the sum of squares of distances between data and the corresponding cluster centroid as following⁴¹:

$$E={\sum _{l=1}^{K}\sum _{{x}_{i}\in {C}_{l}}\Vert {x}_{i}-{\mu }_{l}\Vert }^{2}$$

(7)

where K indicates the number of the clusters; l is the cluster; $\Vert {x}_{i}-{\bar{\mu }}_{l}\Vert $ is the Euclidean distance between x_i and μ_l; x_m is the i^th observation data; and μ_l is the centroid of l^th cluster. Table 2 summarizes the statistics of the input and output data before and after clustering the datasets at Goryung station. In this table for K = 3, Cluster 31, Cluster 32 and Cluster 33 represent the datasets belonging to low, intermediate and high water temperature, respectively. The datasets collected at other monitoring stations were also clustered following the same manner as demonstrated in Table 2.

Table 2 Summary of clustered inputs (WT, EC, TN, TP, Q) and output (Chl-a) data collected at Goryung station.

Full size table

Model performance criteria

To evaluate the performance of the trained ANN models by comparing predictions with observations, the statistical performance measures were used as follows:

$${{\rm{R}}}^{2}=1-\frac{\sum _{i=1}^{n}{({C}_{i}-{\hat{C}}_{i})}^{2}}{\sum _{i=1}^{n}{({C}_{i}-{\bar{C}}_{i})}^{2}}$$

(8)

$${\rm{RSR}}=\frac{\sqrt{\sum _{i=1}^{n}{({C}_{i}-{\hat{C}}_{i})}^{2}}}{\sqrt{\sum _{i=1}^{n}{({C}_{i}-{\bar{C}}_{i})}^{2}}}$$

(9)

$${\rm{APBIAS}}=\frac{\sum _{i=1}^{n}|{C}_{i}-{\hat{C}}_{i}|}{\sum _{i=1}^{n}{C}_{i}}\times 100 \% $$

(10)

R² ranges from 0 to 1 and higher values indicate better model performance. RSR is ratio of RMSE and standard deviation of the observed data. APBIAS measures the average tendency of the prediction results to be more deviated than the observation data. The optimal value of RSR and PBIAS is 0, which indicates the perfect model prediction.

Results and Discussion

ANN models without clustering

Before implementing the ANN modeling incorporated with K-means clustering, the ANN models without clusters, the non-clustered ANN models (K = 1) were first constructed at Shinam station, Dasan station, Goryung station, and Sangdong station. This study employed k-fold cross validation to determine the optimal number of the hidden neuron for the ANN models, and the optimal number was 15 for all prediction sites. Table 3 summarizes the training and testing results at each monitoring station, in which SH-1, DA-1, GO-1, and SA-1 denote the non-clustered models for Shinam station, Dasan station, Goryung station and Sangdong station, respectively. According to this table, the trained ANN models showed R² of 0.507–0.624 at the prediction sites. As a result of testing the trained ANN models, the prediction accuracy was R² of 0.345, 0.428, 0.465, and 0.367 of at Shinam station, Dasan station, Goryung station and Sangdong station, respectively. Figure 2 is evident that the predicted values of Chl-a tended to be relatively underestimated compared to the observations at all prediction stations.

Table 3 Prediction accuracy of ANN models with different numbers of clusters.

Full size table

ANN models with clustering

The clustered ANN models were constructed by partitioning the input and output data into 2 and 3 cluster groups (K = 2 and 3) according to the ranges of the water temperature using K-means clustering. The optimal number of the hidden neuron was 15 for both ANN models with 2 and 3 clusters, referred to the 2-cluster ANN models and 3-cluster ANN models, respectively at the monitoring stations. Figure 2 indicates that the 2-cluster ANN models (SH-2, DA-2, GO-2 and SA-2) showed the enhanced prediction accuracy by comparison with that of the non-clustered ANN models. The underestimated predictions of Chl-a, caused by the non-clustered ANN models, was diminished with the 2-cluster ANN models. The prediction accuracy was even more improved when using the 3-cluster ANN models (SH-3, DA-3, GO-3 and SA-3) that distinctively minimized the deviations from the perfect linear line, as shown in Fig. 2 as the 3-cluster models partitioned the datasets into three different groups indicating the diatom season (Cluster 31), transition season (Cluster 32) and cyanobacteria season (Cluster 33). The training results of the 3-cluster ANN models showed R² of 0.690–0.806, and the trained ANN models resulted in R² of 0.485, 0.618, 0.634 and 0.545 at Shinam station, Dasan station, Goryung station and Sangdong station, respectively, as shown in Table 3. Moreover, RSR and APBIAS decreased by about 0.10–0.15 and 6–10%, respectively, compared to those resulted from the non-clustered ANN models.

As a result of the 3-cluster ANN modeling, the prediction accuracy for Chl-a in Cluster 32 and Cluster 33 was generally lower than that in Cluster 31 at all monitoring stations, as shown in Fig. 3. Ha et al. (2003) and Kim et al. (2018a) reported that centric diatoms, Stephanodiscus usually occupy about 85% of total phytoplankton population in the Nakdong River from late fall to spring (about 5–15 °C)^18,30. Thus, the ANN models adequately explained the seasonal behavior of Chl-a in the low water temperature below about 15 °C as the concentration of Chl-a in Cluster 31 was governed by the population of the single diatom assemblage. Whereas, in the warm seasons, the multiple communities of cyanobacteria such as Microcystis, Anabaena and Aphanizomenon constitute the biomass of total phytoplankton in the study area⁴². The complex dynamics of the cyanobacteria therefore hampered the accurate prediction for Chl-a belonging to Cluster 32 and Cluster 33.

Relative importance of explanatory variables

Figure 4 shows the relative importance of the explanatory variables on Chl-a prediction, which was calculated using Eq. (6), at the target stations using both non-clustered and clustered ANN models. According to Fig. 4(a,d,g,j), at all prediction sites, the non-clustered ANN models revealed that Q was the most dominant factor in predicting the seasonal variation of Chl-a as this hydraulic variable contributed 32.5–46.7% to the prediction. On the other hand, less than 20% was contributed to the prediction by each of other input variables such as WT, EC, TN, and TP. Since the multi-purpose weirs controlled the river discharge, Q accounted for the water residence time which affects the intensity and timing of the phytoplankton bloom, and thereby substantially influenced the seasonal dynamics of Chl-a.

Similar to the results of the non-clustered ANN modeling, with the 2-cluster ANN models, Q was the most influential input variable on the daily fluctuation of Chl-a. Here, the influence of Q was stronger in Cluster 22 than in Cluster 21 while its relative importance increased up to 52.7%, as depicted in Fig. 4(b,e,h,k). Among the nutrient variables, TP exhibited the high values of the relative importance up to 32.2% when predicting Chl-a belonging to Cluster 21. However, for the prediction of Chl-a in Cluster 22, the relative importance of TP drastically decreased to about 10% which was occasionally lower than that of TN. These results indicated that TP did not always act as the limiting nutrient for the phytoplankton growth despite it has generally been known that the phytoplankton growth in the freshwater systems was limited by TP concentration^43,44. This seasonal variation in the relative importance of the river environmental factors was not captured with the conventional ANN approaches like the non-clustered ANN models that assume the temporal homogeneity in the input-output relationship.

The results with the 3-cluster ANN models revealed that the significance of Q was amplified with increase in water temperature as the reach-averaged relative importance of Q was 26.0, 38.4 and 46.2% for Cluster 31, Cluster 32 and Cluster 33, respectively. In the meanwhile, the significance of TP declined as water temperature increased as the relative importance of TP ranged 18.0–29.3%, 12.9–19.9% and 7.3–13.4% for Cluster 31, Cluster 32 and Cluster 33, respectively, as shown in Fig. 4(c,f,i,l). In the Nakdong River, Chl-a in Cluster 31 represents the diatom concentration while that in Cluster 33 usually constitutes the cyanobacterial biomass. Thus, from these results, one can notice that TP played a role as the limiting factor for the growth of not cyanobacteria but diatoms, while the seasonal variation of cyanobacteria was highly affected by the retention time rather than the nutrient concentration. The previous studies reported that the forming of cyanobacteria was strongly impacted by the artificial mixing stimulated by the increase in Q as the enhanced vertical mixing destructs the thermal stratification as well as disturbs the buoyancy mechanism to alleviate the intensity of the cyanobacterial bloom^4,45,46. Hence, the results of the clustered ANN modeling elucidated that the relationship between Chl-a and the river environmental factors was temporally heterogeneous and dependent on the seasonal succession of the phytoplankton community.

The results also showed that the significance of the river environmental factors changed according to the location of the monitoring stations because the relationship between the target water quality and the explanatory variables can vary spatially due to the biogeochemical characteristics inherent to the specific locations³⁵. As aforementioned, the water quality variables including Chl-a observed at Goryung station was strongly influenced by the effluents introduced from the Kumho River, which constitute the high concentration of EC. Figure 4(i) represents that the seasonal variation of Chl-a at this monitoring station was considerably explained by EC which contributed 23.6–30.8% to the prediction with the 3-cluster ANN models as the water quality of this hypereutrophic Kumho River acted as the external sources of Chl-a. Whereas, the relative importance of EC was almost less than 20% at other monitoring stations. The tributary inflow therefore needs to be regarded as the explanatory variable for the Chl-a prediction in the rivers including the confluence zone.

Impact assessment of weir discharge on cyanobacterial bloom

To propose the countermeasures against toxic cyanobacterial bloom prevailing in summer, also known as harmful algal bloom (HAB) which severely impacts the ecosystem functioning in the Nakdong River, this study performed the scenario-based simulation using the 3- cluster ANN models. The mitigation approaches primarily focused on the effects of the weir discharge control on the reduction of HAB as Q was the dominant factor over other input variables in the prediction of Chl-a belonging to high water temperature (Cluster 33) at all monitoring stations. For the scenario generation, Q increased from 20 to 600 m³/s with the increment of 20 m³/s, and the remaining input variables were fixed to the average values of the datasets corresponding to Cluster 33 of each monitoring station.

As a result of the scenario-based simulation, with the non-clustered ANN models, the concentrations of Chl-a diminished to 50% of the maximum values at all monitoring stations when Q increased up to 600 m³/s, and no distinct change in Chl-a was simulated in the further increase in Q, as shown in Fig. 5(a). On the other hand, the 3-cluster ANN models demonstrated that the same level of the Chl-a reduction was obtained with Q of 400 m³/s which was only 67% of the amount estimated by the non-clustered ANN models, as illustrated in Fig. 5(b). Consequently, the non-clustered ANN tended to overestimate the optimal level of Q for the HAB control, which could mislead the flow management in the Nakdong River.

The seasonal fluctuation of Chl-a at the specific site like Goryung station was very sensitive to the change in EC, as demonstrated in Fig. 4(i). The additional scenario-based simulation was thus conducted to investigate the effect of EC on determining the optimal value of Q for the HAB reduction. The scenarios were prepared as follows; Q was set to 250–600 m³/s with the increment of 50 m³/s, and EC ranged 140–500 μS/cm with the increment of 20 μS/cm. Other explanatory variables were constant to the average values of the datasets belonging to Cluster 33 at all target sites, as shown in Table 3.

The simulation results showed that, in the case of Goryung station, the additional amount of Q was required to control Chl-a with the increase in EC. Figure 6(a) indicates the abrupt rise in Chl-a in the region of EC ranging 360–600 μS/cm, which corresponds to 48% of the total EC data at the target site. For example, Chl-a of 40 mg/m³ was simulated when EC and Q were set to 360 μS/cm which is the average value of Cluster 33 and 400 m³/s derived from Fig. 5(b), respectively. However, if EC increased to 460 μS/cm, the corresponded amount of Q was 560 m³/s to retain the same value for Chl-a (40 mg/m³) as obtained in the former case. In contrast, in the case of Sangdong station with Q of 400 m³/s, Chl-a did not change significantly and remained relatively constant to around 15–25 m³/s in the range of EC less than 300 μS/cm, which corresponds to 85% of the total EC data at this monitoring station, as shown in Fig. 6(b). Hence, in the river reaches adjacent to the confluence of the contaminated tributaries, the water quality variable accounting for the tributary effluents should be considered as the crucial factor in predicting the phytoplankton abundance and furthermore assessing the optimal level of Q for the HAB mitigation during the summer season.

Conclusions

This study applied the clustering technique to the ANN models for predicting the seasonal variation of Chl-a by capturing the relationship between the river environmental factors and Chl-a at the water quality monitoring stations located in the Nakdong River, of which the discharge was regulated by a number of the weirs. The results demonstrated that the clustered ANN models more accurately reproduced the temporal distribution of Chl-a. With the 3-cluster ANN models, R² increased to 0.485, 0.618, 0.634 and 0.545 from 0.345, 0.428, 0.465 and 0.367 at Shinam station, Dasan station, Goryung station and Sangdong station, respectively for 5 years (2013–2017), compared to the non-clustered ANN models.

Using the clustered ANN models, this study assessed the relative importance of the river environmental variables on the seasonal dynamics of Chl-a. The results showed that Q contributed more than 45% to the predicted Chl-a because the water residence time played a vital role in governing the phytoplankton growth mechanism in the flowing water body. In addition, the influence of Q tended to amplify with the increase in the water temperature because the artificial mixing induced by the enhanced Q strongly impacted the timing and magnitude of cyanobacterial bloom occurring in summer. Whereas, the relative importance of TP declined to less than 10% as water temperature increased even if TP still worked as the key limiting factors for Chl-a belonging to the low water temperature, which represented the diatom abundance, as the relative importance of TP increased up to about 30%. Therefore, it can be concluded that the relationship between Chl-a and the river environmental factors is temporally heterogeneous in the study reach.

Furthermore, this study proposed the countermeasures against HAB associated with the cyanobacterial bloom, taking into account the effect of the flow control on the HAB reduction with the clustered ANN models. The results of the 3-cluster ANN models illustrated that Q of 400 m³/s was the optimal level for the HAB mitigation. However, at Goryung station, it was found that the mitigation discharge should increase up to 1.5 times the original level of Q owing to the increase in EC resulted from the excessive effluents from the Kumho River. Therefore, it was important not only to manage the weir discharge but also to reduce the tributary-induced pollutant sources of Chl-a in order to suppress the phytoplankton blooms in the regulated Nakdong River.

References

Anderson, D. M., Glibert, P. M. & Burkholder, J. M. Harmful algal blooms and eutrophication: nutrient sources, composition, and consequences. Estuaries 25(4), 704–726 (2002).
Article Google Scholar
Oliver, R. L. & Merrick, C. J. Partitioning of river metabolism identifies phytoplankton as a major contributor in the regulated Murray River (Australia). Freshwater Biology 51(6), 1131–1148 (2006).
Article CAS Google Scholar
Paerl, H. W., Valdes-Weaver, L. M., Joyner, A. R. & Winkelmann, V. Phytoplankton indicators of ecological change in the eutrophying Pamlico Sound system, North Carolina. Ecological Applications 17(sp5), S88–S101 (2007).
Article Google Scholar
Waylett, A. J., Hutchins, M. G., Johnson, A. C., Bowes, M. J. & Loewenthal, M. Physico-chemical factors alone cannot simulate phytoplankton behaviour in a lowland river. Journal of hydrology 497, 223–233 (2013).
Article ADS CAS Google Scholar
Watson, S. B., Ridal, J. & Boyer, G. L. Taste and odour and cyanobacterial toxins: impairment, prediction, and management in the Great Lakes. Canadian Journal of Fisheries and Aquatic Sciences 65(8), 1779–1796 (2008).
Article CAS Google Scholar
Hijnen, W. A. et al. Removal and fate of Cryptosporidium parvum, Clostridium perfringens and small-sized centric diatoms (Stephanodiscus hantzschii) in slow sand filters. Water research 41(10), 2151–2162 (2007).
Article CAS Google Scholar
Gregor, J. & Maršálek, B. Freshwater phytoplankton quantification by chlorophyll a: a comparative study of in vitro, in vivo and in situ methods. Water Research 38(3), 517–522 (2004).
Article CAS Google Scholar
Najah, A., Elshafie, A., Karim, O. A. & Jaffar, O. Prediction of Johor River water quality parameters using artificial neural networks. European Journal of Scientific Research 28(3), 422–435 (2009).
Google Scholar
Lek, S. & Guégan, J. F. Artificial neural networks as a tool in ecological modelling, an introduction. Ecological modelling 120(2-3), 65–73 (1999).
Article Google Scholar
Maier, H. R. & Dandy, G. C. Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications. Environmental modelling & software 15(1), 101–124 (2000).
Article Google Scholar
Recknagel, F. Applications of machine learning to ecological modelling. Ecological Modelling 146(1–3), 303–310 (2001).
Article Google Scholar
Olden, J. D. An artificial neural network approach for studying phytoplankton succession. Hydrobiologia 436(1–3), 131–143 (2000).
Article CAS Google Scholar
Jeong, K. S., Joo, G. J., Kim, H. W., Ha, K. & Recknagel, F. Prediction and elucidation of phytoplankton dynamics in the Nakdong River (Korea) by means of a recurrent artificial neural network. Ecological Modelling 146(1–3), 115–129 (2001).
Article CAS Google Scholar
Jeong, K. S., Recknagel, F. & Joo, G. J. Prediction and elucidation of population dynamics of the blue-green algae Microcystis aeruginosa and the diatom Stephanodiscus hantzschii in the Nakdong River-Reservoir System (South Korea) by a recurrent artificial neural network. Ecological Informatics (pp. 255–273). Springer, Berlin, Heidelberg (2006).
Kim, J., Kim, J. & Cho, Y. Establishing a predictive model for Chlorophyll-A concentration in lake daechung, Korea using multilinear statistical techniques. Journal of Environmental Engineering 141(2), 04014061 (2014).
Article Google Scholar
Wu, N., Huang, J., Schmalz, B. & Fohrer, N. Modeling daily chlorophyll a dynamics in a German lowland river using artificial neural networks and multiple linear regression approaches. Limnology 15(1), 47–56 (2014).
Article Google Scholar
Wehr, J. D. & Descy, J. P. Use of phytoplankton in large river management. Journal of Phycology 34(5), 741–749 (1998).
Article Google Scholar
Ha, K., Jang, M. H. & Joo, G. J. Winter Stephanodiscus bloom development in the Nakdong River regulated by an estuary dam and tributaries. Hydrobiologia 506(1–3), 221–227 (2003).
Article Google Scholar
Cottingham, K. L., Ewing, H. A., Greer, M. L., Carey, C. C. & Weathers, K. C. Cyanobacteria as biological drivers of lake nitrogen and phosphorus cycling. Ecosphere 6(1), 1–19 (2015).
Article Google Scholar
Dauta, A., Devaux, J., Piquemal, F. & Boumnich, L. Growth rate of four freshwater algae in relation to light and temperature. Hydrobiologia 207(1), 221–226 (1990).
Article Google Scholar
Litchman, E., Steiner, D. & Bossard, P. Photosynthetic and growth responses of three freshwater algae to phosphorus limitation and daylength. Freshwater Biology 48(12), 2141–2148 (2003).
Article CAS Google Scholar
Lürling, M., Eshetu, F., Faassen, E. J., Kosten, S. & Huszar, V. L. Comparison of cyanobacterial and green algal growth rates at different temperatures. Freshwater Biology 58(3), 552–559 (2013).
Article Google Scholar
Kuo, J. T., Hsieh, M. H., Lung, W. S. & She, N. Using artificial neural network for reservoir eutrophication prediction. Ecological modelling 200(1–2), 171–177 (2007).
Article Google Scholar
Forman, R. T. Urban ecology: science of cities. (Cambridge University Press, 2014).
Brown, C. D., Canfield, D. E. Jr., Bachmann, R. W. & Hoyer, M. V. Seasonal patterns of chlorophyll, nutrient concentrations and Secchi disk transparency in Florida lakes. Lake and Reservoir Management 14(1), 60–76 (1998).
Article Google Scholar
Tucker, C. S. & D’Abramo, L. R. Managing high pH in freshwater ponds. (Southern Regional Aquaculture Center, 2008).
Park, J., Wang, D. & Lee, W. H. Evaluation of weir construction on water quality related to algal blooms in the Nakdong River. Environmental earth sciences 77(11), 408 (2018).
Article Google Scholar
Cha, Y., Park, S. S., Lee, H. W. & Stow, C. A. A Bayesian hierarchical approach to model seasonal algal variability along an upstream to downstream river gradient. Water Resources Research 52(1), 348–357 (2016).
Article ADS Google Scholar
Kim, J. S., Seo, I. W. & Baek, D. Modeling spatial variability of harmful algal bloom in regulated rivers using a depth-averaged 2D numerical model. Journal of Hydro-environment Research 20, 63–76 (2018).
Article ADS Google Scholar
Kim, J. S., Seo, I. W., Lyu, S. & Kwak, S. Modeling water temperature effect in diatom (Stephanodiscus hantzschii) prediction in eutrophic rivers using a 2D contaminant transport model. Journal of Hydro-environment Research 19, 41–55 (2018).
Article Google Scholar
Mitrovic, S. M., Hardwick, L. & Dorani, F. Use of flow management to mitigate cyanobacterial blooms in the Lower Darling River, Australia. Journal of Plankton Research 33(2), 229–241 (2010).
Article Google Scholar
Tekile, A., Kim, I. & Kim, J. Mini-review on river eutrophication and bottom improvement techniques, with special emphasis on the Nakdong River. Journal of environmental sciences 30, 113–121 (2015).
Article Google Scholar
Yoon, T., Rhodes, C. & Shah, F. A. Upstream water resource management to address downstream pollution concerns: A policy framework with application to the Nakdong River basin in South Korea. Water Resources Research 51(2), 787–805 (2015).
Article ADS CAS Google Scholar
Hwang, S. J., Bae, H. K. & Kim, H. Y. The Effect of for Major River Project and Kumho River on Nakdong River’s Water Quality-Focused on Kangjung-Koryung Weir. Journal of Environmental Science International 22(6), 695–703 (2013).
Article Google Scholar
Kim, S. E., Seo, I. W. & Choi, S. Y. Assessment of water quality variation of a monitoring network using exploratory factor analysis and empirical orthogonal function. Environmental Modelling & Software 94, 21–35 (2017).
Article Google Scholar
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. Proceedings of the thirteenth international conference on artificial intelligence and statistics, 249–256 (2010).
Duchi, J., Hazan, E. & Singer, Y. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12(Jul), 2121–2159 (2011).
MathSciNet MATH Google Scholar
Garson, G. D. Interpreting neural-network connection weights. AI expert 6(4), 46–51 (1991).
Google Scholar
Millie, D. F. et al. Modeling phytoplankton abundance in Saginaw Bay, Lake Huron: Using artificial neural networks to discern functional influence of environmental variables and relevance to a great lake observing system 1. Journal of phycology 42(2), 336–349 (2006).
Article CAS Google Scholar
Yu, J. J. et al. Relations of nutrient concentrations on the seasonality of algal community in the Nakdong River, Korea. Journal of Korean Society on Water Environment 31(2), 110–119 (2015).
Article Google Scholar
Hartigan, J. A. & Wong, M. A. Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics) 28(1), 100–108 (1979).
Google Scholar
Hur, M. et al. Temporal shifts in cyanobacterial communities at different sites on the Nakdong River in Korea. Water research 47(19), 6973–6982 (2013).
Article CAS Google Scholar
Conley, D. J. et al. Controlling eutrophication: nitrogen and phosphorus. Science 323(5917), 1014–1015 (2009).
Article CAS Google Scholar
Hecky, R. E. & Kilham, P. Nutrient limitation of phytoplankton in freshwater and marine environments: a review of recent evidence on the effects of enrichment 1. Limnology and Oceanography 33(4part2), 796–822 (1988).
Article ADS CAS Google Scholar
Paerl, H. W., Fulton, R. S., Moisander, P. H. & Dyble, J. Harmful freshwater algal blooms, with an emphasis on cyanobacteria. The Scientific World Journal 1, 76–113 (2001).
Article CAS Google Scholar
Visser, P. M., Ibelings, B. W., Bormans, M. & Huisman, J. Artificial mixing to control cyanobacterial blooms: a review. Aquatic Ecology 50(3), 423–441 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

This research was supported by the BK21 PLUS research program of the National Research Foundation of Korea. This research work was conducted at the Institute of Engineering Research, and Institute of Construction and Environmental Engineering in Seoul National University, Seoul, Korea.

Author information

Authors and Affiliations

Department of Earth Sciences, University of Minnesota, Minneapolis, MN, 55455, USA
Jun Song Kim
Department of Civil and Environmental Engineering, Seoul National University, Seoul, 08826, South Korea
Il Won Seo & Donghae Baek
Institute of Engineering Research, Seoul National University, Seoul, 08826, South Korea
Jun Song Kim

Authors

Jun Song Kim
View author publications
You can also search for this author in PubMed Google Scholar
Il Won Seo
View author publications
You can also search for this author in PubMed Google Scholar
Donghae Baek
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.S.K. conceived the study framework, performed the data collection, model development and results analysis, and wrote the paper. I.W.S. supervised the study and validated the results. D.B. contributed to the model development. All authors contributed to the manuscript.

Corresponding author

Correspondence to Il Won Seo.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, J.S., Seo, I.W. & Baek, D. Seasonally varying effects of environmental factors on phytoplankton abundance in the regulated rivers. Sci Rep 9, 9266 (2019). https://doi.org/10.1038/s41598-019-45621-1

Download citation

Received: 26 February 2019
Accepted: 11 June 2019
Published: 25 June 2019
DOI: https://doi.org/10.1038/s41598-019-45621-1

This article is cited by

Application and recent progress of inland water monitoring using remote sensing techniques
- Qi Cao
- Gongliang Yu
- Zhiyi Qiao
Environmental Monitoring and Assessment (2023)
Inter-annual and intra-annual variations in water quality and its response to water-level fluctuations in a river-connected lake, Dongting Lake, China
- Mingming Geng
- Yandong Niu
- Yonghong Xie
Environmental Science and Pollution Research (2022)
Dynamics of phytoplankton community in seasonally open and closed wetlands in the Teesta–Torsa basin, India, and management implications for sustainable utilization
- Pranab Gogoi
- Suman Kumari
- Basanta Kumar Das
Environmental Monitoring and Assessment (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.