Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Drought index prediction using advanced fuzzy logic model: Regional case study over Kumaon in India

  • Anurag Malik,

    Roles Conceptualization, Data curation, Investigation, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Soil and Water Conservation Engineering, College of Technology, G.B. Pant University of Agriculture & Technology, Uttarakhand, India

  • Anil Kumar,

    Roles Supervision, Writing – original draft, Writing – review & editing

    Affiliation Department of Soil and Water Conservation Engineering, College of Technology, G.B. Pant University of Agriculture & Technology, Uttarakhand, India

  • Sinan Q. Salih,

    Roles Investigation, Validation, Writing – original draft, Writing – review & editing

    Affiliation Institute of Research and Development, Duy Tan University, Da Nang, Vietnam

  • Sungwon Kim,

    Roles Funding acquisition, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju, South Korea

  • Nam Won Kim,

    Roles Funding acquisition, Writing – original draft, Writing – review & editing

    Affiliation Department of Land, Water and Environment Research Institute: Korea Institute of Civil Engineering and Building Technology, Goyang, South Korea

  • Zaher Mundher Yaseen ,

    Roles Conceptualization, Investigation, Project administration, Validation, Visualization, Writing – original draft, Writing – review & editing

    yaseen@tdtu.edu.vn

    Affiliation Sustainable Developments in Civil Engineering Research Group, Faculty of Civil Engineering, Ton Duc Thang University, Ho Chi Minh City, Vietnam

  • Vijay P. Singh

    Roles Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Biological and Agricultural Engineering and Zachry Department of Civil Engineering, Texas A&M University, Austin, Texas, United States of America, National Water Center, UAE University, Al Ein, United Arab Emirates

Abstract

A new version of the fuzzy logic model, called the co-active neuro fuzzy inference system (CANFIS), is introduced for predicting standardized precipitation index (SPI). Multiple scales of drought information at six meteorological stations located in Uttarakhand State, India, are used. Different lead times of SPI were computed for prediction, including 1, 3, 6, 9, 12, and 24 months, with inputs abstracted by autocorrelation function (ACF) and partial-ACF (PACF) analysis at 5% significance level. The proposed CANFIS model was validated against two models: classical artificial intelligence model (e.g., multilayer perceptron neural network (MLPNN)) and regression model (e.g., multiple linear regression (MLR)). Several performance evaluation metrices (root mean square error, Nash-Sutcliffe efficiency, coefficient of correlation, and Willmott index), and graphical visualizations (scatter plot and Taylor diagram) were computed for the evaluation of model performance. Results indicated that the CANFIS model predicted the SPI better than the other models and prediction results were different for different meteorological stations. The proposed model can build a reliable expert intelligent system for predicting meteorological drought at multi-time scales and decision making for remedial schemes to cope with meteorological drought at the study stations and can help to maintain sustainable water resources management.

1. Introduction

Drought is among the natural hazards and a recurrent climatic feature observed in most climatic regions in the world. Factors determining the impact of drought include its severity, areal extent, frequency and duration [1]. Drought, as one of the environmental disasters, has received much attention in various fields, including environment, ecology, hydrology, meteorology, geology, and agriculture. Drought accounts for almost half of all the natural disasters since 1967 and has affected almost 2.8 billion people in the world. About 1.3 million deaths out of the estimated 3.5 million disaster-related deaths are either directly or indirectly related to droughts [2]. Accurate drought prediction is dependent on the right selection of the input variables and determines the type of drought to be predicted [3,4]. The analysis of meteorological drought (MD) requires accurate precipitation data while hydrologic drought can only be analyzed with accurate lake level, streamflow, and reservoir. Regarding groundwater drought, its analysis is dependent on the level of groundwater while agricultural drought analysis is dependent on accurate crop yield, soil moisture, and a combination of elements such as precipitation, soil moisture, and temperature data. Accurate forecasting of drought is therefore essential for multiple water resources planning, optimal operation of the irrigation system, drought preparedness, and mitigation.

Over recent decades, machine learning (ML) and autoregressive integrated moving average (ARIMA; time-series) models have been utilized in prediction of MD using metrics such as standardized precipitation-evapotranspiration index (SPEI), standardized precipitation index (SPI), and effective drought index (EDI). Rezaeian-Zadeh and Tabari [5] used multi-scale SPI values to develop an MLPNN model for forecast meteorological drought in Iran. They found that the MLPNN model forecasted drought at SPI-12 and SPI-24 more accurately. Shirmohammadi et al. [6] predicted meteorological drought in Iran using several versions of artificial intelligence (AI) models, including standalone ANFIS and ANN models and their complementary versions by integrating with wavelet transformation (WT) with time-series data processing. They showed the potential of complementary approaches over the standalone models. Belayneh et al. [7] developed ARIMA, ANN, support vector regression (SVR), WT-ANN, and WT-SVR models for predicting drought in Awash river basin of Ethiopia using 12 and 24 SPI scales. Results showed that the complementary models performed superior to other models for both drought scales.

Danandeh Mehr et al. [8] predicted long-lead-time drought using linear genetic programming (LGP), WT-LGP, and WT fuzzy-logic based on synoptic climate variables over Texas. Results of analysis demonstrated that the LGP model poorly performed to represent the stochasticity of the 3-month SPI. However, the WA-LGP effectively predicted drought for 3-, 6-, and 12-month lead times. Deo & Şahin [9] predicted drought in eastern Australia using extreme learning machine (ELM) and ANN models based on EDI. The prediction was performed using monthly precipitation data from 1957 to 2011. The results of the analysis illustrated that the ELM model performed superior to the ANN model. Deo and Şahin [10] studied the feasibility of using ANN for meteorological drought prediction in eastern Australia based on SPI and SPEI. The outcome of the study found the ANN model capable of predicting the SPEI and SPI over the considered area. Nguyen et al. [11] focused on the prediction of drought at short-term and long term basis in Cai river basin, Vietnam using the ANFIS model; the prediction was based on SPI & SPEI. From the result, SPI and SPEI were found useful in predicting short- and long-term drought, respectively. Rafiei-Sardooi et al. [12] applied neuro-fuzzy (NF) and ARIMA models to predict meteorological drought in Iran, using 3- and 12-month SPI. Results indicated that the NF model acceptably predicted SPI-2 and SPI-12 scales.

Currently, the fuzzy logic-based model has been applied to diverse fields of engineering sciences for multiple risk assessments [1316]. Khalil et al. [14] applied cascaded fuzzy logic-layer of protection analysis (CFL-LOPA) model for risk management in the natural gas industry, and they found the superior performance of CFL-LOPA model for maintaining the safety integrity level. Yan et al. [15] proposed the set pair analysis-layer of protection (SPA-LOPA) model to assess the severity of gas leakage in the biomass gasification (BG) system. Results expose the better performance of the SPA-LOPA model in the evaluation for independent protection layers of the BG system.

Recently, a number of studies have used ML models for predicting meteorological droughts using various drought indices. Mokhtarzad et al. [17] evaluated the possibility of using ANN, ANFIS, & SVM models for prediction of meteorological drought at Bojnourd, Tehran based on SPI. They confirmed the capability of the SVM model over other models. Nguyen et al. [18] assessed the ANFIS model for meteorological drought prediction using SPI and SPEI in Khanhhoa Province Vietnam. Results showed SPI & SPEI suitable for the prediction task in the study region using the ANFIS model. Zhang et al. [19] forecasted drought using the ARIMA, ANN, WA-ANN, and SVR models using 3- and 6-month SPI values in the Haihe River basin, China. The forecasted results of SPI-3 and SPI-6 revealed that the WA-ANN model better predicted than did the ANN model. Ali et al. [20] focused on multi-scalar SPI-based meteorological drought prediction in Pakistan using three different models (M5Tree, ensemble-ANFIS, & minimax probability machine regression (MPMR)). From the results, the ensemble-ANFIS model was found to outperform the other models in predicting SPI6 & SPI12 compared to SPI3 prediction. Liu et al. [21] applied ELM, online sequential ELM (OS-ELM), and self-adaptive evolutionary ELM (SAE-ELM) for drought forecasting based on SPI and SPEI in Khanhhoa Province, Vietnam. The study reported the SAE-ELM models to perform best compared to the other models. Mouatadid et al. [22] applied MLR, ELM, LSSVR, and ANN models for drought prediction over eastern Australia using multi-scalar SPI & SPEI. The study reported ELM and ANN models to perform best compared to MLR & LSSVR models in terms of drought prediction. Soh et al. [23] applied the WT-ARIMA-ANN and WT-ANFIS models for meteorological drought forecasting using 1-, 3-, and 6-month SPEI in the Langat River basin, Malaysia. Comparison of results reveals WT-ARIMA-ANN outperformed than the other for SPEI-3 and SPEI-6 prediction in the study region.

According to the literature, the exploration of new reliable and robust version of AI models is still ongoing. Also, AI models behave differently from one region to another. Hence, this is essential to understand the influence of synoptic climatological information on each station. The efficiency of the CANFIS model is investigated for drought index (SPI) forecasting. Two models (i.e., MLPNN and MLR) are developed for validation. Six meteorological stations, including Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar, were selected for meteorological drought prediction, based on multiple SPI lead times (e.g., SPI-1, SPI-3, SPI-6, SPI-12, & SPI-24). Statistical modeling techniques (i.e., ACF and PACF) were employed for the abstraction of input based on correlated lag months.

2. Case study and applied models

2.1 Case study region and data description

The present study was conducted at six meteorological stations; Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar positioned in the Kumaon region of Uttarakhand State, India (Fig 1, https://www.diva-gis.org/gdata). The altitude of the Kumaon region varies from 223m to 3669m above MSL with the geographical area of 21313 km2. Table 1 presents altitude, latitude, longitude, and data available all through the year in the region. The Uttarakhand State (28° 43' N to 31° 28' N latitudes, and 77° 34' E to 81° 03' E longitudes) sharing its northwest boundary with Himachal Pradesh, South boundary with Uttar Pradesh, the southeast boundary with Nepal, and the northeast boundary with China. The altitudes of Uttarakhand State ranges from 145m to 7796m above MSL and comprises with 13 districts, clustered into 2 administrative regions, (i) Garhwal region with 7 districts (Haridwar, Tehri Garhwal, Pauri Garhwal, Chamoli, Dehradun, Rudraprayag, and Uttarkashi), and (ii) Kumaon region with 6 districts (Almora, Bageshwar, Champawat, Nainital, Pithoragarh and Udham Singh Nagar (Pantnagar)). It is characterized by temperate climate, although the plains have a tropical climate, which has a temperature range of -0 to 43 °C with annual rainfall ranging from 260–3955 mm. Major rainfall events (60 to 85% of the annual total) have occurred from June to September (monsoon season).

thumbnail
Table 1. Details of study stations and rainfall data availability.

https://doi.org/10.1371/journal.pone.0233280.t001

The monthly scale if weather data (i.e., rainfall) for 5 stations; Almora, Bageshwar, Champawat, Nainital, and Pithoragarh were acquired from the Indian Meteorological Department (IMD), Pune (India). While, Pantnagar station data was obtained from the research crop Centre located at the G. B. Pant University of Agriculture and Technology, Pantnagar.

2.2 Calculation of the SPI

The standard index for defining, monitoring and analysing the meteorological drought (MD) conditions on multi-time scales is SPI, discovered by McKee et al. [24]. More than (≥ 30) years monthly precipitation data is required for computation of SPI for a given time-scale at any place by transforming the original precipitation series into a standardized normal distribution. Three probability distributions; normal, lognormal, and gamma were applied to the running sum of 1-, 3-, 6-, 9-, 12-, and 24-month rainfall series, and out of these three bests, one was decided though KS (Kolmogorov-Smirnov) test. The KS test revealed the gamma probability distributions fitted well to the running sum series of rainfall data. In the current study, the computation of SPI involved the use of gamma distribution at 1-, 3-, 6-, 9-, 12-, and 24-month time-scales over Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar stations. For more information on the mathematical calculation of the SPI, one can refer to [2528].

2.3 Co-active neuro-fuzzy inference system (CANFIS)

Jang et al. [29] invented the basic concept of CANFIS model by extending the adaptive neuro-fuzzy inference system (ANFIS) to produce multiple outputs. It may be used as universal approximator of any nonlinear function. The CANFIS model assimilates the features of a fuzzy inference system (FIS) and artificial neural network (ANN) together in a single frame to process the complex systems rapidly and accurately. The dominant potential of CANFIS model stems from the pattern-dependent weights between the consequent layer and the fuzzy association layer. Fig 2a and 2b demonstrate the assembly of membership functions (MF) and CANFIS model with two input variables (x and y), one output (c), under first-order Takagi-Sugeno-Kang (TSK) model with IF-THEN for CANFIS model is as follows [30,31]: (1) (2) where, A1, A2, and B1, B2 = the MFs for the inputs x and y, respectively; p1, q1, r1 and p2, q2, r2 = the parameters of the consequent part (Fig 2a). The characteristics of each layer is described as follows:

  1. Layer 1 (fuzzification layer): The nodes of this layer are adaptive (square), generates membership function (or grades) of crisp input and each node output is computed as: (3) (4)
    where, O1,i = the output of the ith layer, Ai and Bi = the linguistic labels (small, medium, large etc.), x and y = the inputs to ith node, and and = the membership functions for Ai and Bi linguistic labels, respectively. The mathematical expression of the Gaussian MF is written as: (5) where, d and σ are the conditional parameters of the function. The parameters of this layer are stated as premise parameters.
  2. Layer 2 (rule layer): this node is circular and facilitated with Π operator. The output of this layer, called firing strengths, is the product of corresponding signals obtained from layer 1. For example: (6)
  3. Layer 3 (normalization layer): this layer is circular and characterized by an N operator. The main purpose of this layer is to normalize the signal of the previous layer and facilitated as normalized firing strength by: (7)
  4. Layer 4 (defuzzification layer): every node in this layer is square, and the parameters of this layer are mentioned as consequent parameters. The contribution of ith rule towards the total output is computed by Eq (8): (8)
  5. Layer 5 (summation layer): this layer is also known as output node, labeled as Σ. In this node the overall output is computed by summing all the incoming signals: (9)
thumbnail
Fig 2. (a) MFs of two input variables in TSK model, and (b) architecture of proposed CANFIS model.

https://doi.org/10.1371/journal.pone.0233280.g002

In this research, the CANFIS model was formulated with error-and-trail procedure using gaussian (Gauss) MF, TSK fuzzy model, hyperbolic tangent (Tanh) activation function, and delta-bar-delta (D-B-D) learning algorithm for multi-scalar SPI prediction at six study stations. NeuroSolutions 5.0 software [32] was utilized to calibrate (train) the CANFIS model with a threshold of 0.001 for 1000 iterations.

2.4 Multi-layer perceptron neural network (MLPNN) model

Haykin [33] was the first scholar introduced the concept of the MLPNN model. MLPNN model is a network of several layers of parallel processing units called neurons. In the MLPNN model, each layer is linked to the subsequent layer via interconnections called weights (W). A typical illustration of the feed forward MLPNN model, which consists of input (i), hidden (j) and output (k) layers through interconnected weights (Wij & Wjk) among the neuron layers is shown in Fig 3. The exact number of neurons and hidden layers are required for accurate mapping of the entire training dataset, which is problem-specific (the number of predictors and predictands). The correction of values of the initially estimated weights is progressively done through training by matching the predicted output with the pre-determined through backpropagation [34]. The explicit expression for an output value in the MLPNN model is written as: (10) where, Y is the output vector, Wij is the weight in hidden layer connecting the ith neuron in the input layer and jth neuron in the hidden layer, Wjk is the weight in the output layer connecting the jth neuron in the hidden layer and kth neuron in the output layer, Xi is ith input variable for input-layer, Ni and Nj are the neurons in the input and hidden layers, and fj and fk are activation function of hidden and output layer neurons, expressed as (11)

A supervised learning approach, which contains three layers of input/hidden/output, was used to design the architecture of the MLPNN model. Data normalization was realized using the Tanh activation function (varies from -1 to 1) with the D-B-D learning algorithm. This technique was considered fairly because of its quickness and robustness compared to the traditional gradient descent. Regarding the hidden layer, the optimal size of neurons was decided through 2n + 1 concept provided by [35,36]; here, n represents the number of inputs. The training of the MLPNN model was terminated after reaching 1000 epochs with a 0.001 threshold value. The designed MLPNN model was applied at different locations for MD prediction.

2.5 Multiple linear regression (MLR) model

Among several well-established regression models within the field of hydrology and climate MLR model is implemented widely [22]. The MLR model was selected as a second model to validate the capacity of the CANFIS model to predict the multi-scalar SPI. The MLR model module the collinearity among one target (dependent) variable and several (two or more) independent variables [37,38]. The regression equation of the MLR model can be written as: (12) where, SPI = the target variable at multi-time scales, SPIt−1, SPIt−2 to SPItn are input parameters, w0 is the intercept of the MLR equation, and w1 to wk are the weights of the MLR equation.

2.6 Optimal input nomination and model development

Nominating the appropriate input-output variables for modeling nonlinear hydrological processes is a tedious task. In this research, long-term monthly rainfall data were utilized to compute multi-time scale SPI (i.e., 1, 3, 6, 9, 12 and 24-month). The ACF and PACF analysis were performed for picking up the optimal inputs (significant lags) for target output [3941]. The ACF and PACF are calculated using the Eqs 13 and 14: (13) (14) where, N is the multi-scalar SPI observation in entire series, Yt and are the mean whole series, and k is the lag through series. Afterward, these PACF values were tested at 5% significance level (SL) by constructing the upper and lower critical limits (UCL and LCL) by Eq (15): (15)

Figs 4a–4f to 9a–9f demonstrate the PACF results of multi-scalar SPI at Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar stations, respectively. The dotted red line in these figures indicates the UCL and LCL at 5% SL if PACF value crosses these limits counted statistically significant, and utilized for CANFIS, MLPNN, and MLR models development. Table 2 provides the details of developed models with inputs and outputs, while Table 3 summarizes the details of training (70%) and testing (30%) datasets percentages of multi-scalar SPI utilized by CANFIS, MLPNN and MLR models for MD prediction at six different study stations.

thumbnail
Fig 4. The statitstical calculation of the partial autocorrelation function PACF for (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Alomra station.

https://doi.org/10.1371/journal.pone.0233280.g004

thumbnail
Fig 5. The statitstical calculation of the partial autocorrelation function (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Bageshwar station.

https://doi.org/10.1371/journal.pone.0233280.g005

thumbnail
Fig 6. The statitstical calculation of the partial autocorrelation function PACF for (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Champawat station.

https://doi.org/10.1371/journal.pone.0233280.g006

thumbnail
Fig 7. The statitstical calculation of the partial autocorrelation function PACF for (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Nainital station.

https://doi.org/10.1371/journal.pone.0233280.g007

thumbnail
Fig 8. The statitstical calculation of the partial autocorrelation function PACF for (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Pithoragarh station.

https://doi.org/10.1371/journal.pone.0233280.g008

thumbnail
Fig 9. The statitstical calculation of the partial autocorrelation function PACF for (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 at Pantnagar station.

https://doi.org/10.1371/journal.pone.0233280.g009

thumbnail
Table 2. Output-input relationship of SPI for prediction using CANFIS, MLPNN and MLR models at study stations.

https://doi.org/10.1371/journal.pone.0233280.t002

thumbnail
Table 3. Percentage of training and testing datasets for CANFIS, MLPNN and MLR models at study stations.

https://doi.org/10.1371/journal.pone.0233280.t003

2.7 Performance evaluation metrics

The predictive performance of proposed and other models (i.e., CANFIS, MLPNN, and MLR) were examined by using several performance evaluation metrices; the RMSE (root mean square error), NSE (Nash-Sutcliffe efficiency), COC (coefficient of correlation), and WI (Willmott index) [42], and by pictorial inspection through scatter plot and Taylor diagram [43]. Their mathematical expression can be written as:

  1. Root mean square error [44,45]: (16)
  2. Nash-Sutcliffe efficiency [46]: (17)
  3. Coefficient of correlation [47,48]: (18)
  4. Willmott index [49,50]: (19) where, SPIpre and SPIcal are the predicted and calculated multi time scale SPI values for the ith dataset, and are the average of predicted and calculated multi time scale SPI values, represent the absolute difference between predicted and calculated mean values, represent the absolute difference between calculated and their mean values, and N is the total number of observations in a dataset.

3. Results of application and discussion

The SPI was computed at multi time scales (1, 3, 6, 9, 12, and 24-months) for meteorological drought (MD) prediction in the Kumaon region by the application of relatively new AI model called CANFIS. Two predictive models (i.e., MLPNN and MLR) were established for validation. Six meteorological stations, including Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar, were used for modeling. Optimal inputs (lags) were nominated through PACF at 5% SL for all SPI scales. Then models were evaluated statistically and graphically. The model having minimal absolute error measures (RMSE) and highest (NSE, COC, and WI) best-goodness-of-fit over the testing phase recognized healthier model for MD prediction over the study area. The MD prediction results of applied AI models are discussed in the following sub-sequent section.

The MD condition was predicted by finding the suitability of CANFIS, MLPNN and MLR models for all SPI scales at six study stations. All the formulated models were trained with 70% dataset, whereas the remaining 30% dataset was used for testing. Tables 4, 5 and 6 summarize the RMSE, NSE, COC and WI values of CANFIS, MLPNN, and MLR models over the testing phase. It was noticed from these tables that the CANFIS model produces best prediction for Almora station over SPI-1, SPI-3, SPI-6, SPI-9, and SPI-12 (RMSE = 0.952, 0.486, 0.267, 0.292, and 0.158; NSE = 0.136, 0.793, 0.942, 0.921, and 0.973; COC = 0.533, 0.932, 0.987, 0.967, and 0.989; WI = 0.373, 0.924, 0.982, 0.977, and 0.993), for Bageshwar station over all SPI scales (RMSE = 1.116, 0.562, 0.402, 0.298, 0.297, and 0.339; NSE = 0.074, 0.755, 0.886, 0.937, 0.941, and 0.912; COC = 0.383, 0.907, 0.984, 0.982, 0.989, and 0.981; WI = 0.234, 0.906, 0.960, 0.981, 0.982, and 0.971), for Champawat station over SPI-1, SPI-3, SPI-6, SPI-9, and SPI-12 (RMSE = 0.820, 0.472, 0.358, 0.302, and 0.369; NSE = 0.205, 0.809, 0.908, 0.923, and 0.879; COC = 0.539, 0.927, 0.984, 0.982, and 0.981; WI = 0.432, 0.932, 0.970, 0.977, and 0.962), for Nainital station over SPI-1, SPI-3, SPI-6, SPI-9, and SPI-24 (RMSE = 0.949, 0.524, 0.332, 0.266, and 0.328; NSE = 0.180, 0.782, 0.918, 0.946, and 0.892; COC = 0.644, 0.951, 0.988, 0.985, and 0.960; WI = 0.363, 0.915, 0.973, 0.984, and 0.967), for Pithoragarh station over SPI-6, and SPI-9 (RMSE = 0.380, and 0.392; NSE = 0.921, and 0.925; COC = 0.990, and 0.990; WI = 0.974, and 0.976), and for Pantnagar station over all SPI scales (RMSE = 0.744, 0.447, 0.272, 0.189, 0.077, and 0.061; NSE = 0.303, 0.809, 0.931, 0.954, 0.991, and 0.992; COC = 0.737, 0.950, 0.972, 0.977, 0.996, and 0.997; WI = 0.539, 0.928, 0.980, 0.988, 0.998, and 0.998). Similarly, MLPNN model best MD prediction for SPI-24 (RMSE = 0.197; NSE = 0.944; COC = 0.975; and WI = 0.984) at Almora, SPI-12 (RMSE = 0.193; NSE = 0.972; COC = 0.989; and WI = 0.992) at Nainital, and SPI-1 and SPI-3 (RMSE = 0.909, 0.518; NSE = 0.357, 0.820; COC = 0.781, 0.954; and WI = 0.585, 0.933) Pithoragarh station. The MLR model achieved poor results for all SPI scales at all stations, expect Champawat (SPI-24), and Pithoragarh (SPI-12, and SPI-24).

thumbnail
Table 4. RMSE, NSE, COC and WI values for multi-scalar SPI by CANFIS model during testing period at study stations.

https://doi.org/10.1371/journal.pone.0233280.t004

thumbnail
Table 5. RMSE, NSE, COC and WI values for multi-scalar SPI by MLPNN model during testing period at study stations.

https://doi.org/10.1371/journal.pone.0233280.t005

thumbnail
Table 6. RMSE, NSE, COC and WI values for multi-scalar SPI by MLR model during testing period at study stations.

https://doi.org/10.1371/journal.pone.0233280.t006

Figs 10a–10f to 15a–15f illustrate the temporal variation “scatter plot” among predicted vs calculated multi-time scale SPI observation generated by applied models (i.e., CANFIS, MLPNN and MLR) during the testing phase at six study stations. As seen from these figures the estimates of CANFIS model are adjacent to the 1:1 (best fit) line for SPI-1, SPI-3, SPI-6, SPI-9, and SPI-12 at Almora and Champawat stations, for SPI all scales at Bageshwar and Pantnagar stations, for SPI-1, SPI-3, SPI-6, SPI-9, and SPI-24 at Nainital station, and for SPI-6 and SPI-9 at Pithoragarh station. Additionally, these figures also show a similar pattern of results as mentioned Tables 4 to 6.

thumbnail
Fig 10. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Almora station.

https://doi.org/10.1371/journal.pone.0233280.g010

thumbnail
Fig 11. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Bageshwar station.

https://doi.org/10.1371/journal.pone.0233280.g011

thumbnail
Fig 12. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Champawat station.

https://doi.org/10.1371/journal.pone.0233280.g012

thumbnail
Fig 13. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Nainital station.

https://doi.org/10.1371/journal.pone.0233280.g013

thumbnail
Fig 14. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Pithoragarh station.

https://doi.org/10.1371/journal.pone.0233280.g014

thumbnail
Fig 15. Scatter plots of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Pantnagar station.

https://doi.org/10.1371/journal.pone.0233280.g015

Taylor diagram [43] concept was utilized to map the spatial pattern of calculated (reference field) vs predicted (test field) multi-time scale SPI value by applied models (i.e., CANFIS, MLPNN, and MLR) through the testing phase over the study region. Taylor diagram is a 2-dimensional graphical presentation incorporated the RMSE, correlation coefficient, and standard deviation metrics together in one frame as the polar plot demonstrated in Figs 16a–16f to 21a–21f. It was recorded from these figures that the CANFIS, MLPNN and MLR models have a similar outline of results as observed in Tables 4 to 6 and Figs 10a–10f to 15a–15f. Therefore, it is suggested that the applied models with optimal lags can predict multi-time scale SPI effectively at six study stations.

thumbnail
Fig 16. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Almora station.

https://doi.org/10.1371/journal.pone.0233280.g016

thumbnail
Fig 17. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Bageshwar station.

https://doi.org/10.1371/journal.pone.0233280.g017

thumbnail
Fig 18. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Champawat station.

https://doi.org/10.1371/journal.pone.0233280.g018

thumbnail
Fig 19. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Nainital station.

https://doi.org/10.1371/journal.pone.0233280.g019

thumbnail
Fig 20. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Pithoragarh station.

https://doi.org/10.1371/journal.pone.0233280.g020

thumbnail
Fig 21. Taylor diagram of predicted and calculated (a) SPI-1, (b) SPI-3, (c) SPI-6, (d) SPI-9, (e) SPI-12, and (f) SPI-24 values by CANFIS, MLPNN and MLR models in testing period at Pantnagar station.

https://doi.org/10.1371/journal.pone.0233280.g021

The viability of relatively new artificial intelligence model called CANFIS model was assessed for predicting the MD at six stations; Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar, based, based on multi-scalar standardized precipitation index (SPI). The input variables were selected, based on statistical analysis (i.e., ACF and PACF) of the most correlated lags to predict multiple SPI scale values. Based on the prediction accuracy of the proposed CANFIS model, the proposed model distinguished itself over the competing MLPNN and the MLR models. The MD prediction by the CANFIS model over the study stations displays the latent of the model (Table 7). It mimicked the actual trend of the SPI in this particular region and demonstrated an intelligent system that can be valuable for water resources managers and policymakers for drought mitigation.

thumbnail
Table 7. Comparison of CANFIS, MLPNN, and MLR results at study stations.

https://doi.org/10.1371/journal.pone.0233280.t007

The results of proposed model were compared and validated against the nature-inspired algorithm and stochastic (time-series) model built by numerous drought indices (DIs). For instance, there are studies conducted on the SPI prediction using various versions of AI models [40,5155]. Memarian et al. [56] applied the CANFIS model to predict the meteorological drought in Birjand, Iran using global climatic indicators and lagged values of SPI. They found a better predictive capability of the CANFIS model in the study region. Fung et al. [57] forecasted meteorological drought in Langat River basin, Malaysia using hybrid wavelet integrated with boosting-SVR (W-B-SVR), multi-input-fuzzy-SVR (W-MI-F-SVR), and weighted-fuzzy-SVR (W-WF-SVR) models based on 1, 3, and 6-month SPEI. Results reveal the superior multi-scales SPEI was forecasted by the W-WF-SVR model. Kisi et al. [58] examined the potential of hybrid ANFIS-PSO (particle swarm optimization), ANFIS-GA (genetic algorithm), ANFIS-ACO (ant colony optimizer), ANFIS-BOA (butterfly optimization algorithm) against classical ANFIS to forecast the meteorological drought at three synoptic stations located in Iran, based on multi-scalar SPI. They fund the superior performance of hybrid ANFIS models for forecasting SPI3, SPI6, SPI9, and SPI12 at study stations.

The reported literature evidenced the capability of ML models in drought metrological drought prediction. The overall finding of this research suggested that AI models (i.e., CANFIS & MLPNN) achieved better meteorological drought forecasting at different time scales at the considered stations. As future research devotion, sensitivity analysis can be conducted for the data, input variables and models to investigate the potential source influencing the modeling performance results.

4. Conclusion

This research implements a relatively new AI model (i.e., CANFIS) to predict meteorological drought using multiple SPI scales at Almora, Bageshwar, Champawat, Nainital, Pithoragarh and Pantnagar stations positioned in the Kumaon region of Uttarakhand State, India. The results yielded by the CANFIS model were compared against the MLPNN and MLR models for each study station through performance evaluation indicators (RMSE, NSE, COC, and WI), and visual explanation (i.e., scatter plot and Taylor diagram). According to the results of comparison, the best model were obtained with Gaussian MFs, TSK fuzzy model, Tanh activation function, D-B-D learning algorithm at Almora and Champawat stations (SPI-1, SPI-3, SPI-6, SPI-9, and SPI-12), at Bageshwar and Pantnagar stations (for all SPI scales), at Nainital station (SPI-1, SPI-3, SPI-6, SPI-9, and SPI-24), and at Pithoragarh station (SPI-6, and SPI-9). Consequently, the MLPNN model achieves the best prediction for SPI-24 (6-13-1) at Almora station, for SPI-12 (6-13-1) at Nainital station, and SPI-1 (6-8-1) and SPI-3 (9-18-1) at Pithoragarh station. The MLR model attains worst prediction at all stations and SPI scales, expect SPI-24 at Champawat station, and SPI-12 and 24 at Pithoragarh station for prediction of meteorological drought. Therefore, this study demonstrates the worth utility machine learning models; CANFIS and MLPNN for the magnificent prediction of current SPI based on antecedent phases. Furthermore, the MD prediction through multi-time scale SPI observations by machine learning models will hydrologists, agriculturists, water managers, and policymakers to project drought mitigation strategy for sustainable planning and management of water resources in the study region.

Acknowledgments

This research was supported by a grant (20200027-001) from a Strategic Research Project (Development of Hydrological Safety Assessment System for Hydraulic Structures) funded by the Korea Institute of Civil Engineering and Building Technology.

References

  1. 1. Mishra AK, Singh VP. Analysis of drought severity-area-frequency curves using a general circulation model and scenario uncertainty. Journal of Geophysical Research Atmospheres. 2009;
  2. 2. Obasi GOP. WMO’s Role in the International Decade for Natural Disaster Reduction. Bulletin of the American Meteorological Society. 2002;
  3. 3. Mishra AK, Singh VP. A review of drought concepts. Journal of Hydrology. 2010. pp. 202–216.
  4. 4. Mishra AK, Singh VP. Drought modeling—A review. Journal of Hydrology. 2011. pp. 157–175.
  5. 5. Rezaeian-Zadeh M, Tabari H. MLP-based drought forecasting in different climatic regions. Theoretical and Applied Climatology. 2012;109: 407–414.
  6. 6. Shirmohammadi B, Moradi H, Moosavi V, Semiromi MT, Zeinali A. Forecasting of meteorological drought using Wavelet-ANFIS hybrid model for different time steps (case study: Southeastern part of east Azerbaijan province, Iran). Natural Hazards. 2013;69: 389–402.
  7. 7. Belayneh a., Adamowski J, Khalil B, Ozga-Zielinski B. Long-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet neural networks and wavelet support vector regression models. Journal of Hydrology. 2014;508: 418–429.
  8. 8. Danandeh Mehr A, Kahya E, Ozger M. A gene-wavelet model for long lead time drought forecasting. Journal of Hydrology. 2014;517: 691–699.
  9. 9. Deo RC, Şahin M. Application of the extreme learning machine algorithm for the prediction of monthly Effective Drought Index in eastern Australia. Atmospheric Research. 2015;153: 512–525.
  10. 10. Deo RC, Şahin M. Application of the Artificial Neural Network model for prediction of monthly Standardized Precipitation and Evapotranspiration Index using hydrometeorological parameters and climate indices in eastern Australia. Atmospheric Research. 2015;161–162: 65–81.
  11. 11. Nguyen LB, Li QF, Ngoc TA, Hiramatsu K. Adaptive Neuro-Fuzzy Inference System for drought forecasting in the Cai River basin in Vietnam. Journal of the Faculty of Agriculture, Kyushu University. 2015;
  12. 12. Rafiei-Sardooi E, Mohseni-Saravi M, Barkhori S, Azareh A, Choubin B, Jafari-Shalamzar M. Drought modeling: a comparative study between time series and neuro-fuzzy approaches. Arabian Journal of Geosciences. 2018;
  13. 13. Chen SM, Chen JH. Fuzzy risk analysis based on ranking generalized fuzzy numbers with different heights and different spreads. Expert Systems with Applications. 2009;
  14. 14. Khalil M, Abdou MA, Mansour MS, Farag HA, Ossman ME. A cascaded fuzzy-LOPA risk assessment model applied in natural gas industry. Journal of Loss Prevention in the Process Industries. 2012.
  15. 15. Yan F, Xu K. A set pair analysis based layer of protection analysis and its application in quantitative risk assessment. Journal of Loss Prevention in the Process Industries. 2018.
  16. 16. Yan F, Jin C, Li Z, Cao R, Xu K. Research and development of field theory-based three-dimensional risk assessment. Part I: Optimization of risk reduction. Safety Science. 2019;
  17. 17. Mokhtarzad M, Eskandari F, Jamshidi Vanjani N, Arabasadi A. Drought forecasting by ANN, ANFIS, and SVM and comparison of the models. Environmental Earth Sciences. 2017.
  18. 18. Nguyen V, Li Q, Nguyen L. Drought forecasting using ANFIS- a case study in drought prone area of Vietnam. Paddy and Water Environment. 2017;15: 605–616.
  19. 19. Zhang Y, Li W, Chen Q, Pu X, Xiang L. Multi-models for SPI drought forecasting in the north of Haihe River Basin, China. Stochastic Environmental Research and Risk Assessment. 2017.
  20. 20. Ali M, Deo RC, Downs NJ, Maraseni T. An ensemble-ANFIS based uncertainty assessment model for forecasting multi-scalar standardized precipitation index. Atmospheric Research. 2018;207: 155–180.
  21. 21. Liu ZN, Li QF, Nguyen LB, Xu GH. Comparing machine-learning models for drought forecasting in vietnam’s cai river basin. Polish Journal of Environmental Studies. 2018;
  22. 22. Mouatadid S, Raj N, Deo RC, Adamowski JF. Input selection and data-driven model performance optimization to predict the Standardized Precipitation and Evaporation Index in a drought-prone region. Atmospheric Research. 2018;212: 130–149.
  23. 23. Soh YW, Koo CH, Huang YF, Fung KF. Application of artificial intelligence models for the prediction of standardized precipitation evapotranspiration index (SPEI) at Langat River Basin, Malaysia. Computers and Electronics in Agriculture. 2018;144: 164–173.
  24. 24. Mckee TB, Doesken NJ, Kleist J. The relationship of drought frequency and duration to time scales. AMS 8th Conference on Applied Climatology. 1993; 179–184. citeulike-article-id:10490403
  25. 25. Lloyd-Hughes B, Saunders MA. A drought climatology for Europe. International Journal of Climatology. 2002;22: 1571–1592.
  26. 26. Angelidis P, Maris F, Kotsovinos N, Hrissanthou V. Computation of Drought Index SPI with Alternative Distribution Functions. Water Resources Management. 2012;
  27. 27. Spinoni J, Naumann G, Carrao H, Barbosa P, Vogt J. World drought frequency, duration, and severity for 1951–2010. International Journal of Climatology. 2014;34: 2792–2804.
  28. 28. Yacoub E, Tayfur G. Evaluation and Assessment of Meteorological Drought by Different Methods in Trarza Region, Mauritania. Water Resources Management. 2017;
  29. 29. Jang J-SR, Sun C-T, Mizutani E. Neuro-Fuzzy and Soft Computing A Computational Approach to Learning adn Machine Intelligence. PTR Prentice Hall. 1997.
  30. 30. Aytek A. Co-active neurofuzzy inference system for evapotranspiration modeling. Soft Computing. 2009;13: 691–700.
  31. 31. Tabari H, Talaee PH, Abghari H. Utility of coactive neuro-fuzzy inference system for pan evaporation modeling in comparison with multilayer perceptron. Meteorology and Atmospheric Physics. 2012;116: 147–154.
  32. 32. Principe J, Lefebvre C, Lynn G, Fancourt C, Wooten D. NeuroSolutions–Documentation. NeuroDimension, Inc: Gainesville, FL. 2005;
  33. 33. Haykin S. Neural networks-a comprehensive foundation (2nd eds.) London. Prentice-Hall, Upper Saddle River, NJ. 1999; 26–32.
  34. 34. Dawson CW, Wilby RL. Hydrological modelling using artificial neural networks. Progress in Physical Geography: Earth and Environment. 2001;25: 80–108.
  35. 35. Mishra AK, Desai VR. Drought forecasting using feed-forward recursive neural network. Ecological Modelling. 2006.
  36. 36. Belayneh A, Adamowski J. Standard Precipitation Index Drought Forecasting Using Neural Networks, Wavelet Neural Networks, and Support Vector Regression. Applied Computational Intelligence and Soft Computing. 2012;2012: 1–13.
  37. 37. Malik A, Kumar A. Pan Evaporation Simulation Based on Daily Meteorological Data Using Soft Computing Techniques and Multiple Linear Regression. Water Resources Management. 2015;29.
  38. 38. Malik A, Kumar A, Kisi O. Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models. Computers and Electronics in Agriculture. 2017;143: 302–313.
  39. 39. Landeras G, Ortiz-Barredo A, López JJ. Forecasting Weekly Evapotranspiration with ARIMA and Artificial Neural Network Models. Journal of Irrigation and Drainage Engineering. 2009;
  40. 40. Deo RC, Kisi O, Singh VP. Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model. Atmospheric Research. 2017;
  41. 41. Malik A, Kumar A, Singh RP. Application of Heuristic Approaches for Prediction of Hydrological Drought Using Multi-scalar Streamflow Drought Index. Water Resources Management. 2019;
  42. 42. Beyaztas U, Yaseen ZM. Drought Interval Simulation using Functional Data Analysis. Journal of Hydrology. 2019; 124141.
  43. 43. Taylor KE. Summarizing multiple aspects of model performance in a single diagram. Journal of Geophysical Research: Atmospheres. 2001;106: 7183–7192.
  44. 44. Malik A, Kumar A, Ghorbani MA, Kashani MH, Kisi O, Kim S. The viability of co-active fuzzy inference system model for monthly reference evapotranspiration estimation: case study of Uttarakhand State. Hydrology Research. 2019;
  45. 45. Tikhamarine Y, Malik A, Kumar A, Souag-Gamane D, Kisi O. Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches. Hydrological Sciences Journal. 2019; 1–19.
  46. 46. Nash JE, Sutcliffe J V. River flow forecasting through conceptual models part I—A discussion of principles. Journal of Hydrology. 1970;10: 282–290.
  47. 47. Adnan RM, Malik A, Kumar A, Parmar KS, Kisi O. Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arabian Journal of Geosciences. 2019;12: 606.
  48. 48. Malik A, Kumar A, Kisi O, Shiri J. Evaluating the performance of four different heuristic approaches with Gamma test for daily suspended sediment concentration modeling. Environmental Science and Pollution Research. 2019; pmid:31172434
  49. 49. Tao H, Diop L, Bodian A, Djaman K, Ndiaye PM, Yaseen ZM. Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso. Agricultural Water Management. 2018;
  50. 50. Willmott CJ, Robeson SM, Matsuura K. A refined index of model performance. International Journal of Climatology. 2012;32: 2088–2094.
  51. 51. Djerbouai S, Souag-Gamane D. Drought Forecasting Using Neural Networks, Wavelet Neural Networks, and Stochastic Models: Case of the Algerois Basin in North Algeria. Water Resources Management. 2016;
  52. 52. Komasi M, Sharghi S, Safavi HR. Wavelet and cuckoo search-support vector machine conjugation for drought forecasting using Standardized Precipitation Index (case study: Urmia Lake, Iran). Journal of Hydroinformatics. 2018;
  53. 53. Abbasi A, Khalili K, Behmanesh J, Shirzad A. Drought monitoring and prediction using SPEI index and gene expression programming model in the west of Urmia Lake. Theoretical and Applied Climatology. 2019;
  54. 54. Wang F, Wang Z, Yang H, Zhao Y, Zhang Z, Li Z, et al. Copula-based drought analysis using Standardized Precipitation Evapotranspiration Index: A case study in the Yellow river basin, China. Water (Switzerland). 2019;
  55. 55. Li Z, Chen T, Wu Q, Xia G, Chi D. Application of penalized linear regression and ensemble methods for drought forecasting in Northeast China. Meteorology and Atmospheric Physics. 2019;
  56. 56. Memarian H, Pourreza Bilondi M, Rezaei M. Drought prediction using co-active neuro-fuzzy inference system, validation, and uncertainty analysis (case study: Birjand, Iran). Theoretical and Applied Climatology. 2016;125: 541–554.
  57. 57. Fung KF, Huang YF, Koo CH. Coupling fuzzy–SVR and boosting–SVR models with wavelet decomposition for meteorological drought prediction. Environmental Earth Sciences. 2019;78: 693.
  58. 58. Kisi O, Docheshmeh Gorgij A, Zounemat-Kermani M, Mahdavi-Meymand A, Kim S. Drought forecasting using novel heuristic methods in a semi-arid environment. Journal of Hydrology. 2019;578: 124053.