Improving Signiﬁcant Wave Height Prediction Using a Neuro-Fuzzy Approach and Marine Predators Algorithm

: This study investigates the ability of a new hybrid neuro-fuzzy model by combining the neuro-fuzzy (ANFIS) approach with the marine predators’ algorithm (MPA) in predicting short-term (from 1 h ahead to 1 day ahead) signiﬁcant wave heights. Data from two stations, Cairns and Palm Beach buoy, were used in assessing the considered methods. The ANFIS-MPA was compared with two other hybrid methods, ANFIS with genetic algorithm (ANFIS-GA) and ANFIS with particle swarm optimization (ANFIS-PSO), in predicting signiﬁcant wave height for multiple lead times ranging from 1 h to 1 day. The multivariate adaptive regression spline was investigated in deciding the best input for prediction models. The ANFIS-MPA model generally offered better accuracy than the other hybrid models in predicting signiﬁcant wave height in both stations. It improved the accuracy of ANFIS-PSO and ANFIS-GA by 8.3% and 11.2% in root mean square errors in predicting a 1 h lead time in the test period.


Introduction
Significant wind wave height (SWH) is important for predicting seaquakes, tsunamis, and tropical cyclones; wave period and wave length is also needed for ships, maritime structures, and other business [1]. Accurate short-term SWH measurements are essential for planning protective measures against tsunamis, hydraulic structures, and wave energy facilities [2,3]. Hourly estimation of SWH is essential for short-term management, such as power generation [4]. Prediction of ship movements, construction of maritime structures, dredging operations, and disaster warnings are all examples of marine engineering that benefit from accurate real-time predictions of SWH characteristics. However, due to natural waves' unpredictable and irregular nature, predicting wave power and building wave power plants is difficult [2]. The height of the waves is affected by environmental factors and climatic variations [5]. Wind causes waves and is the most important meteorological factor in determining wave height. The accuracy of weather forecasts is affected by the non-stationarity and non-linearity of wind and wave properties [6]. In early wave models, nonlinear interactions and energy dissipation were not adequately accounted for, resulting in unpredictable wind fields that made predictions difficult [4]. Many wave height prediction methods rely on semi-analytical approaches such as the Pierson-Neumann-James and increase statistical variance and uncertainty due to limited input data for wave parameter prediction. To address this problem, the results of several different models can be combined using a multi-model approach. In this study, the performance of a refined neurofuzzy method was evaluated in conjunction with the algorithm used by marine predators. Adaptive Neuro-Fuzzy Inference System (ANFIS), ANFIS with genetic algorithm (ANFIS-GA), and ANFIS with particle swarm optimization were compared with this novel hybrid machine learning approach (ANFIS-MPA) (ANFIS-PSO). Half-hour time series data and a prediction method that looks several steps into the future were used for the analysis. This study is innovative in using the ANFIS-MPA approach to make multi-step predictions for SWH. Section 2 discusses the use of soft computing models and details the study area and data collection. Section 3 provides the main results and examines their implications for extending the newly tested model to additional climatic conditions. Section 4 summarizes the main results of this paper.

Case Study
For this study, two buoy sites were selected for wave monitoring in Queensland, namely the Cairns buoy and the Palm Beach buoy ( Figure 1). The Cairns buoy is located in southeast Queensland near Cairns with a 12 m monitoring depth at coordinates 16 • 43.830 south latitude and 145 • 42.910 east longitude. In contrast, the Palm Beach buoy is located in far north Queensland near Sydney with 23 m monitoring depth at coordinates 28 • 05.956 south latitude and 153 • 29.073 east longitude. The Queensland Government Meteorological Department collects significant wave height data at 30 min intervals. For this study, significant wave height data were downloaded from the Queensland Government website (https://www.data.qld.gov.au/dataset, accessed on 7 March 2023) at 30 min intervals for both buoy locations from 1 January 2022 to 31 December 2022. A brief statistical summary of the data used is provided in Table 1. This study uses the average hourly data with a split ratio of 75% training and 25% test to apply models to predict single-level and multi-level significant wave height (SWH). To predict SWH, only historical data of the SWH variable of both stations were used as inputs; it is worth noting that both stations were not connected as one was on the northern side, whereas the other station was on the southeast side of Queensland, as mentioned above. Data from 1 January 2022 to 30 September 2022 was adopted as the training dataset, whereas data from 1 October 2022 to 31 December 2022 was adopted as the testing dataset.

Multivariate Adaptive Regression Splines (MARS)
The Multivariate Adaptive Regression Splines model (MARS) allows for revealing a hidden nonlinear pattern in a dataset with many variables [21][22][23]. In this way, obtaining the estimator defined using a single method is possible, and combining several statistical methods is unnecessary. The basis of this method is based on functions called basis functions, which can be expressed for each explanatory variable as follows (Equation (1)): where t is called a node, and in practice, one of its observations is an explanatory variable. These functions are spline functions found in the reciprocal pair node t. The general shape of the MARS model is described by Equation (2): In this equation, is the estimated value of the response variable (here, it is estimated significant wave height), X is the vector ck of the basis function, and Bk is the explanatory variables or the equal elements of the Parajets coefficient determined by minimizing the sum of the squares of the residuals. Each basis function can be a form of a linear spline function or the product of two or more of them, indicating mutual effects. In the model MARS, the space of explanatory variables is divided into several separate regions by certain knots, which gives the greatest reduction of the sum of squared errors [24][25][26].
MARS model fitting is performed in two stages. Many basis functions with different nodes are gradually added to the model in the advanced stage. This process results in a more complicated model that is more fitting. In the second stage, the regression removal

Multivariate Adaptive Regression Splines (MARS)
The Multivariate Adaptive Regression Splines model (MARS) allows for revealing a hidden nonlinear pattern in a dataset with many variables [21][22][23]. In this way, obtaining the estimator defined using a single method is possible, and combining several statistical methods is unnecessary. The basis of this method is based on functions called basis functions, which can be expressed for each explanatory variable as follows (Equation (1)): where t is called a node, and in practice, one of its observations is an explanatory variable. These functions are spline functions found in the reciprocal pair node t. The general shape of the MARS model is described by Equation (2): In this equation, Y is the estimated value of the response variable (here, it is estimated significant wave height), X is the vector ck of the basis function, and B k is the explanatory variables or the equal elements of the Parajets coefficient determined by minimizing the sum of the squares of the residuals. Each basis function can be a form of a linear spline function or the product of two or more of them, indicating mutual effects. In the model MARS, the space of explanatory variables is divided into several separate regions by certain knots, which gives the greatest reduction of the sum of squared errors [24][25][26].
MARS model fitting is performed in two stages. Many basis functions with different nodes are gradually added to the model in the advanced stage. This process results in a more complicated model that is more fitting. In the second stage, the regression removal stage, the basis functions that are less important and influential for the estimation are removed. Finally, the most accurate model is selected based on the minimum Generalized Cross Validation (GCV) criterion. GCVk is assumed as the value of GCV for k of the model in the elimination phase. This quantity is defined by Equation (3): wheref k is the estimated model in the k th step of the elimination step; C(k) is the number of model sentences in the step k th + λm, where m represents the number of spline function nodes in the model; and λ is called the smoothing parameter and is usually chosen between 2 and 4 in practice [27,28].

Adaptive Neuro-Fuzzy Inference System (ANFIS)
Networks based on an adaptive neuro-fuzzy inference system [29][30][31] provide a practical approach to approximate functions. Fuzzy set theory is tailored to automated systems that efficiently manage complex operations [32,33]. Individual membership values should be used to map the elements of a fuzzy set to function theory. The components of the fuzzy theory can have real values anywhere in the range [0, 1]. Fuzzy inference systems (FIS) and ANFIS models rely heavily on partitioning the available data into training and test sets. The general structure of ANFIS is shown in Figure 2. In this present study, previous significant wave heights (SWH) were used as inputs, while the output f is the SWH value at t + 1 (one hour ahead) to t + 24 (one day ahead). In Figure 2, the p, q, and r refer to the consequent parameters, while the parameters of the membership functions (e.g., A and B, as seen from the figure) are called premise parameters. All these parameters were optimized using GA, PSO, and MPA algorithms, which are briefly explained in the following sections. In Figure 2, W refers to the membership degree corresponding to each value set of x and y inputs. In this study, x and y can be assumed as SWH t and SWH t−1 . stage, the basis functions that are less important and influential for the estimation are removed. Finally, the most accurate model is selected based on the minimum Generalized Cross Validation (GCV) criterion. GCVk is assumed as the value of GCV for k of the model in the elimination phase. This quantity is defined by Equation (3): where is the estimated model in the step of the elimination step; C(k) is the number of model sentences in the step + λm, where m represents the number of spline function nodes in the model; and λ is called the smoothing parameter and is usually chosen between 2 and 4 in practice [27,28].

Adaptive Neuro-Fuzzy Inference System (ANFIS)
Networks based on an adaptive neuro-fuzzy inference system [29][30][31] provide a practical approach to approximate functions. Fuzzy set theory is tailored to automated systems that efficiently manage complex operations [32,33]. Individual membership values should be used to map the elements of a fuzzy set to function theory. The components of the fuzzy theory can have real values anywhere in the range [0, 1]. Fuzzy inference systems (FIS) and ANFIS models rely heavily on partitioning the available data into training and test sets. The general structure of ANFIS is shown in Figure 2. In this present study, previous significant wave heights (SWH) were used as inputs, while the output f is the SWH value at t + 1 (one hour ahead) to t + 24 (one day ahead). In Figure 2, the p, q, and r refer to the consequent parameters, while the parameters of the membership functions (e.g., A and B, as seen from the figure) are called premise parameters. All these parameters were optimized using GA, PSO, and MPA algorithms, which are briefly explained in the following sections. In Figure 2, W refers to the membership degree corresponding to each value set of x and y inputs. In this study, x and y can be assumed as SWHt and SWHt−1.

Genetic Algorithm (GA)
Genetic algorithms (GAs) are machine learning and optimization methods like neural networks. In genetic programming, the existing blocks are first defined, which include the input and target variables and their connection function. Then, the appropriate structure of the model and its coefficients are determined [34][35][36]. This method includes a correlation equation between the input and output variables. Therefore, it can automatically select the appropriate variables of the model and delete unrelated variables that cause input dimension reduction [37][38][39]. The selection of appropriate model inputs is one of the most important issues that should be considered in this method. This issue is even more important when secondary input data are also used. This is because the presentation of unrelated input data leads to a decrease in the model's accuracy and to the creation of more complicated models, the interpretation of which is associated with more difficulties. In engineering applications, genetic programming is often used for modeling problems related to determining the structure of phenomena. The step-by-step process of genetic programming is as follows: 1.
An initial set of composite functions indicating predictive models is considered randomly.

2.
Each person in the above population is evaluated with appropriate functions.

3.
For each production, the following steps are followed to select a new population: (a) One of the transitions, mutation, and copy operators is selected.
An appropriate number of individuals from the current community are selected.
The selection operator is used to generate the descendants.
The named descendant enters a new society.
The considered model is evaluated using various adjustments.

4.
The third step is repeated until the maximum production is reached.
In this method, no dependent relation is considered at the beginning of the process. Therefore, it can optimize the structure of the model and its components [40,41].

Particle Swarm Optimization (PSO)
The PSO algorithm was developed by Kennedy and Eberhart [42] and is based on the activity of swarms of bees searching for flowers to pollinate. A swarm consists of particles, and each particle represents an answer to an optimization problem. This method starts with a collection of solutions initially in random locations and moving at random speeds. Particle swarm optimization (PSO) uses particles moving in hyperspace to find optimal solutions. Over time, these solutions gain knowledge through their own experiences and those of their peers. Each atom remembers its highest fitness level in the cloud, which it calls its pbest. The gbest or global best is the best value ever achieved by a particle in the population. Each particle in the system rapidly moves to its pbest and gbest positions when the PSO algorithm is run. For this purpose, the new velocity of each particle is determined by its distance from the pbest and gbest positions. Then, a new velocity value for that particle is calculated by randomly weighing the two best velocities, pbest and gbest, to determine its next place in the following iteration [43][44][45].
The relative simplicity of the process with only two equations is one of the main advantages of PSO over many other optimization techniques (i.e., the equation of motion and the equation for updating the velocity). The actual motion of the particles is determined via their unique velocity vector, which is used in the first equation. Conversely, the second determines how the velocity vector must shift to account for the presence of opposing forces [46,47]. The convergence rate of the PSO algorithm is further improved by introducing the inertia weight (w) (Figure 3).

Marine Predators Algorithm (MPA)
The MPA is a meta-heuristic (MH) algorithm based on the hunting techniques of marine predators. Like the other algorithms of MH, the MPA first explores the search space and obtains random intervals of initial solutions using a prey search technique. The primary structure of the algorithm is that the current position determines the future position of the solution [48,49]. Marine predators use Lévy and Brownian search techniques when searching for food, switching between the two depending on the availability of potential meals. In locations with lower prey concentration, predators use the Lévy movement. Instead, Brownian motion is used [50,51] when numerous potential victims are available. This study randomly chose initial solutions, updating the positions using Equation (4) as follows: where and are the upper and lower bounds of the design variables, respectively, and is also a random vector in the range [1 0]. In the MPA, there are two main matrices, the best-fit predator matrix (Best/Elite) and the prey matrix (Prey), given in Equations (5) and (6).
where is a vector of the most appropriate hunters repeated -times to organize the elite matrix. and refer to the number and dimensions of search factors. Elite is updated after each iteration of the prey by replacing the better hunters. The matrix is the basis on which the hunters update their position, and its dimensions are the same as those of the elite matrix. The prey matrix is expressed as follows: In y-ij., j-th. is next for i-th. by prey. MPA: During the search, it repeatedly uses random variables and operators to prevent the algorithm from getting stuck in local minimal

Marine Predators Algorithm (MPA)
The MPA is a meta-heuristic (MH) algorithm based on the hunting techniques of marine predators. Like the other algorithms of MH, the MPA first explores the search space and obtains random intervals of initial solutions using a prey search technique. The primary structure of the algorithm is that the current position determines the future position of the solution [48,49]. Marine predators use Lévy and Brownian search techniques when searching for food, switching between the two depending on the availability of potential meals. In locations with lower prey concentration, predators use the Lévy movement. Instead, Brownian motion is used [50,51] when numerous potential victims are available. This study randomly chose initial solutions, updating the positions using Equation (4) as follows: y 0 = y min + rand * (y max − y min ) where y max and y min are the upper and lower bounds of the design variables, respectively, and rand is also a random vector in the range [1 0]. In the MPA, there are two main matrices, the best-fit predator matrix (Best/Elite) and the prey matrix (Prey), given in Equations (5) and (6).
In y-ij., j-th. is next for i-th. by prey. MPA: During the search, it repeatedly uses random variables and operators to prevent the algorithm from getting stuck in local minimal points [52]. To better understand the MPA method, the structure of the proposed algorithm is shown in the flowchart in Figure 4. points [52]. To better understand the MPA method, the structure of the proposed algorithm is shown in the flowchart in Figure 4.

Development of Hybrid ANFIS Methods
To improve the accuracy of ANFIS method in predicting SWH for multiple horizons, three metaheuristic algorithms, PSO, GA, and MPA, were used to optimize the consequent (p, q, and r) and premise parameters (membership functions). In this present study, inputs are SWHt, SWHt−1, and SWHt−2, referring to the SWH at time t, t − 1, and t − 2, where t is hour and output is SWH at time t + 1 (one hour ahead) to t + 24 (one day ahead). Each input has Gaussian membership functions, and each membership function has two parameters: mean and standard deviation. Three metaheuristic algorithms, i.e., PSO, GA, and MPA, were used for optimizing linear (consequent) and nonlinear (premise) parameters of ANFIS. The procedure for developing hybrid ANFIS models, including ANFIS-PSO, ANFIS-GA, and ANFIS-MPA, is depicted in Figure 5.

Development of Hybrid ANFIS Methods
To improve the accuracy of ANFIS method in predicting SWH for multiple horizons, three metaheuristic algorithms, PSO, GA, and MPA, were used to optimize the consequent (p, q, and r) and premise parameters (membership functions). In this present study, inputs are SWH t , SWH t−1 , and SWH t−2 , referring to the SWH at time t, t − 1, and t − 2, where t is hour and output is SWH at time t + 1 (one hour ahead) to t + 24 (one day ahead). Each input has Gaussian membership functions, and each membership function has two parameters: mean and standard deviation. Three metaheuristic algorithms, i.e., PSO, GA, and MPA, were used for optimizing linear (consequent) and nonlinear (premise) parameters of ANFIS. The procedure for developing hybrid ANFIS models, including ANFIS-PSO, ANFIS-GA, and ANFIS-MPA, is depicted in Figure 5.

Accuracy Assessment
To achieve better prediction of SWH using historical SWH values as input, the primary goal of this work was to use a novel hybrid neuro-fuzzy approach: ANFIS-MPA. The root means square error, mean absolute error, and coefficient of determination (R2) were compared with two other hybrid ANFIS techniques, ANFIS-GA and ANFIS-PSO, to evaluate the results. Data from two sites were used for the applications to ensure that the techniques worked as intended. There are many ways to display the RMSE, MAE, and R2 statistics, which are demonstrated as follows [53][54][55][56]:

Accuracy Assessment
To achieve better prediction of SWH using historical SWH values as input, the primary goal of this work was to use a novel hybrid neuro-fuzzy approach: ANFIS-MPA. The root means square error, mean absolute error, and coefficient of determination (R2) were

Development of Hybrid ANFIS-PSO, ANFIS-GA, and ANFIS-MPA Models
In the final model developed, MARS was used to determine the best input combination, i.e., the best scenario for predicting SWH is evaluated. Each scenario considers different lagged SWH values. Then, all input combinations are analyzed for three hybrid ANFIS models, including ANFIS-PSO, ANFIS-GA, and ANFIS-MPA. Three statistical indices, including RMSE, MAE, and R2, are used for comparison suggestions.

Results and Discussion
This section compares the results of the MPA-based neuro-fuzzy approach in predicting significant wave heights for multiple horizons from t + 1 (one hour ahead) to t + 24 (one day ahead) with other hybrid neuro-fuzzy methods.

Results
In this study, we first apply the MARS method to determine the best input combination. The goal was to investigate whether this method can be applied to determine the best scenario for predicting SWH. This was then evaluated via hybrid ANFIS methods for all input combinations. The training and test results of the method MARS are shown in Table 2 for the first station. As seen from the input combinations, three lagged inputs were used because inputs beyond this lag did not improve the prediction accuracy, and our goal was to predict SWH for multiple horizons from t + 1 to t + 24. Table 3 shows that adding earlier lags slightly improves the accuracy of MARS. Therefore, three delayed inputs were selected as the best input combination. Then, this combination was used to predict SWH for other periods. As expected, the model's accuracy deteriorates as the prediction horizon increases. The RMSE and MAE decreased from 0.0325 and 0.0232 to 0.1410 and 0.1076, and R2 increased from 0.9748 to 0.5201 over the test period.  Tables 3-5 summarize the training and testing results of the hybrid models ANFIS-PSO, ANFIS-GA, and ANFIS-MPA in predicting the SWH of the first station. The accuracy of the implemented methods is consistent, and all three methods provide the best performance for the third input combination. The model ANFIS-MPA showed the lowest RMSE (0.0277) and MAE (0.0192) and the highest R2 (0.9831) during the test period; followed by ANFIS-GA with an RMSE, MAE, and R2 of 0.0302, 0.0216, and 0.9787; and ANFIS-PSO with an RMSE, MAE, and R2 of 0.0312, 0.0226, and 0.9753. From t + 1 (1 h ahead) to t + 24 (1 day ahead), the accuracy of ANFIS-MPA decreases significantly; the RMSE, MAE, and R2 range from 0.0277, 0.0192, and 0.9831 to 0.1344, 0.1019, and 0.5833, respectively. At all forecast horizons, ANFIS-MPA is superior to the other hybrid methods. The improvement in RMSE of ANFIS-GA and ANFIS-PSO at the 1 h lead time test period is 11.2% and 8.3%, respectively. In contrast, the corresponding improvement at one day lead time (t + 24) is 3.38% and 0.59%.  Table 6 shows the training and test results of the method MARS for the first station. Again, accuracy for this station decreased slightly when delayed inputs were added. Accuracy decreased significantly when the prediction horizon was increased from 1 h to 1 day (t + 24). The RMSE, MAE, and R2 range from 0.1067, 0.0824, and 0.9635 to 0.2928, 0.2029, and 0.7303 in the test period. The best accuracy is obtained by the model with inputs Hst, Hst-1, and Hst-2 with the lowest RMSE (0.1067) and MAE (0.0824) and the highest R2 (0.9635) in the test period. The training and test results of the hybrid ANFIS methods in predicting SWH at the second station are shown in Tables 7-9. This station also shows consistent accuracy of the implemented methods with MARS. The best performance is obtained at the third input combination. Again, ANFIS-MPA outperforms ANFIS-PSO and ANFIS-GA in the 1 h SWH prediction with the lowest RMSE (0.0689) and MAE (0.0475) and the highest R2 (0.9847) in the test period. The use of ANFIS-MPA improves the RMSE accuracy of ANFIS-PSO by about 7% in predicting SWH 1 h ahead. Similar to the first station, the accuracy of the hybrid methods decreases significantly. For example, the RMSE, MAE, and R2 of ANFIS-MPA range from 0.0689, 0.0475, and 0.9847 to 0.2640, 0.1962, and 0.7735, respectively, for forecast horizons t + 1 to t + 24. The ANFIS-MPA outperforms the other hybrid methods at all forecast horizons.  The hybrid neuro-fuzzy and MARS models for predicting SWH are compared in Figures 6 and 7 using scatter plots. The MPS-based ANFIS has the least scattered predictions, with the fitting equation closer to the exact line (y = x) and the highest R 2 in both stations. The models with three inputs (best models) are compared using Taylor diagrams in Figures 8 and 9. This type of graph is very useful for observing the accuracy of the models based on RMSE, standard deviation, and correlation. The plots show that the MPA-based ANFIS has the highest correlation and the lowest squared error in predicting the SWH of both stations. The violin charts in Figures 10 and 11 compare the SWH predictions and observations distributions. The figures show that the mean, median, and distribution of the MPA-based ANFIS are more like the observed values. Figure 12 illustrates the average RMSE and MAE errors of all implemented models in predicting the SWH of both stations. It is clearly seen from the bar charts that the ANFIS-MPA has fewer RMSE and MAE errors in the short-term prediction of SWH in both sites.

Discussion
This study uses a new hybrid neuro-fuzzy method (ANFIS-MPA) to predict SWH using previous values as input. The results are compared with other hybrid neuro-fuzzy models. The MPA-based model is observed to outperform the other models in predicting SWH for multiple horizons from 1 h to 1 day.
The best input combination is investigated using the MARS method. Next, hybrid ANFIS methods are applied to the same scenarios to see if MARS is suitable for determining the best input combination in predicting SWH. A similar trend is observed between the MARS and hybrid ANFIS methods, indicating that the MARS can successfully determine the best input combination in SWH prediction. The comparison of the two stations shows that the methods are more successful in predicting SWH at the second station. The main reason could be the higher autocorrelation of SWG at the second station. These results are consistent with the previous literature [48,50].        . Taylor diagrams of the predicted SWH by different models using the best input combination at Station 2. Figure 10. Violin charts of the predicted SWH by different models using the best input combination at Station 1. Figure 10. Violin charts of the predicted SWH by different models using the best input combination at Station 1.  Figure 11. Violin charts of the predicted SWH by different models using the best input combination at Station 2. Figure 11. Violin charts of the predicted SWH by different models using the best input combination at Station 2. Figure 11. Violin charts of the predicted SWH by different models using the best input combination at Station 2.  It is observed that with the increasing horizon from 1 h to 24 h, the models' accuracy highly deteriorates. However, the ANFIS-MPA generally provides superiority in such cases, which can be useful in monitoring SWH.
Machine learning allows us to find connections between physical parameters that we do not see or do not know. The formation of waves has a nonlinear and complex physical mechanism, and SWH is affected by different parameters, including wind speed, sea surface temperature, water depth, air humidity, and some other weather parameters. In this present study, only SWH data were used as inputs because of the unavailability of other influencing parameters.

Conclusions
This study examined the performance of a new hybrid neuro-fuzzy model, ANFIS-MPA, in predicting significant wave height in multiple horizons from 1 h to 1 day. Hourly data were obtained from two stations, Cairns and Palm Beach buoys, Australia. MARS as a simple tool was used to determine the best input of significant wave height for the much more complex hybrid ANFIS methods. This was also justified by employing hybrid methods for the same input combinations. It was observed that the MARS can be successfully used for selecting the best input combination in predicting significant wave height. The results of ANFIS-MPA were compared with those of the hybrid models ANFIS-PSO and ANFIS-GA. The results showed that the ANFIS-MPA model performed better than the other hybrid models in predicting significant wave height at both stations. At the second station, ANFIS-GA and ANFIS-PSO provided better accuracy than ANFIS-MPA for predicting the significant wave height 1 h ahead, while the latter model outperformed the ANFIS-GA and ANFIS-PSO for other forecasting horizons involving wave heights 2, 4, 8, 12, and 24 h ahead. Assessment criteria involving average RMSE and MAE and graphical inspections such as Taylor and Violin charts revealed that the ANFIS-MPA is superior to the other models in predicting SWH for multiple horizons. Overall results recommend the use of ANFIS-MPA in monitoring significant wave height for multiple time horizons, using only earlier values as inputs.
In this study, we used hourly data from two sites. The results can be generalized if data from other sites and other data intervals (daily or monthly) are used. The developed methods can also be compared with other hybrid machine learning methods to evaluate the accuracy of the implemented methods in predicting significant wave height. In this study, only previous SVH data were used as inputs, and in future studies, more effective parameters such as wind speed, sea surface temperature, water depth, and air humidity can be involved to develop more robust and accurate models in predicting SWH for multiple horizons.