Prediction of Mean Sea Level with GNSS-VLM Correction Using a Hybrid Deep Learning Model in Australia

: The prediction of sea level rise is extremely important for improved future climate change mitigation and adaptation strategies. This study uses a hybrid convolutional neural Network (CNN) and a bidirectional long short-term (BiLSTM) model with successive variational mode decomposition (SVMD) to predict the absolute sea level for two study sites in Australia (Port Kembla and Milner Bay). More importantly, the sea level measurements using a tide gauge were corrected using Global Navigation Satellite System (GNSS) measurements of the vertical land movement (VLM). The SVMD-CNN-BiLSTM model was benchmarked by a multi-layer perceptron (MLP), support vector regression (SVR) and gradient boosting (GB). The SVMD-CNN-BiLSTM model outperformed all the comparative models with high correlation values of more than 0.95 for Port Kembla and Milner Bay. Similarly, the SVMD-CNN-BiLSTM model achieved the highest values for the Willmott index, the Nash–Sutcliffe index and the Legates and McCabe index for both study sites. The projected linear trend showed the expected annual mean sea rise for 2030. Using the current trend, Port Kembla was projected to have an MSL value of 1.03 m with a rate rise of approx. 4.5 mm/year. The rate of the MSL for Milner Bay was comparatively lower with a value of approx. 2.75 mm/year and an expected MSL value of 1.27 m for the year 2030.


Introduction
The anticipated rise in sea level caused by climate change has many potential consequences, including flooding, shoreline retreat, ecosystem imbalance and the invasion of salt water into freshwater systems. This endangers 60% of the world's population that lives in coastal areas [1]. The situation is particularly acute for communities on small islands which may become uninhabitable [2,3]. These impacts motivate using long-term sea level rise modeling as a tool to understand the trends and accurately predict sea level rise in the future. However, the accuracy and reliability of these predictions should consider certain important aspects, such as the absolute sea level rise, which accounts for land subsidence. In this paper, we model the vertical land motion (VLM) [4]-corrected absolute sea level at two separate geographical locations in Australia using a novel deep learning artificial intelligence model based on successive variational mode decomposition (SVMD) [5], a convolutional neural network (CNN) [6] and bidirectional long short-term memory (BiL-STM) [7]. The VLM consideration is extremely important because the position of the sea floor changes over time, and therefore the measured sea level should be corrected for this variability for accurate estimation [4]. The VLM has many root causes, including tides, plate tectonic movements, earthquakes, glacial activity and human actions. While the typical values are significantly less than 10 mm year −1 , they can be as large as 100 mm year −1 in some extreme cases [8]. Given that the order of magnitude of typical VLM values is the same as the changes to the mean sea level, these values cannot be ignored and sea level measurements must be corrected.
LSTM models are well known to be reliable for modeling time series data such as the mean sea level. They are a type of recurrent neural network (RNN) in that they use feedback connections. However, unlike plain RNNs, they do not suffer from vanishing gradients during training, which affects the convergence of the model fit [9]. LSTM models are generally considered to be superior to traditional approaches, such as autoregressive integrated moving average (ARIMA) models [10] and bidirectional LSTM models (i.e., a BiLSTM), in which training data is fed in both directions through the model [11]. Previous research studies into sea level modeling have used different techniques. For example, in [12], the authors employed a simple linear regression with independent variables, including wind speed and pressure, to predict sea level rise. The model, while coarse, was able to explain 74% of the total observed subtidal frequency sea level in Atlantic City, New Jersey. The study in [13] compared chaos theory and ARIMA models for daily, weekly, 10-day and monthly sea levels at the Cocos Islands from 1992 to 2001 and found that the chaos theory models were slightly superior. Other studies have also used various AI models, such as artificial neural networks (ANNs), to model sea level. A study modeling the surface water levels in the Caspian Sea identified a similar accuracy between an ARIMA model and a basic 3-layer ANN model (comprising input, hidden and output layers) [14]. The study in [15] compared ARIMA, support vector regression (SVR) and LSTM models for sea levels, with the LSTM model exhibiting a significantly higher accuracy. Various other studies used different types of ANNs and deep learning to model sea level. However, no other study considered the VLM correction for absolute sea level, the SVMD data decomposition or a hybrid model. Hence, this study presents a new deep learning hybrid CNN-BiLSM model to predict the absolute sea level after GNSS-VLM for Australian sites with a data decomposition of the wave signals.

Study Area and Dataset
Two study sites situated in Australia were selected for this study. The two sites were located on different sides of Australia and provided a good comparison for absolute sea level rise. Table 1 and Figure 1 describe the location and geographical details. The datasets were extracted from the Australian Meteorological website (Australian Baseline Sea Level Monitoring Project (bom.gov.au) accessed on 15 May 2022).

Data Preprocessing
The datasets were corrected for missing values, which were indicated with '−9999'. This was processed in Python using the dataframe.interpolate function. The Australian Meteorological website (Australian Baseline Sea Level Monitoring Project (bom.gov.au) accessed on 15 May 2022) maintains high quality data and the relative proportion of missing or bad values in each file is small. After the correction of the missing values, the dataset was checked for stationarity. This is an important step in time series data analysis [16]. The following result showed the analysis of stationarity for the Milner Bay dataset using  [17,18]. The test statistic should be less than the critical values at 1%, 5% and 10% for the null hypothesis to be rejected. Based on the test statistic, a conclusion was made that the dataset was stationary. Table 2 shows that the test statistic was less than the critical values for the stationarity test. The dataset was then subjected to autocorrelation and partial autocorrelation analyses to determine the correlation lags, as shown in Figure 2. All the input parameters were checked for correlation with the mean sea level and computed as a matrix, as given in Figure 3. The dataset was normalized using Equation (1) and then denormalized using Equation (2) after modeling.

Data Preprocessing
The datasets were corrected for missing values, which were indicated with '−9999′. This was processed in Python using the dataframe.interpolate function. The Australian Meteorological website (Australian Baseline Sea Level Monitoring Project (bom.gov.au) accessed on 15 May 2022) maintains high quality data and the relative proportion of missing or bad values in each file is small. After the correction of the missing values, the dataset was checked for stationarity. This is an important step in time series data analysis [16]. The following result showed the analysis of stationarity for the Milner Bay dataset using the augmented Dickey-Fuller (ADF) test [17,18]. The test statistic should be less than the critical values at 1%, 5% and 10% for the null hypothesis to be rejected. Based on the test statistic, a conclusion was made that the dataset was stationary. Table 2 shows that the test statistic was less than the critical values for the stationarity test. The dataset was then subjected to autocorrelation and partial autocorrelation analyses to determine the correlation lags, as shown in Figure 2. All the input parameters were checked for correlation with the mean sea level and computed as a matrix, as given in Figure 3. The dataset was normalized using Equation (1) and then denormalized using Equation (2) after modeling.

GNSS-VLM Correction of Port Kembla and Milner Bay
The Global Positioning System (GPS) is part of the Global Navigation Satellite System (GNSS) which allows for accurate measurements of the Earth's surface [19]. It is used to locate the position of a receiver using a constellation of numerous artificial satellites [20,21]. This study uses the GNSS-VLM dataset in [22] derived from continuously operating GPS stations operated by the Nevada Geodetic Laboratory.

GNSS-VLM Correction of Port Kembla and Milner Bay
The Global Positioning System (GPS) is part of the Global Navigation Satellite System (GNSS) which allows for accurate measurements of the Earth's surface [19]. It is used to locate the position of a receiver using a constellation of numerous artificial satellites [20,21]. This study uses the GNSS-VLM dataset in [22] derived from continuously operating GPS stations operated by the Nevada Geodetic Laboratory. Table 3 shows the information for Port Kembla and Milner Bay.

GNSS-VLM Correction of Port Kembla and Milner Bay
The Global Positioning System (GPS) is part of the Global Navigation Satellite System (GNSS) which allows for accurate measurements of the Earth's surface [19]. It is used to locate the position of a receiver using a constellation of numerous artificial satellites [20,21]. This study uses the GNSS-VLM dataset in [22] derived from continuously operating GPS stations operated by the Nevada Geodetic Laboratory. Table 3 shows the information for Port Kembla and Milner Bay.

Signal Decomposition by Successive Variational Mode Decomposition (SVMD)
SVMD is an efficient signal decomposition technique which extracts underlying modes of the input signal into its intrinsic mode functions (IMFs) [23]. It has a lower computational complexity compared to VMD and is less sensitive to the initial values of the central frequencies of the modes [5]. These modes are orthogonal and are separated by the respective bands. The highest frequency is removed from the original signal iteratively until the residual is left as a monotonic function [24]. The process of signal decomposition is very important in data modeling as it helps to extract important features and improves the training model efficiency [25]. The Hs wave signal for both study sites were fed through the SVMD algorithm. The algorithm parameters of the compactness mode, step of dual ascend, tolerance of convergence, stopping criteria and sampling frequency were determined through trial runs before extracting the decomposed intrinsic modes. Figure 4 shows the Milner Bay signal decomposed into its IMFs using the SVMD technique for this study.

Signal Decomposition by Successive Variational Mode Decomposition (SVMD)
SVMD is an efficient signal decomposition technique which extracts underlying modes of the input signal into its intrinsic mode functions (IMFs) [23]. It has a lower computational complexity compared to VMD and is less sensitive to the initial values of the central frequencies of the modes [5]. These modes are orthogonal and are separated by the respective bands. The highest frequency is removed from the original signal iteratively until the residual is left as a monotonic function [24]. The process of signal decomposition is very important in data modeling as it helps to extract important features and improves the training model efficiency [25]. The Hs wave signal for both study sites were fed through the SVMD algorithm. The algorithm parameters of the compactness mode, step of dual ascend, tolerance of convergence, stopping criteria and sampling frequency were determined through trial runs before extracting the decomposed intrinsic modes. Figure  4 shows the Milner Bay signal decomposed into its IMFs using the SVMD technique for this study.

Data Partition
Data partitioning is an essential step in data modeling that ensures the optimum data for each sample in training, validation and testing [26]. Table 4 shows the 60% training, 20% validation and 20% testing dataset breakdown used in this study.

Data Partition
Data partitioning is an essential step in data modeling that ensures the optimum data for each sample in training, validation and testing [26]. Table 4 shows the 60% training, 20% validation and 20% testing dataset breakdown used in this study.

Objective Model and Modeling Process
The objective model selected for this study was the hybrid deep learning model CNN-BiLSTM. The CNN-BiLSTM model can utilize the benefits of the CNN algorithm and two layers of the LSTM model to process the data inputs effectively for data training and forecasting. A CNN is a deep learning convolutional neural network consisting of multiple layers of artificial neurons [27]. This study used a one-dimensional convolutional layer which added a filter to the model architecture for convolution. This process of onedimensional convolution helped to extract valuable information from the data inputs. The next stage of data processing was then conducted using the BiLSTM layers. The BiLSTM layers were widely used in time series analysis due to their ability to expand according to the sequence of time [28]. The BiLSTM platform in the study consisted of two LSTM layers (a forward and a reverse LSTM layer). Each BiLSTM layer had two LSTM networks, which processed the input dataset. A LSTM model is a network designed to overcome gradient explosion and gradient disappearance in a RNN [28,29]. The hybrid CNN-BiLSTM model had the additional capability of utilizing the CNN feature extraction superior ability and the BiLSTM architecture, which further processed these using the forward and backward neural layers. This approach of using the hybrid deep learning model performed better than the standalone models in multiple past studies [30][31][32]. Table 5 shows the model parameters, which were obtained using Grid-Search optimization for optimum results. Figure 5 illustrates the data analysis in Python and how the layers were arranged within the hybrid deep learning model architecture. Figure 6 shows the overall modeling process.

Objective Model and Modeling Process
The objective model selected for this study was the hybrid deep learning mode CNN-BiLSTM. The CNN-BiLSTM model can utilize the benefits of the CNN algorithm and two layers of the LSTM model to process the data inputs effectively for data training and forecasting. A CNN is a deep learning convolutional neural network consisting o multiple layers of artificial neurons [27]. This study used a one-dimensional convolutiona layer which added a filter to the model architecture for convolution. This process of one dimensional convolution helped to extract valuable information from the data inputs. Th next stage of data processing was then conducted using the BiLSTM layers. The BiLSTM layers were widely used in time series analysis due to their ability to expand according to the sequence of time [28]. The BiLSTM platform in the study consisted of two LSTM layer (a forward and a reverse LSTM layer). Each BiLSTM layer had two LSTM networks, which processed the input dataset. A LSTM model is a network designed to overcome gradien explosion and gradient disappearance in a RNN [28,29]. The hybrid CNN-BiLSTM mode had the additional capability of utilizing the CNN feature extraction superior ability and the BiLSTM architecture, which further processed these using the forward and backward neural layers. This approach of using the hybrid deep learning model performed bette than the standalone models in multiple past studies [30][31][32]. Table 5 shows the model parameters, which were obtained using Grid-Search opti mization for optimum results. Figure 5 illustrates the data analysis in Python and how th layers were arranged within the hybrid deep learning model architecture. Figure 6 show the overall modeling process.

Multi-Layer Perceptron
A multi-layer perceptron (MLP) is a supervised learning algorithm that learns on a nonlinear function by utilizing backpropagation [33]. It usually conists of at least three layers of nodes, i.e., an input layer, a hidden layer and an output layer. Apart from the input layer, each node consists of a nonlinear activation function. The data transfer is only conducted in a forward direction [34].

Gradient Boosting
Gradient boosting (GB) utilizes a set of weak learners that perform slightly better than random guessing to develop a single strong learner [35,36]. The regularization of the hyperparameters helps to control the additive process of gradient boosting. Shrinking is applied to reduce each gradient descent step to naturally achieve regularization [37].

Multi-Layer Perceptron
A multi-layer perceptron (MLP) is a supervised learning algorithm that learns on a nonlinear function by utilizing backpropagation [33]. It usually conists of at least three layers of nodes, i.e., an input layer, a hidden layer and an output layer. Apart from the input layer, each node consists of a nonlinear activation function. The data transfer is only conducted in a forward direction [34].

Gradient Boosting
Gradient boosting (GB) utilizes a set of weak learners that perform slightly better than random guessing to develop a single strong learner [35,36]. The regularization of the hyperparameters helps to control the additive process of gradient boosting. Shrinking is applied to reduce each gradient descent step to naturally achieve regularization [37].

Support Vector Regression
Rooted in the Vapnik-Chervonenkis (VC) theory, support vector regression (SVR) is characterized by the use of kernels, a sparse solution, a control of margin and support vectors [38]. It is a supervised learning approach where the dataset is trained using a symmetrical loss function, which equally penalizes high and low misestimates.

Performance Evaluation Metrics
Eight evaluation metrics were used for this study for the model performance comparison. Every metric shown below (Equations (3)-(9)) added an important aspect of performance evaluation and justified the accuracy of the model performance in sea level prediction for the study sites.

Mean Absolute Percentage Error (MAPE)
where DS i is the simulated data and DO i is the observed data.

Results and Discussion
The prediction results from the testing phase were compared with the observational dataset by computing the performance and error metrics. Tables 6-9 show the performance metrics for the objective model (SVMD-CNN-BiLSTM) and the three benchmark models (SVMD-MLP, SVMD-SVR, SBMD-GB). This study developed a hybrid SVMD-CNN-BiLSTM deep learning model for VLMcorrected mean sea level prediction. Other well-known AI models were also used for the prediction of the VLM-corrected sea level. Hence, to analyze and compare the efficiency of these models, all the model results were used to calculate the performance and error metrics to evaluate the prediction ability. Figures 7 and 8 Figures 11 and 12 show the model performance of the two study sites based on the computation of the correlation coefficient, Willmott's index, the Nash-Sutcliffe coefficient and the Legates and McCabe index. The correlation coefficient was an important statistical measure and showed the degree of association between the observed and predicted VLMcorrected mean sea level [39]. Willmott's index of agreement was a measure of the variability between the observed and predicted data and is commonly used in climate modeling [40]. Nash-Sutcliffe index is mostly used in water quality models and provided a   Figures 11 and 12 show the model performance of the two study sites based on the computation of the correlation coefficient, Willmott's index, the Nash-Sutcliffe coefficient and the Legates and McCabe index. The correlation coefficient was an important statistical measure and showed the degree of association between the observed and predicted VLMcorrected mean sea level [39]. Willmott's index of agreement was a measure of the variability between the observed and predicted data and is commonly used in climate modeling [40]. Nash-Sutcliffe index is mostly used in water quality models and provided a  Figures 11 and 12 show the model performance of the two study sites based on the computation of the correlation coefficient, Willmott's index, the Nash-Sutcliffe coefficient and the Legates and McCabe index. The correlation coefficient was an important statistical measure and showed the degree of association between the observed and predicted VLMcorrected mean sea level [39]. Willmott's index of agreement was a measure of the variability between the observed and predicted data and is commonly used in climate modeling [40].
Nash-Sutcliffe index is mostly used in water quality models and provided a goodness of fit for the evaluated models [41]. The Legates and McCabe index is a variant of the Nash-Sutcliffe index and is considered a more refined index of the model performance, which compared the agreement between the observed and predicted data [42]. All the models showed a high correlation with values of greater than 0.9 for both study sites. goodness of fit for the evaluated models [41]. The Legates and McCabe index is a variant of the Nash-Sutcliffe index and is considered a more refined index of the model performance, which compared the agreement between the observed and predicted data [42]. All the models showed a high correlation with values of greater than 0.9 for both study sites.
The   The computation of more than one error metric for evaluation was as important as the performance metrics to ensure a comprehensive evaluation [43]. The error results also goodness of fit for the evaluated models [41]. The Legates and McCabe index is a variant of the Nash-Sutcliffe index and is considered a more refined index of the model performance, which compared the agreement between the observed and predicted data [42]. All the models showed a high correlation with values of greater than 0.9 for both study sites.
The   The computation of more than one error metric for evaluation was as important as the performance metrics to ensure a comprehensive evaluation [43]. The error results also The computation of more than one error metric for evaluation was as important as the performance metrics to ensure a comprehensive evaluation [43]. The error results also         accuracy and its ability to predict the MSL values. Using the current trend, Port Kembla would have a MSL value of 1.03 m with a rate rise of about 4.5 mm/year, which was slightly lower than the values of 6.5 mm/year in [44]. This was due to the consideration of land subsidence and future projected values based on past trends. high accuracy and its ability to predict the MSL values. Using the current trend, Port Kembla would have a MSL value of 1.03 meters with a rate rise of about 4.5 mm/year, which was slightly lower than the values of 6.5 mm/year in [44]. This was due to the consideration of land subsidence and future projected values based on past trends. The rate of the MSL for Milner Bay was comparatively lower, with a value of about 2.75 mm/year and an expected MSL value of 1.27 meters for the year 2030.   high accuracy and its ability to predict the MSL values. Using the current trend, Port Kembla would have a MSL value of 1.03 meters with a rate rise of about 4.5 mm/year, which was slightly lower than the values of 6.5 mm/year in [44]. This was due to the consideration of land subsidence and future projected values based on past trends. The rate of the MSL for Milner Bay was comparatively lower, with a value of about 2.75 mm/year and an expected MSL value of 1.27 meters for the year 2030.

Conclusions
In this study, we developed a hybrid SVMD-CNN-BiLSTM deep learning model for VLM-corrected mean sea level prediction at two sites in Australia, namely Port Kembla and Milner Bay. This model outperformed the three other benchmark models (SVMD-MLP, SVMD-SVR and SBMD-GB) in terms of their metrics, such as the correlation coefficient, Willmott's index, the Nash-Sutcliffe coefficient and the Legates and McCabe index, thus supporting its superior performance. The study also confirmed the importance of using a data decomposition method to extract important features from wave signals. The prediction of the absolute mean sea level accurately assists with careful planning and decision making for the future. The study successfully provided an insight into the trend by allowing for the linear projection of the sea level rise. Using the current trend, Port Kembla will have an MSL rise of approx. 4.5 mm/year and 1.27 m for Milner Bay will reach a 1.27 m rise by the year 2030. This study can be extended to other areas for accurate predictions of the MSL.