Sea level variability and modeling in the Gulf of Guinea using supervised machine learning

Ayinde, Akeem Shola; Yu, Huaming; Wu, Kejian

doi:10.1038/s41598-023-48624-1

Download PDF

Article
Open access
Published: 03 December 2023

Sea level variability and modeling in the Gulf of Guinea using supervised machine learning

Akeem Shola Ayinde^1,2,3,
Huaming Yu^1,2 &
Kejian Wu^1,2

Scientific Reports volume 13, Article number: 21318 (2023) Cite this article

1019 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The rising sea levels due to climate change are a significant concern, particularly for vulnerable, low-lying coastal regions like the Gulf of Guinea (GoG). To effectively address this issue, it is crucial to gain a comprehensive understanding of historical sea level variability, and the influencing factors, and develop a reliable modeling system for future projections. This knowledge is essential for informed planning and mitigation strategies aimed at protecting coastal communities and ecosystems. This study presents a comprehensive analysis of mean sea level anomaly (MSLA) trends in the GoG between 1993 and 2020, covering three distinct periods (1993–2002, 2003–2012, and 2013–2020). It investigates the connections between interannual sea level variability and large-scale oceanic and atmospheric forcings. Furthermore, the study evaluates the performance of supervised machine learning techniques to optimize sea level modeling. The findings reveal a consistent rise in MSLA linear trends across the basin, particularly pronounced in the northern region, with a total linear trend of 88 mm over the entire period. The highest decadal trend (38.7 mm) emerged during 2013–2020, with the most substantial percentage increment (100%) occurring in 2003–2012. Spatial variation in decadal sea-level trends was influenced by subbasin physical forcings. Strong interannual signals in the spatial sea level distribution were identified, linked to large-scale oceanic and atmospheric phenomena. Seasonal variations in sea level trends are attributed to seasonal changes in the forcing factors. The evaluation of supervised learning modeling methods indicates that Random Forest Regression and Gradient Boosting Machines are the most accurate, reproducing interannual sea level patterns in the GoG with 97% and 96% accuracy. These models could be used to derive regional sea level projections via downscaling of climate models. These findings provide essential insights for effective coastal management and climate adaptation strategies in the GoG.

Predicting regional coastal sea level changes with machine learning

Article Open access 07 April 2021

Long-lead Prediction of ENSO Modoki Index using Machine Learning algorithms

Article Open access 15 January 2020

Exploring the long-term changes in the Madden Julian Oscillation using machine learning

Article Open access 29 October 2020

Introduction

Climate change is a pressing global concern, with its escalating impacts significantly affecting sea levels¹. By comprehending and modeling sea level trends, we can gain crucial insights into how climate change is impacting coastal regions, especially in the Gulf of Guinea, a wide expanse of land on the West African coast, where coastlines are predominantly low-lying. Therefore, there is an urgent need for sea level projection models to assess the potential impacts of sea level rise. These models are invaluable for policymakers and coastal planners, enabling them to proactively prepare for and mitigate the consequences on coastal communities and ecosystems. The examination of regional sea level variability and the identification of its driving factors are of paramount importance in understanding the consequences of climate change on coastal areas². The range of sea level variability encompasses a wide spectrum of challenges, with profound implications for coastal communities, infrastructure, and marine ecosystems. These challenges encompass elevated storm surges, coastal erosion, flooding, saltwater intrusion, disruption of marine ecosystems, and infrastructure damage, all of which carry substantial economic implications. The intensification of these impacts, particularly in low-lying coastal regions, underscores the critical need for comprehending the mechanisms underlying sea level variability and for establishing precise, cost-effective models to elucidate regional sea level drivers and projections.

Conventional numerical models, including deterministic numerical models which are based on mathematical equations that describe the physical processes governing a system (such as dynamical or physical models), have a long history of demonstrating their effectiveness in forecasting variations in sea state and sea level^3,4,5. Nevertheless, the practical deployment and execution of these models can entail substantial intricacies and expenses, meaning that they come with challenges and costs when it comes to implementing and using them in real-world applications. Lately, the emergence of machine learning techniques has offered promising avenues to enhance the prediction and forecasting capabilities of sea levels. These advancements extend beyond predictions and forecasts, with machine learning showcasing its potential in refining 'best-estimate' ensemble forecasts for ocean waves⁶ and predicting storm surges with precision⁷. Particularly, the performance of ANN matches that of deterministic hydrodynamic models in capturing extreme events. In recent times, ANN has been successfully employed for storm surge hindcasts in estuarine ports in the UK, enabling accurate coastal flood predictions⁸. Furthermore, the predictive aptitude of ANN extends to oceanic variables such as subsurface temperature (ST) and even climate phenomena like El Niño/Southern Oscillation (ENSO) over 1.5 years⁹. Impressively, ANN surpasses state-of-the-art dynamical forecast systems in forecasting ENSO, highlighting remarkable advancements in ENSO predictions¹⁰.

The ascendant integration of Artificial Intelligence (AI) in the scientific realm has necessitated the development of diverse machine learning approaches, encompassing Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Feedforward Neural Networks (FNN), and traditional regression methods. While CNNs excel in image recognition, object detection, and classification tasks^11,12,13, RNNs are well-suited for temporal analyses, natural language processing, and speech recognition^14,15. In contrast, FNNs are adept at pattern recognition, function approximation, and mapping input features to output targets. The efficacy of the machine learning models hinges upon data quality, quantity, and a solid understanding of the system to discern requisite input data (predictors/training data) for the analysis at hand. Marine meteorological data have been particularly successful as input variables, demonstrating effectiveness in modeling and forecasting sea level variability^16,17,18.

This study examines the spatial linear trend of sea levels and its drivers in the GoG, focusing on decadal trends. Additionally, it investigates seasonal variability and the impact of large-scale oceanic and atmospheric phenomena on interannual sea level fluctuations. We develop and evaluate five supervised machine learning models, including two artificial neural networks (ANN): Multi-layer Perceptron Regression (MLPR), which is an FNN, and Long Short-Term Memory (LSTM), an example of RNN. We also employ traditional regression models: Multiple Linear Regression (MLR), Random Forest Regression (RFR), and Gradient Boosting Machine (GBM). These models utilize marine meteorological and hydrological variables, including thermosteric sea level anomaly (TSLA), halosteric sea level anomaly (HSLA), wind stress curl (WSC), atmospheric pressure, net heat flux (NHF), precipitation, evaporation, and freshwater runoff. These variables have been widely recognized and studied in previous research, and their influence on sea level fluctuations is well-documented^19,20,21. The manuscript is structured as follows: Section "Results" introduces the study area, data sources, and variable parameterization. In Section "Discussion/conclusion", we detail the methodology and model specifications. Subsequently, in Section "Methodology", we present and discuss the experimental outcomes.

Results

Interannual-to-interdecadal spatial trends and variability of MSLA and its forcings

In this section, we analyzed the interannual-to-interdecadal spatial trends and variability of MSLA and its associated drivers, including SSLA, TSLA, HSLA, ocean heat content (OHC), WSC, air temperature, NHF, precipitation, evaporation, freshwater runoff, and atmospheric pressure. We aimed to investigate their potential contributions to MSLA during the period from 1993 to 2020 in the GoG. Subsequently, we compared the spatial trends and variability for the three distinct periods: 1993–2002, 2003–2012, and 2013–2020. This division was based on the need to capture and analyze long-term trends while avoiding the potential masking of significant shorter-term variability. This approach allows us to distinguish between gradual, sustained changes and shorter-term variations, and to assess how sea level variability and its drivers have evolved over time. In general, analysis of MSLA variability revealed a distinct spatial and temporal pattern across the basin, which was significantly influenced by subbasin-scale drivers.

Linear trends during 1993–2002

The spatial distribution of linear trends in MSLA from 1993 to 2002 reveals notable variations in trends ($11.8\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$) across the GoG basin (Fig. 1A). Substantial trends are evident along the northern coast, particularly pronounced ($\sim 55{-}60\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$) along Sierra Leone, Guinea Conakry, Guinea Bissau, Gambia, and Senegal. Comparatively higher trends are observed along the continental shelf, except for Nigeria, Benin, Togo, Ghana, Gabon, and Congo, which exhibit lower trends. The variability in these spatial trends is closely tied to the underlying sub-basin forcing factors. TSLA's basin-wide negative trend (-$19\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$) mirrors MSLA's coastal trend (Fig. 1A,C), while HSLA's basin-wide positive trend (0.6 $\mathrm{mm }\,{\mathrm{decade}}^{-1}$) exhibits pronounced variability on the northeast coast (Fig. 1D). Both TSLA and HSLA contribute to SSLA's overall spatial trend (-$18.6\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$), aligning closely with MSLA's trend along the northern coast (Fig. 1A–D). Notably, regions experiencing decreased evaporation trends coincide with increased MSLA trends, while areas with high (low) runoff and precipitation correspond to regions with high (low) MSLA, and vice versa (Fig. 1A,E,F,G). While there is no significant change in the WSC trend offshore, cyclonic WSC trends dominate the northern basin, influencing MSLA variations (Fig. 1J). Negative current velocity trends are linked with cyclonic WSC trends, leading to low MSLA, except along the Gambia coast, where high MSLA is observed (Fig. 1A,I,J). The role of cyclonic WSC in coastal upwelling and MSLA modulation is well-documented^22,23,24. Atmospheric pressure shows a strong spatial trend (4.9 $\mathrm{hPa }\,{\mathrm{decade}}^{-1}$), decreasing meridionally from the south to north basin. High atmospheric pressure in the south drives northward flow, raising (lowering) water levels respectively (Fig. 1A,L). The NHF's spatial linear trend is negative ($-7.2\,{\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$), signifying ocean heat release into the atmosphere and a net heat loss. This contributes to decreasing OHC ($-8.6 \times{ 10}^{8}\,{\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$), leading to a lower MSLA trend (Fig. 1A,H,K).

Linear trends during 2003–2012

The period from 2003 to 2012 witnessed distinctive changes in the spatial distribution of linear trends in MSLA and its driving forces, compared to 1993–2002, with heightened sea level trends in the northwestern basin (Fig. 2A). Specifically, the northwest basin along the coasts of Guinea Conakry, Guinea Bissau, Gambia, and Senegal experienced the highest trends, at approximately $100\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$, while the northeast basin and offshore areas showed lower trends. In general, MSLA exhibited a remarkable increase in spatial linear trend, with a value of $23.5\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$, constituting a 100% surge from the previous decade in the GoG. Similarly, TSLA and HSLA displayed overall increased spatial linear trends, with values of $-16.3\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$ decade and $0.81\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$, respectively, with varying magnitudes across subbasins (Fig. 2C,D). These trends notably contributed to SSLA and MSLA at the coasts (Fig. 2A,B). Hydrological variables, including precipitation, runoff, and evaporation, demonstrated consistent spatial linear trend patterns, which had a similar impact on MSLA compared to the previous decade. This was particularly evident along northern coasts, where high trends in these variables correlated with MSLA trends. However, an exception was observed on the Nigerian coast, where a higher evaporation trend led to lower HSLA and MSLA (Fig. 2A,D–G). The spatial linear trend of WSC and current velocities resembled the previous decade's pattern but with slightly lower overall linear trends ($0.004 \,{\mathrm{Nm}}^{-2}\,{\mathrm{decade}}^{-1}$ and $0.2 \,{\mathrm{ms}}^{-1}\,{\mathrm{decade}}^{-1}$ respectively). Conversely, atmospheric pressure showed a unique pattern, characterized by a decreased spatial linear trend of − 16.5 hPa, influencing MSLA in areas like Guinea Conakry, Guinea Bissau, Gambia, and Senegal (Fig. 2A,L). Low linear trends in WSC, current velocity, and atmospheric pressure negatively correlated with sea level, enhancing MSLA (Fig. 2A,I,J,L). Despite a negative NHF spatial linear trend ($-1.007 \,{\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$), it represented an 86% increase from the prior decade, impacting OHC trends ($-7.5 \times{ 10}^{8} \,{\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$) and further amplifying SSLA and MSLA trends during this period (Fig. 2A,B,H,K).

Linear trends during 2013–2020

In the period spanning 2013 to 2020, the behavior of MSLA diverges from the previous two decades, showcasing distinct spatial trends and magnitudes with elevated sea level trends in the eastern basin (Fig. 3A). Notably, a considerable upsurge in MSLA is observed across the basin, with a spatial linear trend of $31.7\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$. This trend marks a substantial increment of about 169% in comparison to the 1993–2002 periods, and a 32% increase from the 2003–2012 spans. The eastern basin stands out with a notable linear trend, particularly evident along the coasts of Cameroon and Equatorial Guinea. However, the trend in HSLA presents a contrasting scenario, experiencing a significant decrease (-$0.43\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$) in comparison to the previous two decades (Fig. 3D). This reduction amounts to approximately 28% from the first decade and 47% from the second, yet the distribution of spatial trends remains akin to that of MSLA. This decrease in HSLA predominantly stems from negative trends along specific coasts, particularly the Nigerian coast, a result of diminishing freshwater runoff and heightened evaporation in these areas. Meanwhile, TSLA displays an overall positive spatial linear trend of $13.6\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$, reflecting substantial increments of 183% and 172% from the preceding decades, respectively (Fig. 3C). However, the spatial pattern of TSLA slightly diverges from MSLA during this timeframe, with the highest trend observed in the western basin and particularly the northwestern shelf. The combined spatial trends of HSLA and TSLA significantly contribute to the spatial linear trend of SSLA (Fig. 3B–D), reaching a peak of $14.1\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$. This indicates an impressive increment of 191% and 175% in comparison to the previous decades, respectively. The linear trends of precipitation and runoff display a consistent pattern along the coast, counterbalancing evaporation, with the northern basin exhibiting a high precipitation trend corresponding to low evaporation in the southern basin (Fig. 3E–G). During this period, the spatial linear trend of both WSC ($0.44\,\mathrm{ mm }\,{\mathrm{decade}}^{-1}$), and current velocity ($-0.07 \,{\mathrm{ms}}^{-1}\,{\mathrm{decade}}^{-1}$) displays a negative trend, representing a decrease of about 300% for WSC and 158% for current velocity compared to the previous decade (Fig. 3I,J). Similarly, atmospheric pressure exhibits a negative trend of ($-5.5\,\mathrm{ hPa }\,{\mathrm{decade}}^{-1}$), signifying a 67% increase from the last decade, with a distinct distribution along the coasts and offshore regions (Fig. 3L). The spatial linear trend of NHF portrays a positive trend ($0.008 \,{\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$), indicating an increase of 108% from the previous decade (Fig. 3K). Similarly, OHC demonstrates a positive trend of $6.6 \times{ 10}^{8}\, {\mathrm{Wm}}^{-2}\,{\mathrm{decade}}^{-1}$, corresponding to a remarkable 188% increment (Fig. 3H). These affirmative trends contribute to the observed heightened SSLA and MSLA during this period. The summarized analysis is presented in Table 1.

Table 1 Summary of the magnitude and percentage changes in the trends of MSLA and its forcing factors for the periods 1993–2002, 2003- 2012, 2013–2020, and 1993–2020 in the GoG.

Full size table

Leading modes of interannual MSLA variability

The following section discusses the results of the EOF analysis to further investigate the climate and oceanic phenomena that dominate the interannual variability of MSLA in the GoG. This investigation was conducted using the detrended spatial MSLA dataset, which isolates interannual variability by removing long-term effects from the time series data.

The first mode, EOF1, which is the dominant mode (Fig. 4A), explains 57% of the total variance in MSLA. This mode exhibits a mountainous tripole pattern. The base of the mountain displays high variability of MSLA, culminating from the eastern coasts and extending up to the coasts of Côte d'Ivoire and parts of Liberia in the west. It has a shoulder extending to the equatorial region, with the slopes of the mountain displaying moderately high sea levels that decay with increasing latitudes on both sides of the slope (north and south basins). These features demonstrate the classical characteristic of the barotropic Kelvin waves (equatorial and boundary) as described by²⁵. “The wave propagates equatorward along the western boundary, poleward along the eastern boundary, and cyclonic circulation around a closed boundary (counterclockwise in the Northern Hemisphere and clockwise in the Southern Hemisphere). It has the highest amplitude at the boundary which exponentially decays away from the boundary. Furthermore, it consistently propagates eastward at the equator, attaining its maximum magnitude and subsequently decaying exponentially with increasing latitude”. This wave is usually wind-induced due to atmospheric pressure gradients acting on the ocean surface. This suggests that the teleconnectivity of the equatorial and eastward propagation of Kelvin waves modulate the interannual sea level in the GoG, with the highest contribution observed around 1998 and a persistent slowdown during the 2016–2020 periods.

The second mode, EOF2, accounts for 17% of the variance fraction and exhibits a tripole pattern. It shows low sea levels along the northwest and northeast coasts, surrounding a high sea level in the central north coasts. The low sea level extends to the equatorial basin, separating the two high sea levels in the central north coasts and the south basin. This pattern closely corresponds to the temperature distribution of the surface oceanic circulation in the GoG, primarily driven by wind (Fig. 4B).

The third EOF of the interannual variability of MSLA explains 12% of the total variance and exhibits a meridional dipole pattern with high variability observed in the northern basin, a pattern that bears the hallmark of Atlantic Meridional Overturning Circulation (AMOC). AMOC is a powerful oceanic current system that plays a crucial role in regulating the climate in the Atlantic by transporting warm surface water northward and cold deep water southward²⁶. The temperature distribution by AMOC consequently affects the sea level variability as observed in EOF3 (Fig. 4C).

Seasonal variability

Analysis of the spatial seasonal variability of MSLA in the GoG, as depicted in Fig. 5, shows a distinct spatial seasonal variability of sea level across the basin. The northern basin, which has the highest MSLA distribution throughout the seasons compared to the southern basin, records its lowest value towards the end of RONS, through the ROFFS seasons. Notably, Cameroon and Nigerian coasts have an overall highest seasonal sea level trend compared to other coasts with the highest during RONS season. This is because the highest sea level along the northwest coasts is observed in December (DMON). This seasonal sea level pattern in the northern basin is similar to that of the south, with the highest in RONS, specifically in the southwestern basin.

The overall seasonal trend of MSLA across the basin (Fig. 6) shows two distinct peaks (April and November) and troughs (July and August), with November and August being the highest and lowest troughs, respectively. These peaks and troughs correspond to the observed HSLA peaks and TSLA troughs (Fig. 6a). Noticeably, the first peak of MSLA was preceded by the current trough in March, and the current peak in July preceded the MSLA trough (Fig. 6b). However, OHC, which follows a similar pattern with SSLA, shows a striking resemblance to the seasonal pattern of MSLA, with only a deviation in the period of the observed highest MSLA peak. Meanwhile, the atmospheric pressure demonstrates an inverse relationship with MSLA, rising as the sea level falls and vice versa throughout the season (Fig. 6d). WSC exhibits a seasonal pattern similar to the ocean current as they lead MSLA by a month. The same is observed for NHF; however, the impact of NHF on MSLA contrasts with both current and WSC, as MSLA peaks precede NHF peaks.

Correlations between MSLA and the forcing variables

The results of the Pearson correlation and regression analysis conducted on the detrended and filtered MSLA and its forcings in GoG are presented in Fig. 7. Our analysis revealed a significant level of association between MSLA and its various forcing variables. This connection provides a comprehensive understanding of the intricate relationships at play within the GoG. For instance, we observed that MSLA exhibited a positive relationship with SSLA, TSLA, HSLA, OHC, NHF, precipitation, freshwater runoff, and air temperature. These positive associations highlight the interdependence of these variables, emphasizing how changes in one component can influence MSLA and, in turn, contribute to the sea level variability in the GoG. Conversely, we noted negative relationships between MSLA and certain other variables, including current, WSC, atmospheric pressure, and evaporation. These negative associations provide additional depth to our understanding of the complex interactions governing sea level changes. They reveal the counteracting forces at play, where these variables act in opposition to MSLA, influencing sea levels by exerting forces in different directions. Additional observations include strong positive correlations between current and WSC, evaporation and NHF, air temperature with OHC and TSLA, and vice versa. On the other hand, strong negative correlations are found between precipitation and evaporation, TSLA and OHC with current and WSC, and vice versa. These findings further elucidate the intricate relationships between MSLA and its forcing variables, shedding light on the complex dynamics of sea level variability in the GoG.

Model performance

Following the model procedures and evaluation metrics presented in the methodology section of this paper, we present the results of the model performance and their associated feature importance in Fig. 8. Interestingly, we found no significant difference in the models' performance between splitting the data and using the entire dataset for training and testing. Therefore, we depict the plots of the model where we use the entire dataset for training and testing to show the full temporal extent of the data. However, in practical terms, such as for model deployment, the splitting model is considered standard. Our observations show that RFR and GBM models with ${R}^{2}$ and RMSE of 0.97, 0.96, and 1.14, 1.36 respectively, exhibit the best performance among the evaluated models in this study. However, the relative importance of input features (predictor variables) in making predictions or explaining the target variable's variance varies among the models. While it was demonstrated that current and WSC are related in the previous section of this paper in the GoG, they play dominant roles in the performance of RFR and GBM models, respectively. Additionally, the LSTM model, outperforming MLR and MLPR models, closely follows RFR and GBM, with TSLA emerging as the most influential variable. Notably, freshwater runoff stands out by dominating over other features in both MLR and MLPR models. However, the MLPR model exhibits the lowest performance when compared to all the models considered in the study. In general, the performance of machine learning models depends on several factors, including the complexity of the model, data distribution, feature selection methods, hyperparameter tuning, and the model's assumptions about the datasets. The summary of the results of the models’ evaluation metrics is presented in Table 2.

Table 2 Summary of the result of model evaluations, including MLPR, MLR, FRF, GBR, and LSTM.

Full size table

Discussion/conclusion

In this study, we have undertaken a comprehensive analysis of the linear trend of MSLA, focusing on the changes in the decadal trend and their underlying drivers in the GoG. Our investigation involved separate examinations of decadal trends for the periods 1993–2002, 2003–2012, and 2013–2020. Additionally, we explored the seasonal variability of MSLA and investigated potential links between interannual sea level variability and large-scale oceanic and atmospheric forcings spanning from 1993 to 2020. To model sea levels in the GoG, we assessed various supervised machine learning models, including artificial neural networks such as LSTM and MLPR, alongside traditional regression methods like MLR, GBM, and RFR.

Our analysis revealed a consistent increase in the linear trend of MSLA across the entire basin, with the northern region exhibiting a more pronounced trend. The total linear trend from 1993 to 2020 amounts to approximately $88\, \mathrm{mm}$. The highest decadal trend ($38.7\, \mathrm{mm}$) was observed during 2013–2020, while the most substantial percentage increment occurred during 2003–2012 (100%). Zonal differences were evident in the variability of the linear trend of sea level across decades, with the western region showing unique behavior during the 2013–2020 period. The spatial variabilities of the linear decadal sea level trend across the basin are driven by variations in physical forcings within the sub-basins. These forcings, SSLA, TSLA, HSLA, OHC, WSC, air temperature, atmospheric pressure, NHF, precipitation, evaporation, and freshwater runoff, exert distinct impacts on sea levels. While WSC, current velocity, atmospheric pressure, and evaporation negatively correlate with sea levels, SSLA, TSLA, HSLA, OHC, air temperature, NHF, precipitation, and freshwater runoff exhibit positive associations.

Our study also highlighted the significant influence of large-scale oceanic and atmospheric phenomena on the spatial distribution of sea levels. The first three modes of Empirical Orthogonal Function (EOF) variability explained substantial proportions of variance, with the first mode reflecting teleconnections between equatorial and coastal Kelvin waves driven by atmospheric circulations. However, the role of Kelvin waves in modulating the interannual sea level variability near the coast is well-documented^27,28.

The second mode revealed the spatial variability of the current circulation system and its thermal characteristics, largely governed by the wind system. For instance, in the northern basin, the relatively cooled Canary Current (CC) flows southward along the African coasts between 30° and 10°N²⁹. Meanwhile, the warm North Equatorial Counter Current (NECC) flows between 3° and 10°N, acting as the northern limit for the South Equatorial Current (SEC), and the Guinea Current (GC) is a relatively warm eastern flowing current between 3°-5°N along the coasts of West Africa³⁰. In the southern basin, the SEC flows westward between 2°N and 4°S and is fed with relatively cool Benguela water, while the South Equatorial Counter Current (SECC) is a relatively warm eastward flowing current that moves below the SEC³¹. This confirms the temperature distribution of the surface circulation as the second leading mode of interannual variability of MSLA in the GoG. The highest variability was observed in 1994 and has consistently decreased during the 2017–2020 period.

The third mode captured the AMOC, a vital oceanic process controlled by density-driven currents. While there have been reports indicating a slowdown of AMOC in recent decades³², more recent research suggests that AMOC may already be recovering^33,34. For instance, the results of the analysis of the interannual variability of AMOC conducted between 2004 and 2018 by³³ along 26°N, which is slightly above our study area, show a significant decline between the period 2009–2010 and two peaks between the periods 2013–2014 and 2018. This result is consistent with the interannual variability of MSLA in the GoG, as observed in PC3. While the variability in the AMOC has been linked to the sea level variability in the GoG³⁵, to the best of our knowledge, no research has reported the interannual variability of AMOC in the GoG. Therefore, the work of Moat et al. could offer valuable comparative insights, as most of the variability in AMOC originates from the tropical Atlantic.

Furthermore, seasonal variability in sea level trends emerged due to seasonal changes in forcing factors. The opposing impact of current and WSC and the positive effect of NHF led MSLA by a month, just as other variables experience seasonal fluctuations with MSLA.

Despite the challenges inherent in sea level modeling and prediction, the integration of advanced artificial neural networks and machine learning techniques presents a promising solution. By harnessing extensive datasets encompassing ocean currents, WSC, freshwater runoff, TSLA, HSLA, NHF, and atmospheric pressure, these innovative approaches can unveil hidden relationships and underlying mechanisms shaping oceanic processes and sea levels. Such integration empowers the development of more realistic models, expanding projection capabilities over extended temporal ranges. Notably, our analysis highlighted the efficacy of RFR and GBM models, with accuracy rates of 97% and 96%, respectively, in reproducing interannual sea level patterns in the Gulf of Guinea. The implications of the findings of this work may be extended to other regions in terms of methodological transferability for regional sea level modeling, understanding the historical trend and their drivers, enabling proper environmental monitoring, climate adaptation, resilience, and data-driven decision-making.

Methodology

Study area

The Gulf of Guinea, situated along the western coast of Africa, stretches from Cape López near the equator to Cape Palmas, spanning longitudes ${17}^{o}W$ to ${11}^{o}E$, and latitude ${15}^{o}N$ to ${10}^{o}S$ (Fig. 9). Known for its predominantly low-lying coasts, the Gulf features warm tropical waters with relatively low salinity due to the influx of major rivers, including the Volta, Niger, Congo, Forcadoes, Ouémé, Delta, Sassandra, Tano, Nun, and Komoé, among others. Spanning approximately 6000 km, the Gulf boasts a diverse coastline, characterized by a nearly uniformly narrow continental shelf measuring about 100 nautical miles. The region is impacted by five principal ocean currents: the Benguela Current, Canary Current, South Equatorial Current, Counter Equatorial Current, and Guinea Current³⁶. The prevailing climate follows a monsoon pattern, particularly the West Africa monsoon (WAM), with two primary air masses: the Southwest and Northeast winds³⁷. The region experiences minimal dry season during the summer, leading to two distinct wet seasons annually³⁸, marked by the onset and conclusion of the rainy period. The initiation of the rainy season is often accompanied by low rainfall amounts, referred to as pre-rain onset³⁹. This study classifies seasons based on West Africa's continental rainfall quantity and timing. The winter season (DJF) is identified as the dry-monsoon (DMON), spring (MAM) as pre-rain onset (PRONS), summer (JJA) as rain onset (RONS), and autumn (SON) as the rain-offset season (ROFFS).

Data source

The temperature and salinity data utilized in this research are drawn from the high-resolution 3-D GLORYS12V1 products, with a spatial resolution of 1/12 degrees (~ 9 km), spanning the period from 1993 to 2020. These products originate from the Nucleus for European Modeling of the Ocean (NEMO) general circulation model⁴⁰, incorporating surface boundary conditions from the European Centre for Medium-Range Weather Forecasts (ECMWF) atmospheric reanalysis and forecasts. Through the assimilation of near-real-time observations, the NEMO model offers accurate estimates of the oceanic state in the GoG. This dataset has undergone rigorous validation against in situ observations and other sea surface temperature (SST) and salinity products, demonstrating robust consistency with independent data sources (e.g.,^41,42,43). Its applicability spans oceanographic and climate research domains (e.g.,^44,45,46,47). Accessible from the Copernicus Marine Environment Monitoring Services CMEMS data archive, the dataset covers the temporal span from 1993 to the present.

For sea level data, the monthly gridded sea surface height, hereafter referred to as MSLA, was acquired for the GoG from CMEMS. This dataset amalgamates observations from diverse altimetry missions, resulting in a consistent and unbiased dataset characterized by a 1/4° horizontal and vertical spatial resolution⁴⁸. Necessary geophysical corrections, including tidal corrections using the Finite Element Solution 2014 (FES2014) ocean tide model⁴⁹, were performed by the Data Unification and Altimeter Combination System (DUACS) to produce the dataset. Additionally, the dataset underwent further refinement for glacial isostatic adjustment (GIA) using the ICE5G-VM2 GIA model⁵⁰ to isolate oceanographic phenomena. Widely employed by researchers in investigating sea level variability, ocean dynamics, and coastal processes (e.g.,^24,51,52), the data was accessed from the CMEMS archive at http://marine.copernicus.eu/. Furthermore, supplementary datasets encompassing air temperature, u wind (10 m), v wind (10 m), total precipitation, evaporation, Atmospheric pressure, net shortwave radiation, net longwave radiation, surface latent heat flux, and sensible heat flux at a single pressure level were sourced from the European Centre for Medium-range Weather Forecasts⁵³ reanalysis era5 data, accessible at http://cds.climate.copernicus.eu/.

Parameterization

TSLA and HSLA were computed using the Thermodynamic Equation Of Seawater 2010 (TEOS-10), which comprises a set of standardized equations for determining the thermodynamic properties of seawater. TSLA and HSLA account for the individual impact of temperature and salinity, respectively, which can cause expansion or contraction of sea level depending on their respective values at a specific location and time. Their combined impact forms the steric sea level anomaly (SSLA), which measures how changes in water density affect the sea level. The steric sea level from the surface up to 1000 m is computed following^54,55):

$$h={h}_{T}+ {h}_{S} = {\int }_{-1000}^{0}\alpha \left(T- {T}_{o}\right)dz+{\int }_{-1000}^{0}\beta \left(S- {S}_{o}\right)dz$$

(1)

where h is the total steric sea level height, ${h}_{T}$ and ${h}_{S}$ are the thermohaline and halosteric components, respectively. T and S represent the temperature and salinity at each grid point, while ${T}_{o}$ and ${S}_{o}$ denote the reference temperature and salinity. α and β are the thermal expansion and haline contraction coefficients, respectively, calculated from the temperature and salinity using the (TEOS-10) equation.

Similarly, wind speed and wind stress were calculated from ERA5 zonal (u) and meridional (v) wind components data using the following equations:

$$G= {({u}^{2}+ {v}^{2})}^{0.5}$$

(2)

$$K={({\tau }_{x}+ {\tau }_{y})}^{0.5}$$

(3)

here u and v represent the u and v components of wind, G is the wind speed, K is the wind stress, and ${\tau }_{x}$ and ${\tau }_{y}$ are defined as ${\rho }_{air}cd \times G\times u$ and ${\rho }_{air}cd \times G\times v$, respectively, which are the wind stress of the u and v components. ${\rho }_{air}$ represents air density, and cd is the drag coefficient. The curl of the wind stress is computed as follows:

$${curl}_{z }K= \frac{{\partial \tau }_{y}}{\partial x}- \frac{{\partial \tau }_{x}}{\partial y}$$

(4)

where ${\tau }_{x}$ is the zonal wind stress component, ${\tau }_{y}$ is the meridional wind stress component, and x and y represent eastward and northward coordinates, respectively. Also, the ocean heat content (OHC) within 1000 m along the GoG is calculated following⁵⁶:

$$OHC= \rho {C}_{p}{\int }_{-1000}^{0}\left[T(z)\right]dz$$

(5)

where ρ is the seawater density calculated from temperature and salinity at each grid point following⁵⁷, ${C}_{p}$ is the specific heat capacity of seawater (4178 J kg⁻¹ °C⁻¹), and T(z) is the temperature (℃) at each grid point. Finally, the surface net heat flux (NHF) is estimated following⁵⁸:

$${\text{NHF}} = {\text{SWR}} + {\text{LWR}} + {\text{LHF}} + {\text{SHF}}$$

(6)

where the respective components of Eq. 6 are defined as follows: net shortwave radiation (SWR), net long-wave radiation (LWR), surface latent heat flux (LHF), and sensible heat flux (SHF).

Procedure

The spatial decadal linear trends of MSLA are calculated at each grid point by taking a ten-year average of the annual trends in the study domain using regression coefficients estimated by the ordinary least squares method. The significance of these trends is tested using a non-parametric Mann–Kendall (MK) trend test with a 99% confidence level^59,60. Thereafter, a non-parametric Theil-Sen’s slope estimator^61,62 was employed to estimate the magnitude and direction of trends in the time-series data. To isolate the interannual variability from the datasets, monthly mean values were extracted across the entire longitude and latitude. The climatological mean was then removed from the extracted time series data before they were detrended and filtered using a low-pass filter with a 13-month running mean. This approach considers that detrending isolates interannual variation in climate variables⁶³. A similar approach was employed to isolate the interannual spatial variability from the MSLA at each grid point. Subsequently, Empirical Orthogonal Function (EOF) analysis was performed. EOF analyses are often used for dimensionality reduction and the extraction of dominant spatial patterns of climate variability and how they change with time⁶⁴. This mathematical technique decomposes datasets into a set of orthogonal patterns (EOFs) and their corresponding time series Principal Components (PCs). Each EOF represents a spatial pattern of variability, and the PCs indicate how these patterns vary over time. In our study, EOF analysis was employed to gain insights into the spatial patterns and temporal variations of interannual MSLA and its connection with large climate and oceanic phenomena. Meanwhile, the spatial seasonal signal was extracted from the undetrended MSLA by calculating the mean value of sea level for each month at each grid point. Additionally, the linear seasonal signal was extracted from the undetrended MSLA and its forcings by calculating the mean value of the basin-average sea level for each month. Five different ANN and MLT models, namely MLPR, MLR, RFR, GBR, and LSTM, were developed to determine the best-performing model for sea level predictions in the GoG. The description of each model is provided below.

Model description

Multi-layer perceptron regression (MLPR)

MLPR is an ANN model specifically designed for regression tasks, It utilizes an FNN network model to predict continuous numerical values rather than discrete categories. The network comprises multiple hidden layers of neurons that apply nonlinear transformations to the input data, enabling the model to learn intricate patterns and relationships between input features and the target output. Its effectiveness in sea level prediction has been well-documented in previous studies^21,65. The MLPRegressor model architecture, depicted in Fig. 10, illustrates the key components, including the input layer, hidden layers, and output layer. Neurons are the fundamental building blocks of ANN that process information and facilitate the network's ability to learn complex patterns and make predictions based on input data. Each neuron applies an activation function, such as the rectified linear unit (ReLU), to the weighted sum of its inputs. This helps address the vanishing and exploding gradient problem⁶⁶ and is defined by the function in Eq. (7)

$$\mathrm{ReLU}=\mathrm{ max}\left(0,\mathrm{ x}\right)$$

(7)

where x represents the weighted sum of inputs to the neuron. The weighted sum of inputs to a neuron (Z), which captures relevant features and introduces non-linearity, is calculated by multiplying the input values with their corresponding weights and summing them up, along with the bias term, as represented by Eq. (8).

$$Z = (W_{i} .X_{i} ) + b$$

(8)

where ${W}_{i}$ represents the weights, ${X}_{i}$ represents the input values, and 'b' is the bias term. Feedforward is a fundamental concept in neural networks that defines the process of propagating input data to the output through the network's layers without any feedback connections. This sequential flow of information in a single direction allows the network to make predictions based solely on the provided input data. The feedforward process is aided by a series of layers of neurons, with the activation of each neuron determined by Eq. (9):

$${A}_{i}= \mathrm{ReLU}\left({Z}_{i}\right)$$

(9)

where ${A}_{i}$ represents the activation of neuron i, and ${Z}_{i}$ represents the weighted sum of inputs to neuron i. The output of the model is calculated based on the activations of the neurons in the output layer i.e. the weighted sum of activations without applying any additional activation function and it expresses as:

$$Y= \sum \left({W}_{i} . {A}_{i}\right)$$

(10)

where Y represents the predicted output, ${W}_{i}$ represents the weights connecting the neurons, and ${A}_{i}$ represents the activations of the neurons in the output layer.

Multiple linear regression (MLR)

MLR is a statistical technique that uses a linear equation to establish the relationship between multiple independent variables (predictors) and a dependent variable (response). It determines the best-fitting line between the dependent variable and the independent variables by employing a least squares technique to estimate the independent variables for predicting the dependent variable, assuming a linear relationship between the variables. The application of MLR in modeling and prediction tasks in the fields of oceanography and climate science has been extensively documented^67,68. However, MLR has some limitations, such as the linearity assumption, independence assumption, multicollinearity, sensitivity to outliers, and limited handling of non-normality^69,70,71. The MLR model equation can be represented as follows:

$$Y = C + \beta_{1} x_{1} + \beta_{2} x_{2} + \beta_{3} x_{3} + \ldots \beta_{n} x_{n} + \varepsilon$$

(11)

where Y represents the dependent variable (response variable), C represents the intercept (constant term), ${\beta }_{1}$, ${\beta }_{2}$, …, ${\beta }_{n}$ represent the regression coefficients associated with the independent variables ${x}_{1}$, ${x}_{2}$, …, ${x}_{n}$ and ɛ represents the error term, accounting for unexplained variation.

Random forrest regression (RFR)

RFR is an ensemble machine learning technique that addresses the limitations of individual decision trees by combining them through bagging. This results in a robust and accurate regression model. Figure 11 illustrates the architecture of RFR with multiple decision trees. The structural arrangement of the trees provides insights into the internal workings of the RFR model, enabling a better understanding and interpretation of its predictions. Unlike other machine learning models, each RFR decision tree in the ensemble operates independently and does not have explicit equations associated with its components. However, the RFR architecture involves three key steps: bootstrap sampling, feature subset selection, and the final prediction. Bootstrap sampling is used to create subsets of training data. Then, feature subset selection is performed to reduce the correlation between the trees, and the final prediction is obtained by aggregating the individual predictions of the decision trees, usually by taking the mean or median. This approach introduces randomness through bootstrapping and feature subset selection, effectively reducing overfitting and improving robustness. RFR is highly effective in handling noise, capturing complex relationships, and generating accurate predictions by leveraging the collective power of multiple trees⁷². Furthermore, the application of RFR for sea level prediction has been extensively documented in the literature^73,74,75.

Gradient boosting machine (GBM)

GBM, a machine learning algorithm leveraging an ensemble method called boosting, has gained significant popularity for its effectiveness in regression tasks, handling complex non-linear relationships and interactions between variables. Introduced by⁷⁶ and further elaborated in 2001, GBM generates a robust predictive model by combining multiple weaker models. The fundamental principle involves iteratively training a sequence of models, typically decision trees, to refine overall prediction accuracy by rectifying errors made by previous models. This iterative process ensures subsequent models focus on areas where previous ones underperformed, resulting in a powerful and accurate predictive model. GBM demonstrates robustness in handling noisy data, outliers, and missing values. Figure 12 illustrates the GBM model architecture, showcasing core concepts of gradient boosting and its sequential nature. It integrates multiple base learners to form a strong ensemble prediction, depicting the underlying mechanism. The GBM architecture consists of input data, multiple base learners capturing different patterns and relationships, additive outputs from each base learner, and the final prediction from the combined additive outputs. Weighted edges (w1, w2, and w3) connect nodes, determining each base learner's contribution to the additive outputs and the ultimate prediction. The base learner represented as a function ${F}_{i}\left(x\right)$, is denoted by index i and input data x. Thus, the base learner equation is expressed as:

$${F}_{i}\left(x\right)= {f}_{i}\left(x\right)$$

(12)

Here, ${f}_{i}\left(x\right)$ represents the prediction of the i-th base learner. The equation for the additive output of the i-th base learner (${H}_{i}\left(x\right)$) can be represented as:

$${H}_{i}\left(x\right)= {H}_{i-1}\left(x\right)+\upeta . {F}_{i}\left(x\right)$$

(13)

where ${H}_{i-1}\left(x\right)$ represents the cumulative additive output of the previous base learners, η is the learning rate controlling the contribution of each base learner, and ${F}_{i}\left(x\right)$ is the prediction of the i-th base learner. The final prediction ($Y\left(x\right)$) is obtained by summing the additive outputs of all the base learners:

$$Y\left(x\right)= \sum {H}_{i}\left(x\right)$$

(14)

This architecture demonstrates the iterative nature of gradient boosting, where each base learner improves upon its predecessors by focusing on residual errors, allowing gradual learning and adaptation to the data. GBM's flexibility through customizable hyperparameters, like the number of trees, learning rate, and tree depth, enables model performance optimization and addresses specific data-driven tasks⁷⁷. GBM has proven effective in predicting and modeling sea states^78,79.

Long short-term memory (LSTM)

RNNs are neural networks specifically designed to handle sequential data through recurrent connections. However, they are limited in capturing long-term dependencies due to the vanishing gradient problem, which can result in the loss of information over time. LSTM was introduced by⁸⁰ as an improvement over traditional RNNs. It addresses the limitation of capturing long-term dependencies in sequential data by introducing specialized memory cells that allow information to persist over time. LSTM networks, depicted in Fig. 13, consist of multiple LSTM cells denoted as LSTM 0 and LSTM 1. These cells serve as the memory units of the network, capturing and storing relevant information from the input data. Each LSTM cell has three gates: the input gate, the forget gate, and the output gate. These gates regulate the flow of information within the cell, controlling the input, forgetting, and output of information, respectively. The input gate determines the incorporation of new information into the current cell state, the forget gate decides which information from the previous cell state should be discarded, and the output gate determines the amount of information passed to the next cell or the output layer. The connections between the components indicate the flow of information. The input is fed into LSTM 0, which then passes information to the input, forget, and output gates of LSTM. Finally, the output is generated from LSTM 1 and sent to the output layer. Equations (15–20) provide further details of each of the LSTM components

$${i}_{t}= \sigma \left({W}_{i} . \left[{h}_{t-1}, {x}_{t}\right]+{b}_{i}\right)$$

(15)

$${f}_{t}= \sigma \left({W}_{f} . \left[{h}_{t-1}, {x}_{t}\right]+{b}_{f}\right)$$

(16)

$${o}_{t}= \sigma \left({W}_{o} . \left[{h}_{t-1}, {x}_{t}\right]+{b}_{o}\right)$$

(17)

$$\tilde{C}_{t} = \tanh \left( {W_{c} . \left[ {h_{t - 1} , x_{t} } \right] + b_{c} } \right)$$

(18)

$$C_{t} = f_{t} * C_{t - 1} + i_{t} . {\tilde{\text{C}}}_{t}$$

(19)

$${\mathrm{h}}_{t}= {\mathrm{o}}_{t}*\mathrm{tanh}\left({C}_{t}\right)$$

(20)

where ${i}_{t}$,${f}_{t}$ ${o}_{t}$, ${\tilde{\text{C}}}_{t}$, ${C}_{t}$, and ${\mathrm{h}}_{t}$ represent the input gate, forget gate, output gate, candidate cell state, cell state, and hidden state, respectively. Similarly,$(\left[{h}_{t-1}, {x}_{t}\right])$ represents the concatenation of the previous hidden state and current input, ${C}_{t-1}$ represents the previous cell state, (${W}_{i},{W}_{f},{W}_{o},{W}_{c}$), represents the weight matrices associated with the input gate, forget gate, output gate, and candidate cell state, respectively, (${b}_{i},{b}_{f},{b}_{o},{b}_{c}$) represents the bias terms associated with the input gate, forget gate, output gate, and candidate cell state, respectively, and ($\sigma , tanh$) represents the sigmoid and hyperbolic tangent activation functions, respectively. The interconnectedness of these equations enables the LSTM to capture and store relevant information over time, addressing the vanishing gradient problem in traditional RNNs. LSTM networks have proven successful in time series data applications, such as sea state and sea level modeling and prediction^81,82.

Model training

In the present study, the models were trained using the filtered and detrended 13-month running mean of marine meteorological and hydrological data. These datasets comprised variables such as TSLA, HSLA, OHC, WSC, NHF, precipitation, evaporation, freshwater runoff, and atmospheric pressure. Two methods were employed: (1) the data was split into an 80:20 ratio, with 80% and 20% used for training and testing, respectively—considered a best practice in machine learning; (2) the entire dataset was used for both training and testing. The former follows best practices in machine learning models, while the latter, prone to overfitting, was performed solely to depict the temporal extent of the dataset. Grid Search hyperparameter optimization, a technique for optimizing model performance and reducing the risk of overfitting by systematically exploring different hyperparameter combinations, was employed to find the optimal combination of hyperparameters for the models. The models were then instantiated, and the training data was fed into them to capture the underlying patterns and relationships between the variables.

Model performance evaluation

Moving forward, we present the model prediction performance evaluation metrics used in the present work to assess the reliability and accuracy of the models in predicting the MSLA in the GoG. The coefficient of determination (${R}^{2}$), a metric that assesses the proportion of the total variance in the observed data that can be explained by the model predictions, and the root mean square error (RMSE), a metric for assessing the model's predictive skill, were employed. As a standard model evaluation approach, the R² value ranges between 0 and 1. A value of 0 indicates that the model does not explain any variability in the data, indicating poor performance. Conversely, a value of 1 signifies that the model perfectly explains all the variability in the data, indicating better performance. However, a lower RMSE value corresponds to higher prediction accuracy and better model performance. Conversely, a higher RMSE value suggests lower accuracy and poorer model performance. The two evaluation metrics are expressed as follows:

$$R^{2} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left( {Y_{i} - X_{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} \left( {Y_{i} - {\tilde{\text{Y}}}} \right)^{2} }}$$

(21)

$$RMSE= \sqrt{\frac{1}{N}\sum_{n=1}^{N}{\left({X}_{i}-{Y}_{i}\right)}^{2}}$$

(22)

where N is the number of data points in the sample, ${Y}_{i}$ represents the observed values of the dependent variable, ${X}_{i}$ represents the predicted values of the dependent variable based on the regression model, and ${\tilde{\text{Y}}}$ represents the mean of the observed values of the dependent variable.

Data availability

The datasets analyzed during the current study are: sea surface height (SSH) hereafter referred to as mean sea level anomaly (MSLA), sea surface temperature (SST), sea surface salinity (SSS) data provided by the Copernicus Marine Environment Management Services (CMEMS), available at http://marine.copernicus.eu/product. Additionally, air temperature, u wind (10 m), v wind (10 m), Total precipitation, Evaporation, Atmospheric pressure, Net Shortwave Radiation , Net Longwave Radiation, Surface Latent Heat Flux, and Sensible Heat Flux at a single pressure level provided by the European Centre for Medium-range Weather Forecasts (ECMWF, 2011) reanalysis era5 data, available at http://cds.climate.copernicus.eu/.

References

IPCC. In: Field, C. B., Barros, V. R., Dokken, D. J., Mach, K. J., Mastrandrea, M. D., Bilir, T. E., Chatterjee, M., Ebi, K. L., Estrada, Y. O., Genova, R. C., Girma, B., Kissel, E. S., Levy, A. N., MacCracken, S., Mastrandrea, P. R., & White, L. L. (Eds.), Climate Change 2014: Impacts, Adaptation, and Vulnerability. Part A: Global and Sectoral Aspects. Contribution of Working Group II to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, pp. 1132. (2014).
Hamlington, B. D. et al. Understanding of contemporaryregional sea-level change and theimplications for the future. Rev. Geophys. 58, e2019RG0006. https://doi.org/10.1029/2019RG00067 (2020).
Article Google Scholar
Belmont, M. R. et al. An examination of the feasibility of linear deterministic sea wave prediction in multidirectional seas using wave profiling radar: Theory, simulation, and sea trials. J. Atmos. Oceanic Technol. 31, 1601–1614. https://doi.org/10.1175/JTECH-D-13-00170.1 (2014).
Article ADS Google Scholar
Klein, M. et al. On the deterministic prediction of water waves. Fluids 5(1), 9. https://doi.org/10.3390/fluids5010009 (2020).
Article ADS Google Scholar
Marco, B., Christian, F., Georg, U., Andrea, B. & Elisa, C. Modelling the barotropic sea level in the Mediterranean Sea using data assimilation. Ocean Sci. 19, 559–579. https://doi.org/10.5194/os-19-559-2023 (2023).
Article Google Scholar
O’Donncha, F., Zhang, Y., Chen, B. & James, S. C. Ensemble model aggregation using a computationally lightweight machine-learning model to forecast ocean waves. J. Mar. Syst. 199, 103206 (2019).
Article Google Scholar
Ayyad, M., Hajj, M. R. & Marsooli, R. Machine learning-based assessment of storm surge in the New York metropolitan area. Sci. Rep. 12, 19215. https://doi.org/10.1038/s41598-022-23627-6 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
French, J., Mawdsley, R., Fujiyama, T. & Achuthan, K. Combining machine learning with computational hydrodynamics for prediction of tidal surge inundation at estuarine ports. Procedia IUTAM 25, 28–35. https://doi.org/10.1016/j.piutam.2017.09.005 (2017).
Article Google Scholar
Han, M. et al. A convolutional neural network using surface data to predict subsurface temperatures in the Pacific Ocean. IEEE Access 7, 172816–172829. https://doi.org/10.1109/ACCESS.2019.2955957 (2019).
Article Google Scholar
Fang, W., Sha, Y. & Sheng, V. S. Survey on the application of artificial intelligence in ENSO forecasting. Mathematics 10(20), 3793. https://doi.org/10.3390/math10203793 (2022).
Article Google Scholar
Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., & Fei-Fei, L. Large-Scale video classification with convolutional neural networks. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, 1725–1732. https://doi.org/10.1109/CVPR.2014.223 (2014).
Shelhamer, E., Long, J. & Darrell, T. Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 640–651. https://doi.org/10.1109/TPAMI.2016.2572683 (2017).
Article PubMed Google Scholar
Kohler, M., & Langer, S. Statistical theory for image classification using deep convolutional neural networks with cross-entropy loss. arXiv. https://doi.org/10.48550/arXiv.2011.13602 (2020).
Graves, A., Mohamed, A. R., & Hinton, G. Speech recognition with deep recurrent neural networks. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6645–6649. IEEE. https://doi.org/10.1109/ICASSP.2013.6638947 (2013).
Lipton, Z. C., Berkowitz, J., & Elkan, C. A critical review of recurrent neural networks for sequence learning. arXiv preprint arXiv:1506.00019. https://doi.org/10.48550/arXiv.1506.00019 (2015).
Bruneau, N., Polton, J., Williams, J. & Holt, J. Estimation of global coastal sea level extremes using neural network. Environ. Res. Lett. 15, 074030. https://doi.org/10.1088/1748-9326/ab89d7 (2020).
Article ADS Google Scholar
Tur, R., Tas, E., Haghighi, A. T. & Mehr, A. D. Sea level prediction using machine learning. Water 13, 3566. https://doi.org/10.3390/w13243566 (2021).
Article Google Scholar
Guillou, N. & Chapalain, G. Machine learning methods applied to sea level predictions in the upper part of a tidal estuary. Oceanologia 63, 531–544. https://doi.org/10.1016/j.oceano.2021.07.003 (2021).
Article Google Scholar
Stanev, E. V., Le Traon, P. Y. & Peneva, E. L. Sea level variations and their dependency on meteorological and hydrological forcing: Analysis of altimeter and surface data for the black sea. J. Geophys. Res. Ocean 105(C7), 17203–17216 (2000).
Article ADS Google Scholar
Zubier, K. M. & Lina, S. E. Investigating the role of atmospheric variables on sea level variations in the eastern central red sea using an artificial neural network approach. Oceanologia 62(3), 267–290. https://doi.org/10.1016/j.oceano.2020.02.002 (2020).
Article Google Scholar
Shaila, A. et al. Seasonal & long-term sea-level variations & their forcing factors in the northern Bay of Bengal: A statistical analysis of temperature, salinity, wind stress curl, & regional climate index data. Dyn. Atmos. Oceans 95, 101239. https://doi.org/10.1016/j.dynatmoce.2021.101239 (2021).
Article Google Scholar
Timmermann, A., McGregor, S. & Jin, F. F. Wind effects on past & future regional sea level trends in the southern Indo-Pacific. J. Clim. 23, 4429–4437. https://doi.org/10.1175/2010JCLI3519.1 (2010).
Article ADS Google Scholar
Philander, S. G. Upwelling in the Gulf of Guinea. J. Mar. Res. 37(1), 1–22 (1979).
Google Scholar
Wiafe, G. & Nyadjro, E. S. Satellite observations of upwelling in the Gulf of Guinea. IEEE Geosci. Remote Sens. Lett. 12(5), 1066–1070. https://doi.org/10.1109/LGRS.2014.2379474 (2015).
Article ADS Google Scholar
Wang, B. Kelvin Wave. University of Hawaii, Honolulu, HI, USA. https://doi.org/10.1006/rwas.2002.0191.
Buckley, M. W. & Marshall, J. Observations, inferences, and mechanisms of the Atlantic Meridional Overturning Circulation: A review. Rev. Geophys. 54(1), 5–63. https://doi.org/10.1002/2015RG000493 (2016).
Article ADS Google Scholar
Suresh, I. et al. Sea level interannual variability along the west coast of India. Geophys. Res. Lett. 45, 12440–12448. https://doi.org/10.1029/2018GL080972 (2018).
Article ADS Google Scholar
Hughes, C. W. et al. Sea level and the role of coastal trapped waves in mediating the influence of the open ocean on the coast. Surv. Geophys. 40, 1467–1492. https://doi.org/10.1007/s10712-019-09535-x (2019).
Article ADS Google Scholar
Fedoseev, A. Geostrophic circulation of surface waters on the shelf of north-west Africa. Rapp. P.-V. Reun. Cons. Int. Explor. Mer. 159, 32–37 (1970).
Google Scholar
Djakoure, S., Penven, P., Bourles, B., Kone, V. & Veitch, J. Respective roles of the guinea current and local winds on the coastal upwelling in the Northern Gulf of Guinea. J. Phys. Oceanogr. https://doi.org/10.1175/JPO-D-16-0126.1 (2017).
Article Google Scholar
Dorothee, B., Elizabeth R., Arthur J. M., & Edward H. R. The South Equatorial System Current. Ocean Surface Currents. Retrieved from https://oceancurrents.rsmas.miami.edu/atlantic/south-equatorial.html (2004).
Collins, M., Sutherland, M., Bouwer, L., Cheong, S. M., Frolicher, T., & Jacot Des Combes, H. IPCC Special Report on The Ocean and Cryosphere in a Changing Climate. Chapter 6: Extremes, Abrupt Changes and Managing Risks. Cambridge University Press.
Moat, B. I. et al. Pending recovery in the strength of the meridional overturning circulation at 26°N. Ocean Sci. 16, 863–874. https://doi.org/10.5194/os-16-863-2020 (2020).
Article ADS Google Scholar
Jackson, L. C. et al. Understanding AMOC stability: The North Atlantic Hosing Model Intercomparison Project. Geosci. Model Dev. 16, 1975–1995. https://doi.org/10.5194/gmd-16-1975-2023 (2023).
Article ADS Google Scholar
Evadzi, P. I. K., Zorita, E. & Hünicke, B. West African sea level variability under a changing climate - what can we learn from the observational period?. J. Coastal Conserv. 23(4), 759–771 (2019).
Article Google Scholar
Longhurst, A. RA review of the oceanography of the Gulf of Guinea. Bulletin de l’Institut Fondamental d’Afrique Noire 24, 633–663 (1962).
Google Scholar
Nicholson, S. E. The West African Sahel: A review of recent studies on the rainfall regime & its interannual variability. ISRN Meteorol. 2013, 1–32. https://doi.org/10.1155/2013/453521 (2013).
Article Google Scholar
Okoloye, C., Aisiokuebo, N., Ukeje, J., Anuforom, A. & Nnodu, I. Rainfall variability and the recent climate extremes in Nigeria. J. Meteorol. Climatol. Sci. 11(1), 49–57 (2014).
Google Scholar
Benjamin, S. & Janicot, S. The West African monsoon dynamics, Part II: The “pre-onset” and the “onset” of the summer monsoon. J. Clim. 16, 3407–3427. https://doi.org/10.1175/1520-0442(2003)016%3c3407:TWAMDP%3e2.0.CO;2 (2003).
Article ADS Google Scholar
Madec, G., & the NEMO team. NEMO ocean engine. Note du Pôle de modélisation, Institut Pierre-Simon Laplace (IPSL), France, No 27, ISSN No 1288–1619 (2008).
Guinehut, S., Dhomps, A. L., Larnicol, G. & Le Traon, P. Y. High-resolution 3-D temperature and salinity fields derived from in situ and satellite observations. Ocean Sci. 8, 845–857. https://doi.org/10.5194/os-8-845-2012 (2012).
Article ADS Google Scholar
Jean-Michel, L. et al. Global 1/12° oceanic and sea ice GLORYS12 reanalysis. Front. Earth Sci. 9, 698876. https://doi.org/10.3389/feart.2021.698876 (2021).
Article Google Scholar
Gasparin, F. et al. On the control of spatial and temporal oceanic scales by existing and future observing systems: An observing system simulation experiment approach. Front. Mar. Sci. https://doi.org/10.3389/fmars.2023.1021650 (2023).
Article Google Scholar
Xie, S.-P., Kosaka, Y., Du, Y. & Hu, K. Indo-western Pacific ocean capacitor and coherent climate anomalies in post-ENSO summer: A review. Adv. Atmos. Sci. 33(4), 411–432. https://doi.org/10.1007/s00376-015-5192-6 (2016).
Article Google Scholar
Cai, C., Kwon, Y. O., Chen, Z. & Fratantoni, P. Mixed layer depth climatology over the northeast U.S. continental shelf (1993–2018). Continental Shelf Res. https://doi.org/10.1016/j.csr.2021.104611 (2021).
Article Google Scholar
Karnauskas, K. B. Whither warming in the Galápagos?. PLOS Clim. 1(9), e0000056. https://doi.org/10.1371/journal.pclm.0000056 (2022).
Article Google Scholar
Mondal, S., Lee, M. A., Chen, Y. K. & Wang, Y. C. Ensemble modeling of black pomfret (Parastromateus niger) habitat in the Taiwan Strait based on oceanographic variables. PeerJ https://doi.org/10.7717/peerj.14990 (2023).
Article PubMed PubMed Central Google Scholar
Legeais, J. et al. Copernicus Sea Level space observations: A basis for assessing mitigation and developing adaptation strategies to Sea level rise. Front. Mar. Sci. https://doi.org/10.3389/fmars.2021.704721 (2021).
Article Google Scholar
Taburet, G. et al. DUACS DT2018: 25 years of reprocessed sea level altimetry products. Ocean Sci. 15, 1207–1224. https://doi.org/10.5194/os-15-1207-2019 (2019).
Article ADS Google Scholar
Peltier, W. R. Global glacial isostasy & the surface of the ice-age Earth: the ICE-5G (VM2) model & GRACE. Annu. Rev. Earth Planet. Sci. 32, 111–149. https://doi.org/10.1146/annurev.earth.32.082503.144359 (2004).
Article ADS CAS Google Scholar
Bayoumy, M. & Nikolaos, S. Steric and atmospheric contributions to interannual sea level variability in the eastern Mediterranean Sea over 1993–2019. Oceanologia 64, 50–62. https://doi.org/10.1016/j.oceano.2021.09.001 (2022).
Article Google Scholar
Lee, K., Nam, S., Cho, Y. K., Jeong, K. Y. & Byun, D. S. Determination of long-term (1993–2019) sea level rise trends around the korean peninsula using ocean tide-corrected, multi-mission satellite altimetry data. Front. Mar. Sci. 9, 810549. https://doi.org/10.3389/fmars.2022.810549 (2022).
Article ADS Google Scholar
European Centre for Medium-range Weather Forecast (ECMWF). The ERA-Interim Reanalysis Dataset, Copernicus Climate Change Service (C3S) . Retrieved from https://www.ecmwf.int/en/forecasts/datasets/archive-datasets/reanalysis-datasets/era-interim (2011).
Antonov, J. I., Levitus, S. & Boyer, T. P. Steric sea level variations during 1957–1994. The importance of salinity. J. Geophys. Res. https://doi.org/10.1029/2001JC000964 (2002).
Article Google Scholar
MacIntosh, C. R., Merchant, C. J. & von Schuckmann, K. Uncertainties in steric sea level change estimation during the satellite altimeter era: Concepts and practices. Surv. Geophys. 38, 59–87. https://doi.org/10.1007/s10712-016-9387-x (2017).
Article ADS CAS PubMed Google Scholar
Levitus, S. et al. World ocean heat content and thermosteric sea level change (0–2000m), 1955–2010. Geophys. Res. Lett. https://doi.org/10.1029/2012GL051106 (2012).
Article Google Scholar
Roquet, F., Madec, G., McDougall, T. J. & Barker, P. M. Accurate polynomial expressions for seawater density using the TEOS-10 standard. Ocean Model. 90, 29–43. https://doi.org/10.1016/j.ocemod.2015.04.002 (2015).
Article ADS Google Scholar
Tomita, H., Kutsuwada, K., Kubota, M. & Hihara, T. Advances in the estimation of global surface net heat flux based on satellite observation: J-OFURO3 V11. Front. Mar. Sci. 8, 612361. https://doi.org/10.3389/fmars.2021.612361 (2021).
Article Google Scholar
Mann, H. B. Nonparametric tests against trend. Econom. J. Econom. Soc. 13, 245–259 (1945).
MathSciNet MATH Google Scholar
Kendall, M. G. Rank Correlation Methods (Griffin, 1975).
Google Scholar
Theil, H. A rank-invariant method of linear & polynomial regression analysis. Proc. R. Netherlands Acad. Arts Sci. 53, 386–392 (1950).
MathSciNet MATH Google Scholar
Sen, P. K. Estimates of the regression coefficient based on kendall’s tau. J. Am. Stat. Assoc. 63, 1379–1389. https://doi.org/10.1080/01621459.1968.10480934 (1968).
Article MathSciNet MATH Google Scholar
Iler, A. M., Inouye, D. W., Schmidt, N. M. & Høye, T. T. Detrending phenological time series improves climate–phenology analyses and reveals evidence of plasticity. Ecology 98, 647–655. https://doi.org/10.1002/ecy.1690 (2017).
Article PubMed Google Scholar
Lorenz, E. N. Empirical Orthogonal Functions and Statistical Weather Prediction. Statistical Forecasting Project Report No. 1, Department of Meteorology, Massachusetts Institute of Technology (1956).
Chi, Y. N. Time series modeling and forecasting of monthly mean sea level (1978–2020): SARIMA and multilayer perceptron neural network. Int. J. Data Sci. 3(1), 45–61. https://doi.org/10.18517/ijods.3.1.45-61.2022 (2022).
Article Google Scholar
Nair, V., & Hinton, G. E. Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), 807 8014 (2010).
Kumar, V. et al. Reconstruction of local sea levels at South West Pacific Islands—A Multiple linear regression approach (1988–2014). J. Geophys. Res. Oceans 123, 1502–1518. https://doi.org/10.1002/2017JC013053 (2018).
Article ADS Google Scholar
Mohammad, P. & Goswami, A. A. Spatio-temporal assessment and prediction of surface urban heat island intensity using multiple linear regression techniques over Ahmedabad City, Gujarat. J. Indian Soc. Remote Sens. 49, 1091–1108. https://doi.org/10.1007/s12524-020-01299-x (2021).
Article Google Scholar
Moran, P. A. P. Notes on continuous stochastic phenomena. Biometrika 37(1/2), 17–23 (1950).
Article MathSciNet CAS PubMed MATH Google Scholar
Fox, J. Regression Diagnostics: An Introduction (Sage Publications, 1991).
Book Google Scholar
Hair, J. F., Black, W. C., Babin, B. J. & Anderson, R. E. Multivariate Data Analysis (Pearson, 2010).
Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Hughes, M. G., Glasby, T. M., Hanslow, D. J., West, G. J. & Wen, L. Random forest classification method for predicting intertidal wetland migration under sea level rise. Front. Environ. Sci. 10, 749950. https://doi.org/10.3389/fenvs.2022.749950 (2022).
Article Google Scholar
Bellinghausen, K., Hünicke, B., & Zorita, E. Short-term prediction of extreme sea-level at the Baltic Sea coast by Random Forests. Natural Hazards and Earth System Sciences Discussions [preprint]. https://doi.org/10.5194/nhess-2023-21 (2023).
Passaro, M. & Juhl, M. C. On the potential of mapping sea level anomalies from satellite altimetry with Random Forest Regression. Ocean Dyn. 73, 107–116. https://doi.org/10.1007/s10236-023-01540-4 (2023).
Article ADS Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article MathSciNet MATH Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. https://doi.org/10.3389/fnbot.2013.00021 (2013).
Article PubMed PubMed Central Google Scholar
Den Bieman, J. P., Wilms, J. M., Van den Boogaard, H. F. P. & Van Gent, M. R. A. Prediction of mean wave overtopping discharge using gradient boosting decision trees. Water 12(6), 1703. https://doi.org/10.3390/w12061703 (2020).
Article Google Scholar
Den Bieman, J. P., Van Gent, M. R. A. & Van den Boogaard, H. F. P. Wave overtopping predictions using an advanced machine learning technique. Coastal Eng. 166, 103830. https://doi.org/10.1016/j.coastaleng.2020.103830 (2021).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735 (1997).
Article CAS PubMed Google Scholar
Song, T. et al. Prediction of significant wave height based on EEMD & deep learning. Front. Mar. Sci. 10, 1089357. https://doi.org/10.3389/fmars.2023.1089357 (2023).
Article Google Scholar
Miao, Y., Zhang, X., Li, Y., Zhang, L. & Zhang, D. Monthly extended ocean predictions based on a convolutional neural network via the transfer learning method. Front. Marine Sci. 9, 1073377. https://doi.org/10.3389/fmars.2022.1073377 (2023).
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Oceanic and Atmospheric Sciences, Ocean University of China, Qingdao, 266100, China
Akeem Shola Ayinde, Huaming Yu & Kejian Wu
Physical Oceanography Laboratory, Ocean University of China, Qingdao, 266100, China
Akeem Shola Ayinde, Huaming Yu & Kejian Wu
Department of Marine Meteorology and Climate, Nigerian Institute for Oceanography and Marine Research, PMB 12729, Victoria Island, Lagos, Nigeria
Akeem Shola Ayinde

Authors

Akeem Shola Ayinde
View author publications
You can also search for this author in PubMed Google Scholar
Huaming Yu
View author publications
You can also search for this author in PubMed Google Scholar
Kejian Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.Y. design the concept of the manuscript and write the introduction K.W. downloaded the data and write part of the methodology A.S harmonizes the manuscript by providing codes for data analysis, running the model, write the results and conclusion of the manuscript.

Corresponding authors

Correspondence to Akeem Shola Ayinde or Huaming Yu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ayinde, A.S., Yu, H. & Wu, K. Sea level variability and modeling in the Gulf of Guinea using supervised machine learning. Sci Rep 13, 21318 (2023). https://doi.org/10.1038/s41598-023-48624-1

Download citation

Received: 09 August 2023
Accepted: 28 November 2023
Published: 03 December 2023
DOI: https://doi.org/10.1038/s41598-023-48624-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.