Measurements and modeling of water levels, currents, density, and wave climate on a semi-enclosed tidal bay, Cádiz (southwest Spain)

. Estuarine dynamics are highly complex as a result of the temperature and salinity gradients, as well as the multiple interactions between atmospheric, maritime, and hydrological forcing agents. Given the environmental and socioeconomic importance of estuaries and their current and future threats due to human interventions and climate change, it is of vital importance to characterize these dynamics, monitor their evolution, and quantify the expected impacts derived from climate change. This paper presents a hybrid database combining data obtained in six ﬁeld surveys (in 2012, 2013, and 2015) and results from a physically based 3D numerical model for the Bay of Cádiz (southern Spain), a highly anthropized mesotidal estuary. The 3D dataset includes water levels, currents, density, and wave climate, allowing for an analysis of bay dynamics at different timescales ranging from intratidal processes to seasonal variabilities. The results offer an example of the potential uses of the dataset and include (1) an assessment of the spatial and seasonal variability of the estuarine dynamics and (2) an analysis of the effects of severe weather events. These


Introduction
An estuary is a partially enclosed coastal water body where freshwater from rivers and streams mixes with saltwater from the ocean (Alahmed et al., 2022).Although influenced by tides, estuaries are frequently protected from the full force of ocean waves, winds, and storms (Eryani and Nurhamidah, 2020).The sheltered waters of estuaries also support freshwater and saltwater marshes, swamps, sandy beaches, mud and sand flats, rocky shores, oyster reefs, mangrove forests, river deltas, tidal pools, and seagrass beds, which increase the diversity of the environment (Hopkinson et al., 2019;Hobohm et al., 2021).Moreover, wetland plants and soils also act as natural buffers between the land and ocean, absorbing flooded waters and dissipating storm surges (Asari et al., 2021).The protected coastal waters of estuaries also support important public infrastructure, serving as harbors and ports vital for shipping and transportation, which must Published by Copernicus Publications.
be carefully managed to ensure the sustainability and protection of this natural resource (Cisneros-Montemayor et al., 2021).
The complexity and the environmental and socioeconomic importance of the estuaries described above highlight the need for a detailed characterization of the dynamics of these coastal systems.This information will be even more important in upcoming years and decades considering the expected impacts of sea level rise due to global warming and climate change in these areas (Haasnoot et al., 2019).Currently, there are three main approaches for characterizing estuaries: (1) physical models, (2) field data, and (3) physically based numerical modeling.Physical models replicate coastal systems at a reduced scale so that the major dominant forces acting are reproduced (Reeve et al., 2018).Although these models are extensively applied in the design of major hydraulic-engineering works and greatly enhance our understanding of fluvial, estuarine, and coastal processes (Weisscher et al., 2020), the characteristics of estuarine dynamics complicate their use; complex geometries, a significant number of different forcings acting simultaneously, and water density gradients have to be considered.Furthermore, the conditions required to appropriately model the substrate, sediment, and biota are difficult to reproduce in the laboratory.
Field data are obtained through multiparameter and multiscale environmental monitoring at a limited number of discrete locations where appropriate scientific instrumentation can be deployed (Garel and Ferreira, 2015).With this approach, hydrodynamic variables and water properties such as water levels, currents, salinity, and/or water density can be measured.The usefulness of these data in characterizing estuarine dynamics relies primarily on the design of the deployment (type, number, location, and setup of the equipment); the length of the survey; and the representativeness of the meteorological, atmospheric, maritime, and hydrological conditions during the selected period of time.Despite providing the most realistic characterization of the estuarine dynamics currently available, field data have severe limitations due to the isolated locations of the environmental monitoring devices and the limited variables and conditions that can be measured.
Physically based numerical models overcome these limitations by resolving the conservation equations that are approximations of laws for water flow and/or sediment transport (e.g., Gomez and Church, 1989;Van Rijn, 2007;Baar et al., 2019).These equations are solved in a spatial domain that can cover complete estuarine areas with spatial resolutions up to tens of meters.The interest in such models lies not only in the transition from isolated to spatially distributed data but also in the analysis of conditions that did not occur during the observation period, including prospective conditions resulting from a changing climate (e.g., Yang et al., 2015;Del-Rosal-Salido et al., 2021), as these models enable full control of the initial and boundary conditions (Weisscher et al., 2020) in contrast to field data.The main limitation of numerical modeling comes from the significant number of model parameters (e.g., bed friction, viscosity, and diffusivity) that have to be adjusted to adequately reproduce the hydrodynamic and transport processes.This problem is solved by calibrating and testing these parameters using field data and exploring the sensitivity to changes in their values (van Maren and Cronin, 2016).Hence, field data and numerical modeling are complementary approaches that can be used simultaneously to analyze estuarine dynamics, to adequately reproduce the processes observed in nature, and in turn to allow prognostic and extreme-event analysis.
This paper presents a hybrid dataset developed to characterize the hydrodynamics, salinity, and temperature distributions in the mesotidal estuary of the Bay of Cádiz (southern Spain).This bay is a paradigmatic example of a complexgeometry, tidally dominated bay where different basins are connected through narrow channels and waves and where river discharges have limited influence.Similar estuaries can be found worldwide, such as San Francisco Bay (Schoellhamer et al., 2008), Jiaozhou Bay (Li et al., 2014), the St Lucia estuarine lake (Schoen et al., 2014), and the Sylt-Romo Bight (Purkiani et al., 2016).The dataset includes both field data and numerical-modeling results that were obtained for the study area during the last decade.In addition to analyzing the general hydrodynamics and transport processes in the bay (Zarzuelo et al., 2021), the dataset was used to study the impact of human interventions on estuarine dynamics (Zarzuelo et al., 2015) and the tidal energy potential in the area (Zarzuelo et al., 2018).With the description presented here and its open-source publication, the dataset aims to provide treated and formatted hydrodynamic data for engineers and researchers working on the environmental management of the Gulf and Bay of Cádiz, including aspects such as water quality (Jiménez-Arias et al., 2020;Besada et al., 2022), ecosystem functioning (Miró et al., 2020;Haro et al., 2022), or energy assessments (Zarzuelo et al., 2018;Legaz et al., 2020).These data may also be of interest to modelers; in particular, they can be used as a benchmark test case for numerical hydrodynamic models, as a reference for human intervention alternatives (e.g., piles, dikes, or breakwaters), or as a basis for modeling renewable energy conversion systems.In addition, the dataset may be of interest for scientists aiming to compare the behavior of different estuarine areas.

Area description
The Bay of Cádiz is a mesotidal, mixed-energy, and wellstratified estuary (Zarzuelo et al., 2015(Zarzuelo et al., , 2017(Zarzuelo et al., , 2020) ) in southern Spain, characterized by a temperate climate.The bay consists of three areas (A, B, and C in Fig. 1) and a zone of tidal flats and marshes, with a total extension of 140 km 2 .The outer basin (A), with an area of 70 km 2 , is characterized by mild slopes and depths ranging between 5-10 m.This area is the most exposed to wave action due to its opening to the Atlantic Ocean and is also connected with the estuaries of the San Pedro and Guadalete rivers.The inner basin (C) has an extension of 50 km 2 , with intertidal areas that give rise to an area of gentle slopes, which is reconnected to the Atlantic Ocean through the 17 km long tidal creeks of Carracas and Sancti-Petri (Zarzuelo et al., 2018(Zarzuelo et al., , 2020)).Finally, the Puntales Channel (B) connects the outer and inner basins with dimensions of 3.1 and 1.7 km in length and width, respectively.This latter area exhibits the greatest depths (up to 18 m) and corresponds to the zone where most recent human interventions are carried out.The bay has tidal ranges of approximately 2-4 m (neap and spring tides, respectively) and a moderate wave climate that propagates mainly from the Atlantic Ocean with a limited incursion inside the bay.The influence of freshwater discharges is weak since the mean discharge rates are approximately 20 m 3 s −1 compared to a tidal prism of 10 5 m 3 s −1 (Zarzuelo et al., 2015(Zarzuelo et al., , 2017)).With these characteristics, the Bay of Cádiz is considered to be an example of tidally dominated, shallow, semi-enclosed bays (Kitheka, 1997;Lee et al., 1997;Newton et al., 2014;Shang et al., 2019).
The estuary contains extensive areas of intertidal sand and mudflats that support a variety of characteristic benthic fauna depending on the nature of the substrate.In its inner basin and around the creeks of Carracas and Sancti-Petri, there are also extensive areas of mudflats.This space, dominated by swamps, marshes, and salt flats, is characterized by an astounding wealth of biodiversity, with a long historical tradition of salt mining.This area has undergone a major transformation due to progressive sedimentary infill conditioned by the important biosedimentary role of vegetation, which often controls the morphological evolution of these tidal environments (D'Alpaos, 2011).At the same time, a significant part of the emerged marshes within the Bay of Cádiz has been almost completely anthropized over the past 60 years (Gracia et al., 2017).In recent years, many of the structures of old salt marshes have disappeared as a result of the intense transformation that is being carried out in the area.
Since the twentieth century, human interventions such as ports, bridges, or salt marshes have been developed to improve the economic and social activities within the bay and its surroundings.The vast majority of these interventions are located in the Puntales Channel (Zarzuelo et al., 2015), interfering with the exchange of water, heat, and sediment between the outer and inner basins (Zarzuelo et al., 2021).

Methods
To analyze the dynamics of the Bay of Cádiz, two methods were used: (1) a combination of three bottom-mounted and three vessel-towed field surveys that provided real data with high temporal resolution and (2) a 13-month 3D numerical simulation that was calibrated and tested with the field data.
This latter simulation provides high-spatial-resolution data for a longer time period for which variabilities down to seasonal scales can be analyzed.The combination of both methods allows us to analyze the hydrodynamic and morphodynamic behavior of the Bay of Cádiz on a global scale, which was the main milestone of project no.P09-TEP-4630 (funded by the Regional Government of Andalusia) that financially supported the study to obtain this hybrid database.

Design of the field surveys
More than 20 instruments during six different surveys were used to measure water levels, currents, water density and conductivity, and wave conditions.The total length of the records considering all the surveys exceeds 8 months.The two types of field surveys designed, bottom mounted and vessel towed, are described within the next sections.

Bottom-mounted observations
As shown in Table 1, three different types of instruments were used in these surveys: acoustic Doppler current profilers (ADCPs), conductivity-temperature (CT) profilers, and tide gauges.The ADCPs are Nortek Aquadopp Profiler acoustic surface tracker (AST), which are acoustic Doppler current profilers with the added feature of acoustic surface tracking (AST).It emits sound pulses in three directions near the vertical and receives the echoes from suspended particles drifting with the current.The compass is calibrated directly on the ship using a track crane to avoid damaging the surrounding metal.It is then placed in the water, and the divers check that the x direction coincides with the east.The velocities obtained from different particles are then averaged within subdivisions of the water columns (cells).The technical specifications of the Nortek ADCP are as follows: acoustic frequencies of 0.6 MHz (one device), 1 MHz (two devices), and 2 MHz (three devices); velocity range of ±10 m s −1 ; accuracy (water velocity) at 1 % of measured value of ±0.5 cm s −1 ; and maximum ranges of 40, 25, and 10 m, respectively.The ADCPs measured surface pressure, bottom temperature, and velocities along the entire water column.From these measurements, the energy spectrum was assessed to determine the wave height, peak period, and wave direction.All acoustic signals were emitted from three beams inclined at 25 • from the vertical and equally spaced at 120 • .The ADCPs included a high-resolution pressure sensor, as well as compass tilt and temperature sensors.The distance from the base of the concrete block (tripod) to the top of the ADCPs was 0.3 m (0.8 m).The three current velocity components (east, north, and vertical) were measured in cells of varying thickness (hereafter referred to as multicell data) along the water column (Table 1).In addition, the depthintegrated velocities were measured in one cell (main cell data, hereafter), the vertical extent of which was automatically adjusted near the surface based on pressure records.https://doi.org/10.5194/essd-15-3095-2023 Earth Syst.Sci.Data, 15, 3095-3110, 2023 Both the main and multicell measurements started at a blanking distance above the instrument and thus at different distances above the bottom (see Table 1).The number of cells was always sufficient to cover the entire water column above the instrument.In addition, this distance varied over time due to changes in bed elevation, especially those caused by the burial and tilting of the mooring structure (Lobo et al., 2004;Morales et al., 2006;Garel and Ferreira, 2015).
The CTs are SBE 37-SMP pumped MicroCAT, with a high-accuracy conductivity and temperature (pressure optional) recorder with Serial (RS-232) interface.Technical specifications are as follows: power supply of 7.8 A h, resolutions of 0.00001 S m −1 (conductivity) and 0.0001 • C (temperature), accuracy (conductivity) of ± 0.0003 S m −1 (0.003 mS cm −1 ), accuracy (temperature) of ± 0.01 • C (35 to 45 • C), ±0.5 cm s −1 , and maximum ranges of 7 S m −1 (conductivity) and 45 • C (temperature).The conductivitytemperature (CT) sensors measured bottom temperature and conductivity (see Table 1), which are converted to salinity values.They recorded data every 15 min to match the ADCP data.The turbidity sensor (OBS-3+) is a submersible turbidity probe with side-facing optics.It uses OBS technology to measure suspended solids and turbidity.The technical specifications of the OBS-3+ are as follows: optical power of 2000 µW, infrared wavelength of 850 nm ± 5 nm, accuracy (turbidity) at 2 % of reading or 0.5 NTU (nephelometric turbidity unit), accuracy (concentration) at 2 % (4 %) of the reading or 1 mg L −1 (10 mg L −1 ) for mud (sand), and maximum ranges of 10 000 mg L −1 (mud) and 100 000 mg L −1 (sand).The OBS measured turbidity at the bottom; it was connected to an ADCP and measured with the same frequency, burst interval, and period, sharing its configuration with the ADCP as they were nested.Finally, the tide gauge model TD304 from SAIV A/S is a high-precision instrument for recording and/or monitoring tides or water levels (tide gauge).The technical specifications are as follows: response time of 0.1 s, resolution of 0.0001 dbar (m), accuracy (water level) of ± 0.01 % full scale (FS), and maximum range of 1000 m.The tide gauges only measure pressure, with 10 s bursts every 15 min (1 Hz).For more details on the configurations of each field survey, see Table 1.
Three field surveys using bottom-mounted instruments were carried out.These surveys were designed to have shortto mid-term time periods with real data that could (1) characterize the dynamics of the bay and (2) calibrate the numerical model considering a wide range of different tidal, wave, wind, and weather conditions.The three deployments were as follows: -Mounted field 2012 (22 December 2011 to 22 May 2012).Twelve instruments were anchored in eight positions (I1-I5, T1-T3) distributed along the bay and inside the creeks (see Table 1 and Fig. 1b).
The instruments included five ADCPs, one OBS, three CTs, and three tide gauges.Concrete blocks were used to secure the instruments to the seabed and avoid displacements.This initial survey was designed to characterize the dynamics of the three main areas of the bay (A, B, and C in Fig. 1b) and the tidal creeks.
The data were also used to assess the balance and water exchange between the basins and the creeks and to characterize the effects of extreme wind and wave events, which are common during the winter season, on the dynamics of the bay.
-Mounted-field 2013 (25 May to 28 August 2013).In this second survey, six instruments (three ADCPs and three CTs) were moored in three locations distributed along a cross-section of the Puntales Channel to characterize the water exchange between the outer basin and the channel itself (see Table 1 and zone B in Fig. 1b).In this case, metal tripods were used to secure the instruments on the seabed.After field 2012, it was observed that the main exchange section was the connection between the outer basin and the Puntales Channel.Therefore, this survey was designed with three locations evenly distributed along a section just at the entrance of the channel.
-Mounted-field 2015 (16-23 September 2015).Finally, since the results provided by the numerical model suggested that the bay behaves as an inverse estuary, four instruments (two ADCPs and two CTs) were anchored with tripods for stability in two locations to observe heat exchange differences and to characterize the water exchange between the Puntales Channel and the inner basin (Table 1 and Fig. 1b).

Vessel-towed observations
As a result of the first bottom-mounted survey (field 2012), important exchange flows were identified between the inner and outer basins through the Puntales Channel.To analyze these flows in detail, three vessel-towed continuous surveys were conducted using two different devices (ADP and CTD).
The ADP used for the vessel-towed field surveys is a Son-Tek YSI 1 MHz acoustic Doppler profiler with a maximum profiling range of 35 m.The velocity range is ±10 m s −1 ; the accuracy is ±1 % of the measured velocity, ±0.5 cm s −1 ; and the resolution is 0.1 cm s −1 .Underway velocity profiles were recorded with a boat-mounted ADP in 50 cm bins with 120 pings per ensemble during a complete tidal cycle.Navigation was performed with a global positioning system (GPS).In 2013, a total of 23 neap tide transects and 24 spring tide transects were sampled, whereas in 2015, a total of 25 neap tide transects were sampled.The transect data were organized in vertically and horizontally equal grids across the channel.
The CTDs used are the SBE 19plus V2 SeaCAT, which measures conductivity, temperature, and pressure at four scans per second (4 Hz) and provides high accuracy and resolution.Technical specifications are as follows: resolution of 0.00005 S m −1 (conductivity), 0.0001 • C (temperature), and 0.002 % of full scale range (pressure); accuracy (conductivity) of ± 0.0005 S m −1 ; accuracy (temperature) of ±0.005 • C; accuracy of ±0.1 % of full scale range (pressure); and maximum range of 9 S m −1 (conductivity), 35 • C (temperature), and 350 m (pressure).
The three vessel-towed continuous surveys were conducted in 2013 (2) and 2015 (1) during a tidal cycle at a cross-section of this channel: -In 2013, two 13 h long acoustic Doppler profiler (ADP) surveys were performed during neap (7 July, neap 2013) and spring (22 August, spring 2013) tides across the Puntales Channel (orange line in Fig. 1b) to capture the seasonal variability and the water exchange between the outer basin and the channel.In addition, a conductivitytemperature-depth instrument (CTD SeaBird SBE19+, four samples at 1 Hz) plus a Wetlabs fluorometer and OBS turbidometer mounted on a towed Guildline Mini-Bat undulator enabled rapid sampling at five extra locations across the transect.Towing at approximately 5 knots provided vertical profiles roughly spaced at 200 m every 2.2 h.Samples of water were also taken to calibrate the sediment suspension in every CTD profile.
-In 2015, one 13 h long survey was performed during neap (22 September, neap 2015) tides across the same transect (orange line in Fig. 1b) defined in 2013 to capture possible changes produced by the new bridge (seaward of orange line in Fig. 1b) built in the Puntales Channel between 2007 and 2015 and to assess the temporal variations between 2013 and 2015.

Post-processing data
Once the data had been exported from the instruments, postprocessing was used to remove errors.First, the data were https://doi.org/10.5194/essd-15-3095-2023Earth Syst.Sci.Data, 15, 3095-3110, 2023 checked for battery life.Then the compass (< 0.1 • ) was corrected by comparison with the heading, pitch, and roll (< 1 • ), which cause deviations in the directions of the variables.The instrumentation was also checked for any rotation that could distort the information.Finally, a comparison was made between the pressure data and the water column in order to remove the cells that remained above the free surface (or to eliminate the influence of atmospheric pressure).
Assessing the quality of acoustic Doppler measurements is essential to ensure reliable interpretation of velocity results.Errors caused by low signal strength, Doppler noise, signal aliasing, and other effects such as side-lobe interference can introduce inaccuracies into the data.Such errors can make the data inaccurate, leading to increased data vari-ance, mean bias, and altered energy spectrum results.All instruments were calibrated prior to configuration (beam calibration, compass calibration, pressure offset) according to the manufacturer's recommendations.Processing steps included removal of poor-quality data points using the manufacturer's Ocean Contour software.Data were corrected for magnetic declination, low correlation, side-lobe interference, and minimum-amplitude and maximum-amplitude spikes, removing a total of 12 % of the data, leaving 88 % of the validated ADCP data.Data points flagged as poor quality were then replaced with Not a Numbers (NaNs) in MATLAB (≈ 20 %).
The pressure signals were then corrected for variations in the atmospheric pressure.After sieving the data, the pressure Earth Syst.Sci.Data, 15, 3095-3110, 2023 https://doi.org/10.5194/essd-15-3095-2023 was converted to free surface elevation following Nielsen (1989).The spectra and the directional spectrum were calculated to obtain the significant wave height, peak period, and wave direction (O'brien, 1993).In this procedure, the nonlinear wave portion is low-pass filtered from the original bottom pressure data prior to estimating the surface wave values.The filter cutoff is determined for each wave burst after fitting a Pearson spectrum.
Temperature and salinity CTD data were post-processed using standard SeaBird software and MATLAB routines.At this stage, peaks were removed, 1 dbar averages were calculated, and the downcast profiles of temperature and salinity were corrected using regression analysis.Once the salinity is known, the density is calculated.Suspended sediment concentration (SSC) was derived from OBS measurements (e.g., Downing, 2006) via calibration with suspended matter concentrations gravimetrically determined from water samples.The OBS calibration was accomplished by cross-correlation between the instrument readings and the SSCs obtained from water samples.Calibration was carried out at the laboratory of the University of Granada.Sediment was collected from the seabed and mixed in a mixing tank.The sediment concentration was slowly increased to obtain the relationship between voltage and sediment concentration.
The vessel-towed ADP data were processed to correct for heading errors and then averaged over 1 min and 1 m depth intervals to provide higher spatial resolution.To mitigate the effects of side-lobe interference, the lower 10 % of the velocity profiles of the signatures was removed.The depth at each location was estimated from the signature data.As the instruments were surface mounted, the location of the seabed was approximated to the nearest cell location by locating the prominent rise in the return signal strength.The validity of the results was also assessed by the quality of the return signals.The consistency of the return signals within the sample volume over the sample period is measured by the percentage correlation.The amplitude is a measure of the return signal strength, while the signal-to-noise ratio (SNR) indicates the strength of the return signals compared to the instrument noise (Doppler noise).As a quality control measure, unusable vector data are identified in this study as having a correlation of < 70 % or an SNR of < 10 dB.For the signature data, a correlation threshold of < 50 % and an amplitude threshold of < 30 counts was applied to identify invalid data.The ADP time series were processed for outliers, and small gaps were filled by cubic-spline interpolation.Data sampled more frequently than once per hour were averaged to 1 h intervals.Currents were rotated into east-west and north-south basin components.Axis directions were determined by examining the principal axes of spatial variability and the orientation of the local bathymetry.

Numerical modeling
The 3D numerical modeling was performed with DELFT3D, a model developed for the coupled simulation of longwave and shortwave dynamics, sediment transport, and heat fluxes.Its subgrid approach for high-resolution bathymetric representation on structured grids (Lesser et al., 2004) makes it suitable for the simulation of hydrodynamics in complex estuaries such as the Bay of Cádiz.The model, fed with tidal harmonics, wind forcing, wave climate, and heat fluxes, provides water levels, currents, density, and wave climate with high spatial (both horizontal and vertical) and temporal resolution.

Model description
Long waves in Delft3D are modeled by the flow module.This module solves the Navier-Stokes equations for an incompressible fluid under the shallow water and Boussinesq assumptions in 2D (depth-averaged) or in 3D using a finite-difference scheme on staggered, curvilinear, and wellstructured grids to simulate the flow resulting from tidal and meteorological forcing.In 3D simulations, the vertical grid is defined using the σ coordinate method.The continuity equation takes into account the kinematic boundary conditions at the surface and bottom: where η is the water level, t is time; h = z b + η is the water depth, with z b being the bed level (positive below the mean sea level); u, v, and w are the velocity components along the x, y, and z coordinates, respectively; and Q represents the contributions per unit area of the sinks and sources of water mass.The momentum equations in x and y are where f is the Coriolis parameter; ρ 0 is the water density, P refers to the baroclinic pressure terms; F is the imbalance of the horizontal Reynolds stresses, with ν v being the vertical eddy viscosity; and M refers to the contributions due to shortwave stresses.Under the assumption of shallow water, the vertical-momentum equation is reduced to a hydrostaticpressure equation.sity.The conservative form of the advection-diffusion equation (Lesser et al., 2004) is as follows: where c is the concentration of the conservative constituents considered; S is the source and/or sink term per unit area; and D h and D v are the horizontal and vertical diffusivity coefficients, respectively.To solve this equation, Delft3D models use a cyclic method where viscosity and diffusivity must be prescribed as calibration parameters.The vertical eddy viscosity ν v and the diffusivity coefficients D h and D v are obtained using (1) the contribution of a user-defined background value used for calibration, which can be spatially constant or variable, and (2) a 3D turbulence contribution obtained from a turbulence closure model.Density, salinity, and temperature are related using the Su et al. ( 2020) equation-ofstate formulation.Heat exchange at the water surface is modeled considering the effects of solar (shortwave) and atmospheric (longwave) radiation, as well as heat loss due to back radiation, evaporation, and convection.Among the available heat exchange models within Delft3D, the ocean heat flux model (Gill and Adrian, 1982;Chanudet et al., 1989) was used as it was found to be the most balanced option considering the dimensions of the bay and the temporal scale of the analysis (1 year).The model takes into account relative humidity, air temperature, cloud cover, shortwave solar radiation, light attenuation (Secchi depth), and the Dalton and Stanton numbers, with the latter two being used as calibration parameters.
The shortwave propagation processes are solved in the model using the wave module.This module is based on the SWAN model (Booij et al., 1999;Ris et al., 1999), a spectral wave model that solves the action balance equation, taking into account wave-current interactions.It considers wave generation by wind and energy dissipation by whitecapping, wave breaking, and bottom friction.Both modules (flow and wave) are coupled online during the simulations: water levels and currents are considered for the wave propagation processes, while wave-induced forces are included in the momentum equations (Eqs. 2 and 3).

Model setup
For the hydrodynamic module, water level boundary conditions were defined as astronomical forcing using the 18 major tidal harmonics in the area (M2, S2, SA, Q1, O1, P1, K1, 2N2, MU2, N2, NU2, L2, T2, R2, K2, MN4, M4, and MS4) obtained from the Oregon State Tidal Prediction Model (Egbert and Erofeeva, 2002).For the transport boundary con-ditions, 3 h interval data of salinity and temperature from buoy 2342 (Puertos del Estado, Spanish Ministry of Public Works; Fig. 1c) were used.Despite their small influence, the average discharges of the two main rivers (Guadalete and San Pedro) were included as source terms, taking into account their temperature and salinity (data provided by the Regional Government of Andalusia).The ocean heat flux model was fed with spatially uniform daily averaged data of solar radiation, air temperature and humidity, and cloudiness (also obtained from the Regional Government of Andalusia).Finally, spatially uniform 1 h interval wind data from buoy 2342 were used to estimate the wind contribution to the momentum equation.For the model calibration, the background roughness was set to a uniform value (Chézy u = 80, v = 60) on the whole grid without differences between zones since Zarzuelo et al. (2018) observed that including the spatial variability of this coefficient did not improve the quality of the calibration.Spatially uniform values were used for the user-defined horizontal and vertical viscosity coefficients, and the k-epsilon model was used for the turbulence closure model, assuming a boundary layer type of flow.Still water and null heat fluxes were used for the initial conditions, with a spin-up interval of 1 month to warm up the model.
For the shortwave propagation module, spectral offshore boundary conditions were used with 3 h interval data from buoy 2342.Wind data from this buoy with the same interval were used as spatially uniform forcings to consider the energy exchange between wind and short waves.The wave spectrum definition was limited to 24 frequencies between 0.05 and 1 Hz with 36 specified directions (15 • steps).Physical processes such as whitecapping (Komen et al., 1984), depth-induced wave breaking (Battjes and Janssen, 1978), bottom friction (Hasselmann et al., 1973), and triads (Eldeberky and Battjes, 1996) were taken into account, but quadruplets were neglected.
The model domain covers the Atlantic Ocean from Rota to Sancti-Petri, including the Bay of Cádiz, the two freshwater rivers (San Pedro and Guadalete) and the tidal creeks (Carracas and Sancti-Petri) (Fig. 1a -orange line).The grid defined for the hydrodynamic simulations has a spatial resolution ranging from 10 × 10 m in the area of the creeks to 5×5 m at the offshore boundary with a total of 90 405 nodes.The vertical structure of the grid was implemented using a sigma-layer approach, for which the layer thickness is defined as a constant proportion of the total water depth.In this case, 10 layers with thicknesses of 2 %, 10 %, 10 %, 10 %, 10 %, 10 %, 10 %, 10 %, 10 %, and 2 % of the water depth were used.This number of layers was proven to be sufficient to capture vertical processes, as shown by Zarzuelo et al. (2018Zarzuelo et al. ( , 2021)), since stratification is not important in bay dynamics.The grid for the wave module was based on the hydrodynamic grid but with a lower resolution in creek areas where the role of waves is negligible (Zarzuelo et al., 2020), allowing a more efficient numerical scheme.
The offshore bathymetric data were provided by the Instituto Hidrográfico de la Marina (Spanish Ministry of Defense), while the detailed multibeam 2011 bathymetry of the bay was provided by the Bay of Cádiz Port Authority.The bathymetry of Sancti-Petri and Carracas creeks was corrected with nautical charts.Finally, the topography was obtained from the 2010 digital terrain model of the Regional Government of Andalusia with a resolution of 10 × 10 m 2 , combined in the intertidal zones with 2015 lidar data with 2 × 2 m 2 resolution.
The results of the numerical model were extracted in a total of 553 observation nodes distributed along the bay (Fig. 1b -empty circles).To capture the variability, the results were obtained by working only with history files (nodes) but defining a large number of observation points and a high temporal resolution of the output data.The variables included water level, currents, density, significant wave height, peak period, and wave direction.

Calibration and testing
The model calibration and testing were performed using the data from the mounted-field 2012 and 2013 surveys (field 2012 and field 2013, Table 1), respectively.Waves, currents (residual and instantaneous), water levels, temperature, and salinity were analyzed by means of correlation and skill coefficients.In the case of temperature, salinity, and instantaneous currents, these variables were tested for both the 3D and depth-averaged values.All measurements were checked visually and corrected for outliers and suspect data points whenever possible.The nine locations where field and numerical data were compared are shown in Fig. 2; full validation of the DELFT3D modeling approach is documented in Zarzuelo et al. (2015Zarzuelo et al. ( , 2020Zarzuelo et al. ( , 2021)).Figure 2 depicts the correlation and skill coefficients for water levels, east and north velocities, residual currents projected on the axis of the channel, temperature, salinity, and significant wave height.The skill coefficient score is calculated following the formulation proposed by Willmott (1981) to know in more detail the adjustment of the trend and the high and low peaks (Olabarrieta et al., 2011;Zarzuelo et al., 2015).
Figure 3 shows the agreement of the temperature and salinity, the two variables with the lowest correlations, between the measured and modeled results.It can be seen that the mean values are properly captured by the model, while the outliers reduce the agreement.The histograms show which are the most frequent values at the observation point for both the model and the observed data.In the case of temperature, the most common value is 22 • C for both the observed data and the model.However, the distribution is more symmetric and accentuated for the observed data, whereas it follows a less skewed distribution for the modeled data.In the case of salinity, the histogram and distribution follow a very similar trend in the observed and modeled data, with the most common value being 36.5-36.6 psu.

Data overview
The hybrid dataset obtained by applying the methodology described above has been used by the authors of this paper for different research applications, such as an analysis of the exchange flow between the main areas of the bay (Zarzuelo et al., 2022a), a characterization of seasonal dynamics and the influence of the seawater heat exchange on bay dynamics (Zarzuelo et al., 2020), and an assessment of the impacts of recent human interventions on the estuarine dynamics (Zarzuelo et al., 2022a).In this section, an analysis of the spatial, fortnightly, and seasonal variabilities is demonstrated as an example of the capabilities and potential of the hybrid dataset.In addition, storm surges recorded during the winter of 2013 (higher winds) were selected to analyze the relationship of wind intensity-direction with water density changes.

Data record
Representative stations where the numerical results were obtained with a high time resolution (10 min) were used to analyze the bay dynamics from intratidal to seasonal scales (Fig. 1).The length of the records for the different stations, together with the variables measured, is shown in Fig. 4. Some data gaps for the variables measured with the ADCPs (water levels, flow, and wave data) occurred at the I1, I2, and I3 locations.During these gaps, only density measured with the CT instruments was recorded.The data gaps were caused by malfunctions or maintenance of the instruments.

Spatial variability
Figure 5 depicts the results from the analysis of the spatial variability of the main tidal and wave characteristics along the bay.The amplitude (Fig. 5b) and phase (Fig. 5c) of the main tidal constituent (M2) were calculated via a harmonic analysis of the depth-averaged flow velocities obtained at each station of the transect defined in Fig. 5a using the numerical model for the 13-month simulation in 2012-2013 (Fig. 4). Figure 5d shows the maximum significant wave height obtained with the same simulation at every station.Regarding the flow velocities, for the M2 amplitude, the highest values (1.5 m s −1 ) are found where the Puntales Channel connects with the inner basin, decreasing to almost zero (25-32 km) at the connection between the Carracas and Santi Petri creeks.In addition, an increase of 0.5 m s −1 is observed just at the entrance of the Puntales Channel (10 km) and the tidal channels (21 km).These variations, related to the convergence and divergence processes of tidal wave propagation across the bay, are resolved by the spatial and temporal resolutions of the numerical-model results.
The tidal phase of the M2 constituent is related to the celerity of the tidal wave and how it changes during its propagation through the bay.The period of constituent M2 (12.42 h) is the time required for the phase to complete a 360   Thus, a phase of 200 • corresponds to a wave tide propagation time of 6.9 h.When the tidal wave enters the Puntales Channel, the phase increases dramatically by 100 • (3 h).Then, the phase remains constant from the inner basin to the entrance of the creeks, from which it decreases almost linearly (from kilometer 22 (green-yellow color -Fig.5c)).It is also interesting to highlight the change that occurs around kilometer 30 since, as observed in previous studies (Zarzuelo et al., 2019), there is a point in the interior of the channels where the wave slows down.Finally, Fig. 5d shows how wind waves dissipate when propagating from the two connections of the bay and the open ocean to its interior, where their contribution is negligible in the Puntales Channel, the inner basin, and the Carracas and Sancti-Petri creeks.From these results it can be concluded that the effect of the swell in the Bay of Cádiz is negligible, except in the outer part (Zarzuelo et al., 2020), and that the constriction serves as an amplifying element of the tide, which then dissipates into its interior (Zarzuelo et al., 2017).

Fortnightly variability
The hybrid dataset also provides insights into the fortnightly variability of the semidiurnal species along cross-sections of the Puntales Channel.This type of analysis is useful to assess the tidal prism or water exchange between the two basins connected by this channel.lyze the density profiles measured during the same surveys, it is worth noting the decrease in density (0.4 kg m −3 ) during the flood period, possibly coming from the seawater of the ocean (inverse estuary, as we can see in Fig. 7b).This analysis is relevant to understanding the change in density as the tide propagates.The density and the tide determine the location of the pycnocline.When the tide changes from flood to ebb, the location of the pycnocline is closer to the ocean, and as it changes from ebb to flood, it modifies the position of the pycnocline towards the inner basin.This is difficult to detect from field measurements, which only provide crosssectional information, but can be accurately determined from the model.

Seasonal variability
Of all the points obtained from the numerical model, five points were selected (Fig. 7a) to facilitate representation.The density and residual water level time series data are displayed in Fig. 7b and c, respectively (October 2012-October 2013), together with observed records of the CT and ADCP located at I3.At this yearly scale, seasonal temperature variations are expected.Pronounced and rapid density variations in summer are induced by the alternation of warm westward countercurrents that characterize ocean circulation.In addition, it is important to note that, in the winter months, the inner basin (point 4 -cyan line; Fig. 7b) is denser than the outer basin (point 1 -purple line; Fig. 7b) (from 1027 to 1029 kg m −3 ).This tendency changes in summer, when the inner basin becomes less dense (from 1026 to 1024 kg m −3 ).Subtidal variability in the numerical data is well evidenced at a fortnightly timescale (Fig. 7c).Larger density values are also clearly observed at spring tide for locations 1, 2, 3, and 4.

Extreme weather events
A wide range of weather conditions were captured during the measurements; in this section, we selected extreme wind events, as we observed in Zarzuelo et al. (2021) (winter period, Fig. 8), to analyze how the density pattern changes according to the intensity and direction of the wind inside the bay.The same transect of Figs. 5 and 7 is selected for the first 20 km that coincide with the entrance to the creeks.Two well-marked density patterns are identified during this winter period.The first pattern can be observed when the density remains practically constant throughout the bay, coinciding with wind velocities below 5 m s −1 .However, the second pattern corresponds to an increase in density of 2 kg m −3 between the outer and inner basins (orange rectangle, Fig. 8) that is observed when mean wind velocities are approximately 10 m s −1 and when the incoming directions are from the north (1, 2, 4) and west (3).
As seen in Fig. 7b, the estuary is an inverse one, with denser seawater during winter; winds from the north and northwest introduce denser water into the inner basin.However, for the rest of the directions of the wind, the bay is completely well mixed.In addition, changes in the density pattern may be related to other variables, such as water level (Fig. 8c) or increased precipitation and /or radiation (Fig. 8d).During neap tides, the tides help to mix the bay and create homogeneous longitudinal profiles.Spring tides, however, cause differences in density throughout the bay.In this case, it can be observed how the increase in rainfall causes an increase in the density of the water, which is more noticeable in the inner basin due to the resilience time.

Data availability
The data presented in this article are freely available at the Zenodo repository.See https://doi.org/10.5281/zenodo.7484187(Zarzuelo et al., 2022b).The datasets are published in NetCDF format (.nc) with the observed and modeled results.The observed data files (water level, current, density, and wave climate) are referred to with explicit file names and include extensive information (in the header) about the site, instruments, setup, and units.
The probe time series data are organized into five data files depending on the field survey:   The numerical data files (water level, water current, wave climate, and density) are organized into several data files from September 2012 to October 2013 to the points described in Fig. 1.

Conclusions
A unique and comprehensive dataset is presented, containing bathymetric, water level, current, density, and wave climate data from the Bay of Cádiz (southern Spain).The data have a high spatial and temporal resolution and capture a wide range https://doi.org/10.5194/essd-15-3095-2023 Earth Syst.Sci.Data, 15, 3095-3110, 2023 of climate conditions in the bay, including storm events.The high-frequency data are suitable for testing intrawave-scale models, where the spatial coverage allows for comparison with larger-scale wave-resolving models.This high-resolution dataset allows the analysis of intrawave processes in this complex environment, including the influence of tidal currents on wave transformation (de Wit et al., 2019).Different applications can be extracted due to the measured and modeled variables.As seen in Sect.4, application can provide the following: (1) an analysis of the spatial variability of the main tidal harmonic and the variability of the significant wave height along the bay, (2) the seasonal variability of the density along the bay, and (3) an analysis of how the density patterns vary according to wind events or according to the variability of the water level or other variables such as radiation or precipitation among others.
The dataset can evaluate other variables or applications as a result of the obtained parameters.For example, these data provide the opportunity to analyze the influence of wind, waves, and tidal flow on bed shear stresses, which are important for sediment transport.Moreover, the combination of ecological and physical data can be used to develop and verify (conceptual) models that describe interactions between biotic and abiotic processes.Ultimately, the dataset can be used to assess different scenarios such as future sea level rise and to support global warming mitigation and adaptation strategies.

Figure 1 .
Figure 1.Location (a-b) and description (c) of the monitoring system.The Bay of Cádiz (a) is located in the southern Iberian Peninsula.The model domain is indicated by the orange line (a).The locations (b) of the model data are shown as empty circles.The locations of the observed data stations are indicated by colored circles (color legend corresponds to the instrumentation used in each location): field survey 2012 (I1-I5, T1-T3), field survey 2013 (I3, I3a, I3b), and field survey 2015 (I3, I5) (see Table1for more details on sampling periods).Station (c) is equipped with a multiparametric probe (CT and CTD) and an acoustic Doppler current profiler (ADP).Salinity, temperature, wave, and wind data were obtained from buoy 2342 (Puertos del Estado, Spanish Ministry of Public Works).Average discharges of the Guadalete and San Pedro rivers and daily averaged data of solar radiation, air temperature and humidity, and cloudiness were provided by the Regional Government of Andalusia.Bathymetric data were provided by the Instituto Hidrográfico de la Marina (Spanish Ministry of Defense) and the Bay of Cádiz Port Authority.

Figure 2 .
Figure 2. (a) Correlation coefficient (R 2 ) and (b) skill coefficient (formulation proposed by Willmott, 1981) of the water level, east and north instantaneous velocity, residual velocity, temperature, salinity, and wave height for each station.The color indicates the degree of accuracy (green indicates excellent agreement, yellow-orange indicates good agreement, and red indicates poor agreement).

Figure 3 .
Figure 3. Test results of the temperature (a) and salinity (b) for location I3b (field survey 2013).The histograms and distributions of the observed data are shown at the top of the panels, and the histograms and distributions of the modeled data are shown at the right of the panels.
Figure 6 depicts the amplitude of the semidiurnal species (D2) for the horizontal velocity of the flow obtained during the neap 2013 and spring 2013 surveys.The values are more than twice as high during spring tides, with maximum values in the deepest zones (0.15 and 0.4 m s −1 for neap and spring tides, respectively).If we ana-

Figure 4 .
Figure 4. Extent of valid data recorded by the monitoring station and the numerical model during all field surveys.Vessel-towed observations are shown in purple (water level and velocity data from AQP).Bottom-mounted observations are shown in black (water level, velocity, density, and wave data from ADCP and CT), blue (density data from CT), and gray (water level data from tide gauge and ADCP (velocity sensor failure)).Numerical model data are shown in black (water level, velocity, density, and wave data).

Figure 5 .
Figure 5. Panel (a) represents the location of the points along the channel.The amplitude and phase of the tidal-current constituents of the M2 along-channel are represented in panels (b) and (c), respectively.Panel (d) shows the variation in the significant height wave along the channel.The dotted color indicates the distance.The asterisk corresponds to the location of the I3 instrument.

Figure 6 .
Figure 6.Amplitude of the tidal harmonic species D2 of the current during neap 2013 (a.1) and spring 2013 (a.2).The second rows show the density profile (P1-P5) during neap 2013 and spring 2013 at maximum flood (orange line) and ebb (blue line).The location of each profile is indicated by dashed white lines in (a.1) and (a.2).

Figure 7 .
Figure 7. Panel (a) represents the location of five points selected to show the seasonal variability of the density (b) and the residual water level (c).The data observed for I3 are plotted as a black line.

Figure 8 .-
Figure 8. Panel (a) corresponds to the density variability along extreme wind events (winter period), and panel (b) represents the wind velocity (blue line) and direction (orange line).The dashed white line corresponds to the location of instrument I3.The orange rectangle represents density changes throughout the bay.Panel (c) and panel (d) represent the water level and radiation (purple line) and precipitation (green line) recorded in the Bay of Cádiz, respectively.

Table 1 .
The mooring deployments for bottom-mounted surveys during the three field surveys of2012, 2013, and 2015.