Validation of Sentinel-5P TROPOMI tropospheric NO 2 products by comparison with NO 2 measurements from airborne imaging DOAS, ground-based stationary DOAS, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign

. Airborne imaging differential optical absorption spectroscopy (DOAS), ground-based stationary DOAS, and car DOAS measurements were conducted during the S5P-VAL-DE-Ruhr campaign in September 2020. The campaign area is located in the Rhine-Ruhr region of North Rhine-Westphalia, western Germany, which is a pollution hotspot in Europe comprising urban and large industrial sources. The DOAS measurements are used to validate spaceborne NO 2 tropospheric vertical column density (VCD) data products from the Sentinel-5 Precursor (S5P) TROPOspheric Monitoring Instrument (TROPOMI). Seven ﬂights were performed with the airborne imaging DOAS instrument for measurements of atmospheric pollution (AirMAP), providing measurements that were used to create continuous maps of NO 2 in the layer below the aircraft. These ﬂights cover many S5P ground pixels within an area of 30 km × 35 km and were accompanied by ground-based stationary measurements and three mobile car DOAS instruments. Stationary measurements were conducted by two Pandora, two Zenith-DOAS, and two MAX-DOAS instruments. Ground-based stationary and car DOAS measurements are used to evaluate the AirMAP tropospheric NO 2 VCDs and show high Pearson correlation coefﬁcients of 0.88 and 0.89 and slopes of 0.90 ± 0.09 and 0.89 ± 0.02 for the stationary and car DOAS, respectively. Having a spatial resolution of about 100 m × 30 m, the AirMAP tropospheric NO 2 VCD


Introduction
The reactive nitrogen oxides, nitrogen monoxide (NO) and nitrogen dioxide (NO 2 ), collectively known as NO x (= NO + NO 2 ), are important tropospheric air pollutants and have a strong impact on the tropospheric chemistry.In addition to emissions from soils, natural biomass burning, and lightning, they are largely released into the troposphere by a variety of human activities.These include fossil fuel combustion processes of power plants, traffic, and industrial areas, as well as anthropogenic biomass burning.NO x is primarily emitted as NO, which reacts with ozone (O 3 ) and rapidly forms NO 2 .The NO x sources are spatially and temporally highly variable, and nitrogen compounds are reactive and short lived.As a result, the spatial and temporal variability of NO 2 is large, especially in regions characterized by a variety of NO x emission sources.NO x in the troposphere is toxic and impacts the chemical composition and environmental condition, e.g., through tropospheric ozone catalytic production cycles (Chameides and Walker, 1973;Fishman and Crutzen, 1978;Jacob et al., 1996) or its reaction with the hydroxyl radical, OH, the most important tropospheric daytime oxidizing agent.Accurate knowledge of the spatial and temporal distribution of NO 2 in the troposphere is therefore required to better understand tropospheric chemistry.
Atmospheric NO 2 is remotely observed on a variety of platforms, including ground-based stations; moving platforms such as cars, ships, or aircraft; and environmental satellites.Applying the DOAS (differential optical absorption spectroscopy) technique (Platt and Stutz, 2008) in the UV and visible spectral range, the absorption signature of NO 2 can be identified and column densities can be retrieved.
To ensure the accuracy of satellite data products for use in research, policy making, or other applications, each data product from satellite sensors needs to be validated and its accuracy determined.Validation measurements are needed in polluted and clean regions by independent instruments operating on different platforms.Measurements from groundbased sites provide continuous validation data from different locations for the trace gas products, retrieved from satellite instruments (e.g., Verhoelst et al., 2021).Measurements from mobile ground-based platforms like cars enable the observation of the spatial variability in addition to its temporal evolution.Thus, they are used for the comparison with satellite observations (Wagner et al., 2010;Constantin et al., 2013;Wu et al., 2013) and the validation of airborne remote sensing measurements (Meier et al., 2017;Tack et al., 2017;Merlaud et al., 2018).Airborne remote sensing measurements are an additional valuable source of validation data.Airborne mapping experiments have been performed in the recent years using different aircraft imaging DOAS instruments such as AMAXDOAS, APEX, AirMAP, SWING, SBI, GeoTASO, or GCAS (e.g., Heue et al., 2005;Popp et al., 2012;Schönhardt et al., 2015;Meier et al., 2017;Tack et al., 2019;Judd et al., 2020).The aircraft viewing geometry is similar to that of a satellite, but airborne measurements are able to measure at higher spatial resolution than the satellite sensors.Airborne observations are only available for short periods and are concentrated on the campaign region, but compared to measurements from ground-based sites offer the advan-tage that larger areas and full satellite ground pixels are observed in a relatively short period around the satellite overpass.Thus, spatiotemporal variations of trace gas data products become visible at sub-satellite ground pixel resolution.The combination of airborne imaging and ground-based stationary and mobile measurements enables the validation of satellite data products over a long period and at a high spatial resolution.
Focusing on TROPOMI, Verhoelst et al. (2021) have compared TROPOMI tropospheric NO 2 vertical column densities (VCDs) of offline (OFFL) V01.02-V01.03.02 to tropospheric NO 2 VCD data from a total of 19 MAX-DOAS ground stations.Depending on the level of pollution, the TROPOMI tropospheric NO 2 VCD data show a negative bias compared to the ground-based observations.Recent studies by Tack et al. (2021) and Judd et al. (2020), comparing airborne tropospheric NO 2 VCD data products to TROPOMI tropospheric NO 2 VCD data of V01.02 and V01.03.01, also show a significant underestimation of TROPOMI compared to the airborne observations.Modifications in the TROPOMI NO 2 retrieval led to V02.02, operational since 1 July 2021.The main changes influencing the tropospheric NO 2 VCD are that (1) cloud pressures are derived from the new FRESCO-wide algorithm, leading to lower cloud pressures and thus larger tropospheric NO 2 VCDs over polluted scenes with small cloud fractions, and (2) over cloud-free scenes a surface albedo correction leads to larger tropospheric NO 2 VCDs.On average, groundbased validation shows an improvement of the negative bias of the tropospheric NO 2 VCDs from −32 % to −23 % ( van Geffen et al., 2022b).
Different aspects that influence the tropospheric NO 2 VCD determination and possible reasons for the underestimation of the TROPOMI tropospheric NO 2 VCD data, compared to the validation data, are discussed in several studies (e.g., Judd et al., 2020;Tack et al., 2021;Verhoelst et al., 2021; van Geffen et al., 2022b;Douros et al., 2023).The limited knowledge of the NO 2 profiles and differences in the averaging kernels between instruments having different viewing geometries are identified as significant potential sources of disagreement between satellite and validation data.Similarly, inaccuracies in the knowledge of the aerosol load and aerosol vertical profile lead to both underestimations and overestimations of the tropospheric NO 2 VCD depending on the viewing geometry.In addition, knowledge about the surface reflectivity and cloud conditions and their treatment in satellite retrieval algorithms needs to be taken into account.
In the present study, results from a comprehensive field study conducted in North Rhine-Westphalia in September 2020 are presented.The campaign area is located in the west of Germany and includes the highly polluted Ruhr Area, a metropolitan region with large cities, industrial facilities, power plants, and arterial highways.Background areas with low pollution, as well as moderately polluted regions, are also observed, which increases the dynamic range of ob-served values.This campaign utilized the mapping capabilities of the Airborne imaging DOAS instrument for Measurements of Atmospheric Pollution (AirMAP) and includes a ground-based component for the evaluation of the AirMAP data set, comprised of three mobile car DOAS and six stationary DOAS devices.AirMAP is used for regional mapping of areas large enough to contain several TROPOMI pixels.Possible reasons for the bias of the TROPOMI tropospheric NO 2 VCD product are investigated by a systematic variation in the relevant input parameters in the satellite retrieval.
The field campaign site and setup are described in Sect. 2. The instruments and data sets are explained in Sect.3.After a thorough comparison of AirMAP to stationary DOAS (Sect.4) and car DOAS data (Sect.5), the campaign data set is used to evaluate TROPOMI tropospheric NO 2 products (Sect.6), including the operational OFFL V01.03.02 product active during the campaign phase and the reprocessed data PAL V02.03.01.Starting from these base versions, scientific products are developed that enable a dedicated assessment of the retrieval issues described above and the assumptions used about the NO 2 profile, clouds, and surface reflectivity.

The S5P-VAL-DE-Ruhr campaign
The objective of the Sentinel-5 Precursor (S5P)-VAL-DE-Ruhr campaign, an activity within the ESA QA4EO project, was to perform comprehensive field studies optimized for TROPOMI tropospheric NO 2 VCD validation including airborne, ground-based stationary and mobile car DOAS measurements.
The campaign activities took place in September 2020 in North Rhine-Westphalia, including the Ruhr area, a densely populated and strongly polluted urban agglomeration in the west of Germany.The Ruhr area itself has a population of 5 million.Together with the populated surroundings and metropolitan centers along the Rhine, the region is called Metropolitan area Rhine-Ruhr (MRR).It comprises a population of more than 10 million inhabitants, large power plants, energy-intensive industrial facilities and several large highways.NO 2 pollution in the MRR is clearly visible in TROPOMI maps of Europe showing widespread enhanced NO 2 amounts.Figure 1 shows the monthly average for September 2020 of the tropospheric NO 2 VCD using the TROPOMI PAL V02.03.01 product for central Europe (left) and a close-up of the S5P-VAL-DE-Ruhr campaign region (right).
A key contribution to the campaign is the airborne AirMAP instrument explained below in Sect.3.2.AirMAP was installed on a Cessna T207A aircraft that was based at an airport close to Dinslaken, North Rhine-Westphalia.Within the designated campaign area, three research flight areas were defined (see Fig. 1), in which AirMAP performed a total of seven flights on 7 consecutive days.the target area.The transfer flights between airport and target areas were used to overpass nearby stationary instruments.Flight schedules used the S5P overpass times to optimize the amount of data for validation.In general, it was planned to have the S5P overpass in the middle of the flight.On days where two overpasses per day occurred in the target area, the flight schedules were optimized towards the overpass time at the smaller viewing zenith angle (VZA) of TROPOMI.More details of the flights are given in Table 1.
The campaign delivered validation measurements by a mobile and a stationary component.In addition to the measurements made by AirMAP, the mobile component included three car DOAS devices.The stationary component comprised six ground-based remote sensing instruments of three different types, i.e., two Pandora instruments, two MAX-DOAS instruments, and two fixed Zenith-DOAS devices.All the instruments were placed at suitable locations within the selected research flight areas shown in Fig. 2. With this combination of measurements, a comprehensive comparison of the airborne measurements with different types of groundbased instruments is made possible.This provides a basis for the evaluation of the TROPOMI tropospheric NO 2 VCD products with the airborne data, which cover the satellite pixel areas well.The airborne imaging data link the groundbased observations with restricted spatial but good temporal coverage to satellite observations that have large swath widths at a single instance in time.

Instruments and data sets
During the S5P-VAL-DE-Ruhr campaign, tropospheric VCDs of NO 2 were retrieved from instruments mounted on satellite, airborne, car, and stationary ground-based platforms.All of these instruments are passive remote sensing spectrometers using the DOAS technique (Platt and Stutz, 2008) by analyzing visible and UV spectra of scattered sunlight.The instruments involved in the S5P-VAL-DE-Ruhr campaign activities are listed in Table 2.The data analysis was done independently by the operating institutes, and the DOAS fitting window was chosen based on the spectrometer's spectral wavelength range and from the groups experience with their instrument.

S5P TROPOMI
The Copernicus satellite S5P was launched into a Sunsynchronous orbit at 824 km in October 2017.S5P carries a single instrument, TROPOMI, which comprises a hyper-spectral spectrometer measuring radiation in the ultraviolet, visible, and near and shortwave infrared spectral regions (Veefkind et al., 2012).TROPOMI provides observations between 10:50 and 13:45 UTC over the campaign region, measuring the distribution of atmospheric columns from trace gases such as NO 2 , HCHO, CHOCHO, BrO, SO 2 , O 3 , CO, CH 4 , and aerosol and cloud properties.Thereby, TROPOMI extends a long record of satellite-based observations.With its good signal-to-noise ratio and a spatial resolution at nadir of 3.5 km × 5.5 km (initially 3.5 km × 7 km, changed on 6 August 2019), which is more than 10 times better than that of its predecessor, the Ozone Monitoring Instrument (OMI; Levelt et al., 2006), it is currently the best instrument for monitoring small-scale emission sources of NO x from space.

TROPOMI NO 2 operational OFFL V01.03.02 product
During the campaign activities in September 2020, the TROPOMI level-2 NO 2 OFFL V01.03.02 product was generated operationally.For the retrieval of NO 2 slant column densities (SCDs) the measured spectra are analyzed using the DOAS technique in the fitting window 405-465 nm.
The SCDs are separated into their stratospheric and tropospheric parts, using the TM5-MP global chemistry transport model.The tropospheric SCDs are converted into tropospheric VCDs by applying tropospheric air mass factors (AMFs), estimated using a look-up table of altitudedependent AMFs, the OMI Lambertian equivalent reflectivity (LER) climatology (Kleipool et al., 2008), NO 2 vertical profiles from the TM5 model, and cloud fraction and pressure information from the FRESCO-S algorithm (van Geffen et al., 2022a).
Validation by comparison with other observations has shown that NO 2 data versions V01.02-01.03are biased low by up to 50 % over highly polluted regions (e.g., Verhoelst et al., 2021).As discussed in several validation studies (see, e.g., Judd et al., 2020;Verhoelst et al., 2021;van Geffen et al., 2022b), this underestimation could be related to biases in the cloud pressure retrieval, to a too high cloud pressure from the FRESCO-S algorithm, in particular when the cloud fractions are low and/or during periods of high aerosol loading.Other stated factors that could contribute to the underestimation are (1) the low spatial resolution of the used a priori NO 2 profiles from the TM5 global chemistry transport model (e.g., Judd et al., 2020;Tack et al., 2021), (2) the use of the OMI LER climatology given on a grid of 0.5 • × 0.5 • for the AMF and cloud fraction retrieval in the NO 2 fit window, and (3) the GOME-2 LER climatology (0.25 • × 0.25 • ) measured at mid-morning used for the NIR-FRESCO cloud retrieval (van Geffen et al., 2022b).These LER climatologies are not optimal for TROPOMI because of TROPOMI's higher spatial resolution and the missing consideration of the viewing angle dependency in the LER products (Lorente et al., 2018;van Geffen et al., 2022b).In V02.04, operational since July 2022, a directionally dependent LER (DLER) climatology derived from TROPOMI observations given on a resolution of 0.125 • × 0.125 • is applied for AMF and cloud fraction re-trieval in the NO 2 fit window and to the NIR-FRESCO cloud retrieval (Eskes and Eichmann, 2022).Since V02.04 is not yet reprocessed and thus not available for the campaign period, it is not included and discussed in this study.

Scientific TROPOMI NO 2 V01.03.02 CAMS product
The scientific TROPOMI NO 2 V01.03.02 CAMS product is based on the operational OFFL V01.03.02 product.The original 1 • × 1 • TM5 a priori NO 2 profiles are replaced by the Copernicus Atmospheric Monitoring Service (CAMS) analyses.AMFs and tropospheric NO 2 VCDs were recalculated using the averaging kernels and other quantities available in the level-2 NO 2 files, following the approach described in the TROPOMI product user manual (Eskes et al., 2022).Between the surface and 3 km the CAMS European regional analyses with an improved resolution of 0.1 • × 0.1 • are used.For altitudes between 3 km and the tropopause the CAMS global analyses (0.4 • × 0.4 • ) are used.More detailed explanations can be found in Douros et al. (2023).

TROPOMI NO 2 PAL V02.03.01 product
Modifications in the TROPOMI NO 2 retrieval led to the OFFL V02.02 product, which was operationally produced from 1 July to 14 November 2021.To obtain a harmonized data set, a complete mission reprocessing was performed using the latest operational product OFFL V02.03.01 of 14 November 2021.The reprocessed data version available from 1 May 2018 to 14 November 2021 provided by the Product Algorithm Laboratory (PAL) is labeled as PAL V02.03.01.This provided the opportunity to compare the campaign data set to the OFFL V01.03.02 and the new PAL V02.03.01.The main change compared to the OFFL V01.03.02 impacting the tropospheric NO 2 VCD data is the use of the FRESCO-wide algorithm instead of the FRESCO-S algorithm, which was already introduced in V01.04 and was operational from 29 November 2020 to 1 July 2021.The FRESCO-wide algorithm provides lower and therefore more realistic cloud pressures (i.e., clouds are at higher altitudes), especially for scenes when cloud fractions are low.This change results in decreased tropospheric AMFs, which leads to higher tropospheric NO 2 VCDs ( van Geffen et al., 2022b).Another update that can have a significant impact is the correction of the surface albedo over cloud-free scenes by using the observed reflectance.This increases the tropospheric NO 2 VCDs by about 15 % over polluted regions in case the retrieved cloud fraction is zero (van Geffen et al., 2022b).For this study the effect is negligible since only 1 out of the 117 TROPOMI pixels analyzed here is observed as cloud free.van Geffen et al. (2022b) also describes the following other modifications, which have only a small or no impact on the tropospheric NO 2 VCD data.Level-1b v2.0 (ir)radiance spectra are updated in the new version and are increasing the NO 2 SCD of about 3 %, from which most of it ends up in a slightly increased stratospheric VCD.The improved level-1b v2.0 also leads to a small increase in completely cloud-free pixels and to slightly lower cloud pressures for pixels with a small cloud fraction, resulting in tropospheric NO 2 VCDs being about 5 % higher for these ground pixels.An introduced outlier removal is increasing the amount of good-quality retrievals over the South Atlantic Anomaly and over bright clouds where saturation can occur.The change to new spatially more highly resolved snow and ice information is increasing the amount of valid retrievals at high latitudes.On average, the new data version increased the tropospheric NO 2 VCDs by 10 % to 40 % compared to the V1.x data, depending on season and pollution.The largest increase is found in wintertime at midlatitudes and high latitudes.The first comparisons to ground-based measurements show an improvement of the negative bias of the TROPOMI tropospheric NO 2 VCDs from on average −32 % to −23 % ( van Geffen et al., 2022b).

Scientific TROPOMI NO 2 IUP V02.03.01 product
For the evaluation of the influence of auxiliary data on the TROPOMI NO 2 product, we developed a customized scientific product rebuilding the V02.03.01 data product, named IUP V02.03.01.The IUP V02.03.01 gives the possibility to change the a priori assumptions such as surface reflectance, which cannot be done using the averaging kernel approach used for V01.03.02 CAMS.
The a priori NO 2 vertical profile shapes for the operational TROPOMI NO 2 retrieval are taken from the TM5 model and have a resolution of 1 • × 1 • (∼ 100 km × 100 km), which is much coarser than the TROPOMI data (3.5 km × 5.5 km at nadir).In highly polluted regions, such as the campaign area, high spatial variability of NO 2 VCDs are observed.The NO 2 plumes from sources, such as power plants, industrial complexes, or cities, cannot be resolved in the model.To demonstrate the impact of more highly resolved a priori NO 2 vertical profiles, we recalculated AMFs and the tropospheric NO 2 VCDs with a lookup-table created with the radiative transfer model SCIATRAN (Rozanov et al., 2014) using a priori tropospheric profiles from the 0.1 • × 0.1 • CAMS regional analyses for altitudes between the surface and 3 km.For altitudes between 3 km and the tropopause, where horizontal variability is in general small, the TM5 model analyses are used.Two maps showing the NO 2 distribution of the CAMS regional and the TM5 analyses for the campaign region can be found in Appendix A (Fig. A1).In the following, this data version using the CAMS regional analyses is called IUP V02.03.01 REG.
The surface reflectivity information from the 5-year OMI LER climatology used for the operational TROPOMI AMF calculations has a resolution of 0.5 • × 0.5 • .After more than 3 years of TROPOMI data acquisition, a TROPOMI surface reflectivity database, estimated from 36 months of TROPOMI v1.0.0 level-1b data, provides LER data as a function of month, wavelength, latitude, and longitude and at a finer spatial resolution of 0.125 Tilstra, 2022).The recalculation of AMFs and tropospheric NO 2 VCDs using the regional CAMS NO 2 profiles and the TROPOMI LER results in the product named IUP V02.03.01 REG TROPOMI LER.The use of the TROPOMI LER in this data set is limited to the NO 2 AMFs and not extended to the cloud retrieval.
In addition to the traditional LER database, a DLER database has been generated using TROPOMI data.The DLER database is in addition a function of the TROPOMI viewing direction and provides generally higher values than the LER database, which does not take into account the directional dependence of the surface reflectance (Tilstra, 2022).Recalculating AMFs and tropospheric NO 2 VCDs with the regional CAMS NO 2 profiles and the TROPOMI DLER https://doi.org/10.The different TROPOMI NO 2 products with their most important differences are summarized in Table 3.

TROPOMI data set
In the present study, we evaluate the TROPOMI tropospheric NO 2 VCD product from 12 to 18 September 2020 of the two described data products OFFL V01.03.02 and PAL V02.03.01, as well the described scientific data products.
Each TROPOMI pixel has a quality assurance value (qa_value) indicating the quality of the processing and retrieval result.Following the recommendation by Eskes and Eichmann (2022), we only use observations with a qa_value above 0.75 for all used TROPOMI data products.This removes problematic retrievals and observations with cloud radiance fractions of more than 50 %.Since the campaign measurement days were mostly cloud free, the cloud radiance fraction retrieved in the TROPOMI NO 2 spectral window was on average 0.21 ± 0.10 with a maximum of 0.48, and thus all data can be used.
Large tropospheric NO 2 VCDs are observed in central Europe, e.g., over Paris, London, Milan, and Antwerp, with the largest values of 1.6 × 10 16 molec.cm −2 in the campaign region in North Rhine-Westphalia (see Fig. 1).The campaign area is clearly distinguished from surrounding rural areas, which have low tropospheric NO 2 VCDs below approximately 3 × 10 15 molec.cm −2 .

AirMAP
AirMAP, an airborne imaging spectrometer developed by the Institute of Environmental Physics in Bremen (IUP-Bremen), has been used in several campaigns for trace gas measurements and pollution mapping (Schönhardt et al., 2015;Meier et al., 2017;Tack et al., 2019;Merlaud et al., 2020).During the campaign, AirMAP was installed on a Cessna 207-Turbo, operated by the Freie Universität Berlin.AirMAP is a pushbroom imaging DOAS instrument with the ability to create spatially continuous and nearly gap-free measurements.The scattered sunlight from below the aircraft is collected with a wide-angle entrance optic resulting in an across track field of view of around 52 • .This leads to a swath width of approximately 3 km, about the same size as the flight altitude, during the campaign.With a sorted fiber bundle of 35 fibers, vertically stacked at the spectrometer entrance slit and orthogonally oriented to the flight direction, the radiation is coupled into the UV-Vis imaging grating spectrometer.The 400 g mm −1 grating, blazed at 400 nm, provides measurements in the 429-492 nm wavelength range, with a spectral resolution between 0.9 and 1.6 nm full width at half maximum.The spectrometer is temperature stabilized at 35 • C. The along-track resolution depends on the speed of the aircraft (around 60 m s −1 ) and the exposure time (0.5 s).At a flight altitude of 3300 m, this results in a typical ground scene having a footprint of around 100 m × 30 m.More details about AirMAP can be found in Schönhardt et al. (2015), Meier et al. (2017) and Tack et al. (2019).

AirMAP data retrieval
For the NO 2 retrieval, the DOAS method is applied to the measured spectra in a fitting window of 438-490 nm.The NO 2 differential SCDs (dSCDs) are retrieved relative to inflight-measured reference background spectra, which were measured over a region with small NO 2 concentrations during the same flight.The dSCD is converted into a tropospheric SCD (SCD trop ) by correcting for the amount of NO 2 in the reference background measurement (SCD ref ): (1) For the conversion to the desired tropospheric VCD (VCD trop ), the SCD trop is divided by the tropospheric air mass factor (AMF trop ): Since the AMF of the actual measurement (AMF trop ) and of the reference background measurement (AMF trop, ref ) are usually not the same, simply adding the VCD trop, ref would introduce additional uncertainties.To correct for the NO 2 in the reference spectrum (SCD ref ), we assume a tropospheric VCD of 1×10 15 molec.cm −2 over the reference background region, which is a typical value during summer in Europe (Popp et al., 2012;Huijnen et al., 2010).This assumption can be supported by the car DOAS measurements; see Sect.3.3.1.All measurements of the campaign were performed around noon close to the S5P overpass.The maximum difference between the time of the reference background and the actual measurement is of around 3 h, which is the total measurement time.We assume that the effect of the changing solar zenith angle (SZA) and the diurnal variation of the stratospheric NO 2 concentration are small (Schreier et al., 2019), and a stratospheric correction of the data is therefore not necessary.The AMF calculated using SCIATRAN estimates the relative light path length through the absorbing layer by accounting for the effects of sun and viewing geometry, surface reflectance, aerosols, and the NO 2 profile assuming cloud-free conditions.As only limited information about the NO 2 profile is available in the campaign area and the profile shape is expected to vary strongly within each flight region every day, we assume a typical urban NO 2 profile, which is based on an old WRF-Chem (Weather Research and Forecasting model coupled with Chemistry) run and scaled to a height of 1 km (see Fig. A2).This assumption is supported by typical boundary layer heights in the measurement area and time of approximately 1 km (ERA5 reanalysis; Hersbach et al., 2018).Input parameters related to aerosols (single-scattering albedo, asymmetry factor, and aerosol optical thickness) were extracted from the AERONET station FZJ-JOYCE at the Jülich research center (Löhnert et al., 2015), which is the only known source providing local ground-based aerosol information in the campaign area.During the campaign measurement days, the daily averages of aerosol optical thickness (AOT) at 440 nm measured at FZJ-JOYCE ranged between 0.235 and 0.398 with a mean value of 0.285.This information is spatially constrained, and the situation can differ during the flights in the Duisburg and Cologne area.A sensitivity study using AMFs for a range of AOTs between 0.003 and 0.6 for the AirMAP NO 2 VCD retrieval demonstrated that the influence on the AirMAP tropospheric NO 2 VCD data set is small (< 1 %, comparing AirMAP tropospheric NO 2 VCDs assuming AOTs of 0.003 and 0.6).TROPOMI and AirMAP tropospheric NO 2 VCD scatter plots for AOTs of 0.003, 0.3, and 0.6 can be found in Appendix A (Fig. A3).
Considering the mean AOT of 0.285 from the AERONET station and the results from the sensitivity study, the AirMAP data set was retrieved using an AOT of 0.3 for all measurement days.In following discussions we also consider the pre-operational TROPOMI AOT product ( de Graaf, 2022), which can provide a larger picture of the aerosol situation (see Fig. A4).In general it shows AOT values in the same range as investigated within the sensitivity study.
Bright surfaces enhance the relative contribution of light reflected from the surface to the signal received by the airborne instrument, increasing the sensitivity to NO 2 near the ground.Therefore, areas of high surface reflectance in the fitting window generally show larger dSCDs for the same amount of NO 2 .Thus, differences in the surface reflectivity must be accounted for in the AMF calculations.As far as we are aware, reflectance data that have a sufficient spatial resolution are not available for the region of our flight campaign.Therefore, we use the individual AirMAP-recorded intensities together with a method based on a reference area with a known surface reflectance taken from the ADAM database (a surface reflectance database for ESA's Earth observation mission; Prunet et al., 2013) and a look-up table of AirMAP radiances.Detailed information about the derivation of the surface reflectance and also about the general conversion from dSCDs to tropospheric NO 2 VCDs can be found in Meier et al. (2017).
The total uncertainty on the tropospheric NO 2 VCD comprises error sources of the dSCD retrieval, the estimation of the NO 2 in the reference background spectrum, and the AMF calculation.We follow the same approach for error estimation and thus the same assumptions as were made in Meier et al. (2017) and Tack et al. (2019).
The total uncertainty of the AirMAP tropospheric NO 2 VCD follows the error propagation of the three error sources https://doi.org/10. . (3) The error from the dSCD retrieval is estimated from the fit residual and is a direct output of the DOAS retrieval algorithm.Since no direct measurements of the NO 2 column in the reference ground scene exist, we assume a systematic error with an uncertainty of 100 % on the estimated value of 1 × 10 15 molec.cm −2 .The error resulting from the AMF determination depends in large part on the values of the uncertainty attributed to the surface reflectance, the accuracy of the NO 2 vertical profile, and the aerosol optical depth as a function of altitude and location.Following Meier et al. (2017), the total error on the AMF is estimated to be smaller than 26 %.Taking the mean dSCD value (1.2×10 16 molec.cm −2 ) and the mean dSCD error (2 × 10 15 molec.cm −2 ) as typical values, the total error of the tropospheric NO 2 VCD is ∼ 35 %.More details on error contributions can be found in Meier et al. (2017).The tropospheric NO 2 VCD over the Jülich flight area is smaller during the flight on Sunday 13 September than on Thursday 17 September, where several peaks in the NO 2 VCD up to 2.5 × 10 16 molec.cm −2 are visible.These peaks are caused by plumes of NO 2 coming from three large power plants, located in the Jülich research flight area, which are clearly visible in the maps of the AirMAP NO 2 VCD in Fig. 4. Two power plants are located in the northeast of the Jülich flight area, while one is located in the southwest.The plumes, which have enhanced tropospheric NO 2 VCDs compared to low background VCDs outside of the plume, are blown in the mean wind direction (shown in the top left corner of the maps) determined from ERA5 10 m wind data (Hersbach et al., 2018) for the flight area and in the middle of the flight time.Differences between the 2 measurement days over the Jülich flight area are related to wind conditions potentially enhanced by a weekend effect.On Sunday 13 September, there was a weak wind coming from the southwest blowing the plumes to the northeast, and thus two out of three plumes were mostly outside the flight area and cleaner air from a rural area was prevalent.On Thursday 17 September, a stronger wind coming from the opposite direction, the northeast, was blowing the plumes to the southwest.

AirMAP campaign data set
The three maps from flights over the Duisburg flight area show the strong NO x emissions from power plants and the industrial area in Duisburg with plumes oriented depending on wind direction.
The two AirMAP flights in the Cologne area show only slightly enhanced NO 2 amounts compared to the background tropospheric NO 2 VCD on both days.

Car DOAS instruments
During the S5P-VAL-DE-Ruhr campaign, mobile car DOAS measurements were performed by three institutions, the IUP-Bremen, the Max Planck Institute for Chemistry in Mainz (MPIC), and the Royal Belgian Institute for Space Aeronomy (BIRA).More information about the car DOAS instruments can be found in Schreier et al. (2019), Donner (2016), and Merlaud (2013).The measurement elevation angle was for the majority of measurements in zenith-sky with some off-zenith measurements.The off-zenith measurements are used in the estimation of the NO 2 SCD in the reference spectrum and the stratospheric NO 2 contribution for the BIRA and MPIC car DOAS measurements.The focus on zenithsky measurements during driving has the advantage of a stable viewing direction when the direction of travel changes.Variations from relative azimuth changes are avoided, and measurements cannot be blocked by buildings, which can be a large problem in cities.In addition, the highest horizontal resolution is achieved with this viewing geometry.

IUP car DOAS instrument and data retrieval
The IUP car DOAS instrument uses an experimental setup, which comprises an Avantes spectrometer and a light fiber with a fixed viewing direction to the zenith measuring scattered sunlight in the UV-Vis range (see also Schreier et al., 2019).Collected spectra are averaged over 10 s, which corresponds to traveled distances of around 80-300 m, depending on the driving speed.The DOAS method is applied to the measured spectra in a fitting window of 425-490 nm.The tropospheric NO 2 VCD from car DOAS zenith-sky measurements is determined in a similar manner to that used for the    et al., 2011;Merlaud, 2013;Schreier et al., 2019).Merlaud (2013) analyzed the AMF distribution for a large number of simulations, resulting in a mean of 1.33 ± 0.2 for measurements in 90 • viewing zenith angle.Since we only analyze data close to the AirMAP overpass, which was performing measurements around noon, the SZA does not vary much.
Following the mentioned studies, we assume an uncertainty of 20 % for the AMF.

MPIC car DOAS instrument and data retrieval
The MPIC car DOAS instrument uses an Avantes spectrometer with an active temperature stabilization and takes offaxis measurements at 22 • elevation in addition to the zenithsky measurements (see Donner, 2016).During the validation measurement period, only zenith-sky measurements were used to increase spatial and temporal coverage.The integration time was 30 s.Before and after the validation measurements, the elevation angles alternate between 22 • elevation and zenith-sky (90 • ).The combination of both angles allows the determination of the absorption in the reference spectrum SCD ref , as well as the absorption in the stratosphere.
The DOAS analysis is performed in a wavelength interval of 400-460 nm using a daily fixed reference background at 90 • elevation and at low SZA in a region with small NO 2 concentrations.NO 2 dSCDs retrieved from the DOAS analysis are converted to tropospheric NO 2 VCDs by using Eq.(4) (see also Wagner et al., 2010;Ibrahim et al., 2010).Radiative transfer model calculations for NO 2 box profiles of 500 or 1000 m and moderate aerosol loads provide on average tropospheric AMFs of 3 and 1.3 with an assumed uncertainty of 20 % for the 22 and 90 • elevation angle measurements, respectively (Shaiganfar et al., 2011;Merlaud, 2013).

BIRA car DOAS instrument and data retrieval
The BIRA car DOAS instrument consists of two Avantes spectrometers measuring simultaneously scattered light in 90 and 30 • elevation (see also Merlaud, 2013).Individual spectra are co-added, and the DOAS analysis is performed in a wavelength interval of 450-515 nm on spectra averaged every 30 s using a single pair of time-coincident low SZA zenith reference spectra for all measurement days.The measurements on both channels being simultaneous, the retrieval of tropospheric NO 2 VCDs follows the MAX-DOAS principle (see Eq. 5), using the differences in dSCDs and AMFs for two elevation angles.For the AMFs, a sun-positiondependent look-up table (LUT) is used.This LUT was calculated using DISORT and provides AMFs of 2.5 and 1.3 for the 30 and 90 • elevation angle measurements, respectively (Merlaud, 2013).An additional zenith-DOAS instrument was operated for SO 2 measurements, results are not shown in this study.

Car DOAS campaign data set
For the verification of the car measurements, regular colocations of the cars were used at selected meeting points and overlapping measurement routes.Figure A5 in Appendix A shows a scatter plot of the co-located car DOAS measurements, demonstrating a good agreement between the three instruments.In general, the car DOAS measurements were planned in a way that each car made measurements during a round trip of a large part of the research flight area.The routes were also chosen to pass by the ground-based measurement stations.The duration of the car measurements was typically around 4 h per day.This enabled measurements to be made during the complete AirMAP flight and the S5P overpass times to gather many closely co-located measurements.Several round trips, about three to four, were performed, depending on traffic conditions.In addition to spatial variations of NO 2 , temporal changes are also observed.
Figure 5 shows maps of car DOAS tropospheric NO 2 VCDs for the 7 d in the research flight areas around Cologne, Jülich, and Duisburg.Measurements are within ±1 h of the S5P overpass time given in the map title.As already seen in the AirMAP data, strong variability between the three target areas is observed.The highest amounts of NO 2 are visible around Duisburg with high spatial variability within the target area.The lowest amounts of NO 2 are found in the area around Cologne, which confirms the findings of the AirMAP measurements.The car DOAS measurements in the Jülich area show enhanced NO 2 values where the AirMAP measurements also see the plumes of the two power plants located in the northeast of the flight area.

Ground-based instruments
During the campaign period, six ground-based instruments, two Zenith-DOAS, two MAX-DOAS, and two Pandora instruments were measuring in the three target areas.The instrument locations are marked on the map of the TROPOMI tropospheric NO 2 VCDs in Fig. 1 and the flight overview map in Fig. 2.

Zenith-DOAS
Two Zenith-DOAS instruments were deployed and operated within the Ruhr area for several months.The instruments use an experimental setup, which comprises an Avantes spectrometer (290-550 nm) and a light fiber with a fixed viewing direction to the zenith measuring scattered sunlight in the UV-Vis spectral range (similar as in Schreier et al., 2019).One instrument is located at the Jülich research center next to the Pandora (Zenith-DOAS JUE) and the second at a local residence in Gelsenkirchen (Zenith-DOAS GEL), in the Duisburg research flight area.The tropospheric NO 2 VCDs are estimated from the dSCDs resulting from the DOAS fit using Eq. ( 4).For the reference background spectra in the DOAS fit, we use a fixed spectrum taken in summer on a clean day around noon.The amount of NO 2 in the reference background spectrum, SCD ref , is determined from the long time series using the lowest measured NO 2 .For the measurements made by the Zenith-DOAS GEL, this is a SCD ref of 1.7 × 10 16 molec.cm −2 .For the Zenith-DOAS JUE, the SCD ref is determined as 1.0 × 10 16 molec.cm −2 using the same approach.The SCDs ref given here include the stratospheric and tropospheric NO 2 in the reference background spectrum.Since the reference measurements were taken during summer, a relatively large part is stratospheric NO 2 .An uncertainty of 30 % for the SCD in the reference spectrum is assumed.The VCD strat is estimated from twilight Langley fits (e.g., Constantin et al., 2013) with an uncertainty of 2 × 10 14 molec.cm −2 , and the stratospheric AMFs are obtained from SCIATRAN calculations.For the tropospheric AMF we use the same value of 1.3 as for the car DOAS.Since we only analyze the measurements close to the AirMAP overpass, i.e., around noon, the SZA does not vary much and the influence on the AMF is small (see Sect. 3.3.1).

MAX-DOAS measurement truck
From 7 to 19 September 2020, the IUP Bremen measurement truck performed MAX-DOAS measurements in the harbor area of Duisburg close to the Rhine River (MAX-DOAS DUI).This MAX-DOAS instrument uses a UV spectrometer (282-412 nm) with a light fiber connected to a telescope on a pan-tilt head and was scanning in multiple elevation angles.The tropospheric NO 2 VCDs are estimated from the dSCD measurements in 30 • elevation angle with a sequential zenith-sky reference spectrum (interpolated from the zenithhttps://doi.org/10.5194/amt-16-1357-2023Atmos.Meas.Tech., 16, 1357-1389, 2023 sky measurements shortly before and after the off-axis measurement): Based on SCIATRAN AMF calculations for a wavelength of 350 nm, adjusted to the ground-based and AirMAP comparison times around noon regarding SZA and with typical albedo and AOT values found during the campaign measurement days, AMFs of 2.5 and 1.4 are used for elevation angles of 30 and 90 • , respectively.The total uncertainty of the tropospheric NO 2 VCD originates from uncertainties in the retrieved dSCD, which results mainly as the error of the DOAS fit, and uncertainties from the AMF for which we assume 20 %.

BIRA SkySpec MAX-DOAS
A further MAX-DOAS instrument was set up at the airport Schwarze Heide in Dinslaken (MAX-DOAS AIRPT) from 3 August to 29 September 2020.The instrument, deployed by BIRA, was an Airyx Compact SkySpec MAX-DOAS, based on an Avantes spectrometer (300-463 nm).A scanning prism in elevation direction can rotate 180 • , enabling elevation scan measurements in two azimuthal directions (Airyx GmbH, 2022;Kreher et al., 2020).At the airport, the instrument was scanning in azimuths of 132 and 312 • and at multiple elevation angles.In this study, only measurements inversion algorithm using dSCDs retrieved with the spectral fitting software QDOAS (Danckaert et al., 2017) using the FRM4DOAS settings and setup (Hendrick et al., 2016).The tropospheric NO 2 VCD error is calculated from the covariance smoothing error matrix, the covariance measurement noise error matrix and a systematic error as a fixed fraction of the VCD, based on the systematic uncertainty of the cross section, for NO 2 as 3 % (Vandaele et al., 1998).

Pandora
The Pandora instrument is a ground-based UV-Vis spectrometer that provides direct sun total column and sky scan MAX-DOAS tropospheric column observations, comprising an Avantes spectrometer (270-520 nm) (e.g., Herman et al., 2009;Kreher et al., 2020;Verhoelst et al., 2021).Two Pandora instruments are deployed and operated in the campaign area to provide long-term measurements.2014) approach (Cede et al., 2022).NO 2 values are given, together with the respective uncertainty (Cede et al., 2022), as tropospheric NO 2 VCD.The analyzed data are labeled with quality flags, which indicate whether the data quality is high, medium, or low and whether the data are quality assured and usable or not.Only data with a quality flag accounting for high and medium quality (assured as well as not assured) are used.
4 Evaluating airborne tropospheric NO 2 VCD with stationary ground-based data The data set of the stationary ground-based instruments, deployed at different sites in the three selected flight areas, is used to evaluate the AirMAP tropospheric NO 2 VCD.This, together with the mobile measurements, provides a basis for using the AirMAP data for the evaluation of the TROPOMI tropospheric NO 2 VCD.During the campaign, AirMAP overflights were conducted for all ground-based measurement stations.
A scatter plot of all coincident measurements is shown in Fig. 6.Each point is colored according to its instrument type and location.The shown AirMAP tropospheric NO 2 VCDs are averages of the measurements from an area of 500 m × 500 m around the ground-based measurement station.This is then assigned to the selected ground-based stationary measurements, which are averaged in time intervals of 20 min around the AirMAP overpass time.In total 25 coincident measurements were obtained by this procedure.Error bars of Fig. 6 represent the error in the tropospheric NO 2 VCD retrieval, averaged within the 500 m × 500 m grid boxes and 20 min time intervals.Fitting of the data was done with orthogonal distance regression, as for all following data shown in the present study.The AirMAP and ground-based tropospheric NO 2 VCDs are highly correlated (Pearson correlation coefficient r = 0.88) with a slope and standard deviation of 0.90 ± 0.09 and an offset of 1.16 ± 0.15 × 10 15 molec.cm −2 .Overall, the data show good agreement with a tendency toward slightly larger values from the ground-based instruments as compared to the airborne data.Part of the scatter and deviation may result from the different retrieval algorithms with different assumptions on radiative transfer, aerosols, and reference background spectra.Additionally, spatiotemporal variability of NO 2 is influencing the agreement of the comparison.Figure A6 in Appendix A shows the same information as Fig. 6, but the error bars represent the 10th and 90th percentile within the 500 m × 500 m grid boxes and 15 min time intervals to illustrate the spatiotemporal variability within the comparison criteria.

Evaluating airborne tropospheric NO 2 VCD with car DOAS data
The mobile car DOAS measurements performed by IUP, MPIC, and BIRA were synchronized to the AirMAP measurements.They were measuring during the complete flight in the same area as the AirMAP instrument to gather many closely co-located measurements between the instruments.The data are used, in addition to the stationary ground-based measurements, to evaluate the tropospheric NO 2 VCD maps retrieved from AirMAP.Compared to the stationary data, the car measurements have the advantage that they can cover larger and more diverse areas and thus potentially also a wider range of NO 2 values.As a result of having more opportunities to make near-simultaneous synchronized measurements, consequently, a larger number of co-located measurements can be compared.For the comparison, the car DOAS measurements are averaged in time intervals of 15 min and gridded in areas of 500 m × 500 m.The same grid is applied to the AirMAP measurements and a comparison of measurements in the same grid box and time interval is performed.
A scatter plot of all coincident car DOAS and AirMAP measurements fulfilling a time criterion of ±15 min is shown in Fig. 7.Each point is colored by the respective car DOAS instrument.In total, 572 pairs of coincident measurements are considered.Error bars of Fig. 7 represent the error in the tropospheric NO 2 VCD retrieval, averaged within the 500 m × 500 m grid boxes and 15 min time intervals.The comparison shows an offset of −1.29 ± 0.15 × 10 15 molec.cm −2 .This offset could be adjusted to be https://doi.org/10.5194/amt-16-1357-2023Atmos.Meas. Tech., 16, 1357-1389, 2023 closer to zero by increasing the estimated VCD trop, ref in the AirMAP retrieval by more than a factor of 2. However, the offset in the comparison of AirMAP and ground-based stationary data of 1.16 ± 0.15 × 10 15 molec.cm −2 is positive instead of negative, and a larger VCD trop, ref in the AirMAP retrieval would further increase this offset.Because of this reason and a lack of justification for a large difference between the VCD trop, ref for the car and AirMAP retrieval, we chose to leave the VCD trop, ref as it is.Nevertheless, it is clear that the validation of the offset has a large relative uncertainty as there may be offsets in the reference measurements.Aside from this, Fig. 7 shows a good correlation between the airborne and car DOAS instruments, with a correlation coefficient of r = 0.89.The orthogonal distance regression reveals a slope of 0.89 ± 0.02, i.e., close to unity.Considering tropospheric NO 2 VCD retrieval errors and that the data from the different instruments used for this comparison were analyzed independently by the different groups and only partly harmonized retrieval methods, with different assumptions about the radiative transfer, aerosols, and reference background spectra, the data show good agreement.Coincident measurements that are furthest from the 1 : 1 line are mostly cases where the time difference was at the outer edge of the time filter criterion and may therefore be caused by the rapid natural variability of NO 2 (see right plot in Fig. A7). Figure A7 shows the same information as Fig. 7 and an additional plot where points are color coded by time difference, but the error bars represent the 10th and 90th percentile within the 500 m × 500 m grid boxes and 15 min time intervals to illustrate the spatiotemporal variability.
6 Evaluating TROPOMI tropospheric NO 2 VCD with AirMAP tropospheric NO 2 VCD data The good agreement of the ground-based stationary and car DOAS data set with the AirMAP data gives confidence for using the AirMAP tropospheric NO 2 VCD data set to evaluate the TROPOMI products.Airborne observations are valuable for the evaluation of TROPOMI data, as a large number of satellite pixels are mapped in a relatively short time.
The AirMAP measurement time per flight is on the order of 3 h, with measurements over the target area planned to be taken at least ±1 h around the S5P overpass with the smallest VZA (see Fig. 3).In the comparison TROPOMI pixels are only considered when they are at least 75 % mapped by AirMAP pixels.AirMAP data are considered when they match the temporal coincidence criteria of ±30 min around the S5P overpass time.These spatially and temporally coincident criteria are following the suggestion by Judd et al. (2020).During the 7 flight days (for which TROPOMI data are only available on 6 d, due to ground segment anomalies), AirMAP measurements coincide with 117 TROPOMI pixels.For the comparison of the two data sets, the AirMAP measurements are averaged within the TROPOMI pixel.Fig-  3.
The horizontal error bars correspond to the 10th and 90th percentiles of all airborne measurements within the respective TROPOMI pixel.Vertical error bars represent the reported precision of the TROPOMI tropospheric NO 2 VCD.Error bars are shown only for these three examples to illustrate their magnitude and are not shown in the following plots for a better visibility of the data.An investigation of the different available TROPOMI NO 2 data versions compared to the AirMAP data with their different behavior (scatter, bias) gives further insight into the influence of different a priori assumptions made within each retrieval.
Figure 9a shows coincidences between the TROPOMI operational OFFL V01.03.02 data and the AirMAP data, with a high correlation coefficient of 0.86, a slope of 0.38 ± 0.02, an offset of 2.54 ± 0.15 × 10 15 molec.cm −2 , and a median relative difference of −9 % with an interquartile range of −28 %  to +16 %.All statistics of the comparisons between the different TROPOMI tropospheric NO 2 VCDs data versions and the AirMAP measurements are summarized in Table A1 in Appendix A. Figure A11 shows box-and-whisker plots summarizing the bias and spread of the difference between the TROPOMI versions and AirMAP tropospheric NO 2 VCDs.
The regression parameters and their standard errors are calculated for the plotted data points.Taking the uncertainties of the data points into account and considering the parameters of the orthogonal distance regression over the complete range of these uncertainties yields a standard deviation of 0.14 for the slope and 0.39×10 15 molec.cm −2 for the offset.
The slope of 0.38 is significantly lower than the 0.68 from comparisons of TROPOMI NO 2 OFFL V01.03.02 data and aircraft measurements in the New York City and Long Island Sound region reported by Judd et al. (2020) and the 0.82 from comparisons of TROPOMI and APEX measurements over Brussels and Antwerp reported by Tack et al. (2021).
The scientific TROPOMI data V01.03.02 CAMS based on the OFFL data V01.03.02 has the objective to investigate the influence of the NO 2 profile information by replacing the 1 • × 1 • TM5 NO 2 profiles with the spatially more highly resolved 0.1 • × 0.1 • CAMS-based profiles.The scatter plot comparing this TROPOMI data version with the AirMAP data is presented in Fig. 9b and shows a correlation coefficient of 0.86 and a slope of 0.41 ± 0.02.The median relative difference improves from −9 % to −5 %.The correlation has not changed compared to the original data version and the slope increased only slightly, demonstrating that the replacement of the NO 2 profile has only a small impact on this data set.In general, the replacement of the NO 2 profile increases the dynamical range of NO 2 VCDs, with the largest impact (5 %-30 %) in emission hot spots, but is dependent on the location and conditions (Douros et al., 2023).Tack et al. (2021) observed an increasing slope from 0.82 to 0.93 from the original data version to the version using the CAMS regional a priori over Belgium.Thus, the relative difference in slope between the original V01.03.02 and the V01.03.02 CAMS data is similar, i.e., 13 % in Tack et al. (2021) and 8 % in this study.
Since several validation activities already reported that the NO 2 data V01.02-01.03are biased low, a modified TROPOMI NO 2 retrieval led to the development of V02.03.01 and a complete mission reprocessing (see Sect. 3.1.3).The comparison of this TROPOMI product PAL V02.03.01 with the AirMAP data in Fig. 9c shows much more scatter, with a correlation coefficient that is significantly poorer than for the OFFL V01.03.02 product, changing from 0.86 to 0.76.The slope, however, increased by more than a factor of 2 from 0.38 ± 0.02 to 0.83 ± 0.06, demonstrating that the updates in the new TROPOMI NO 2 data version have a large impact on the analyzed data set from the Rhine-Ruhr region.Due to the large scatter and driven by the large number of measurements with tropospheric NO 2 VCDs of less than about 7 ± 0.15 × 10 15 molec.cm −2 , the PAL V02.03.01 product has a positive median relative difference of +20 % with an interquartile range of −14 % to +66 % (see Fig. A11).As described in Sect.3.1.3,the main change from V01.03 to V02.03.01 is the switch to the FRESCOwide product, which provides more realistic higher cloud altitudes for measurements with cloud fractions larger than zero.Only 1 out of the 117 TROPOMI pixels used in this study has a cloud fraction of zero.Higher cloud altitudes result in decreased tropospheric AMFs and therefore higher tropospheric NO 2 VCDs.With the update many of the 117 data points show increased TROPOMI VCDs and are now closer or even over the 1 : 1 line and thus increasing the slope and the median relative difference.However, there is a lower branch of data points (with low TROPOMI NO 2 , but large AirMAP NO 2 VCDs) that is not very affected by the modifications in the new data version and still matches the pattern of the OFFL V01.03.02 comparison (Fig. 9a).Comparisons of coincidences between the AirMAP and TROPOMI OFFL V01.03.02 and PAL V02.03.01 data, on the basis of single days, show different magnitudes of the described impact from the TROPOMI data version change (see Appendix Figs.A8 and A9).The addressed lower branch visible in the overall comparison of TROPOMI PAL V02.03.01 and AirMAP (Fig. 9c) is dominated by observations from 17 September and is linked to cloud pressures close to the surface even after the change from FRESCO-S to FRESCOwide (see Fig. A9, where points are color coded in terms of the surface and cloud pressure difference).In the OFFL V01.03.02 product, 110 out of 117 pixels and thus 97 % of the TROPOMI observations were found to have cloud heights very close to the surface (within 50 hPa), which is not realistic, especially not for such a large amount of observations.In the new PAL V02.03.01 product, the cloud retrieval yields a cloud height close to the surface for 28 out of 117 pixels, resulting in a better slope of the regression line.However, since some scenes remain problematic, it results in more scatter.Previous studies showed that for scenes with low clouds, i.e., close to the surface, a height that is even closer to the surface was retrieved by the original FRESCO implementation.Since the cloud algorithm does not discriminate between clouds and aerosols, this also holds for low aerosol layers.In many cases, FRESCO then retrieves the surface height, which is incorrect (Compernolle et al., 2021;van Geffen et al., 2022b).Observations during the flights and VIIRS images of the campaign measurement days revealed nearly perfect cloud-free conditions during the measurements over the target areas.Thus, the high cloud pressures are suspected to be caused by a higher aerosol load, which is identified as cloud.This assumption can be supported by the preoperational TROPOMI AOT product ( de Graaf, 2022).The daily maps depicted in Fig. A4 in Appendix A show a quite variable AOT over the region and between the different days, without any obvious correlation with the TROPOMI tropospheric NO 2 VCD.The highest AOT is found on 17 September spanning the pixels that show much lower tropospheric NO 2 VCDs than seen by AirMAP and cause the lower branch in the scatter plot.

Cloud effects
For TROPOMI the tropospheric NO 2 VCDs are corrected for cloud and aerosol effects by the AMFs accounting for cloud-contaminated pixels using a combination of a cloudy tropospheric AMF and a clear-sky tropospheric AMF (AMF trop, clr ).The determined cloud radiance fraction from the NO 2 window is on average 0.21 ± 0.10 with a maximum of 0.48.As mentioned before, based on observations during the measurement flights, VIIRS images and the TROPOMI AOT product these clouds detected by the cloud retrieval must be mostly aerosols, which are identified as clouds in the cloud correction.For nearly cloud-free observations, the cloud correction is more an aerosol correction (Boersma et al., 2011).To investigate the impact of the cloud correction on the TROPOMI tropospheric NO 2 VCDs, we calculated VCDs without this correction, VCD trop, nocc , using the following equation: Figure 10b shows the scatter plot between the TROPOMI PAL V02.03.01 tropospheric NO 2 VCD without cloud correction and the AirMAP tropospheric NO 2 VCD, having a high correlation of 0.85, a slope of 0.73 ± 0.04, and a median relative difference of +16 %.For comparison, Fig. 10a again shows the original PAL V02.03.01 tropospheric NO 2 VCD data with the original cloud correction, having a correlation of 0.76, a slope of 0.83 ± 0.06, and a median relative difference of +20 %.The data version without cloud correction does not show the discussed lower branch anymore, and the upper branch is reduced.Hence, the product without cloud correction has a much better correlation and illustrates that the two branches are caused by the cloud correction.
To investigate the effect of TROPOMI observations with cloud pressures close to the surface, we use an additional coincidence criterion separating the TROPOMI data into observations in which clouds and aerosols are respectively retrieved close to the surface and cases in which this is not the case.As in Judd et al. (2020), the criterion is looking for differences between the cloud pressure and the surface pressure ( CS).However, in contrast with Judd et al. (2020), data with CS > 50 hPa are kept, and the observations for which low clouds are retrieved are filtered out or replaced.The limit of 50 hPa is chosen based on the reported uncertainty of the cloud pressure retrieval (van Geffen et al., 2022a).For the 117 coincident pixels from the 6 measurement days, this criterion reduces the number of coincidences in the PAL V02.03.01 to 89.Thus, the cloud retrieval of PAL V02.03.01 yields a cloud height close to the surface for 23 % of the observations.In comparison, this is true for 53 % of the TROPOMI observations of V01.02 used in Judd et al. (2020) and for 97 % in V01.03.02 used in this study.Figure 10c shows the scatter plot of the TROPOMI PAL V02.03.01 vs. the co-located AirMAP NO 2 VCDs limited to pixels with surface and cloud pressure differences of CS > 50 hPa.Compared to the unfiltered PAL V02.03.01 product, the slope and correlation increased from 0.83 ± 0.06 to 0.96 ± 0.06 and 0.76 to 0.84.The median relative difference increased from +20 % to +29 %.As a next step we replaced the 28 observations with cloud pressures close to the surface with the VCDs without cloud correction.In this way, the number of coincidences is maintained.Figure 10d shows the result with a slope of 0.89 ± 0.05, a correlation of 0.84, and a median relative difference of +26 %.
The new TROPOMI data V02.03.01 provide a more realistic estimate of the cloud pressure for a large part of the measurements as compared to earlier data versions.However, for certain cases with a higher aerosol load, which is treated as a cloud in the cloud retrieval, the cloud pressures remain close to the surface and lead to negatively biased TROPOMI tropospheric NO 2 VCDs.Whether the cloud correction actually improves the NO 2 results in the presence of aerosols depends on the details of the vertical distributions of aerosols and NO 2 .In some cases, the results can be better if no cloud correction is made.To investigate this further, additional information about the vertical distributions of aerosols and NO 2 in the campaign area is needed.

NO 2 profile shape and surface reflectivity effects
To evaluate the influence of the auxiliary data, such as surface reflectivity or a priori NO 2 vertical profiles, on the TROPOMI NO 2 data, we developed a custom TROPOMI NO 2 product based on the retrieval of the PAL V02.03.01 product, named IUP V02.03.01, with the possibility to change auxiliary data used within the retrieval.
Figure 11a shows the comparison between the IUP V02.03.01 tropospheric NO 2 VCD and the AirMAP VCDs in dark blue.The PAL V02.03.01 data are shown in light blue (for details and regression statistics, see Fig. 9c).The correlation is 0.76, as in the PAL data comparison.The slope of 0.88 ± 0.06 is slightly higher than the 0.83 ± 0.06 and within the uncertainties.Since the agreement between the PAL V02.03.01 and the IUP V02.03.01 version is fairly good, we assume that the effects of changing auxiliary data would be similar for the PAL V02.03.01 product.
To demonstrate the impact of more highly resolved a priori NO 2 vertical profiles on the PAL V02.03.01 data, we recalculated AMFs and the tropospheric NO 2 VCDs using a priori tropospheric profiles from the regional 0.1 • × 0.1 • CAMS-Europe analyses for altitudes between the surface and 3 km as described in Sect. 3.1.4REG), while maintaining nearly the same correlation of 0.75 as compared to 0.76.With a relative difference in slope of 14 %, the change shows a slightly larger impact than the 8 % we found for changing the a priori NO 2 profile information from TM5 to CAMS-Europe for the OFFL V01.03.02 data set.Using the spatially more highly resolved profile information has the effect that the profile shape over source regions is improved in the sense that there is more NO 2 near the ground, which decreases the AMF and thus increases the tropospheric NO 2 VCD and compensates for the reduced sensitivity of TROPOMI to trace gases close to the surface.This has a larger effect in the case of the more realistic lower cloud pressures of the PAL V02.03.01 data.Observations for which the cloud pressure is still determined to be close to the surface, which are represented by the lower branch of points, are less affected by the change to the more highly resolved profiles.In combination with the improved cloud treatment, however, the improved NO 2 profiles reveal their positive impact.
Recalculating AMFs with the regional CAMS NO 2 profiles and the TROPOMI LER results in the IUP V02.03.01 REG LER product.Figure 11c compares the IUP V02.03.01 REG TROPOMI LER and AirMAP tropospheric NO 2 VCD, showing a slope of 1.02 ± 0.07 and a correlation of 0.74.Compared to the IUP V02.03.01 REG data (Fig. 11b), the slope increased slightly from 1.00 ± 0.07 to 1.02 ± 0.07, and the correlation hardly changed (from 0.75 to 0.74).The median relative difference changed from +31 % to +24 %.This comparison shows that replacing the OMI LER with the TROPOMI LER data only has a small impact on the TROPOMI NO 2 VCD retrieval for our data set.Differences between the OMI LER and TROPOMI LER are rather small in the campaign region and in the NO 2 fit window but can be larger in other regions, and a change would thus have a greater impact there.Recalculating AMFs with the regional CAMS NO 2 profiles and the TROPOMI DLER result in the IUP V02.03.01 REG DLER product, which is compared to the AirMAP data in Fig. 11d.The implementation of the DLER product leads to decreased TROPOMI NO 2 VCDs as compared to the products using OMI LER (Fig. 11b) or TROPOMI LER (Fig. 11c) and results in a slope of 0.95 ± 0.07 and a median relative difference of +21 % with a correlation of 0.75.Thus, the directional aspect of the surface reflectivity only plays a small role in the tropospheric NO 2 retrieval in the campaign region with nearly cloud-free conditions (mean cloud radiance fraction = 0.21 ± 0.10) during the measurement days.As for the comparison between OMI LER and TROPOMI LER, it should be pointed out that this result is specific to the area, month, and cloud conditions, as the reflectivity influences the cloud height retrieval and thus also the AMF.Larger differences could, for example, be expected for snowcovered surfaces with high reflectivity.Figure A10 in Ap-pendix A shows scatter plots of the TROPOMI tropospheric NO 2 VCD retrieved with TROPOMI LER and TROPOMI DLER for the 117 TROPOMI pixels used throughout the study and for larger areas up to one full orbit.All comparisons show only minor influences from the directional component.Since only TROPOMI observations made in September are compared, no larger snow-covered areas are expected, and a more detailed analysis including a different period and area would be needed to investigate possible larger differences.
All statistics of the comparisons between the different TROPOMI tropospheric NO 2 VCDs data versions and the AirMAP measurements are summarized in Table A1 and the box-and-whisker plots in Fig. A11  The ground-based stationary measurements conducted by different types of DOAS instruments (two Zenith-DOAS, two MAX-DOAS, two Pandora) deployed at different locations in the flight area provide independent, high-precision, and well-established data for the evaluation of the AirMAP retrievals.The AirMAP tropospheric NO 2 VCDs are highly correlated (r = 0.88) with the stationary ground-based VCDs with a slope of 0.90 ± 0.09.Due to limited overflight possibilities, the comparison is limited to in total 25 coincident measurements.
The car DOAS measurements have the advantage that they are mobile, can cover larger and more diverse areas, and can be better synchronized to the AirMAP measurements.They have a high temporal resolution and are coordinated in the AirMAP flight area to gather many co-located measurements.The evaluation of the AirMAP NO 2 VCD considers 572 coincident measurements that are highly correlated (r = 0.89) with a slope of 0.89 ± 0.02.
The combination of the two independent data sets to assess the AirMAP data gives confidence for using the AirMAP tropospheric NO 2 VCD data set to evaluate the TROPOMI products.Despite the fairly good spatial resolution of the TROPOMI measurements, the spatial variability within TROPOMI pixels can be large and cannot be fully captured by ground-based instruments.The AirMAP data, having a resolution of about 100 m × 30 m, create a link between the ground-based and TROPOMI measurements with a nadir resolution of 3.5 km × 5.5 km.Airborne measurements are more representative of the satellite measurements than point measurements as a large number of TROPOMI pixels can be fully mapped in a relatively short time.
For the comparison of TROPOMI and AirMAP tropospheric NO 2 VCDs, only TROPOMI pixels that are at least 75 % mapped by AirMAP and measurements that are less than ±30 min separated in time are used.This results in 117 TROPOMI pixels coinciding with AirMAP measurements during the six flights.Due to nearly cloud-free conditions during the measurement days, the cloud radiance fraction re-trieved in the TROPOMI NO 2 spectral window was on average 0.21 ± 0.10 with a maximum of 0.48, and thus all measurements were below the recommended filter criterion of 0.5.
We evaluate the TROPOMI tropospheric NO 2 VCD data from 12 to 18 September 2020 using the two data products OFFL V01.03.02 and PAL V02.03.01 and scientific data versions.One scientific version is based on the OFFL V01.03.02 with a replacement of the a priori NO 2 profiles from the TM5 model by the CAMS-Europe and CAMS-global product, and one scientific product reproduces the PAL V02.03.01 in which different a priori assumptions are replaced and their effects investigated.
The different TROPOMI and AirMAP data sets are correlated with correlation coefficients between 0.74 and 0.86, slopes of 0.38 ± 0.02 to 1.02 ± 0.07, and relative mean differences between −9 % and 31 %.The operational OFFL V01.03.02 and the scientific V01.03.02 CAMS product show a clear underestimation of TROPOMI compared to the AirMAP tropospheric NO 2 VCDs with a slope of 0.38 ± 0.02 and 0.41 ± 0.02 and median relative differences of −9 % and −5 %, respectively.Both products show a high correlation with a correlation coefficient of 0.86.
The updates implemented in the TROPOMI PAL V02.03.01 product increase the slope from 0.38 ± 0.02 to 0.83 ± 0.06 but result in much more scatter and reduce the correlation from 0.86 to only 0.76, demonstrating the large impact of the modifications on the analyzed data set.Due to the large scatter and driven by the large number of measurements with tropospheric NO 2 VCDs of less than about 7 ± 0.15×10 15 molec.cm −2 , the PAL V02.03.01 product has a median relative difference of +20 % with an interquartile range of −14 % to +66 %.The main change influencing the tropospheric NO 2 VCD is the switch from the FRESCO-S to the FRESCO-wide product, which results in more realistic higher cloud altitudes and therefore decreased tropospheric AMFs and higher tropospheric NO 2 VCDs.In the analyzed TROPOMI data set many of the data points are affected by the modifications and are thus closer or even over the 1 : 1 line and increase the slope and the median relative difference.However, there is a lower branch with low TROPOMI NO 2 VCDs but large AirMAP NO 2 VCDs that still shows cloud pressures close to the surface.The clearly decreased correlation is mainly caused by this separation of the data into two branches, with one branch around the 1 : 1 line and a second with low biased TROPOMI observations close to the distribution seen in OFFL V01.03.02.
We found that the TROPOMI observations on the lower branch are dominated by the observations from a single day and are linked to cloud pressures that are still close to the surface as in the OFFL V01.03.02 product, i.e., they are not affected much by the modifications.Due to nearly cloud-free conditions during the measurement flights, the high cloud pressures are suspected to be caused by a higher aerosol load (which is identified as cloud) and are not accounted for ade-quately in the cloud correction.This assumption is supported by the TROPOMI AOT product, which shows a high AOT for the pixels that show much lower tropospheric NO 2 VCDs than seen by AirMAP and cause the lower branch in the scatter plot.
Comparing TROPOMI PAL V02.03.01 VCDs without cloud correction with the AirMAP VCDs decreases the slope from 0.83 ± 0.06 to 0.73 ± 0.04 and the median relative difference from +20 % to +16 % but brings the two branches together, which improves the correlation from 0.76 to 0.84.This illustrates that the two branches are caused by the cloud correction.We introduced an additional criterion that filters TROPOMI observations with surface to cloud pressure differences of less than 50 hPa, i.e., clouds close to the surface, and either excluded these pixels or replaced them with the NO 2 VCDs without cloud correction.This increases the slope from 0.83 ± 0.06 to 0.96 ± 0.06 and 0.89 ± 0.05 and improves the correlation from 0.76 to 0.84, respectively.Thus, we saw that the PAL V02.03.01 NO 2 product provides a more realistic estimate of the cloud pressure for a large part of the measurements as compared to earlier data versions, but for certain cases with a higher aerosol load, cloud pressures remain close to the surface and lead to negatively biased TROPOMI tropospheric NO 2 VCDs and a larger scatter.Therefore, in some cases the results can be better if no cloud correction is made.
We developed a custom TROPOMI NO 2 product based on the retrieval of the PAL V02.03.01 but replacing the TM5 a priori NO 2 profiles with the spatially more highly resolved CAMS-Europe product for altitudes up to 3 km.This modification increases the slope from 0.88 ± 0.06 to 1.00 ± 0.07 with consistent correlation.
Replacing the OMI LER data with the more highly resolved TROPOMI LER or DLER data in the NO 2 fit window only has a small impact on the TROPOMI NO 2 VCDs of our data set and the comparison to the AirMAP data.The slope increases from 1.00 ± 0.07 to 1.02 ± 0.07 using the TROPOMI LER and decreases to 0.95 ± 0.07 using the TROPOMI DLER.The influence of the surface reflectivity on the VCD retrieval is rather small in the campaign region but can be larger in other regions or seasons, especially regarding snow-covered surfaces and different cloud conditions, as the reflectivity influences the cloud height retrieval and thus the AMF.A larger impact is expected when applying the TROPOMI DLER in the NIR-FRESCO cloud retrieval, affecting the NO 2 retrieval through adjusted cloud parameters.
In summary, a validation has been presented of the TROPOMI tropospheric NO 2 retrievals based on airborne mapping flights supported by ground-based stationary and car DOAS measurements.We found that the modifications in the cloud pressure retrieval in the TROPOMI PAL V02.03.01 data product lead to more realistic lower cloud pressures and thus larger tropospheric NO 2 VCDs for a large part of the analyzed observations compared to the OFFL V01.03.02 prod-uct.While this improves the slope, it significantly increases the scatter.The results can be improved for cases with high aerosol load and retrieved cloud pressures close to the surface if no cloud correction is made.Spatially more highly resolved a priori NO 2 profile information can further increase the tropospheric NO 2 VCDs, while the application of the TROPOMI LER and DLER had only small effects.Further validation activities on the TROPOMI PAL V02.03.01 data product using larger data sets in more regions with different pollution levels, surface reflectances, and aerosol and cloud conditions would help to evaluate the performance of the TROPOMI NO 2 product under different conditions and confirm the results found in this data set.After reprocessing of the new V02.04NO 2 retrieval, which has a consistent implementation of the TROPOMI DLER climatology in the NO 2 fit window and the NIR band for the cloud retrieval, comparisons to the campaign data set can investigate the impact of this modification.The presented validation strategy can be assigned to future validation activities for upcoming satellite missions such as GEMS, TEMPO,  https://doi.org/10.5194/amt-16-1357-2023Atmos.Meas.Tech., 16, 1357-1389, 2023            The article processing charges for this open-access publication were covered by the University of Bremen.
Review statement.This paper was edited by Robyn Schofield and reviewed by three anonymous referees.

Figure 1 .
Figure 1.S5P TROPOMI tropospheric NO 2 VCD taken from the PAL V02.03.01 product for the month of September 2020 in central Europe (a) and a close-up map of the campaign target area in North Rhine-Westphalia (b).The three research flight target areas and the ground-based measurement sites are shown.

Figure 2 .
Figure 2. Overview of the flight area of the Ruhr campaign, including exemplary flight patterns in the three target areas and locations of the stationary instruments in Jülich (blue), Cologne (red), Dinslaken airport (purple), Gelsenkirchen (cyan), and Duisburg (green).

Figure 3
Figure 3 shows a time series of tropospheric NO 2 VCDs measured by AirMAP for each of the 7 flight days of the campaign.The mean over the 35 viewing directions is shown in dark colors, and their standard deviation is shown in light colors.The colors red, blue, and green represent the respective research flight areas around Cologne, Jülich, and Duisburg.The S5P overpass times with respective VZA and the times of the AirMAP reference background measurement are marked by the vertical dashed lines.Two flights were performed in the research flight area around Cologne (red), two flights were performed in the Jülich area (blue), and three flights were performed in the Duisburg area (green).The first two flights shown in Fig. 3 are weekend days, a Saturday and a Sunday.The columns show strong variability between the three target areas and from day to day with the highest tropospheric NO 2 VCDs being ∼ 5 × 10 16 molec.cm −2 over the Duisburg area on Monday 14 September and Tuesday 15 September 2020 and much lower values for both flights in the Cologne area, having tropospheric NO 2 VCDs of up to 2.5×10 16 molec.cm −2 .Maps of the tropospheric NO 2 VCD for each flight are displayed in Fig. 4.The tropospheric NO 2 VCD over the Jülich flight area is smaller during the flight on Sunday 13 September than on Thursday 17 September, where several peaks in the NO 2 VCD up to 2.5 × 10 16 molec.cm −2 are visible.These peaks are caused by plumes of NO 2 coming from three large power plants, located in the Jülich research flight area, which are clearly visible in the maps of the AirMAP NO 2 VCD in Fig.4.Two power plants are located in the northeast of the Jülich flight area, while one is located in the southwest.The

Figure 3 .
Figure 3. Plots of the AirMAP time series of tropospheric NO 2 VCD (mean over the 35 viewing directions and standard deviation as a dark line and bright area, respectively) for the 7 flight days from Saturday 12 September-Friday 18 September 2020.These show strong variability from day to day (weekday vs. weekend) and between the three target areas (Cologne, Jülich, Duisburg).The dashed black vertical lines indicate S5P overpass times with their viewing zenith angle.The dashed colored vertical lines indicate the times of the AirMAP reference measurement.

Figure 4 .
Figure 4. Maps of VCD NO 2 from AirMAP flights from 12 to 18 September 2020.Two flights in the research flight area around Cologne (left column), two flights in the flight area around Jülich (second column), and three flights in the flight area around Duisburg (third and fourth column).The mean wind direction and speed in the flight area, determined from ERA5 10 m wind data for the middle of the flight, are given in the top left corner.

Fig
Fig.A5).Using a larger VCD trop, ref in the IUP car DOAS retrieval would increase the offset compared to the MPIC and BIRA car DOAS data.Since we used a fixed reference background measurement for all car DOAS measurement days, a stratospheric correction based on the Bremen 3D chemistry transport model (B3dCTM;Hilboll et al., 2013b), providing a daily diurnal cycle of the stratospheric NO 2 VCDs and scaled to TROPOMI stratospheric VCDs in the measurement area, is applied to the car DOAS data.Stratospheric AMFs are calculated with the radiative transfer model SCIATRAN(Rozanov et al., 2014) as a function of the SZA.For the conversion of tropospheric SCDs to tropospheric NO 2 VCDs, a constant tropospheric AMF of 1.3 was used.The AMF of 1.3 for an elevation angle of 90 • is closer to the true AMF (derived from radiative transfer simulations) than the geometric approximation for the tropospheric AMF of 1(Shaiganfar et al., 2011;Merlaud, 2013;Schreier et al., 2019).Merlaud (2013) analyzed the AMF distribution for a large number of simulations, resulting in a mean of 1.33 ± 0.2 for measurements in 90 • viewing zenith angle.Since we only analyze data close to the AirMAP overpass, which was performing measurements around noon, the SZA does not vary much.Following the mentioned studies, we assume an uncertainty of 20 % for the AMF.

Figure 5 .
Figure 5. Maps of tropospheric NO 2 VCDs from car DOAS measurements from 12 to 18 September 2020 in the research flight areas around Cologne, Jülich, and Duisburg.Measurements are within ±1 h of the S5P overpass time given in the title.
in a northwesterly direction (312 • ) are used for the analysis.The tropospheric NO 2 VCDs are retrieved by applying the Mexican MAX-DOAS Fit (MMF; Friedrich et al., 2019) They were installed in August 2019 and were still in operation in 2022.One Pandora is located at the Jülich research center (Pandora JUE) and a second is located in Cologne, district Deutz (Pandora COL).Locations are marked in Fig. 1.All data are processed as part of the Pandonia Global Network (2023).Tropospheric NO 2 VCDs are retrieved using coincident sky scan MAX-DOAS and direct-sun observations and are calculated based on the Spinei et al. (

Figure 6 .
Figure 6.Scatter plot of AirMAP data against the stationary ground-based NO 2 VCDs averaged over a time interval of 20 min closest to the AirMAP overpass data, which are averaged over a 500 m × 500 m area around the station site.Each point is colored according to its ground-based instrument type and location.Error bars represent the error in the tropospheric NO 2 VCD retrieval, averaged within the 500 m × 500 m grid boxes and 20 min time intervals.The 1 : 1 line is indicated by the dashed grey line.The solid black line represents the orthogonal distance regression.

Figure 7 .
Figure 7. Scatter plot between co-located car DOAS (±15 min window from the aircraft overpass) and AirMAP NO 2 VCDs using grid boxes of 500 m × 500 m and 15 min time intervals.The data points from BIRA, MPIC, and IUP car DOAS instruments are color coded in red, green, and orange, respectively.The 1 : 1 line is indicated by the dashed grey line.The thick solid black line represents the orthogonal distance regression.Error bars represent the error in the tropospheric NO 2 VCD retrieval, averaged within the 500 m × 500 m grid boxes and 15 min time intervals.

Figure 8 .
Figure 8. Daily maps of tropospheric NO 2 VCDs demonstrating how AirMAP data are matched to TROPOMI measurements.(top row) TROPOMI PAL V02.03.01 tropospheric NO 2 VCDs where qa_value > 0.75.(middle row) AirMAP tropospheric NO 2 VCDs with overlaid TROPOMI pixel outlines that fulfill the co-location criteria of a coverage of at least 75 % and AirMAP measurements ±30 min around the S5P overpass.(bottom row) AirMAP tropospheric NO 2 VCDs scaled to the TROPOMI pixel.

Figure 9 .
Figure 9. Scatter plots of TROPOMI NO 2 VCDs vs. co-located AirMAP NO 2 VCDs for different versions of TROPOMI data: (a) operational OFFL V01.03.02,(b) V01.03.02 based on the CAMS NO 2 profiles, and (c) PAL V02.03.01.AirMAP measurements are considered as colocated and gridded to the TROPOMI pixel if they are taken ±30 min around the S5P overpass and are covering at least 75 % of the TROPOMI pixel.The horizontal error bars represent the 10th and 90th percentiles of airborne measurements within the TROPOMI pixel.Vertical error bars show the reported precision of the TROPOMI tropospheric NO 2 VCD.Error bars on the TROPOMI measurements are shown to illustrate their magnitude and are not shown for all further plots for better visibility of the data.

Figure A2 .
Figure A2.NO 2 profile used in the SCIATRAN tropospheric AMF calculations.The profile is based on old WRF-Chem model runs and scaled to the typical boundary layer height during the measurement days around noon.

Figure A3 .
Figure A3.Scatter plots of TROPOMI PAL V02.03.01 tropospheric NO 2 VCDs vs. co-located AirMAP tropospheric NO 2 VCDs with an (a) AOT of 0.003, (b) AOT of 0.3, and (c) AOT of 0.6.AirMAP measurements are considered as co-located and gridded to the TROPOMI pixel if they are taken ±30 min around the S5P overpass and are covering at least 75 % of the TROPOMI pixel.

Figure A4 .
Figure A4.Daily maps of (top) TROPOMI AOT at 440 nm where the qa_value is > 0.5 and (bottom) TROPOMI PAL V02.03.01 tropospheric NO 2 VCDs where the qa_value is > 0.75.Black boxes represent TROPOMI pixel outlines that fulfill the co-location criteria of having an AirMAP coverage of at least 75 % and AirMAP measurements performed ±30 min around the S5P overpass (see also Fig. 8).

Figure A5 .
Figure A5.Scatter plot between co-located car DOAS measurements (±5 min time window) of MPIC and BIRA car DOAS data vs.IUP car DOAS tropospheric NO 2 VCDs averaged within 200 m × 200 m grid boxes and 5 min time intervals.The data points from the BIRA and MPIC car DOAS instrument are color coded in red and green, respectively.The thick solid black line represents the orthogonal distance regression.Error bars represent the error in the tropospheric NO 2 VCD retrieval, averaged within the 200 m × 200 m grid boxes and 5 min time intervals.

Figure A6 .
Figure A6.The same as Fig. 6 but with different error bars.Scatter plot showing the stationary ground-based NO 2 VCDs averaged in a time interval of 20 min closest to the AirMAP overpass data, which are averaged over a 500 m × 500 m box around the station site.Error bars represent the 10th and 90th percentile within the 500 m × 500 m grid boxes and 20 min time intervals.

Figure A7 .
Figure A7.Similar to Fig. 7 but with different error bars.Scatter plots showing co-located car DOAS (±15 min window from the aircraft overpass) and AirMAP NO 2 VCDs using grid boxes of 500 m × 500 m and 15 min time intervals.The data points from BIRA, MPIC, and IUP car DOAS instruments are color coded as red, green, and orange, respectively (a).The color coding in panel (b) shows the time difference between the AirMAP and car DOAS measurements.Error bars represent the 10th and 90th percentile within the 500 m × 500 m grid boxes and 15 min time intervals.

Figure A8 .
Figure A8.Daily scatter plots of TROPOMI operational OFFL V01.03.02 tropospheric NO 2 VCDs vs. co-located AirMAP tropospheric NO 2 VCDs for the 6 measurement days.Points are color coded to show the surface and cloud pressure difference.AirMAP measurements are considered as co-located and gridded to the TROPOMI pixel if they are taken ±30 min around the S5P overpass and are covering at least 75 % of the TROPOMI pixel.

Figure A10 .
Figure A10.Scatter plots of TROPOMI IUP V02.03.01 tropospheric NO 2 VCDs with TROPOMI LER and TROPOMI DLER for (a) the 117 TROPOMI pixels coinciding with the AirMAP measurements used throughout the study, (b) a larger orbit segment over western Europe on 13 September 2020, and (c) one full orbit including the campaign area on 13 September 2020.All data are quality and cloud filtered using the qa_value of 0.75.

Figure A11 .
Figure A11.Box-and-whisker plots summarizing the bias and spread of the difference between the different TROPOMI versions and AirMAP tropospheric NO 2 VCDs.The green line inside the box represents the median relative difference.Box bounds mark the 25th and 75th percentiles, while the whiskers represent the 5th and 95th percentiles.

Table 1 .
List of the aircraft activities, including S5P overpass information.All times are in UTC.On days with two S5P overpasses over the area, flights were arranged to coincide with the overpass at smaller VZA of TROPOMI.

Table 2 .
List of instruments included in S5P-VAL-DE-Ruhr campaign activities with location and observation geometry.Car DOAS instruments are operated by three different institutes: the Institute of Environmental Physics, University of Bremen (IUP); the Max Planck Institute for Chemistry in Mainz (MPIC); and the Royal Belgian Institute for Space Aeronomy (BIRA).

Table 3 .
TROPOMI NO 2 product versions with the most important differences between the analyzed products.