Interlaboratory Comparison of Branched GDGT Temperature and pH Proxies Using Soils and Lipid Extracts

Ratios of glycerol dialkyl glycerol tetraethers (GDGT), which are membrane lipids of bacteria and archaea, are at the base of several paleoenvironmental proxies. They are frequently applied to soils as well as lake‐ and marine sediments to generate records of past temperature and soil pH. To derive meaningful environmental information from these reconstructions, high analytical reproducibility is required. Based on submitted results by 39 laboratories from across the world, which employ a diverse range of analytical and quantification methods, we explored the reproducibility of brGDGT‐based proxies (MBT ′ 5ME , IR, and #rings tetra ) measured on four soil samples and four soil lipid extracts. Correct identification and integration of 5‐ and 6‐methyl brGDGTs is a prerequisite for the robust calculation of proxy values, but this can be challenging as indicated by the large inter‐interlaboratory variation. The exclusion of statistical outliers improves the reproducibility, where the remaining uncertainty translates into a temperature offset from median proxy values of 0.3–0.9°C and a pH offset of 0.05–0.3. There is no apparent systematic impact of the extraction method and sample preparation steps on the brGDGT ratios. Although reported GDGT concentrations are generally consistent within laboratories, they vary greatly between laboratories. This large variability in brGDGT quantification may relate to variations in ionization efficiency or specific mass spectrometer settings possibly impacting the response of brGDGTs masses relative to that of the internal standard used. While ratio values of GDGT are generally comparable, quantities can currently not be compared between laboratories.


Introduction
Glycerol dialkyl glycerol tetraethers (GDGTs) form the basis of several organic proxies used for the reconstruction of past environmental conditions.For example, archeal isoprenoid GDGTs (isoGDGTs) preserved in marine sediments are used as proxy for seawater temperature reconstructions, based on the degree of cyclization of these compounds, quantified in the TEX 86 lipid paleothermometer (Schouten et al., 2002).Similarly, the branched and isoprenoid tetraether (BIT) index, used to reconstruct soil-derived organic matter input to marine sediments (Hopmans et al., 2004), is based on the relative contribution of branched GDGTs (brGDGTs) produced by heterotrophic soil bacteria to crenarchaeol, an isoGDGT that is exclusively produced by Nitrososphaerota (formerly Crenarchaeota and Thaumarchaeota) (Sinninghe Damsté et al., 2002).Crenarchaeol is also used as a tracer for aerobic ammonium oxidation in (suboxic) terrestrial environments (i.e., Yang et al., 2019).In addition, the distribution of brGDGTs serves as a paleothermometer for terrestrial temperatures; it is based on their degree of methylation, which was found to increase with decreasing temperature in globally distributed surface soils (Weijers, Schouten, et al., 2007).This mechanism forms the basis of the MBT′ 5ME proxy (Equation 1), which only uses the brGDGTs with a methyl group on the fifth carbon position (5-methyl brGDGTs) in the alkyl chain (De Jonge, Hopmans, et al., 2014) in contrast to those with a methyl group on the sixth carbon position (i.e., 6methyl brGDGTs) or other carbon atoms (Ding et al., 2016).Although different calibrations exist to translate MBT′ 5ME into temperatures for soils (Dearing Crampton-Flood et al., 2020;De Jonge, Hopmans, et al., 2014), peats (Naafs et al., 2017), and lakes (Martínez-Sosa et al., 2021), the temperature dependence of the MBT′ 5ME has been shown to be near-universal (Raberg et al., 2022).The application of this proxy to different sedimentary archives-such as river fan sediments that receive substantial contributions from land, loess-paleosol sequences, ancient peats/lignites, or lake sediments-has led to an ever-expanding collection of continental temperature records that span a wide range of timescales (e.g., Inglis et al., 2017;Lauretano et al., 2021;Lu et al., 2019;O'Connor et al., 2023;Peterse et al., 2011;Sinninghe Damsté et al., 2012;Weijers, Schefuß, et al., 2007).Furthermore, both the degree of cyclization of the alkyl chains (Weijers, Schouten, et al., 2007) and the abundance of 6-methyl brGDGTs relative to that of the 5-methyl brGDGTs, used in the CBT′ ratio, are linked to pH in mineral soils (De Jonge, Hopmans, et al., 2014;Yang et al., 2015) and peats (Naafs et al., 2017).These differences in membrane structures can also be separately quantified using the #rings of tetra-, penta-, or hexamethylated brGDGTs (Sinninghe Damsté, 2016) or the isomer ratio (IR;De Jonge, Stadnitskaia, et al., 2014) for the relative abundance of 6-methyl brGDGT isomers.Notably, the existence of 6-methyl isomers, partially or fully co-eluting with their 5-methyl homologs using the original chromatographic method based on a single cyano column (Hopmans et al., 2004), was only recognized using improved chromatographic methods and after the isolation and structural identification of these compounds (Becker et al., 2013;De Jonge et al., 2013;Zech et al., 2012).Following the widespread presence of these compounds in a wide range of modern environments (e.g., De Jonge, Hopmans, et al., 2014), Hopmans et al. (2016) developed a liquid chromatography method that further improved Geochemistry, Geophysics, Geosystems 10.1029/2024GC011583 DE JONGE ET AL.
2 of 17 the separation of these isomers using two BEH HILIC UHPLC columns in tandem.This method of GDGT analysis is currently used by most organic geochemistry laboratories worldwide.
Analytical reproducibility, including both precision and accuracy, is crucial for the wide and comparable application of biomarker-based proxies to quantify key environmental parameters such as temperature in the geological past.A first study comparing the measurement and quantification of biomarkers between laboratories focused on alkenones in marine sediments (Rosell-Melé et al., 2001) that are linked to sea surface temperatures in their degree of unsaturation, quantified in the U k' 37 index (Brassell et al., 1986;Prahl & Wakeham, 1987).This intercomparison study paved the way for studies comparing other temperature proxies developed by different disciplines, such as Mg/Ca measurements of planktonic foraminifera (Rosenthal et al., 2004) or carbonate clumped isotope measurements (Bernasconi et al., 2021), but also those based on biomarkers.Specifically, the TEX 86 and the BIT indices have previously been assessed in two interlaboratory comparison studies (Schouten et al., 2009(Schouten et al., , 2013)).The first round robin study was performed in 2008, when 15 laboratories analyzed the polar GDGT-containing fraction of two sediment extracts and submitted their results of TEX 86 and BIT index values (Schouten et al., 2009).This study found that the repeatability, that is, the intralaboratory variation based on replicate runs of the same sample, of both proxies was small; whereas, the reproducibility, that is, the interlaboratory variation, could vary by up to 3-4°C for TEX 86 -derived sea surface temperature (SST) estimates but was especially larger for the BIT index, which could be up to 0.4 on a scale of 0-1.The offset in the BIT index was attributed to the relatively large difference in molecular weight between the isoprenoidal GDGT crenarchaeol (m/ z 1,292) and the brGDGTs (m/z's 1,022-1,050), and how they are detected by used mass spectrometers.Thus, the second round robin study from 2012 consisting of 35 participating laboratories included mixtures of isolated brGDGTs and crenarchaeol using known pre-weighed quantities in addition to homogenized and pre-extracted sediments (Schouten et al., 2013).The results from this second study showed an improved reproducibility of the TEX 86 index compared with the first round robin study, translating to SST differences between 1.3 and 3.0°C.However, the reproducibility of BIT index values for the mixtures with known molar ratios of crenarchaeol and brGDGTs was again poor and reported BIT index values generally overestimated the molar-based values again pointing toward a different mass-spectrometric response between these two classes of GDGTs.This finding resulted in the recommendation to use calibrated standards to optimize comparability among laboratories.
So far, the reproducibility of the brGDGT-based MBT′ 5ME , IR, and #rings tetra proxies has never been assessed.Therefore, we conducted an anonymized round-robin study to specifically target brGDGT-based proxies.A total of 39 laboratories based in 15 countries worldwide participated in this study.The participating labs analyzed brGDGTs and crenarchaeol in four different homogenized soils and four polar fractions of soil extracts.Since the repeatability of GDGT analysis is generally monitored internally and reported in the literature as part of the methods, here we focus on the reproducibility of the proxy data.In addition to reporting the established indices, participating laboratories were asked to quantify crenarchaeol and brGDGTs using either an internal or external standard.This quantification allowed us to determine whether possible differences in the quantification of GDGTs result from interlaboratory variations in extraction or purification or from which mass-spectrometric system was used.

Study Setup
A general invitation to join the round robin study was sent to organic geochemistry laboratories worldwide and advertised online in September 2022.A total of 48 laboratories from 16 countries responded to this invitation.Sample sets were sent out in November 2022 to all participants.The laboratories received four centrifuge tubes containing homogenized soils labeled A-D and four 4-mL vials containing polar fractions of pre-extracted soils labeled E-H prepared at Utrecht University, The Netherlands, and ETH Zürich, Switzerland.The participating laboratories were asked to analyze the samples using their routine HPLC-MS method, following their local extraction and sample preparation protocols.In addition to the peak areas of the 15 brGDGTs that are generally integrated for calculating ratios that are applied as climate proxies, all laboratories were asked to report the peak areas for crenarchaeol to enable determination reproducibility of the BIT index reproducibility, which was low in the previous round robin studies that focused on isoGDGTs in marine samples (Schouten et al., 2009(Schouten et al., , 2013)).Additionally, participating laboratories were asked to quantify crenarchaeol and the summed brGDGTs concentrations in the soil samples and soil extracts using either internal or external standards according to their protocols.
The participating laboratories did not know the origin of the soil samples and extracts to ensure an unbiased result.A dedicated email address was opened and monitored by a third person (T.Blattmann) to ensure anonymization of the results after submission.The results from 39 laboratories were received before the deadline and are reported here.

Sample Origin
Soil A "Canada, soil" was derived from the Black Creek Watershed on the Yukon coastal plain in the Western Canadian Arctic.The mean annual air temperature at the site is 10°C.The soil was collected from the Bg horizon of the active layer in August 2018, and has a total organic carbon (TOC) content of 48% (Speetjens et al., 2022).The soil pH (measured in deionized H 2 O, soil/water ratio of 1:2.5) was 4.9.The soil was freeze-dried and homogenized by hand using mortar and pestle, after which 100 g was extracted to yield Extract F. The remainder was divided into ∼0.45g subsamples in 15-mL falcon tubes (Sample A).
Soil B "Switzerland, soil" was collected in the Pfynwald nature reserve in Valais, Switzerland, representing the A horizon (10-20 cm depth).The mean annual temperature at the site is 10.7°C.The soil was collected in 2019 and stored frozen before air drying and sieving using a 4 mm sieve.The pH of the soil was measured potentiometrically in 0.01 M CaCl 2 with a solid-extractant ratio of 1:2, and reflects a slightly alkaline soil (pH = 7.3) with a TOC content of 3.3%.After homogenization by mortar and pestle, a total of wt.%.200 g was extracted to obtain Extract E. The remaining soil was divided into ∼4 g subsamples (Sample B).
Soil C "Rwanda, soil" was collected as part of the "TropSOC" project (DFG 387472333) in Mujabagiro (2.4645°S, 29.10346°E), Rwanda, from an altitude of 1,908 m in a forest, representing a soil depth of 10-20 cm (NPL-1 C2).The mean annual temperature at the site is 17.2°C (TropSOC database; Doetterl et al., 2021).pH was measured in 1M KCl, reflecting an acidic soil (pH = 3.2).The soil was sampled in 2018, air-dried, and stored at room temperature.After homogenization by mortar and pestle, approximately 200 g of soil was extracted to obtain Extract H. Subsamples (∼3.2 g) of remaining soil were distributed as Sample C.
Soil D "Brazil, soil" was collected from a site in the Amazon Basin (49.4524°W, 14.30296°S), Brazil, located within a Cerrado dry forest biome, and represents a dry forest topsoil (A horizon, 0-5 cm depth).The mean annual temperature at the site is 26°C, and the soil pH is 6.7 (Häggi et al., 2023).The soil was sampled in 2019, and transported at room temperature before freeze-drying and homogenization by mortar and pestle.Subsamples of ∼3.0 g were distributed as sample D.
Extract E "Switzerland, extract" is derived from Soil B. All extracts represent the same aliquot volume and thus brGDGT concentration.
Extract F "Canada, extract" is derived from Soil A. All extracts represent the same aliquot volume and thus brGDGT concentration.
Extract G "The Netherlands, extract" is obtained from a grassland soil (0-10 cm) at Utrecht Science Park, the campus where part of Utrecht University resides.The mean annual air temperature in Utrecht is 10.5°C.The soil has a TOC content of ∼8 wt.% and a pH (H 2 O) of 5.3.A large quantity of this soil was collected in 2017 to be used as a standard to monitor the performance of the HPLC-MS systems in the organic geochemistry laboratory at Utrecht University.A subsample (∼44 g) of the freeze-dried and homogenized soil was used to generate Extract G for this study.
Extract H "Rwanda, extract" is derived from soil C. All extracts represent the same aliquot volume and thus brGDGT concentration.

Preparation of GDGT Fractions
All freeze-dried and homogenized soils were extracted at Utrecht University, where batches of 40-60 g, depending on the soil type, were Soxhlet extracted using a 15:2 mixture (v/v) of dichloromethane (DCM): methanol (MeOH) for 24 hr.Solvents were subsequently removed using rotary evaporation.300-400 mg of total lipid extract per soil was fractionated using 60 g (23 × 1.6 cm column) of Al 2 O 3 as the stationary phase, eluting with four column-volumes of n-hexane:DCM (1:1) to obtain apolar fractions, followed by four column-volumes of DCM:MeOH (95:5) to yield the polar fractions containing the GDGTs.After rotary evaporation, the yields of the polar fractions were determined gravimetrically.Subsequently, a 1: 80 aliquot was taken to check for the distribution and concentrations of the branched GDGTs following the HPLC-MS method used at Utrecht University.Based on these results, approximately 80 aliquots were prepared for each of the Extracts E-H, transferred to 4 mL vials, and dried overnight.Between the extracts E-H, the dry weight varied between 0.6 and 1.5 mg.

GDGT Proxy Calculations and Quantification
The following ratios and indices were used to quantify the relative abundances of the GDGTs based on the peak areas reported by each of the laboratories.Roman numerals refer to molecular structures (Figure S1 in Supporting Information S1) as in De Jonge, Hopmans, et al. (2014), where the branched GDGTs with a methylation on the 6position are indicated with a prime symbol (′).
was used to quantify the degree of methylation of the 5-methyl branched GDGTs (De Jonge, Hopmans, et al., 2014).
was used to quantify the degree of methylation of the summed 5-methyl and 6-methyl brGDGTs (Peterse et al., 2012) to determine whether offsets in the MBT′ 5ME were caused by erroneous integration of 5-and 6methyl brGDGTs.
Finally, the BIT index was calculated following Hopmans et al. (2004), rewritten to explicitly include the 6methyl isomers (sensu De Jonge, Hopmans, et al., 2014): Quantities of crenarchaeol and GDGTs were reported by the participating laboratories in ng per g dry soil for Soils A-D, and ng per vial for Extracts E-H.

Metadata
Information on the HPLC column used, HPLC solvent systems/gradients, phase type of separation, mass spectrometer (MS) type and method, as well as integration settings, was provided by the participating laboratories (Table S1, http://hdl.handle.net/20.500.11850/666016).Information on HPLC-MS type is not reported uniformly.In addition, for Soils A-D, extraction methods, solvent mixtures, further sample preparation and purification steps, and any standards used for quantification are reported.The metadata used in the discussion and figures are reported in Table 1.In Table 1 and the following discussion HPLC columns used were grouped into 8 classes based on similar column chemistry.

Lipid Data
Participating laboratories provided lipid data as integrated peak areas and, when available, concentrations.No further quality control steps were performed by the two lead authors.This implies that we did not vet whether the reported areas comply with the detection limit reported by the individual laboratories.Ratios were calculated based on the areas reported; when no area is reported for a compound, the calculated fractional abundance value 0 is used for ratio calculation (Table S2, http://hdl.handle.net/20.500.11850/666016).Ratio values along with median and mean values, and standard deviations are calculated for each sample (A-H; Figure 1, Table 2).Samples with values larger or smaller than Q1 1.5 × IQR (where Q = quartile, IQR = interquartile range), or larger than Q3 + 1.5 × IQR, were considered outliers of the total data set (Tukey, 1977).Median values and standard deviations after exclusion of these outliers are reported in Figure 1 and Table S3, http://hdl.handle.net/20.500.11850/666016.Geochemistry, Geophysics, Geosystems To determine the impact of selecting 5-methyl and 6-methyl brGDGTs on the MBT′ 5ME and IR, we compared MBT′ with MBT′ 5ME values, plotted against IR values (Figure 2).To determine the impact of extraction and laboratory processing on GDGT ratios, the offset in ratio values (MBT′ 5ME , IR, #ringstetra) between soil-extract pairs (i.e., Sample A and Extract F, B and E, C and H) were calculated for each laboratory (Figure 3).An initial check of the concentration data revealed unrealistic values for the Soil samples (A-D) of one laboratory (3), with 115-1,800 times higher crenarchaeol or brGDGT concentrations reported.For the concentrations of the Extracts (E-H), three laboratories reported unrealistic values (2, 3, 42, for isoGDGT) and brGDGT concentrations (with up to 680 times higher GDGT concentration reported).These concentration offsets are probably due to an error reporting units, for instance, pg instead of ng, or reporting the concentration per ng extract, instead of the full extract.Although the calculation of concentrations is clearly still non-trivial, these offsets cannot be explained by differences in the analysis of GDGTs.Therefore, while all concentration values are reported in Table S2, http:// hdl.handle.net/20.500.11850/666016, the data from these few laboratories were excluded from further concentration analyses in this manuscript.In addition, concentrations that were reported to be 0 were excluded from the discussion.Concentration outliers were identified analogous to the ratios, that is, those values that are larger or smaller than Q1 1.5 × IQR, or larger than Q3 + 1.5 × IQR (Figure 4).As the range in concentration values excluding the outliers was still substantial (Table 3), scatterplots were used to compare the impact of MS type on crenarchaeol and brGDGT ionization in the Extracts (Figure 5), as well as the impact of extraction method on the concentration in soils (Figure 6).

Results and Discussion
The results from 39 laboratories were received and are reported as ratios, that is, MBT′ 5ME , IR, #rings tetra , to assess (a) the impact of chromatography (LC column and mobile phase), presented in Figure 1; and (b) the impact of the extraction method and workup procedures, presented in Figure 3.An overview of all ratio values can be found in Table S2, http://hdl.handle.net/20.500.11850/666016, and a summary of the statistics is presented in Table 2 and Table S3, http://hdl.handle.net/20.500.11850/666016.

Influence of HPLC Column and Mobile Phase on GDGT Proxies in Extracts
The HPLC methods used by the participating laboratories were overall comparable, yet differed in several aspects, summarized in Table 1.In short, most labs (n = 30) use two Waters Acquity UHPLC BEH HILIC Silica columns (with or without pre-column) in tandem as the stationary phase, and n-hexane:iso-propanol (IPA) as the mobile phase, following the method developed by Hopmans et al. (2016), except for one lab that uses hexane:IPA: chloroform as the mobile phase.Other columns used are Waters Acquity UHPLC BEH HILIC Amide columns (n = 1), ThermoFisher Scientific UHPLC Silica (1.7 μm, n = 1, 1.9 μm, n = 3), GL SCIENCES Inertsil SIL-100A (n = 1), Agilent HILIC Plus (n = 1), Phenomenex Luna CN (n = 1), and ACE UltraCore 2.5 Super PhenylHexyl (n = 1).Two of the three laboratories that use ThermoFisher Scientific UHPLC Silica 1.9 μm columns use n-hexane:ethyl acetate (EtOAc) as mobile phase, and the lab using an ACE UltraCore 2.5 Super PhenylHexyl column uses a MeOH:IPA:ammonia solution and is the only participating laboratory that uses reverse phase chromatography as opposed to normal phase used by all other laboratories.
The polar fractions, containing GDGTs (Extract E-H), were provided to all participating laboratories to assess the potential influences of different HPLC columns and settings on GDGT proxies, independent of extraction and  Geochemistry, Geophysics, Geosystems  1).However, the values for Extract E (Switzerland) are characterized by a ∼0.25-unit jump caused by eight laboratories (2,3,25,27,34,44,46,47) that reported substantially lower-than-average MBT′ 5ME values (Figure 1).The reproducibility for this extract is low (σ = 0.15) compared with the other extracts, which all have a reproducibility <0.03 (Figure 1).When translated into temperatures, this would result in an offset of 7.8°C.Five laboratories reported MBT′ 5ME values that were defined as outliers for more than one extract (18,27,42,46,47).Four of these labs followed HPLC methods that deviate from Hopmans et al. (2016), and used a different mobile phase (MeOH:IPA:ammonia solution or hexane:EtOAC), different columns (ThermoFisher Scientific UHPLC Silica 1.9 μm or Phenomenex Luna CN), or ran a reverse phase method with different mobile and stationary phases.Other laboratories (n = 5), however, that used similar alternative normal phase methods produced MBT′ 5ME index values that were not marked as outliers.Similarly, seven outliers were from laboratories that followed standard methods.This suggests that outliers are more likely derived from instrument-specific settings rather than related to the different mobile and stationary phases used.The degree of cyclization of brGDGTs is relatively insensitive in soils with pH < 5 (De Jonge, Hopmans, et al., 2014), which is reflected by the similarly low #rings tetra values of 0.04 and 0.06 for Extracts H and F, although the measured pH of the soils used to generate these extracts differs by 1.7 units.Extract G has a #rings tetra value of 0.28 and Extract E -from the highest pH soil-has a #rings tetra value of 0.36.The reproducibility was relatively high (σ < 0.04) for all Extracts, indicating that the detection and identification of brGDGTs with cyclopentane moieties are generally consistent between laboratories (Figure 1).The outliers, which mostly overestimate the degree of cyclization of brGDGTs in these extracts, do not differ in HPLC methodology.One laboratory (44) that reports deviating values for all extracts follows the standard method of Hopmans et al. (2016).The IR showed values between 0.04 and 0.09 for the three acidic soil extracts with a reproducibility σ < 0.05.These low IR values result from the low contributions of 6-methyl brGDGTs in these soils.Analogous to the #rings tetra , the similar IR values for the three acidic soils, despite the >2 pH range that they cover, fits with the trend in the global surface soil data set (including both mineral soils and peats) used by Dearing Crampton-Flood et al. (2020); IR reaches 0 at pH values of ∼4, indicating reduced sensitivity of the IR at the lower end of the pH range present in global soils.The alkaline soil from Switzerland (Extract E) had an IR of 0.79, reflecting the larger contribution of 6-methyl brGDGTs in this soil.However, the reproducibility for this extract was very low (σ = 0.26).This is caused by much lower IR values, up to 0.6 difference from the median value, reported by the same eight laboratories that also reported the lower-than-average MBT′ 5ME values for this extract.This suggests that the offset in both ratios may stem from problems with the detection, separation, and/or identification of the 5and 6-methyl brGDGT isomers.However, there are also laboratories that consistently report higher-than-median IR values for the three acidic soils (Rwanda, Canada, The Netherlands) but are not flagged as outliers for the MBT′ 5ME of these same extracts.

Influence of Peak Identification on GDGT Proxies in Extracts
To assess the influence of peak identification as a source of deviation on GDGT proxy values, we compared MBT′ 5ME index values, based on just the 5-methyl brGDGTs, with those of the MBT′, which includes both 5-and 6-methyl brGDGTs (Figure 2).The inclusion of 6-methyl isomers per definition results in lower values for the MBT′ compared to the MBT′ 5ME (Figure 2).The highest offsets are expected for soils with large contributions of 6-methyl brGDGTs, that is, with a high soil pH.Indeed, the most pronounced difference between these indices is in Extract E (Switzerland) with a pH of 7.3.Here, MBT′ values are up to 0.43 lower than MBT′ 5ME values for laboratories that also report high IR values.The offset between MBT′ and MBT′ 5ME is much smaller (<0.12) for the eight laboratories that had outliers for the MBT′ 5ME and IR in Section 3.1 (Figure 1).We postulate that these laboratories may have identified the 6-methyl brGDGT peaks as those of the 5-methyl brGDGTs in the chromatogram.This misidentification would result in a relative underestimation of the IR as well as the MBT′ 5ME .
Conversely, the two outliers for the IR in Extract H (Rwanda) are linked to the largest offsets between MBT′ and Geochemistry, Geophysics, Geosystems  2).In this case, the identification of (part of) the 5-methyl brGDGT peaks as belonging to the 6methyl isomers may have resulted in a relative overestimation of IR and MBT′ 5ME .In addition to misidentification, chromatographic issues resulting in reduced separation of 5-and 6-methyl isomers may contribute to the offsets observed here.However, the laboratories that report outlier values are not characterized by a common deviating HPLC method, suggesting that offsets are again introduced by lab-specific approaches to peak integration and/or instrument settings rather than by the use of different mobile or stationary phases.

Influence of Sample Workup Procedure on GDGT Proxies in Soils
The sample workup procedures of the participating laboratories differed substantially; the most commonly applied method was ultrasonic extraction (n = 16), followed by Evaluating the offsets between proxy values calculated based on brGDGTs in soils and matching extracts highlights that all extraction and separation methods used are able to successfully extract and purify brGDGTs.
Taking the example of the soil-extract pair from Switzerland (Sample B and Extract E), some of the laboratories with offset MBT′ 5ME and/or IR values for the Extracts (attributed to integration errors; see Section 3.1) report more accurate values for MBT′ 5ME and IR in soils that they extracted themselves (lab 2, 3, 27, 44).This suggests that if the offsets were indeed introduced by integration errors, these errors were not made consistently.By comparison, other laboratories report offset values for both the Samples and Extracts (labs 25, 34, 46, 47).The exclusion of eight laboratories (2,3,25,27,34,44,46,47), where integration issues may have caused the offset between extracts and soils highlights small offsets that can be linked to extraction and clean-up procedures.
Concerning extraction methods, of the 15 laboratories whose offsets represent an outlier (Figure 3), three employ ASE extraction, five employ microwave extraction, and seven employ ultrasonic extraction, representing 27%, 63% and 43% of the laboratories that use these respective extraction methods.However, of those laboratories that show offsets more often (for instance more than three occurrences [Figure 3: maximum occurrence of offsets = 9]), five out of seven laboratories use ultrasonic extraction.This offset might be caused in part by the fact that Soxhlet extraction, used to prepare the extracts, and ultrasonic extraction are performed at different temperatures, and potentially extract a slightly different pool of brGDGTs.However, a large fraction of laboratories that use ultrasonic extraction are clearly able to extract brGDGTs from the soil Samples (A-D) in a distribution that is comparable to that in the provided Extracts (E-H).Variability in the wavelength frequency used for sonication can potentially be the cause of this discrepancy.All in all, the impact of post-extraction clean-up steps seems minor compared to the extraction method, and these observations are not statistically meaningful as some treatment protocols are only used once.For instance, the only laboratory that uses an NH 2 column after ASE extraction (lab 21) produces a pronounced offset brGDGT distribution only in the Rwandan soil.In addition, the only laboratory that uses activated Cu after ultrasonic extraction reported offset values for the soils from Canada and Rwanda (lab 18).Similarly, the separation of commonly used silica or Al 2 O 3 columns can result in either accurate or offset GDGT distributions.Therefore, the diverse approaches used by the 39 participating laboratories all seem to be mostly valid.

Quantification of GDGTs
Of the 39 labs that submitted data, 30 used standards to enable quantification of GDGTs.The synthetic C 46 glycerol trialkyl glycerol tetraether (GTGT) is used as an internal standard by 28 of these laboratories, while one laboratory (47) uses an H-shaped tetraester as internal standard.Finally, one laboratory (48) uses archeol as an external standard for quantification.Due to the large differences in GDGT abundances, data from three laboratories that reported unrealistically high values for the provided extracts have been excluded from further analyses (see Section 2.5).Still, range differences in the absolute GDGT concentrations, up to orders of magnitude, are reported (Figure 4).The laboratory that uses an external standard for quantification nearly consistently reports the highest GDGT abundances.Still, several other laboratories that reported two or more outlier abundances (2,17,25,33,49) all used C 46 GTGT as an internal standard.Hence, the type of internal standard does not explain the large range in absolute GDGT abundances, yet the use of an external standard seems to introduce a large deviation in the quantification, albeit based on the data of one laboratory only.
Crenarchaeol or total brGDGT concentrations compared between extracts were strongly correlated (Figure 5).This means that laboratories are consistent in their quantification, that is, laboratories that measured high GDGT concentrations in one Extract generally also measured high concentrations in all extracts and vice versa.When evaluating the impact of mass spectrometry methods, the use of orbitrap mass spectrometry can result in offset quantifications between samples (Figure 5).However, this is based on the performance of only two laboratories, one of which uses an external standard, and the unique impact of the orbitrap mass spectrometry on quantification cannot be constrained.
Instead, we propose that the large range in GDGT abundances reported per Extract may be introduced by instrument-specific settings, such as mass selection values, tuning settings, APCI settings and MS type, which may affect, for example, the ionization efficiency and detection of different GDGTs.For individual instruments, changes in the relative response factor between the internal standard and crenarchaeol with time have previously been attributed to instrument drift and tuning events (Huguet et al., 2006).Instrument specific settings have also been invoked to explain large offsets in BIT index values in a previous round-robin study focusing on isoGDGTs, where it was assumed that the relative response of crenarchaeol (or isoGDGTs as a group) and brGDGTs differed among MS systems (Schouten et al., 2013).Specifically, Davtian et al. (2018) showed that the impact of using approximate or unstable mass selection m/z values using SIM on calculated BIT index values amounts up to 0.1 unit.In this round robin, different response factors of the internal standard can impact quantification.Its impact on the BIT index is difficult to observe as the soils that were selected for this round robin study are all dominated by brGDGTs, with BIT index values of Soils A-C close to 1.00, whereas only Soil D (Brazil) shows some scatter and a range (0.91-1.00;Table S2, http://hdl.handle.net/20.500.11850/666016).
Finally, we assessed the consistency of quantification between GDGTs in Extracts and Soils based on three sets of soil-extract pairs, that is, lipid extracts generated by us and the matching soil extracts prepared by the participating laboratories: Soil A and Extract F (Canada), Soil A and Extract F (Canada), Soil B and Extract H (Rwanda), Soil AC and Extract E (Switzerland).There is a general correlation between the GDGT abundances in Extracts and Soils (Figure 6), indicating consistent quantification within laboratories.Given this, and the knowledge that Soils and Extracts are analyzed on the same instrument with the same settings, offsets from the general dependency are likely introduced during sample extraction and workup, for instance incomplete extraction or sample loss, or adding inaccurate amounts of internal standard.Different extraction methods do not show different results (Figure 6), suggesting that all perform equally well.Consequently, the preparation of extract aliquots is the final remaining source of uncertainty.
In general, the large range in quantified values provides an important indication that combining quantitative data sets from different laboratories requires a more rigorous approach and serious evaluation of quantification methods, for example, by exchanging sample material and analyzing the same samples in multiple laboratories.

Conclusions and Recommendations
Our round-robin study based on four soils and the polar lipid fractions of four soil extracts showed good reproducibility of brGDGT-based proxies among the 39 participating laboratories, but also highlights several pitfalls.The observed offsets in proxy values are not related to specific LC-MS methods (e.g., stationary, mobile phase or mass spectrometer type).Similarly, offsets in proxy values between soils and matching extracts that are introduced during sample workup are not related to differences in workup procedure (e.g., extraction method, small column chromatography).Instead, some of the observed outliers in MBT′ 5ME values can be explained by misidentification of 5-and 6-methyl brGDGTs and/or errors in peak integration, as the laboratories with offset MBT′ 5ME values also have offset IR values.This is possibly caused by chromatography that separates brGDGTs insufficiently or human error when selecting peaks of 5-methyl and 6-methyl brGDGTs.We have included example chromatograms with integration from a laboratory that plots close to median values in all instances for comparison and future reference (Figure S2 in Supporting Information S1).Problems with selecting 5-and 6methyl brGDGTs can happen more readily with samples that are dominated by either isomer set or where additional peaks are observed in the m/z 1,050 and 1,036 traces.For these samples, comparison of retention times (potentially using aligning of well-defined brGDGT peaks to correct for retention time shift) with an internal standard that contains both isomer sets is strongly suggested.
The large spread in GDGT concentrations reported by the laboratories makes comparison of GDGT concentrations between laboratories challenging, although quantification within laboratories was generally consistent.Hence, accurate quantification of GDGTs remains the largest challenge in further improving the reproducibility of brGDGT analysis.As a first step to ensure comparable quantifications, errors in preparation of the internal standard from dry powder, adding known amounts of internal standard to the sample and finally calculation of the GDGT quantities need to be evaluated within laboratories.To ensure correct calculation of ratio values and quantities, we suggest that all laboratories shall use a standard mixture that includes all 15 brGDGTs as well as an internal (C 46 GTGT) standard to assess the performance of their instrument, including both the response factor, consistency of GDGT-based proxy values as well as compound quantification.Ideally, the community will introduce a "reference sample"-a standard mixture with established proxy values that can be used as a benchmark in laboratories worldwide, as is usual procedure used to standardize isotope studies, for example, NSB-18 for carbonates.A rigorous approach to quantification would require the development of a reasonable structural brGDGT analog that can be used in individual laboratories to determine and monitor the instrumentspecific response factor of brGDGTs.

Figure 1 .
Figure 1.Offset of selected brGDGT ratio values (MBT′ 5ME , IR, and #rings tetra ) from the median value per laboratory.Median value (x) and standard deviation (σ) are reported with (black) and without (gray) outliers.The gray band indicates the range of ratio offset values after the exclusion of outliers.Symbols refer to liquid chromatography (LC) columns used, colors reflect different mobile phases.

Figure 3 .
Figure 3. Offset of selected brGDGT ratio values (MBT′ 5ME , IR, and #rings tetra ) of samples provided as Soil and as Extract per laboratory.Standard deviations (σ) are reported with (black) and without (gray) outliers.The gray band indicates the range of values after the exclusion of outliers.Symbols refer to extraction methods and colors refer to subsequent processing steps used by the laboratories.

Figure 4 .
Figure 4. Concentrations of crenarchaeol and summed brGDGTs for all Soils and Extracts analyzed indicated by lab numbers.The upper limit of the interquartile range (IQR) is indicated by a horizontal line.Lower IQR limits are always below 0. When no horizontal line is plotted, all samples fall within the IQR.

Figure 6 .
Figure6.Comparison of reported concentrations of summed brGDGTs between samples that were provided as Soils and those provided as Extracts.Note that a direct comparison is not possible as a different normalization is used for Extracts and Soils (x and y-axes).Symbols refer to extraction methods.

Table 2
Statistics of Selected GDGT Ratios per Sample (MBT′ 5ME , MBT′, IR, CBT; #Ringstetra and BIT (Equations 1-6)), Specifically Median and Mean Value, As Well As the Standard Deviation (Stdev)The same summary data after removal of outliers can be found in TableS2, http://hdl.handle.net/20.500.11850/666016.Downloaded from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024GC011583by Cochrane France, Wiley Online Library on [19/07/2024].See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions)on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License Downloaded from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024GC011583by Cochrane France, Wiley Online Library on [19/07/2024].See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions)on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License separation techniques.The reported MBT′ 5ME values have a reasonably Gaussian distribution for most Extracts (Figure

Table 3
Summary Statistics of the Reported Concentrations (Summed Concentration of brGDGTs or Concentration of Crenarchaeol) After Exclusion of Outliers Figure 5. Scatterplots of the concentrations of crenarchaeol (left) or summed brGDGTs (right), comparing two extracts in each panel.Correlation line, Pearson r-value and p-value are reported for significant correlations ( p < 0.05).Symbols refer to the type of mass-spectrometer.
Downloaded from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024GC011583by Cochrane France, Wiley Online Library on [19/07/2024].See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions)on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License MBT′ 5ME (Figure Accelerated Solvent Extraction (ASE) (n = 11), microwave assisted extraction (n = 8), Bligh and Dyer extraction (n = 1), EDGE automated solvent extraction (n = 1), and flow blending (n = 1).The vast majority of laboratories used a 9:1 mixture of DCM:MeOH for lipid extraction (n = 35), except for three laboratories that additionally used a buffer (phosphate and/or trichloroacetic acid) and one lab that used DCM:MeOH:hexane.The approaches used for further processing of the obtained lipid extract and separation into different fractions were very diverse, and consisted of no further treatment, hydrolysis, treatment with activated Cu, passing over a Na 2 SO 4 column, or column separation using Al 2 O 3 , SiO 2 , Si-NH 2 , or Florisil (TableS1, http://hdl.handle.net/20.500.11850/666016).