Introduction

Establishing healthy eating habits in early childhood is crucial as they are determinants of health and disease in adulthood [1]. However, the estimation of dietary intake in children is complex, especially when dietary intake has to be evaluated in large prospective cohorts. The food frequency questionnaire (FFQ) is a well-recognized and most frequently used method to assess food intake in large population studies evaluating associations between diet and health-related outcomes [2] even in cohorts of children [3, 4] and adolescents [5]. Unfortunately, dietary intake assessments using FFQs are usually subject to systematic and random errors [6, 7] affecting their accuracy and relative validity. However, after energy adjustments, these errors are generally reduced, making FFQ-derived data useful to suitably rank study participants according to dietary consumption. Furthermore, FFQs assess dietary intake without altering routine eating habits and are a relatively low-cost method. Nevertheless, FFQs are only valid for the population for which they were developed [3], as they avoid measurement errors and improve the accuracy of dietary estimations. Thus, in recent years, there have been several FFQ validation studies in specific populations, according to the country of residence, age, and cultural factors, which may influence food consumption [2].

In the last decade, several FFQs have been validated in children from different at different growth stages and cultural contexts including preschoolers from China [8], Malaysia [9], Greece [10], Australia [11], Bangladesh [12], and several European countries [13, 14]. In addition, validations have been performed in pre-adolescents or adolescents from China [15], Vietnam [16], Italy [17], and Brazil [18]. Other validations including a wide age range have also been conducted in South America [19], Denmark [20], and Europe [21].

To the best of our knowledge, in Spain, only three FFQs have been validated in healthy children aged 4–5 years [22], 4–7 years [23], and 7–9 years [24]. These validations have only been performed in children residing in two cities in Spain (Valencia and Navarra), while foods consumed largely vary across different regions of Spain. Additionally, in these FFQs, beverages and fluid items have not been exhaustively considered. Furthermore, nutrient intakes estimated from these Spanish FFQs did not always show an acceptable correlation with estimates using 24-h dietary recalls or dietary records and were validated for a very narrow age range and did not include children aged 3 or those age above 9 years [22,23,24].

Therefore, the objective of the present study was to assess the reproducibility and relative validity of a food and beverage frequency questionnaire (F&B-FQ) in children aged 3 to 11 years from several Spanish cities.

Materials and methods

Design and sample

A F&B-FQ named COME-Kids has been designed to assess the usual dietary and beverage intake. This questionnaire has been derived from a previously validated FFQ in Spain [25]. In order to assess its reproducibility and relative validity, a total of 210 healthy participants aged 3 to 11 years were recruited for the present study (see flowchart in Supplementary material). The objective of the study required a sample size between 100 and 200 participants, in accordance with the recommendation of Willett and Lenhart (1998) [6].

Participants aged 3 to 6 years were recruited from two studies: the CORALS (Children Obesity Risk Assessment Longitudinal Study, https://corals.es) [26] and the MELI-POP (Mediterranean Lifestyle in Pediatric Obesity Prevention) study control group. Participants aged 7 to 11 years old were recruited from the same schools participating in the CORALS study (more information in Supplementary material). Participants for the present study were recruited between March 2019 and November 2019.

The research protocol of this study was approved by all the ethic committees of the centers involved in the study (CEIC Córdoba: Acta 284/Ref. 4155), CEIC Navarra (2019/18), CEIm Institut d’Investigació Sanitària Pere Virgili (CEIm-IISPV 051/2019), CEIC Santiago-Lugo (2019/131), CEIm Hospital Universitario Dr. Peset (CEIm: 9/19), and CEIC Aragón (09/2019), which was conducted following the standards of the Declaration of Helsinki.

All families or caregivers who received a detailed description of the study were asked to fill in and sign the informed consent forms for participation and were given the chance to withdraw the children from the study at any point.

COME-Kids food and beverage frequency questionnaire

The COME-Kids F&B-FQ is an adapted, semi-quantitative questionnaire with a total of 125 items. The questionnaire was filled in approximately 20 min in a face-to-face interview by trained registered dietitians. For children aged 3 to 9 years, parents or caregivers were asked how often, on average, the participant had consumed the specified serving of each item during the last year. Children aged 10 or above were personally interviewed because they can provide reliable information. Nevertheless, their parents confirmed or helped them provide additional details of the recipes of dishes consumed.

The questionnaire allows for the collection of detailed information on the variety and quantity of food and beverages habitually consumed by children in the last year. COME-Kids F&B-FQ included an item that assesses the frequency of eating in fast food restaurants and 3 open items to document usual food or dietary supplements consumed beyond other items included in the questionnaire (see more details in Supplementary material).

The COME-Kids F&B-FQ was developed in an optically readable form format (Supplementary material), which was scanned using Evaldara® software. The software automatically exports the indicated consumption frequency for the 125 items, which was incorporated into the e-Diet Base URV software [27]. It computes nutrient and food group intake estimates by multiplying the frequency of consumption of each item by the nutrient content of the portion specified in the F&B-FQ. The total energy and nutrient intakes were estimated according to the Spanish CESNID [28] database, the Veggie Base [29] and CELIAC Base [30] for some special food or beverages, or from nutrient facts labels from the food packages. To minimize misreporting, participants who reported energy intake below the 5th percentile or above the 95th percentile from F&B-FQ3 and 3d-DR were excluded from the present analysis [31].

Reproducibility and relative validity

To explore the COME-Kids F&B-FQ reproducibility, the parents or caregivers answered the F&B-FQ twice (F&B-FQ1 and F&B-FQ2) over 15 days (± 1 week) period about the children’s usual diet in the past year.

To evaluate the relative validity, a 3-day dietary record (3d-DR) including two weekdays and one holiday (non-consecutive) was used as the reference method. Three 3d-DRs were collected at baseline, 6 months, and 1-year follow-up. Then, the mean daily intake of 9 days was computed. To conduct the validation, a third questionnaire (F&B-FQ3) was administered at a 1-year follow-up, which was compared with the mean daily intake computed from three questionnaires of 3d-DR collected over the last year in order to capture the seasonally and intraindividual variation without causing fatigue or lowering motivation to fill out the information. Figure 1 shows the design of the COME-Kids F&B-FQ reproducibility and relative validity assessments. The study procedure is described in the Supplementary material.

Fig. 1
figure 1

Design of the COME-Kids F&B-FQ reproducibility and relative validity assessments. 3d-DR, 3-day dietary records; F&B-FQ, food and beverage frequency questionnaire

Statistical analysis

Quantitative variables were expressed as median (interquartile range) and qualitative variables as a percentage (n). Imputation was required only for missing anthropometric data. To impute the missing data of height (n = 5) and weight (n = 6), the mean value for weight or height by age according to the growth curves of Hernández et al. was used [32].

To assess the reproducibility and relative validity of the F&B-FQ for food groups and nutrients, the Pearson (r) and intra-class (ICC) correlation coefficients were estimated and adjusted by dietary energy intake using the residual method [6]. Additionally, we log-transformed the data (log10) to improve the normality for nutrient and food group consumption. The final analysis used both log transformation and energy adjustment. Guidelines for the statistical interpretation of the ICC are based on values 0.81–1.00 (almost perfect), 0.61–0.80 (substantial), 0.41–0.60 (moderate), 0.21–0.40 (slight), and < 0.21 (poor). The r is interpreted as follows: “0” = no association, “1” = positive linear association, and “ −1” = negative linear association [33].

Paired t-tests were used to compare the means between F&B-FQ1 and F&B-FQ2, as well as between F&B-FQ3 and the mean intake of 9 days calculated from the three 3d-DR.

Food groups and nutrients from the FFQs and 3d-DR were categorized in quintiles. We then evaluated the agreement as those placed in the same quintile (kappa index) and, additionally, as those placed in the same or adjacent quintiles. We examined Bland–Altman plots for each of the food groups and nutrients to explore the direction of bias across levels of intake and determine the agreement between the F&B-FFQ and 3d-DR.

All statistical analyses were performed using Stata 14 software program (StataCorp), and statistical significance was set at a two-tailed p-value < 0.05.

Results

Of the 210 participants who agreed to participate in the study, 195 children (105 boys, 90 girls) completed the study protocol. A total of 28 participants were excluded as they did not complete the three F&B-FFQ and 3d-DR (Supplementary material). The baseline characteristics of the participants included in the reproducibility and validation studies are shown in Table 1.

Table 1 Baseline characteristics of children who participated in the COME-Kids food frequency questionnaire reproducibility and validation studies

Reproducibility

The mean daily food groups consumption, r and ICC, percentage of agreement, and grossly misclassified between F&B-FQ1 and F&B-FQ2 are shown in Table 2. Compared to the F&B-FQ1, the consumption of cereals, dairy products, meat or meat products, potatoes, sweetened beverages, and vegetables reported in the F&B-FQ2 was significantly lower. The overall r and ICC were 0.65 and 0.64, respectively. Similar ICC was found after adjusting by energy. The unadjusted r between the first two F&B-FQs for the food groups ranged from 0.40 to 0.99, and the ICC ranged from 0.39 to 0.99. For log-transformed energy-adjusted intakes, the mean r and ICC were 0.72 and 0.70, respectively. The mean percentage of agreement for all food groups between the first two F&B-FQs showed that 86% of intakes were classified in the same or adjacent quintiles. The kappa index had an average of 63% for all food groups. The percentage of misclassified intakes ranged from 0 (for fish and seafood, vegetables, nuts, legumes, sweets, chocolate, and sugars) to 7.7% (sugar-free beverages), with an average of < 2%The mean daily nutrient intake coefficients (Pearson and intra-class), percentage of agreement, and grossly misclassified between F&B-FQ1 and F&B-FQ2 are shown in Table 3. The consumption of protein, calcium, phosphorus, magnesium, potassium, thiamine, riboflavin, niacin, vitamin B6, and cobalamin in the F&B-FQ2 was significantly lower than in F&B-FQ1. The overall mean r and ICC were 0.63 and 0.62, respectively. However, after energy adjustments, slight increases in the coefficients were observed (r = 0.66 and ICC = 0.65). For log-transformed intakes, overall r and ICC slightly increased to r = 0.71 and ICC = 0.70 after energy adjustments. The unadjusted r ranged from 0.52 (potassium) to 0.77 (vitamin A) and the ICC from 0.50 (zinc) to 0.77 (monounsaturated fatty acids (MUFA)). The percentage of agreement for nutrient intakes classified in the same or adjacent quintile was 85%. Approximately 52% (kappa index) of nutrients were classified in the same quintile. The average percentage of misclassified nutrients was 1.4%, ranging from 0.5 (fiber, folic acid, and vitamin C) to 2.6% (calcium).

Table 2 Assessment of the reproducibility for food group intakes: mean intakes, Pearson and intra-class correlation coefficients, and percentage of agreement between the mean of daily intakes from the first two COME-Kids F&B-FQs (n = 195)
Table 3 Assessment of reproducibility for energy and nutrient intakes: mean intakes, Pearson and intra-class correlation coefficients, and percentage of agreement between the mean of daily intakes from the first two COME-Kids F&B-FQ (n = 195)

Relative validity

Table 4 shows the average intake of daily food groups, r, ICC, percentage of agreement, and items grossly misclassified between the F&B-FQ3 and the average of the 3d-DR.

Table 4 Assessment of relative validity for food group intakes: mean intakes, Pearson and intra-class correlation coefficients, and percentage of agreement between the mean of daily consumption of food groups from the COME-Kids F&B-FQ at 1-year of follow-up and the average of the three 3d-DR

Compared to the F&B-FQ3, the 3d-DR showed a significantly lower consumption of several food groups (dairy products, potatoes, fresh and other fruits, nuts, oils and fats, snacks, water, juices, sugar-free beverages and coffee, tea, and infusions) and higher consumption of other groups (meat and meat products, fish and seafood, vegetables, legumes, cereals, precooked food, sauces, and seasonings).

The average coefficients of relative validity for food groups were r = 0.36 and ICC = 0.30. When the estimates were adjusted for energy intake, the results were r = 0.37 and ICC = 0.30. Unadjusted r ranged between 0.07 (potatoes) and 0.81 (plant-based beverages) and ICC between 0.05 (potatoes) and 0.81 (plant-based beverages). For the correlations of log-transformed and energy-adjusted intakes, we observed the average r = 0.40 and ICC = 0.33. The correlations for food groups ranged between r = 0.002 (other fruits) and 0.76 (fresh fruits) and ICC between 0.001 (other fruits) and 0.75 (fresh fruits). The mean concordance was 65% for classifying food consumption in the same or adjacent quintiles. Overall, 37% of food consumption was classified in the same quintile. The average percentage of misclassified food groups was 6.2% and ranged from 0.6 to 20%.

Table 5 shows the average daily energy and nutrient intake, r, ICC, the percentage of agreement, and items grossly misclassified between the F&B-FQ3 and the average of the 3d-DRs. The means of the dietary energy and intake of most nutrients from the 3d-DRs were significantly lower than those from F&B-FQ3.

Table 5 Assessment of relative validity for energy and nutrient intakes: mean intakes, Pearson and intra-class correlation coefficients, and percentage of agreement between the mean of daily intakes from the COME-Kids F&B-FQ at 1-year of follow-up and the average of the three 3-day DR (n = 167)

The average r and ICC of the relative validity for nutrients were 0.29 and 0.24, respectively. Results after adjustment for energy were r = 0.34 and ICC = 0.27. Energy-adjusted r ranged from 0.11 (riboflavin) to 0.52 (saturated fatty acid (SFA)) and for ICC from 0.07 (cholesterol) to 0.46 (folic acid). The average log-transformed values for r and ICC after energy adjustments were 0.37 and 0.30, respectively. Approximately 64% of daily energy and nutrient intakes were classified in the same or adjacent quintiles. Approximately 27% of these intakes were classified in the same quintile. The average percentage of misclassified nutrients was 4.2% and ranged from 1.8 (fiber) to 9.6% (thiamin).

Figure 2 shows the relationship between the average and differences between the daily intakes of dietary energy and several nutrients and the mean of the F&B-FQ3 and the average of the three 3d-DR (Bland–Altman plots). The analysis confirmed that discrepancies between F&B-FQ3 and 3d-DR were equally likely in both directions. Only protein intake was underestimated by an average of 9 g/day. Mean daily fat (11 g/day) and calcium (9 mg/day) intakes were overestimated by the F&B-FQ3 in relation to the 3d-DR. Differences in the intake estimates between the two instruments were not related to the level of consumption. Bland–Altman plots for all food groups and nutrients are shown in the Supplementary material. Based on the average discrepancy (bias) and the limits of agreement, the Bland–Altman plots showed a good agreement between COME-Kids F&B-FQ and 3d-DR in cereals, eggs, fish or seafood intake, fresh fruit, meat or meat products, nuts, pastries and cakes intake, sauces and seasonings, sweets, chocolate and sugars, and vegetable intake. The data from these food groups exhibited relatively narrowed limits of agreement and a centered mean difference line (close to zero), indicating minimal systematic differences between the two questionnaires. However, legumes, juice and other fruits, oils and fats, potatoes, precooked food, snacks, and water exhibited considerable variability in their differences, suggesting a potential bias due to the wide dispersion of the data and the limits of agreement.

Fig. 2
figure 2

Bland–Altman plots showing the relationship between the mean and the differences in the daily intake of A energy, B carbohydrates, C protein, D total fat, E calcium, and F vitamin C with the mean of the COME-Kids F&B-FQ3 and the average of the three 3d-DR. Lines are the mean difference (—), and the lower and upper 95% show the limits of agreement (- - -). F&B-FQ, food and beverage frequency questionnaire; F&B-FQ3 was determined at 1 year of follow-up; 3d-DR average, average of the 3-day dietary records

The intake differences between COME-Kids F&B-FQ and 3d-DR show a good agreement in cereals, fish or seafood, fresh fruits, meat or meat products, pastries and cakes, sauces and seasonings, and vegetables (< 1 portion/day) and eggs, nuts, and sweets (< 1 g/day). However, poor agreement was observed in the dairy products (almost a half portion/day), juices (due to added sugar), legumes, potatoes, snacks and precooked foods (high spread and low bias), oils and fats, and water (> 1 portion/day).

Discussion

This validation study supported the use of the 125-item COME-Kids F&B-FQ as an acceptable method for dietary assessment in preschool and school children aged 3–11 years from several Spanish cities. Furthermore, it showed an acceptable reproducibility for most of the food groups and nutrients and a relative validity in comparison with the three 3d-DRs.

In general, epidemiological evidence has rarely considered the intake of water and beverages, and to date, no FFQ has been validated for this purpose in children. It is the first FFQ including a beverage section, which showed acceptable levels of reproducibility and relative validity in relation to fluid intake. The importance of estimating fluid intake, specifically in children, is increasingly recognized as hydration status affects memory and cognitive performance [34, 35].

Additionally, with rapid changes in the food market [36], it is important to consider novel (whole grain cereals, lactose-free milk, isotonic and energy beverages) and emerging plant-based beverages (rice, oat, almond, and soy drinks) items which are becoming more popular. Regarding plant-based beverages, COME-Kids F&B-FQ showed the highest coefficients of reproducibility and agreement and a moderate relative validity.

In addition, especially among children prone to consume high amounts of added sugar, it is imperative that a distinction between foods with and without added sugar (i.e., dairy, beverages, and breakfast cereals) has to be made. Therefore, COME-Kids F&B-FQ, which included an extended number of food items, is more likely to assess Spanish children’s diet accurately.

The delicate desired balance between having an exhaustive estimation and reducing respondent burden guided the number of items in FFQ. The number of food items in previous validated FFQs for children ranged from 17 [11] to 183 [20] items; meanwhile, Spanish-validated FFQs ranged from 105 [22] to 138 [23] items for preschool children, and about 46 items were specific for school-aged children [24]. In this sense, COME-Kids F&B-FQ which included 125 items is adequately comprehensive without being extremely limited or extensive. Given the results of the study, it is highly likely that COME-Kids F&B-FQ achieved the right balance between being sufficiently complete and avoiding respondent fatigue to ensure reliability.

The present validation study is in line with previous studies, in which four weighed 3d-DR [23], three 24-h dietary recall (24-h recall) [22, 24], or 7-day estimated dietary records [37] were also used as a reference. Even though it has been well recognized that the 3d-DR method may lead to misreporting [38], it is important to highlight that the relative validity agreement across the same or adjacent quintiles was in an acceptable range for food groups and nutrient intakes, indicating that gross misclassification was very unlikely.

Regarding correlation coefficients to assess the reproducibility of food group consumption in children, the literature is scarce [20, 22,23,24, 39]. Taking into consideration other European studies, our correlation coefficients are higher than those obtained in Danish children [20] and Flemish preschoolers [39]. Moreover, those corresponding to the intake of dairy products, eggs, vegetables, potatoes, fruit, legumes, oils and fats, sweets, chocolates, and sugars were higher than those observed in previous Spanish studies in preschool children that were log-transformed and adjusted by energy [22,23,24]. The higher reproducibility of dietary estimates from COME-Kids F&B-FQ could be due to the sampling of participants from several provinces of Spain and the items in this FFQ better accounting for regional variations in gastronomic culture. Regarding the correlation coefficients to assess the reproducibility of nutrient intake in children, the positive correlations indicated moderate to high consistency between COME-Kids F&B-FQs. Thus, similar to other efforts to validate FFQs developed in previous European projects [37] and in Asian preschoolers [9], our study showed the relevance of conducting local adaptations to improve the assessments.

It is noted that there were significant discrepancies between COME-Kids F&B-FQ and 3d-DR for estimated intakes of a few food groups. Consumption of dairy, snacks, and potatoes was overreported, while juices, legumes, and precooked food were under-reported in the COME-Kids F&B-FQ in comparison to 3d-DR. It is likely that the intakes of more frequently consumed foods are overestimated and those of less frequently consumed foods underestimated by the parents/caregivers while reporting using an FFQ. This discrepancy may be avoided in the 3d-DR where the parent/caregiver records the actual intake. In contrast, oils and fats and water tend to be overlooked when parents/caregivers are capturing dietary data using 3d-DR. However, when these are itemized in the FFQ, they tend to not forget and report their consumption. Hence, the COME-Kids F&B-FQ provides an adequate estimation of energy and macronutrients and nutrients of concern such as fat, salt, and sugar intakes.

This study has several strengths that deserve to be mentioned for efforts made to improve the quality of dietary data and enhance our ability to collect complex information. First, we have standardized and automated data collection and transcription procedures using the e-Diet Base software. Second, newer techniques, such as optical scanning, and the use of food pictures helped dietitians to ensure the accuracy of reporting by parents/caregivers, simplify their reporting, and minimized the load in the fieldwork of researchers. More importantly, our target population consumed several beverages including water, plant-based drinks, juices, and sugar-free and sugar-sweetened beverages. The currently available FFQs for measuring dietary intakes in Spanish children do not include many of these items. Most current FFQs are restricted to capturing sugar-sweetened beverages or orange juice. The COME-Kids F&B-FQ, therefore, provides a better estimate of beverage intake in this target population. In fact, it is as good as 3d-DR for estimating sugar-free and sugar-sweetened beverages and plant-based drinks and better captures water intakes. Given the health implications of beverage consumption, specifically in this age group [22,23,24], therefore, COME-Kids F&B-FQ has filled a much-needed method gap in the area.

Nevertheless, some limitations have to be acknowledged. First, although the interviewers and the parents/caregivers were well-trained, some sources of measurement error in assessing dietary intakes cannot be ruled out, and for the moment, the estimation of intakes could not be validated by biochemical biomarkers. However, it is important to highlight that studies using biomarkers as the only reference method [40] or combined with 24-h recall or dietary records [18] did not find better correlations than those observed in the present study. Furthermore, some authors consider that the use of biomarkers may not be always an appropriate reference method for comparison given that blood concentrations depend on absorption and metabolism and may not always reflect dietary intake [40]. Secondly, in this study, some visits (10.3%) occurred during the confinement period due to COVID-19 pandemic. Data from these visits were collected using video conferences or telephone calls. However, sensitivity analyses stratifying according to the type of visit (in-person and by video conference/calls) showed similar results, indicating that the visit type may have no impact on the results.

Conclusion

The COME-Kids F&B-FQ is a valid tool to assess food and nutrient intake in children aged between 3 and 11 years. The intakes of most food groups and nutrients estimated using this FFQ were correctly classified, and misclassification was unlikely. Therefore, the COME-Kids F&B-FQ could be a useful tool to help improve dietary assessment in pediatric epidemiological studies in Spain.