Development and evaluation of a concise food list for use in a web-based 24-h dietary recall tool

Foodbook24 is a self-administered web-based 24-h dietary recall tool developed to assess food and nutrient intakes of Irish adults. This paper describes the first step undertaken in developing Foodbook24 which was to limit participant burden by establishing a concise list of food items for inclusion in the tool. The aim of the present study was to evaluate whether using a concise food list (as opposed to an extensive list) with generic composition data would influence the estimates of nutrient intakes in a nationally representative sample of Irish adults. A 2319-item food list generated from the Irish National Adult Nutrition Survey (NANS) (2008–2010) (n 1500) was used as the basis for a shortened food list for integration into the tool. Foods similar in nutritional composition were recoded with a generic type food code to produce a concise list of 751 food codes. The concise food list was applied to the NANS food consumption dataset and intake estimates of thirty-five nutrients were compared with estimates derived using the original extensive list. Small differences in nutrient intakes (<6 %) with limited effect size (Cohen's d < 0·1) were observed between estimates from both food lists. The concise food list showed strong positive correlations (rs 0·9–1·0, n 1500, P < 0·001) and a high level of agreement with the extensive list (80–97 % of nutrient intakes classified into the same tertile; >90% of intakes similarly categorised according to dietary reference values). This indicates that a concise food list is suitable for use in a web-based 24-h dietary recall tool for Irish adults.

and participant fatigue can also be an issue (2) . Due to the absence of an interviewer, the process and user interface must be simple and user-friendly. Behind the user interface, the foods selected by the participant must be fully auto-coded and linked to a comprehensive food composition database, which in turn calculates nutrient intake from the food item (6,(11)(12)(13) .
Foodbook24 is a self-administered web-based 24-h dietary recall tool developed to assess food and nutrient intakes in a population of Irish adults. The purpose and scope of this tool are to allow researchers to simply and quickly estimate the daily intake of foods, and macro-and micronutrients in Irish adult populations. It is therefore important to limit participant burden as far as possible by limiting the number of food items available for the participant to choose. A more extensive food list would be necessary to conduct exposure assessment of bioactive and non-nutritional compounds, food chemicals and food additives or conduct risk-benefit analysis of fortification strategies. This paper describes the first step undertaken in developing Foodbook24 which was to establish a concise list of food items for inclusion in the tool. The aim of the present study was to evaluate whether using a concise list of foods (as opposed to an extensive list), together with the associated generic food composition data, would influence the estimates of nutrient intake in Irish adults, using data from a nationally representative nutrition survey of adults in Ireland as a case study. Further studies examining the validity and acceptability of Foodbook24 are currently in progress.

Study design and study population
The National Adult Nutrition Survey (NANS) (14) food consumption dataset was used as the source of food items and composition, to be integrated into the Foodbook24 tool. The NANS, a nationally representative, cross-sectional nutrition survey, was conducted by the Irish Universities Nutrition Alliance between 2008 and 2010 with the aim of establishing a dataset of habitual food and beverage consumption of adults aged 18 to 90 years living in Ireland. Detailed health and lifestyle, eating behaviour and anthropometric data were also collected.

Dietary assessment methodology in the National Adult Nutrition Survey
A 4-d semi-weighed food record was completed by 1500 adults and the dietary intake data recorded by participants were converted to nutrient intakes using the Weighed Intake Software Package (WISP; Tinuviel Software), which encompasses UK food composition tables (15)(16)(17)(18)(19)(20)(21)(22)(23)(24) and The Irish Food Composition Database (IFCDB) (25) . The IFCDB has been consistently updated during each Irish national nutrition survey to reflect the most recent composition data for fortified foods, nutritional supplements, composite dishes and Irish brands consumed which were not adequately characterised by UK food composition tables. The accuracy of food composition, as well as consumption, was aided by asking participants to retain food packaging during the survey period. It is worth noting that if a NANS participant reported consuming a homemade composite dish, the recipe was obtained from the participant and a new food code and accompanying food composition value were generated, if the dish was not adequately characterised by existing food codes. At its most disaggregated level, the NANS dataset consists of 133 068 rows of food intake data, comprised of 7653 branded food items which were described by 2552 food codes, of which 233 were nutritional supplements.

Derivation of the concise food list for Foodbook24
The Foodbook24 user interface aims to provide the participant with a finite number of food item options without requiring brand information, that are linked to food composition data, allowing the automatic calculation of nutrient intakes and thus reducing the time required for data entry and data analysis by the researcher. With the overall aim of reducing participant burden, the objective of the present study was to develop a concise list of food codes available for selection by the Foodbook24 user and evaluate the impact of using this concise list on the estimates of nutrient intakes in the Irish population.
The 2319 food codes used in the NANS were initially evaluated at a food-group level and the food codes that described similar food items/food types were combined and the nutritional composition of the most frequently used food code within a food type was selected for inclusion in the Foodbook24 tool. An excerpt of these recoding exercises is available in Supplementary Table S1. Ten different brands of plain/flavoured full-fat yoghurt were reported during the 4-d recording period by participants of the NANS. The frequency of consumption was examined and ranged from one to 396 eating occasions during the survey period. The nutritional composition of the most commonly consumed yoghurt (in this case the yoghurt consumed 396 times) was brought forward for inclusion in the concise food list (Supplementary  Table S1).
A small number of food items (n 113) were excluded from this recoding exercise and the original food code and associated nutritional composition were retained. Of these 113 food items, forty-eight items were recorded as an ingredient of a recipe or composite dish, e.g. cornflour; sixty-two items were unusual food items that were consumed infrequently (fifty-seven of which were consumed less than ten times by the total sample); three food codes were unique to a study participant or were not similar enough to be recoded or aggregated with other food items.

Statistical analysis
To test the suitability of this concise food list for use in the Foodbook24 tool, we evaluated the level of agreement between estimates of nutrient intake using the concise food list (n 751 food codes) together with generic food composition data with the extensive complete food list used in the NANS (n 2319 food codes), when applied to the NANS food consumption dataset. The distributions of nutrient intakes were assessed for normality using Kolmogorov-Smirnov tests and the natural log and square root transformations were applied, where appropriate. All statistical analyses were performed in SPSS © version 21 (IBM). An α of 0·01 was selected to compensate for a possible increased risk of type 1 errors due to the use of multiple tests.
We evaluated the level of agreement between estimates of intake derived from the concise food list and extensive food list composition sources in a number of ways.
Association between estimates. Estimates of nutrient intake from food sources only were calculated using both the extensive and concise list of food codes and a paired-samples t test, or a Wilcoxon signed-rank test where appropriate, were performed to identify significant differences in estimates of intake. Cohen's d (26) was used to examine the effect size of the differences between both the extensive food list and the concise food list estimates. The percentage difference between estimates of intake from food sources was calculated by subtracting the mean intake estimate using the old (extensive list) from the new (concise list) mean estimate of intake and dividing this value by the mean old (extensive list) estimate and multiplying by 100.
Cross-classification analysis. Cross-classification analysis quantified the level of agreement between categorisation of estimates (from food sources only) into thirds (tertiles) of the intake distribution. The percentage of participants categorised into the same tertile of intake was calculated and the level of agreement quantified using Cohen's κ statistic. For macronutrients, the percentages of participants meeting the recommendations for protein (27) , carbohydrate (28) , total fat (29) , saturated fat (30) and dietary fibre (28) proposed by the European Food Safety Authority (EFSA) and the UK Department of Health were identified. Using the cut-point method proposed by Carriquiry (31) and described by the US Institute of Medicine (32) , the proportion of NANS participants with micronutrient intakes (from all sources) below the estimated average requirement (EAR) (30,33,34) and above the tolerable upper intake level (UL) (35)(36)(37) were calculated and the level of agreement was analysed using Cohen's κ statistic. Given the large impact of nutritional supplements on micronutrient intakes (38) and on estimates of intake relative to dietary reference values (39) , Foodbook24 will approach supplement use in an open-ended manner, allowing the user to enter the brand of nutritional supplement they use into the tool. Thus, for the purpose of this study, we did not reduce the number of nutritional supplement food codes. As misreporting of food and energy intake and particularly under-reporting is a known issue in all methods of dietary assessment, under-reporters of energy intake (29·3 %), identified as having an energy intake: BMR (40) ratio of <1·1 (41) , were excluded from the EAR analysis.

Results
Using the criteria described above, the number of food codes selected for use in the concise food list was reduced from 2319 to 751 food codes, representing a reduction of 68 % ( Table 1). The largest reductions in the number of food items were made for 'butter, spreading fats and oils', 'eggs and egg dishes' and 'bread and rolls' (78, 77 and 75 %, respectively). The number of food items in all other food groups was reduced by 53-73 %.
Small differences in estimates of mean nutrient intakes from 'food sources only' were observed when the concise food list was applied to the NANS food consumption database (Table 2), where the magnitude of the difference in energy, protein, carbohydrate, starch, sugar, non-milk sugars, total fat, polyunsaturated fat and alcohol intakes was <1 % at a population level. The largest differences were observed for saturated fat (+2·0 %) and monounsaturated fat (−2·3 %). The percentage difference between estimates of dietary fibre was −1·4 %. Differences in estimates of micronutrient intakes were <2 % for retinol, carotene, vitamin A, niacin, folate, folic acid, biotin, pantothenate, vitamin C, Ca, Mg, P, Zn and Cu. Larger differences between estimates were observed for vitamin D The practical significance of the difference between estimates of intake, as defined by Cohen's convention for effect size, was small for all nutrients, where all values were <0·1.
When mean estimates of nutrient intake were calculated from all sources (including supplements) (Supplementary  Table S2), the differences in mean daily intake using the extensive and concise food lists were even smaller (<4 %) than those observed from food sources only. However, as with the estimates from food sources only, the practical significance of the difference between estimates of intake from all sources (including supplements) was small. The association between estimates of nutrient intake (from food sources only) was assessed by calculating the percentage of participants who were categorised into the same third of the intake distribution based on the extensive (n 2319) and concise (n 751) food lists ( Table 3). The majority of participants were classified into the same third of the nutrient intake distribution when using the concise and extensive food lists, ranging from 80 % for vitamin E to 97 % for alcohol.
The proportion of adults with adequate protein intakes was 93 and 92 % using both the concise and the extensive food lists, respectively, and 98 % of participants were classified similarly by the two methods (κ 0·883; P < 0·001). Using both lists, the proportions of participants classified as meeting dietary reference values for macronutrients were similar (carbohydrate, 52 % meeting dietary reference value where 91 % of participants were categorised similarly (κ 0·813; P < 0·001); and for total fat, 59 % meeting dietary reference value, where 90 % were similarly classified (κ 0·787; P < 0·001)). Though a lower proportion of participants met the UK Department of Health recommendation for saturated fat using the concise food list (15 %) than the extensive food list (18 %), the level of agreement between estimates was still relatively good (92 % similarly classified; κ 0·720; P < 0·001). The proportions of participants with intakes of dietary fibre below the adequate intake of 25 g/d were similar using both lists (81 and 82 %, respectively; 96 % of participants similarly classified; κ 0·875; P < 0·001) ( Table 4).
There was a high level of agreement between the proportion of individuals identified with intakes below the EAR and above the UL using both the concise and extensive food lists, where 96-100 % of participants were classified into the   same category of nutrient adequacy (κ 0·755-1·000) ( Table 5). With respect to the proportion of individuals with intakes above the UL, there were minimal differences between the estimates derived from the extensive or concise food lists ( Table 6). In terms of classification of individuals into categories of excessive intake, >99·7 % of individuals were classified into the same category of excessive intake using the concise food list.

Discussion
The aim of the present study was to describe one of the initial stages in the development of the Foodbook24 tool: the establishment of a concise food list and evaluation of the impact of using this concise food list on estimates of energy and nutrient intake in a nationally representative reference sample of Irish adults. This is an essential first component in the development of the Foodbook24 tool given the critical role that food composition datasets play in obtaining a reliable estimate of nutrient intake (42) . The results of these analyses indicate that the recalculation of energy and nutrient intake using the concise food list resulted in small differences in summary estimates of intake for energy, macronutrients and micronutrients in this reference sample of adults. However, some notable exceptions exist, which will be discussed below.
The present study has demonstrated that even with a considerable reduction in the number of food codes (68 %), the differences between population estimates of intake derived by the concise food list in comparison with the extensive food list were small (less than 6 %). While the differences in estimates of intake were statistically significant for macronutrients such as saturated and monounsaturated fats, dietary fibre, and for a number of micronutrients also, the practical significance of these differences, as defined by Cohen's convention for effect size, was not practically meaningful.
There was a strong positive association between the estimates of daily nutrient intake from food sources and all sources derived using the two food lists. We observed a high proportion of participants classified into the same category of the nutrient intake distribution using the concise food list, which indicated a high level of agreement between the estimates derived from both lists. For energy, protein, carbohydrate, sugars and dietary fibre, over 90 % of participants were classified into the same third of intake. However, we observed a lower degree of cross-classification for estimates of total fat, saturated, monounsaturated and polyunsaturated fats (82-88 % of participants were classified into the same tertile).
It is important to note that in the NANS a systematic approach to assign accurate and reliable fatty acid composition data from multiple sources, i.e. published food tables, papers, packaging information and industry data (43) , was utilised. The observed differences in absolute intakes of saturated and Table 3. Association between estimates of nutrient intake from food sources only using the extensive (n 2319) and concise (n 751) food lists Nutrient Proportion in the same tertile (%) Cohen's κ P Table 4. Percentage of participants meeting dietary reference values for macronutrients (27)(28)(29)(30) and dietary fibre (28) Total fat (%E ≥ 20 and ≤35) %E, percentage total energy intake.
monounsaturated fat in the present study are attributable to loss of subtle fatty acid composition detail as a result of using a generic approach to fatty acid composition data. Though fatty acid intake data show the greatest level of discrepancy, the ability of the concise list to characterise saturated intakes relative to dietary reference values is still relatively good at a population level (saturated fat ≤10 % total energy: 18 v. 15 %, κ 0·72). Similar to differences in absolute intakes, we found that the lower degree of cross-classification observed for fats was attributable to changes in fatty acid composition data, particularly for meat and meat products (saturated and polyunsaturated fat intake), and butters, spreading fats and oils (monounsaturated and polyunsaturated fat intake). The recoding of food items such as soups and stir-in sauces had a modest impact on estimates of total fat, saturated and monounsaturated fat intake. Should the Foodbook24 tool be used for epidemiological research with a particular focus on dietary fat intake, we would advise that the above food groups be disaggregated to some extent to capture the subtle but important differences in the monounsaturated and polyunsaturated fat content of certain food groups.
Similarly, estimates of dietary fibre intakes in the NANS were obtained using a systematic approach to food composition data, ensuring that the most accurate and reliable information from published sources and the food industry are incorporated into dietary survey databases (44) . While a small difference between estimates was observed (−1·4 %), the practical significance of the difference was minimal, and the level of cross-classification was quite good for absolute intakes (91 % classified into the same third of the intake distribution; κ 0·86) and for the proportion of adults meeting the EFSA reference value for dietary fibre (19 and 18 % of adults with intake ≥25 g/d).
An important function of collecting dietary intake data in large samples of the population is to evaluate the prevalence of nutrient adequacy and risk for excessive intakes. With respect to absolute intakes of micronutrients, we found that 80-91 and 84-93 % of participants were classified into the same third of vitamin and mineral intake distributions (from food sources), respectively. Application of the concise food list to the NANS database did not significantly alter the proportion of the population with intakes below the EAR or above the UL, with over 96 and 99 % of individuals classified into the same categories of nutrient adequacy and risk of excessive intake, respectively. As the concise short list does not include brand-level information we advise interpreting data on micronutrient adequacy and potential for excessive intake with some level of caution.
While it is well recognised that the quality and completeness of food composition data have a major influence on the reliability of estimates of nutrient intake, few studies, including dietary assessment method development studies, have presented data on the development of food lists or evaluation of their performance. Extensive food lists and associated food composition databases (often including brand information) are incorporated into a number of existing online dietary assessment tools (4,6,8,13,45) . However, in the present study a shortened, more concise, food list was developed to limit participant burden by reducing the number of foods that the participant has to choose from. To the best of our knowledge, there are a limited number of studies that have evaluated the impact of reducing the number of food codes and using generic food composition data. Freese et al. (46) recently developed a short food list (n 246) for use in a 24-h recall tool which forms part of a blended strategy to estimate usual nutrient intakes. Freese et al. (46) identified food items consumed in the German national nutrition surveys, NVS I and NVS II, which accounted for 75 % of the variation of intakes of twentyseven nutrients using a stepwise linear regression approach, supplemented with additional checks for completeness.
The results of the present study show that a concise food list developed for use in the Foodbook24 tool shows good performance in assessing population food and nutrient intakes in Irish adults. However, there are some limitations on the ability of the concise list to estimate intakes of fatty acids and some micronutrients at an individual level. It is worth noting again that the intention of the shortened food list was to limit  Table 5. Classification of participants identified with intakes below the estimated average requirement (EAR) (30,33,34) (excluding under-reporters (40,41) ) from all sources including nutritional supplements participant burden by reducing the number of foods the participant has to choose from. The concise food list developed has a number of key strengths. The food list items were derived from nationally representative food consumption data and are therefore reflective of foods that are actually consumed by the Irish adult population. The NANS food consumption dataset is very detailed, where each food item has been recorded and described to brand level. In the development of the concise food list, the number of food and beverage items was considerably reduced. As with all food composition data, it would be important that the concise food list is monitored and updated to reflect any changes in food consumption patterns and to incorporate new foods on the market.

Conclusion
Our evaluation of the concise food list developed and applied to the NANS food consumption dataset as a case study has shown a high level of agreement with a more extensive list with respect to estimating nutrient intakes and categorising nutrient intakes according to dietary reference values. This indicates that a concise food list is suitable for estimating food and nutrient intakes of Irish adults and can be incorporated into a web-based 24-h dietary recall tool, provided the purpose of the analysis fits within the recognised scope and coverage of the concise list.

Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/jns.2017.49