Multidimensional Analysis of Food Consumption Reveals a Unique Dietary Profile Associated with Overweight and Obesity in Adolescents

There is a significant increase in overweight and obesity in adolescents worldwide. Here, we performed a cross-sectional study to examine the potential association between food consumption profiles and overweight in a large number of adolescents from Brazil. Sampling by clusters and conglomerates was carried out in students of public schools in Salvador, Brazil, between June and December 2009 and 1496 adolescents were evaluated. Data on socio-epidemiological data, anthropometric status and food consumption were captured. Multivariate analyses, such as hierarchical clustering and correlation networks, were used to perform a detailed description of food consumption profiles. There were differences in age and anthropometric status related to sex. Four clusters of food groups were identified based on the intake profile in the study population. No disparities in food intake were observed in individuals stratified by sex or anthropometric status. Furthermore, network analysis revealed that overweight or obesity were hallmarked by a selectivity in the ingestion of food groups that resulted in the appearance of inverse correlations of consumption, which was not present in eutrophic adolescents. Thus, overweight and obesity are associated with preferential choices of ingestion of specific food groups, which result in the appearance of inverse correlations of consumption. Such knowledge may serve as basis for future targeted nutritional interventions in adolescents.


Introduction
Adolescence is a transition period between childhood and adulthood, in which social and psychological changes occur, influencing social behavior [1]. The intense transformations experienced in this age range increase the risk of developing adverse physical and psychosocial health problems. For example, during the teenage period, an individual is more likely to express higher concerns related to their physical appearance [1], often resulting in initiation of inadequate diets that can be deleterious to health. 23 were randomly selected and then three classes per school were chosen, in each class they were captured and interviewed an average of 30 students. Thus, 1561 students were evaluated and, after reviewing the questionnaires, 81 adolescents were not included in the study because they did not meet some inclusion criteria, namely: age greater than 18 years, presence of physical problem, gestation and lactation. Thus, the sample consisted of 1496 students. The delineation of study sample selection is shown in Figure 1. captured and interviewed an average of 30 students. Thus, 1561 students were evaluated and, after reviewing the questionnaires, 81 adolescents were not included in the study because they did not meet some inclusion criteria, namely: age greater than 18 years, presence of physical problem, gestation and lactation. Thus, the sample consisted of 1496 students. The delineation of study sample selection is shown in Figure 1. Information on the economic conditions of families was provided by the parents. Stratification of the economic status followed the criteria specified by the Brazilian Federal Government (Critério de Classificacão Economica, Brazil) and predicted that a monthly average family wage below $75.00 US dollars is consider a poor economic status whereas a value above $145.00 denoted a relatively good economic status. Other data, such as age, sex, pubertal development following criteria published previously [11,12] and food consumption were self-reported by the students and recorded in appropriate standardized and previously validated questionnaires. Weight and height were measured. The data was captured in standardized report forms by a panel of trained nutrition technicians.

Dietary Evaluation
The food intake was assessed using the semi-quantitative food frequency questionnaire (FFQ), consisting of 97 food items. The questionnaire used was constructed to fit the reality of the students of public schools in Salvador and later validated by our group [13]. The FFQ was applied directly to the students, who reported on their food consumption outside and inside the household. The frequencies of consumption of these food items were provided by adolescents and had the following options in response: Never/rare; 1 to 3 times a month; 1 time per week; 2 to 4 times a week; ≥4 times per week. In addition, the number of times a teenager consumed these food items was investigated. After the data collection, by using food composition and nutrition tables, we standardized the consumed quantities of each food and/or preparations referred to units of weight (g) and or volume (mL), which were used for calculation of the daily consumption of the food registered with the FFQ. Using this approach, data collected to measure consumption of food in a month (total month consumption) was deconvoluted to infer daily consumption (divided by number of days in a given Information on the economic conditions of families was provided by the parents. Stratification of the economic status followed the criteria specified by the Brazilian Federal Government (Critério de Classificação Econômica, Brazil) and predicted that a monthly average family wage below $75.00 US dollars is consider a poor economic status whereas a value above $145.00 denoted a relatively good economic status. Other data, such as age, sex, pubertal development following criteria published previously [11,12] and food consumption were self-reported by the students and recorded in appropriate standardized and previously validated questionnaires. Weight and height were measured. The data was captured in standardized report forms by a panel of trained nutrition technicians.

Dietary Evaluation
The food intake was assessed using the semi-quantitative food frequency questionnaire (FFQ), consisting of 97 food items. The questionnaire used was constructed to fit the reality of the students of public schools in Salvador and later validated by our group [13]. The FFQ was applied directly to the students, who reported on their food consumption outside and inside the household. The frequencies of consumption of these food items were provided by adolescents and had the following options in response: Never/rare; 1 to 3 times a month; 1 time per week; 2 to 4 times a week; ≥4 times per week. In addition, the number of times a teenager consumed these food items was investigated. After the data collection, by using food composition and nutrition tables, we standardized the consumed quantities of each food and/or preparations referred to units of weight (g) and or volume (mL), which were used for calculation of the daily consumption of the food registered with the FFQ. Using this approach, data Nutrients 2019, 11, 1946 4 of 15 collected to measure consumption of food in a month (total month consumption) was deconvoluted to infer daily consumption (divided by number of days in a given month) as previously described [13]. Industrialized foods and/or preparations that were not included in the tables were searched via internet directly on the manufacturer's website or through recipes. Thus, it was possible to obtain a proxy of the daily total food consumption in grams by calculations based on weekly and monthly consumption. For the statistical analysis of food consumption, the 97 food items that composed the FFQ were grouped according to the similarity in nutritional composition and food habits of the population of the Northeast of Brazil resulting in 14 previously defined food groups: Sugar and sweets, typical Brazilian dishes, sweetened beverages, fast food, oils, milk and dairy, meat, processed meat products, rice and cereals, roots, beans and legumes, vegetables, fruits and coffee. The approach used for creating these groups is described in Table 1.

Milk and dairy
Whole milk powder or liquid, skimmed milk powder or liquid, fermented milk, yogurt (whole, diet or light) , chocolate ready, yellow cheese, white cheese, cream cheese, creamy curd (whole or light).

Meat
Bovine (fried or cooked), chicken with or without skin (fried or cooked), cooked or fried fish, seafood, viscera, chicken egg (fried or cooked), dehydrated meat (Jerky beef).

Beans and legumes
Beans, peanuts, nuts and walnuts.
14. Coffee Coffee and tea.
a Dishes made of beans and shrimp, deep-fried in palm oil, the only difference is that abará is steamed, while acarajé is fried. b Creamy paste prepared with bread, shrimp, coconut milk, finely ground peanuts and palm oil. c Made from okra, onion, shrimp, palm oil and toasted nuts (peanut and/or cashew). d Stew of beans with beef and pork. e Dish made from a cow's flat white stomach lining. f Cattleman's Beans.

Assessment of Anthropometric Status
Participants were weighed on a portable digital scale (Master Balancas, Goiania, Brazil), and their height was measured using a Leicester Height Measure portable stadiometer (Seca, Hamburg, Germany). The weight of the uniform (100 g) was subtracted during the analysis. WHO reference tables (2007) [14] with percentile values of the body mass index (BMI = weight [kg]/height [m 2 ]) were used to assess the anthropometric status according to age and sex. In addition, the 2006 WHO criteria were used to categorize the anthropometric status: underweight (<3rd percentile), normal weight (≥3rd percentile and <85th percentile, the reference category), overweight (≥85th percentile and <97th percentile), or obese (≥97th percentile).

Statistical Analysis
Descriptive statistics were performed to characterize the study population. Continuous variables were tested for Gaussian distribution using the D'Agostino-Pearson test. No variables exhibited normal distribution. Thus, medians and interquartile ranges (IQR) were used as measures of central tendency and dispersion. All comparisons were pre-specified. The non-parametric Mann-Whitney U test was used to compare distributions of continuous variables between two analytical groups whereas the Kruskal-Wallis test with Dunn's multiple comparisons post-test was used to compare more than two groups. Categorical variables presented as percentage were compared using the Pearson's chi-square test. Furthermore, we performed a number of additional analyses employed for Big Data and Systems Biology to provide novel insights in data visualization and interpretation.
With the initial objective of evaluating the simultaneous consumption of food groups in the sample ("Which food groups presented similar consumption profile in the study population?"), a bi-directional, unsupervised, hierarchical cluster analysis was performed using Ward's methodology (grouping both individuals and food consumption in grams) with bootstrap. For this analysis, a heat map was constructed with total consumption values in grams for each food group. Clustering was based on normalized consumption data in z-scores of the mean overall consumption of each food group. This analysis results in identification of clusters of individuals based on the similarity of consumption of the various food groups in grams. In this approach, dendrograms, which represent Euclidean distances and infer similarity [3]. A constellation plot was used to perform 2-dimension visualization of the clusters created by the Hierarchical analysis [15]. In this analysis, distance also infers similarity in overall consumption of food groups.
Correlations between dietary intake values were assessed using the Spearman test. Spearman correlation matrices were constructed for each subgroup of individuals. The matrices were submitted to 100X bootstrap [16,17]. Bootstrapping was used to estimate more realistically the distribution of the correlations per group of individuals, accounting for multiple measures/comparisons using random sampling methods. This is a widely used approach employed in multidimensional analyses to increase accuracy of the statistical findings. Using this approach, only statistically significant correlations (p < 0.05), with values of Spearman rank (rho [r]) > ±0.5 (considered to be strong in the present study in pre-specified assumption) and that remained significant in at least 50% of the bootstraps were included in the network analyzes. The density of connections was calculated on each bootstrap and represents the following formula: L/(N × [N−1]/2), where L represents the number of statistically significant correlations (p < 0.05) and N is number of nodes (parameters). The network density, therefore, infers the number of significant correlations in relation to the total possible number of correlations in the matrix [16,17]. These values on network density were compared between groups using the Kruskal-Wallis test with Dunn's multiple-comparison post-test. In the correlation network analyzes, the identification and characterization of nodes were performed comparing the number of correlations statistically significant for each food group in the different subgroups of individuals. Heat maps were constructed to optimize visualization of different patterns of food group consumption and nodal relationships in the networks.
Study power was calculated using JMP 13.0 (SAS, Cary, NC, USA). The predicted study total sample size per group to have a study power of 90% and alpha value of 5% and to find at least 1.5 fold-variation in overall consumption of at least 1 food group between the distinct anthropometric strata was 50, much lower that the total number of individuals recruited. Differences with p values lower than 5% after adjustment for multiple comparisons using the Holm-Bonferroni's method considered statistically significant. Statistical analyzes were performed using GraphPad Prism 8.0 (GraphPad Software, La Jolla, CA, USA), JMP 13.0 and R 3.5.0 (R Foundation, Vienna, Austria).

Ethics Statement
The research project was approved by the Ethics and Research Committee of the Institute of Collective Health of the Federal University of Bahia (protocol no. 002-08CEP/ISC). Written informed consent was obtained from all participants or their legally responsible guardians, and all clinical investigations were conducted according to the principles expressed in the Declaration of Helsinki.

Characteristics of Participants
The characteristics of the study participants are depicted in Table 2. Among the 1496 students enrolled, 642 were males (42.9%) and 854 females (57.1%). The median age was 14.3 years (interquartile interval: 13.1-15.5). Female participants were on average younger than male individuals (p < 0.0001; Table 2). Approximately half of the study population sample reported a poor family social economic status, with no difference between the subgroups stratified by sex (p = 0.3956). Regarding anthropometric status, the majority of adolescents was classified as eutrophic (n = 1155, 77.2%), whereas 8.8% (n = 132) were overweight and 5.9% (n = 89) were obese ( Table 2). Female adolescents exhibited higher median BMI values than male participants (p = 0.006, Table 2). In addition, the overall frequency of the distinct anthropometric statuses was different between male and female participants (chi-square p-value: 0.005, Table 2). Hence, male individuals showed to be more frequently underweight than females (10.4% vs. 6.2%, respectively). With regards to pubertal development, the majority of the participants were in the post-pubertal stage (n = 1040, 69.5%) with a higher frequency of the most advanced stage of sexual maturation in females and pre-pubertal stage among boys (p < 0.001; Table 2).

Evaluation of Food Consumption Profiles
The dietary items mentioned by the participants in their records were grouped into 14 food groups as described in Methods and depicted in Table 1. In order to initially understand the simultaneous consumption of different food groups, unsupervised cluster analysis was performed, in which the food groups were sorted according to similarity of food consumption in grams in the study participants ( Figure 2A). Using this approach, it was possible to identify four clusters of food groups. Individuals who reported consuming more beverages also consumed more sugar and sweets (Figure 2A). Typical Brazilian dishes, meat, fast food and milk and dairy formed a large group of foods with a similar consumption profile. Rice and cereals, fruits, vegetables and roots formed the third group, and oils, processed meat products, beans and other legumes and coffee completed the fourth group of food consumption (Figure 2A). Of note, the overall profile of food consumption was not able to distinguish male from female study participants (Figure 2A), indicating that sex did not impact the total intake of the different food groups evaluated. The hierarchical cluster analysis was also used to simultaneously test whether individuals presenting with different anthropometric statuses would be grouped separately based on the overall food consumption profile. Interestingly, this statistical approach revealed that adolescents with divergent anthropometric statuses did not exhibit a distinct dietary profile when all the food groups were considered (Figure 2A). It was possible to detect three major groups of study participants based on the overall food consumption profile. In one smaller group (n = 300, which represented approximately 20% of the study participants), the overall consumption of the different food groups was high whereas two other groups displayed relative medium (n = 569, 38%) or low consumption (n = 627, 42%). Again, no preferential grouping related to sex or anthropometric status was observed between the three main clusters of food consumption (Figure 2A). A constellation plot, which is used to display 2-dimmension visualization of the clusters, demonstrated that the group of individuals who exhibited high consumption profile was more divergent than the other 2 groups of adolescents who had middle or low consumption ( Figure 2B). this statistical approach revealed that adolescents with divergent anthropometric statuses did not exhibit a distinct dietary profile when all the food groups were considered (Figure 2A). It was possible to detect three major groups of study participants based on the overall food consumption profile. In one smaller group (n = 300, which represented approximately 20% of the study participants), the overall consumption of the different food groups was high whereas two other groups displayed relative medium (n = 569, 38%) or low consumption (n = 627, 42%). Again, no preferential grouping related to sex or anthropometric status was observed between the three main clusters of food consumption (Figure 2A). A constellation plot, which is used to display 2-dimmension visualization of the clusters, demonstrated that the group of individuals who exhibited high consumption profile was more divergent than the other 2 groups of adolescents who had middle or low consumption ( Figure 2B). Furthermore, we directly compared the individual food groups in the entire population and ranked based on total consumption. We found that sweetened beverages composed the most consumed food group in all the 1469 individuals, followed by rice and cereals, fruits, sugar and sweets and fast food (Figure 3). The least representative groups were vegetables, oils, roots and processed meat products. There were no differences in individual food group consumption between Figure 2. Analysis of the consumption of dietary groups using hierarchical cluster. The total consumption in grams obtained for each food group was calculated. (A) Two-way hierarchical cluster analysis (Ward's method, unsupervised, with 100X bootstrap), in which the dendrogram represent Euclidean distance, was used as an approach to identify similarity profile of the consumption of distinct food groups. Using this approach, it was possible to identify four clusters of food groups that exhibited similar patterns of consumption in the general population. Three main subgroups of participants baes on overall food consumption was observed. (B) Constellation plot of the hierarchical clusters shows similarities between subgroups of study participants stratified by overall food intake. Furthermore, we directly compared the individual food groups in the entire population and ranked based on total consumption. We found that sweetened beverages composed the most consumed food group in all the 1469 individuals, followed by rice and cereals, fruits, sugar and sweets and fast food (Figure 3). The least representative groups were vegetables, oils, roots and processed meat products. There were no differences in individual food group consumption between male and female participants (Figure 3, left panel). We further performed additional univariate analyses adjusted for multiple comparisons to try to delineate the factors which were associated with the different anthropometic statuses (Table 3). We found that underweight individuals presented more frequently with pre-pubertal or pubertal development, whereas post-pubertal adolescents were more common in the groups of overweight or obesity persons (chi square p < 0.001). Intriguingly, analyses revealed that total consumption of each food groups (measured in grams) was not different between the distinct groups of antropometric status (Figure 3, right panel and Table 3). more common in the groups of overweight or obesity persons (chi square p < 0.001). Intriguingly, analyses revealed that total consumption of each food groups (measured in grams) was not different between the distinct groups of antropometric status (Figure 3, right panel and Table 3).

Network Analyses of Food Consumption
Analyzes of dietary group intake had so far failed to reveal substantial, absolute quantitative differences between subgroups of adolescents stratified according to sex or anthropometric status. No single food group provides all the nutrients required for good health; thus, it is essential to eat a variety of foods with different vitamins and nutrients to fulfill all vitamins and nutrient necessities. A balanced consumption of the different food groups is described to be ideal to promote health. The next step was to examine the correlation profiles between the ingestion of the different food groups using network analysis [16,17]. This statistical approach makes possible to evaluate if augmented consumption of a given food group is followed by preferential increases or decreases in consumption of other groups, indicating dietary preferences of subgroups of individuals. We first tested direct correlation between the consumption of different food groups with BMI values and found no statistically significant relationship ( Table 4), arguing that the consumption of a given individual food group was not associated with variation in BMI in the study population. When the study participants were grouped according to the anthropometric status, it was observed that the great majority of the correlations was positive, indicating that the increased consumption of a given food group was related to greater the consumption of other groups ( Figure 4A). Indeed, among eutrophic individuals, only positive correlations among consumption of foods were observed. A similar pattern of correlations was found in the group of underweight individuals ( Figure 4A). Importantly, the number of negative correlations of the food groups was more expressive as the anthropometric status moved towards obesity. In addition, among individuals with thinness a lower number (n = 2) of correlations involving coffee intake was found ( Figure 4A).
When examining the Overweight group, more peculiarities were found. A number of negative correlations were observed between ingestion of food groups ( Figure 4A). Thus, the higher the intake of typical Brazilian dishes, the lower the intake of rice and cereals and processed meat. Increased intake of sweetened beverages was negatively associated with intake of oils as well as with beans and legumes. In addition, the lower intake of vegetables correlated with increased consumption of oils and fast food ( Figure 4A). Among individuals with Obesity, once again several negative correlations between the consumption of several food groups were observed, demonstrating that specifically in this category, there is preferential intake of food groups in relation to others. For example, the consumption of vegetables was inversely correlated to the consumption of fast food, processed meat, rice and cereals, coffee and milk and dairy. Consumption of beans and legumes was also inversely proportional to the intake of milk and dairy products, sugar and sweets, sweetened beverages and coffee ( Figure 4A). rice and cereals, coffee and milk and dairy. Consumption of beans and legumes was also inversely proportional to the intake of milk and dairy products, sugar and sweets, sweetened beverages and coffee ( Figure 4A). Further analysis of the network densities in each of the 100 bootstraps performed in the correlation matrices built from the individuals with different anthropometric statuses revealed a reduction in the number of correlations between the consumption of the food groups in noneutrophic conditions ( Figure 4B). The lowest average network density was found among obese individuals ( Figure 4B). These results indicate that there are quantitative (in numbers of statistically significant correlations) and qualitative (positive vs. negative correlations) changes in dietary profiles in conditions such as overweight and obesity. Finally, a node analysis was performed to identify which food groups had their consumption most related to the consumption of other foods. It was noted that among the eutrophic study participants, all food groups contributed in a similar way to a number of significant correlations. In underweight adolescents, the number of correlations changed and there was a slightly greater predominance of fruits, rice and cereals and sugar and sweets, and a great reduction of the importance of the consumption of coffee in the correlation matrices. In overweight participants, the consumption of meat and fruit formed the most relevant nodes. These groups were also among the most relevant in individuals with obesity, together with the consumption of milk and dairy ( Figure 4C). Thus, the analysis of correlation networks was able to characterize quantitative and qualitative relations of consumption of the different food groups, highlighting differences related to the anthropometric status. Further analysis of the network densities in each of the 100 bootstraps performed in the correlation matrices built from the individuals with different anthropometric statuses revealed a reduction in the number of correlations between the consumption of the food groups in non-eutrophic conditions ( Figure 4B). The lowest average network density was found among obese individuals ( Figure 4B). These results indicate that there are quantitative (in numbers of statistically significant correlations) and qualitative (positive vs. negative correlations) changes in dietary profiles in conditions such as overweight and obesity. Finally, a node analysis was performed to identify which food groups had their consumption most related to the consumption of other foods. It was noted that among the eutrophic study participants, all food groups contributed in a similar way to a number of significant correlations. In underweight adolescents, the number of correlations changed and there was a slightly greater predominance of fruits, rice and cereals and sugar and sweets, and a great reduction of the importance of the consumption of coffee in the correlation matrices. In overweight participants, the consumption of meat and fruit formed the most relevant nodes. These groups were also among the most relevant in individuals with obesity, together with the consumption of milk and dairy ( Figure 4C). Thus, the analysis of correlation networks was able to characterize quantitative and qualitative relations of consumption of the different food groups, highlighting differences related to the anthropometric status.

Discussion
In the present study, we performed novel multidimensional analyses adapted from the Big Data and Systems Biology fields to delineate the dietary patterns associated with overweight and obesity in a large number of adolescents from public school system in Brazil. The results presented here add to the current knowledge in the field as they highlight that adolescents with overweight or obesity, instead of exhibiting higher consumption of individual food groups, present an unbalanced dietary profile hallmarked by relative selective food consumption.
The findings presented here revealed several differences male and female study participants. Female subjects were on average younger and had higher BMI values when compared to those of the opposite sex. Boys had a higher frequency of thinness/underweight. In addition, differences were observed in the referred stage of sexual maturation, with females presenting more frequently as post-pubertal, while males had a higher frequency of pre-pubertal. There was also no difference in level of economic condition between male and female participants. Thus, sexual maturation stage could have a higher influence on the anthropometric status than socioeconomic status. This influence was also observed in previous studies [18][19][20][21].
The first exploratory description of dietary intake has been performed here by hierarchical cluster analysis. The similarity profile for each food group was identified, based on food intake grams, and four food groups were identified. Thus, the amount of sweetened beverage ingested was similar to that of sugar and sweets (group 1). Other food groups that presented similarity in the amount of consumption were typical Brazilian dishes, meat, fast food and milk and dairy (group 2). Rice and cereals, fruits, vegetables and roots formed the third group of consumption, whereas oils, processed meat products, beans and legumes and coffee formed the fourth. These results suggest that the formation of food groups with similarity of consumption reflects a specific dietary profile. The formation of food habits is complex, and depends on associated factors such as biological, economic, food supply and availability [22]. The teenagers' dietary habits are strongly determined by the environment in which they coexist, the influence of parents and friends, the food preparations, as well as their flavor. The taste of food is one of the factors that most interferes in the food choices of adolescents, that is, adolescents who have preference for foods with inadequate nutritional value may increase their consumption [22,23].
Although cluster analysis is a robust methodology for the identification of dietary patterns [24], few studies have used hierarchical grouping analysis and/or k-means for this purpose, mainly in adolescent populations [25][26][27][28]. Other studies evaluated eating patterns of adolescents but using factorial analysis [29][30][31][32]. In the area of nutritional epidemiology, there is a growing interest in innovative methods that bring more inferential information with advantages over conventional methods to better understand the complexity of eating practices. Traditionally, the evaluation of the food pattern occurs through factor analysis or principal components. This approach has the advantage of resulting in linear scores with adequate statistical power, however such scores are abstract and there is a limitation in understanding what they really mean [33]. Other methods include post-reduction regressions [34], Gaussian models [33] and, more recently, hierarchical grouping analysis [23].
The present study used clustering analysis because this method has the advantage of providing a clear description of exactly what groups of individuals are consuming [5,24,35,36] because the individual belongs to a single cluster, which is useful to help in the implementation of nutritional interventions. One of the few limitations of this method is the low power to detect associations with clinical outcomes when the sample size is small [3], however this is not the case of the present study. In addition, the FFQ is not 100% reliable as it depends on memory and it does not record consumption of food groups not listed in the questionnaire, leading to potential sub-notification of food intake. Regardless, the results presented here are robust because they result from a mixture of powerful statistical analyzes that, together, define the details of the food consumption of the adolescents related to the anthropometric status.
Although the present study detected differences in anthropometric profile and other characteristics between male and female participants, no differences were found in the total consumption of the different dietary groups. This means that, on average, boys and girls presented similar food group consumption. Additional analyzes of mean consumption by anthropometric failed to identify statistically significant differences. Thus, in the study participants, the average consumption of food groups does not seem to have been influenced by sex or anthropometric status. Additional studies using similar analyzes in populations of individuals with different age groups are needed to clarify whether the absence of dissimilarity in mean consumption among the various subgroups evaluated here is dependent on age range or another characteristic not examined in the present study.
The various complementary analyzes outlined a detailed food consumption profile, both from the point of view of the individual and the food groups in the study population. We used multidimensional statistical techniques to evaluate the balance of the intake of the different food groups in order to infer the quality of the diet. The Spearman correlation networks were able to define particularities regarding the relationships between differential food consumption. Similar approaches based on correlations have already been published in nutrition [37], however, not in the form of statistical interaction networks, which are more common in transcriptomic analyses [38] and immunology [16,17]. The present study is therefore innovative by using such techniques to visualize consumption profiles in pre-specified groups of individuals. These methodologies were able to highlight correlations that have not been identified with other statistical approaches. Regarding the anthropometric status, the network analysis revealed at least three main results: (i) A large number of statistically significant correlations were observed among the different food groups in all categories, indicating that the individual's diet is a coordinated process that implies the simultaneous consumption of several food groups; (ii) in eutrophy and underweight, the great majority of the correlations were positive, indicating that the higher the consumption of a certain food group, the greater the consumption of other groups in general; (iii) in overweight and obesity, it was noticed the appearance of several negative correlations between the ingestion of food groups, demonstrating that there is preferential intake of some foods in relation to others. Intriguingly, in univariate analyses presented in Table 3 and Figure 3, individuals with different anthropometric status could not be distinguished based on total consumption of each individual food group when examined separately. In converse, among individuals with overweight or obesity, several negative correlations between the consumption of several food groups were observed in the network analysis. This finding indicates that rather been simplistically hallmarked by increased consumption in grams of one or more food groups, weight gain is associated with a preferential choice in the ingestion of food groups that is not present in eutrophy. Thus, although the average amount of food intake is not different in those who are overweight or obese compared to normal, dietary food selection potentially implies nutritional imbalance that results in weight gain.

Conclusions
In summary, our study revealed that the presence of overweight or obesity is associated with the preferential choice of ingestion of specific food groups, resulting in the appearance of inverse correlations of consumption, which is not present in eutrophy and underweight. Such knowledge may serve as a basis for future investments in the field of nutritional epidemiology in Brazil.