Introduction

Colorectal cancer is the third most frequent incident cancer and the second cause of cancer death worldwide [1]. Besides well-described inherited genetic predisposition syndromes (e.g., familial adenomatous polyposis and Lynch syndrome), which, however, are responsible for a minority of cases [2], colorectal cancer is associated with several modifiable risk factors [3], including excess adiposity [4], physical inactivity [5] and sedentary lifestyle [6], and cigarette smoking [7], with, therefore, potential for primary prevention [8]. Diet also plays an important role in the etiology of this neoplasm. According to the World Cancer Research Fund (WCRF) and the American Institute for Cancer Research (AICR), wholegrains, food rich in dietary fiber and dairy products decrease the risk of colorectal cancer, while red and processed meat and high amounts of alcohol increase the risk [9].

The large intestine is colonized by a diverse community of microorganisms which constitutes the gut microbiota. The physiological gut microbiota has numerous functions, including gastrointestinal immune stimulation, protection against pathogens, production of essential nutrients such as vitamins, management of bioactive foods and chemical components, and modulation of gastrointestinal epithelial cell proliferation and differentiation [10]. The shift to a dysbiotic microbiota condition has been associated with the development of selected intestinal (e.g., inflammatory bowel disease) and extra-intestinal conditions (e.g., neurologic, respiratory, metabolic, hepatic, and cardiovascular diseases) [11], as well as colorectal cancer [12, 13].

Diet is one of the main drivers in shaping the gut microbiota [14, 15], potentially contributing to disease susceptibility, with habitual diets appearing to have more durable impact on the gut microbiota than short-term transient dietary strategies [16].

While it is well recognized that dietary fiber as a whole has a favorable role on gastrointestinal health, specific fiber types such as inulin-type fructans (ITFs), fructooligosaccharides (FOSs) (e.g., nystose and kestose) and galactooligosaccharides (GOSs) (e.g., raffinose and stachyose) are considered prebiotics, defined as “substrates selectively used by host microorganisms conferring health benefits” [17, 18]. These compounds bypass digestion in the small intestine and are available for bacterial fermentation in the colon; as such, they have the potential to modify the composition and metabolic activity of the gut microbiota [14, 19]. Prebiotics stimulate the growth of presumably beneficial colonic bacteria, mostly, but not exclusively, Bifidobacterium and Lactobacillus species [17, 18]. Along this line, in a recent systematic review and meta-analysis, dietary interventions involving prebiotic fibers have been associated with high abundance of Bifidobacteria and Lactobacilli [20]. Bacterial fermentation of fiber in the colonic lumen produces short chain fatty acids (SCFAs) [21], which are crucial for intestinal health [22]. In particular, the SCFA butyrate has anticarcinogenic and anti-inflammatory properties and favorable effects on colorectal carcinogenesis in animal models [23]. Fiber may also protect against colorectal cancer by increasing stool bulk, thus reducing the transit time through the bowel and the contact of carcinogens in the feces with the colonic mucosa, and by binding bile acids that produce carcinogenic metabolites [24,25,26,27].

Prebiotic fibers occur naturally in certain foods, including selected legumes, grains, fruits and vegetables, and fiber-rich foods, such as whole grains, have been reported to have a prebiotic-like effect on the gut microbiota [28]. However, still limited published data exist on the content of prebiotic fibers in foods, which derive exclusively from studies conducted outside Europe using heterogeneous methodologies [29,30,31,32,33,34].

While several investigations indicate a favorable role of high-fiber diets on colorectal cancer risk [35], whether, and which, fiber fractions with prebiotic activity contribute to this health benefit is yet to be clarified.

Within the PrebiotiCa (The role of prebiotics in the prevention of cancer, an integrated network of Italian case–control studies) study, we quantified in laboratory analyses selected prebiotic fibers (i.e., ITFs, nystose, kestose, 1F-β-fructofuranosylnystose, raffinose and stachyose) in a wide range of foods [36] and applied such estimates to dietary information of subjects participating in an Italian multicentric case–control study to derive subjects’ prebiotic intake. The aim of this analysis was to assess whether a diet rich in fibers with prebiotic activity may reduce colorectal cancer risk.

Methods

Study population

We derived data from an Italian case–control study on colorectal cancer conducted in six Italian areas (i.e., Milan, Genoa, Pordenone/Gorizia, Forlì, Latina, and Naples) in the period 1992–1996 [37]. The study included 1953 histologically confirmed colorectal cancer cases diagnosed no longer than 1 year before the interview (1225 colon cancers and 728 rectal cancers, 1125 men and 828 women, median age 62, range 19–74 years), with no previous cancer diagnosis, and 4154 controls (median age 58, range 19–74 years). Controls were subjects with no history of cancer admitted to the same hospitals as cases for acute, non-neoplastic conditions unrelated to tobacco, alcohol, hormonal or digestive tract diseases and to long-term modifications of diet; 21% were admitted for traumas, 26% for other orthopedic disorders, 24% for surgical conditions, 18% for eye diseases, and 11% for other illnesses.

The study protocol was approved by the local ethical committees, and all participants signed an informed consent. Less than 5% of either cases or controls approached refused to participate.

Data collection

Trained interviewers administered face-to-face a structured questionnaire to cases and controls collecting data on socio-demographic characteristics, anthropometric measures, physical activity, lifetime smoking and alcohol-drinking habits, personal medical history, and family history of cancer.

An interviewer-administered food frequency questionnaire (FFQ) was used to assess study participants’ usual diet during the 2 years prior to cancer diagnosis (for cases) or hospital admission (for controls). The FFQ included the average weekly consumption of 78 foods, food groups or complex recipes; intakes lower than once a week, but at least once a month, were coded as 0.5 per week. Additional questions aimed at assessing fat intake and general dietary habits. From these data, the intakes of selected nutrients, food components, and total energy were estimated using an Italian food composition database [38] using a standardized methodology [39]; data from laboratory analysis, coducted specifically for the present project, were used for estimating subjects’ prebiotic intake. The FFQ was tested for reproducibility [37, 40] and validity [38]. As for reproducibility, correlation coefficients between intakes estimated by two FFQ were 0.67 for fiber, and between 0.6 and 0.7 for most of the FFQ items in the “bread, cereals and first courses” category, between 0.5 and 0.6 for most of the vegetables, root vegetables, tubers roots and legumes, and around 0.6–0.7 for various fruits. In the validation study, the correlation coefficient between the intakes estimated from the FFQ and from two 7-day diaries was 0.58 of fibers, around 0.60–0.65 for energy, available carbohydrates, sugar and starch, around 0.50 for total, animal and vegetable proteins, animal fats and saturated fatty acids.

Quantification of prebiotic fibers in foods

The methodology used for the quantification of prebiotic fibers was described in details [36]. In brief, FOSs (i.e., nystose [glucose–3*fructose], kestose [glucose–2*fructose] and 1F-β-fructofuranosylnystose [glucose–4*fructose]) and GOSs (raffinose [galactose–glucose–fructose] and stachyose [2*galactose–glucose–fructose]) were determined in 78 food sources; ITFs in 7. Food sampling and analysis were conducted at the laboratory of Neotron SpA in Modena, which has a certified laboratory for food analysis. The food products investigated included 15 types of fruits, 32 varieties of vegetables, root vegetables and tubers, 9 types of dried or fresh legumes, and 22 cereals and cereals-based products (both wholegrain and refined products). Most of these were included in the FFQ used in the present case–control study (as a specific FFQ item, as a food of an item including mixed foods, or as a food ingredient of an item consisting of complex recipes). The 78 samples (unique samples) analyzed in this study were collected from supermarkets located in Modena from 17 May to 24 June 2021.

ITFs were determined using an internal analytical method based on AOAC 997.08 procedure. Freeze-dried samples were extracted in hot water (T equal to 85 °C) with mild agitation and the pH checked immediately (pH equal to 6.5–8.0, at 85 °C) (extract A0). A portion of extracted A0 was first hydrolyzed with a sufficient amount of amyloglucosidase solution, taking into account amount of starch and maltodextrins present (extract A1), and second hydrolyzed with a sufficient amount of inulinase solution, taking into account amount of fructans present, and enzyme concentration (Fructozyme) (extract A2). Extract A0, A1 and A2 were injected into a high-performance anion-exchange chromatography coupled to pulsed amperometric detection (HPAE-PAD), previous addition of 2.0 g of glucoheptose internal standard solution in order to determine the following sugars: glucose, fructose, sucrose, maltitol and galactose, and calculate ITFs content using a specific formula. ITFs were determined in fresh onion, garlic, banana, leek, Jerusalem artichoke, artichoke and shallot. The analysis was performed based on a limit of detection (LOD) of the methodology equal to less than 0.005. ITFs content ranged from 25.1 g/100 g in garlic to 1 g/100 g in onion and leek.

FOSs and GOSs in fresh samples were determined according to Manali Aggrawal and Jeff Rohrer method (Thermo Scientific, Application Note 1149: Profiling Fructosyloligosaccharides (FOS)-containing samples by HPAE-PAD. Sunnyvale, CA, 2015). One gram of homogenized sample was extracted with 200 mL of sodium hydroxide 0.0025 M and then analyzed using HPAE-PAD method. The LOD was between < 0.002 and < 0.010. The following molecules were quantified: raffinose (GOS), stachyose (GOS), nystose (FOS), kestose (FOS) and 1F-fructofuranosylnystose (FOS). Total FOSs was calculated as the sum of nystose, kestose and 1F-β-fructofuranosylnystose.

The food with the highest content of FOSs was Jerusalem artichoke (4.45 g/100 g), with other foods containing less than 1 g/100 g; FOSs were represented principally as kestose. Pulses, excluding green beans, had the highest content of GOSs, with a mean content of 1.17 ± 0.87 g/100 g. In particular, raffinose was particularly abundant in dried peas (0.498 g/100 g) and chickpeas (0.463 g/100 g) and stachyose in dried beans (1.905 g/100 g) and peas (1.814 g/100 g) [36].

Data analysis

We derived the odds ratios (OR) of colorectal cancer with the corresponding 95% confidence intervals (CI) according to quintiles (computed among controls) of fiber prebiotic intake by multiple logistic regression models, adjusted for age, sex, study center, years of education (< 7, 7–11, > 11), body mass index (BMI, in quintiles), occupational physical activity (low, medium, high), smoking habits (never, former, current of < 15, current of ≥ 15 cigarettes/day), alcohol intake (in quartiles), age at menopause and use of hormone replacement therapy, diabetes, aspirin use, family history of colorectal cancer, and total energy intake (in quintiles). In a sensitivity analysis, we included in the model additional terms for total fiber intake when the association estimated from the main model was significant. Tests for trends across quintiles were performed by including the examined variable as ordinal. In addition, prebiotics were entered into the models as continuous variables, with a measurement unit equal to the difference between the upper cutpoints of the 4th (i.e., the 80th percentile) and the 1st quintile (i.e., the 20th percentile). We also assessed the associations using “calorie-adjusted” prebiotic intakes, calculated according to the residual method [41]. Stratification for age, sex and BMI and separate analysis by colorectal cancer subsites were performed for prebiotic fibers showing significant association with colorectal cancer. Heterogeneity across strata was evaluated by testing the significance of the product term between the exposure variable in continuous and the dichotomous stratification factor. Heterogeneity across subsites (colon and rectum) was tested using the Wald test.

All the analyses were performed using the SAS software, version 9.4 (SAS Institute, Inc., Cary, NC, USA).

Results

Table 1 gives the distribution of colon and rectal cancer cases and controls by sex, age, and other selected factors. Both colon and rectal cancer cases reported more frequently family history of intestinal cancer than controls. Colon, but not rectal, cancer cases were more educated than controls and reported more frequently a low level of physical activity.

Table 1 Distribution of 1225 cases of colon cancer, 728 cases of rectal cancer and 4154 controls, according to sex, age and other factors. Italy, 1992–1996

Among controls, median daily intakes (mg) were 798 for ITFs, 167 for kestose, 16 for nystose, 2 for 1F-β-fructofuranosylnystose, 94 for raffinose, and 180 for stachyose.

The intake of prebiotic fibers was positively correlated with total fiber intake. Pearson correlation coefficients were 0.31 with total ITFs, 0.49 with kestose, 0.36 with nystose, 0.70 with 1F-β-fructofuranosylnystose, 0.70 with total FOSs, 0.72 with raffinose and 0.45 with stachyose. Kestose was the largest contributor to total FOSs intake (89.7%); nystose (7.9%) and 1F-β-fructofuranosylnystose (2.4%) accounted for a small fraction of total FOSs intake.

Supplementary Table 1 shows the distribution of potential confounders according to quintiles of prebiotic intake among controls. Controls with higher prebiotic intake were less frequently women, had higher total energy intake, reported less frequently history of diabetes, and, with the exception of stachyose, tended to be younger. Moreover, subjects with higher intake of ITFs tended to have a lower level of physical activity and to be more frequently current smokers; those with higher intake of raffinose and total FOSs tended to be more frequently alcohol drinkers. Women with higher ITFs, raffinose or total FOSs intakes were less frequently in menopause.

Table 2 provides the OR of colorectal cancer, and the corresponding 95% CI, according to prebiotic intake. Inverse associations were observed with the intakes of GOSs. The continuous OR were 0.85 (95% CI 0.76–0.96) for raffinose and 0.81 (95% CI 0.74–0.89) for stachyose. The OR for the highest versus the lowest quintile were 0.73 (95% CI 0.58–0.92) for raffinose and 0.64 (95% CI 0.53–0.77) for stachyose, with significant trends of decreasing risk across quintiles. After further adjustment for total fiber intake was performed, the association with raffinose intake attenuated (OR for Q5 vs. Q1: 0.80, 95% CI 0.62–1.04, p for trend across quintiles: 0.067) while that with stachyose intake remained virtually unchanged (OR for Q5 vs. Q1: 0.66, 95% CI, 0.54–0.80, p for trend across quintiles: < 0.001). Similar OR were obtained when using “calorie-adjusted” intakes.

Table 2 Odds ratios (OR) of colorectal cancer and corresponding 95% confidence intervals (CI) according to the intake of selected prebiotic fibers. Italy, 1992–1996

Total ITFs and compounds of the FOSs family were not associated with colorectal cancer.

In subgroup analyses on raffinose and stachyose intakes (Fig. 1), slight variations in the strength of the (inverse) associations were observed, without significant heterogeneity between strata. When colorectal cancer subsites where analyzed separately, the association with stachyose intake was stronger for colon (continuous OR = 0.74, 95% CI 0.66–0.83) than rectal cancer (continuous OR = 0.89, 95% CI 0.79–1.02).

Fig. 1
figure 1

Odds ratios (OR)a of colorectal cancer and corresponding 95% confidence intervals (CI) according to raffinose and stachyose intake in strata of age, sex, body mass index and colorectal cancer subsites. Italy, 1992–1996. aOR for an increase of intake equal to the difference between the upper cutpoints of the 4th (i.e., the 80th percentile) and the 1st quintiles (i.e., the 20th percentile), adjusted for age, sex, study center, education, body mass index (BMI), physical activity, smoking habits, alcohol intake, age at menopause and use of hormone replacement therapy, diabetes, regular aspirin use, family history of intestinal cancer, and total energy intake, unless the variable was the stratification factor

Discussion

Within the PrebiotiCa study, we determined the intake of six fiber fractions with prebiotic activity in subjects participating in an Italian case–control study by means of novel analytical food composition data and evaluated whether high dietary intakes of such prebiotic fibers influence the risk of colorectal cancer. Our results suggest an inverse association with dietary GOSs intake (particularly for colon cancer), but no association with total ITFs and prebiotic fibers of the FOSs family. Although the association with raffinose intake was attenuated after adjustment for total fiber, the association with stachyose intake remained virtually unchanged, suggesting that part of the protection afforded by fiber consumption on colorectal cancer may be through prebiotic effects. The observed variations in the strength of the GOSs-colorectal cancer association among subgroups of the population may be attributed to chance, in the absence of significant heterogeneity across strata.

In animal studies, ITFs showed anticarcinogenic properties [42]. Besides the extremely different setting, which limits any direct comparison with animal research, our study relies with low intakes of ITFs. Indeed, these compounds were quantified in a limited number of foods, seldom consumed and in low amounts. Whether a favorable effect of higher intakes of ITFs exists cannot, therefore, be excluded.

While a wealth of data is available on the role of fiber on colorectal cancer risk [20] and on its potential to modify the gut microbiota composition [15, 16], epidemiological studies did not evaluate the association with naturally occurring dietary prebiotics. Prebiotic fiber supplement use was not associated with a reduced risk of colorectal cancer in a cohort study of post-menopausal women [43]. Nutritional guidelines for cancer prevention by the WCRF and the AICR are not encouraging the use of dietary supplementation, but instead recommend that nutritional needs be met primarily through food consumption [44].

In laboratory analyses, GOSs were abundant in legumes. In particular, raffinose was abundant in dried peas and chickpeas (~ 0.46–0.5 g/100 g); stachyose in dried beans, peas, and chickpeas (~ 1.6–1.9 g/100 g) followed by lentils, fresh peas and fresh chickpeas (~ 0.27–0.41 g/100 g). While stachyose was detected in significant amounts in legumes only, other foods containing raffinose included whole meal flour (0.3 g/100 g) and selected wholegrain-based products (e.g., wholemeal biscuits, wholemeal pasta) (~ 0.2/100 g), as well as barley (0.22 g/100 g); raffinose was detected also in white wheat flour and wheat products, but in lower amounts (0.04 g/100 g). As for dietary FOSs, kestose was abundant in shallot, garlic, whole wheat pasta, wholemeal biscuits, banana and barley (range, ~ 0.54–0.15 g/100 g); low or undetectable amounts of kestose were found in legumes. Nystose and 1F-β-fructofuranosylnystose were detected in small concentrations in a few foods (e.g., shallot, garlic and barley) [36].

Accounting for amount consumed, the largest contributors of raffinose and stachyose in our population were cereal-based products and legumes (for stachyose, mainly legumes), followed by vegetables and fruit, which provided limited contributions; bananas were the most important source of ITFs, accounting alone for almost 60% of the intake; as for FOSs, kestose came mainly from cereals and fruit in a similar proportion. The few food products containing nystose and 1F-β-fructofuranosylnystose were consumed in small amounts and low daily intakes of these prebiotic fibers were estimated in our database, with limited variation across subjects.

Supporting our results, the intakes of wholegrain products and fiber from grains have been favorably associated with the risk of colorectal cancer in various studies [45,46,47]. An association with grain fiber intake was observed even in studies in which no association with total fiber was detected [48,49,50]. The effect of legume intakes on colorectal cancer intake was less clear. According to a meta-analysis published in 2015 and based on 14 cohort studies, higher legumes consumption was associated with a significant 10% reduced risk of colorectal cancer [51]. A 2018 meta-analysis based on a partially overlapping set of studies of the 2015 meta-analysis (for a total 14 cohort studies) found, however, no association with legumes for high versus low intake and no dose–response relation [47]. The intake of fiber from legumes was associated with a non-significant 16% decreased risk of colorectal cancer in a meta-analysis of 6 studies published in 2019 [45]. The intake of fruit and fruit fiber is not appreciably associated with colorectal cancer risk [35, 45, 50, 52].

Some limitations of the study are inherent in the study design. With reference to possible selection bias, our study was not population-based, but cases were identified in the major teaching and general hospitals of the area under surveillance, and the participation was almost complete. As for controls, these were from comparable catchment areas as cases and their participation rate was high and similar to that of cases (> 95%); further, we excluded patients admitted for chronic conditions or for diseases related to diet modifications or known risk factors for colorectal cancer. The similar interview setting of cases and controls provides reassurance against potential information bias. Recall bias and measurement error in dietary assessment using an FFQ is difficult to avoid in a case–control study. However, the FFQ gave satisfactory results when tested for reproducibility and validity of fiber intake and other food components [38, 40]. With regard to confounding, we were able to consider a number of possible confounding factors in the analysis. The intake of selected prebiotics, namely ITFs and raffinose, appeared associated with some risk factors for colorectal cancer (e.g., low physical activity, current smoking and alcohol intake); such potential confounders were taken into account in multiple adjusted models. Any residual confounding may have led, if anything, to underestimation of the association with raffinose intake or masking an association with ITFs. Allowance for total fiber intake may be considered an over-adjustment, and is provided as a sensitivity analysis. OR without allowance for total fiber give more valid estimates. Another limitation relates to the application of results from food content analyses conducted in 2021 to dietary intakes collected in the 1990s’. However, no prior data on the prebiotic content of Italian foods were available and those from other countries were limited and scattered across studies using different methodologies. Linking dietary data to food composition data collected at different time points, when contemporary data are not available, is a common approach in nutritional studies [53].

Accurate estimation of dietary prebiotic intake may be challenging due to the lack of published food composition data and heterogeneity in methodologies. Furthermore, the definition of ITFs is not universally agreed. For the purpose of our study, we quantified through laboratory analysis the amount of ITFs, nystose, kestose, 1F-β-fructofuranosylnystose, raffinose and stachyose in individual foods (e.g., flour, specific legumes, fruits and vegetables) as well as in recipes (e.g., bread, pasta, biscuits) and applied such determinations to the self-reported weekly frequency of consumption of foods, groups of foods, or recipes obtained from the FFQ. When a single item included different foods or recipes, these were assigned a relative proportion according to nationally representative data. The FFQ used for the present study was not specifically designed to measure the intake of prebiotic fibers and, although the main sources of prebiotics were addressed, it did not include items on certain foods rich in prebiotic fibers such as rye products, spelt, Jerusalem artichoke, breakfast cereal products, oats and soya beans [29,30,31,32,33,34], which, however, are infrequently consumed in Italy. In addition, the FFQ distinguished between wholegrain and non-wholegrain only for bread. In any case, possible misclassification of prebiotic intake should not be unbalanced between cases and controls.

Total ITFs was only quantified in 6 foods assessed in our FFQ, with garlic having by far the highest content. The FFQ collected information on usual garlic consumption in three categories: nonuse or low use, intermediate use, and high use. In our main analysis, we estimated subject’s garlic intake based on a standard amount of garlic in each recipe. When, in an additional sensitivity analysis, we weighted subject’s garlic intake based on the reported qualitative indicator of consumption (i.e., multiplying the intake by 0.2 for nonuse or low use and by 1.8 for high use), we obtained a slightly lower mean total ITFs intake, and similar OR for the association between ITFs intake and colorectal cancer (data not shown).

In conclusion, the present study suggests an inverse association between selected prebiotic fibers, i.e., those of the GOSs family, and colorectal cancer risk.