Introduction

Sugar, a common component of diet, is known to cause dental caries1, and has been thought to have adverse health effects, particularly on obesity and type 2 diabetes (T2D) for decades2. Whether sugar causes cardiovascular disease (CVD) was heavily debated in the context of whether dietary sugar or fat contributed to the rise in coronary heart disease (CHD) in the mid to late twentieth century in many Western countries3,4. Despite some initial concern about the role of protein5, dietary guidelines since the 1980s have largely focused on promoting a low-fat, high-carbohydrate diet to lower blood cholesterol and prevent CVD5. However, with the upward trend in obesity and T2D6 the role of sugar is being reappraised. In 2015, the World Health Organization (WHO) recommended limiting free sugar intake to less than 10% of total energy intake7.

Observationally, systematic reviews and meta-analyses of prospective cohort studies suggest associations of sugar intake, mostly in the form of sugar sweetened beverages (SSB), with higher risks of CHD8, T2D9, weight gain10,11, hypertension12, and elevated triglycerides, low-density lipoprotein cholesterol (LDL-C) and blood glucose13. However, observational studies, largely from Western settings, may be biased by confounding by socioeconomic position and lifestyle14. In contrast, our previous findings from a birth cohort of Hong Kong Chinese children, where adiposity is not strongly confounded by socio-economic position15, suggested no association of SSB consumption with adiposity16. Randomized controlled trials (RCT) of the effect of sugar intake on CVD or T2D are lacking. Meta-analyses of RCTs on adiposity tend to show a positive effect of sugar or SSB consumption on weight gain in trials of ad libitum diets10,11, but not in trials of isoenergetic exchange of sugar with other carbohydrates10,17. Most of the trials were small and short-term10,11,17, with a meta-analysis possibly biased by publication bias and heterogeneity10. Evidence from RCTs on the effects of sugar intake on blood pressure, lipids, or glycaemic traits is limited and mixed. Some meta-analyses of RCTs have found sugar intake raised blood pressure and lipids18 but others showed no effect on blood pressure19,20 or lipids20,21 when sugars were substituted isoenergetically for other carbohydrates. Adding fructose-containing sugars to the diet appears to have a harmful effect on fasting insulin but not when substituting sugar for other macronutrients with differences by food sources22.

Observationally it is challenging to disentangle the role of a single dietary component, such as sugar, on CVD, while long-term RCTs have not been conducted. In this situation, Mendelian randomization provides an alternative means of examining causal effects by using genetic instruments predicting the exposures to identify effects of single items without confounding. Here, we used two-sample Mendelian randomization23 to estimate the effects of blood sucrose24, which may reflect sucrose intake because blood sucrose is mainly of dietary origin and metabolized very slowly25, on CHD and its risk factors, including T2D, adiposity, blood pressure, lipids, and glycaemic traits using the largest available genome-wide association studies (GWAS). We similarly used blood fructose (one of the two monosaccharides that form sucrose)26 as a validation exposure, which we expected to have directionally similar estimates as sucrose. We did not consider blood glucose as an exposure because blood glucose reflects a number of physiological processes as well as dietary intake of sugar and starch. As sugar is known to cause dental caries1, we used it as a positive control outcome.

Results

We obtained 9 and 2 uncorrelated (r2 < 0.001) single nucleotide polymorphisms (SNP) strongly (P < 5 × 10–6) predicting blood sucrose and fructose, respectively (Supplementary Table 1). Of the 9 SNPs for blood sucrose, only 2, 3 and 8 were available for lipids in the Global Lipids Genetics Consortium (GLGC), glycaemic traits, and blood pressure, respectively, while all were available for other outcomes. We used rs3821975 as a proxy for rs4693441 (r2 = 1.0) for lipids in GLGC and glycaemic traits, but the other SNPs had no proxy, giving 3, 4, and 8 SNPs for lipids in GLGC, glycaemic traits, and blood pressure, and 9 SNPs for all other outcomes. Both of the 2 SNPs for blood fructose were available for all outcomes. The 9 and 2 SNPs explained 6.2% and 0.6% of the variance in blood sucrose and fructose, respectively. The mean F-statistic was 15.7 and 23.6, respectively. None of the SNPs was related to key confounders in the UK Biobank or had known pleiotropic effects according to Ensembl or PhenoScanner.

Genetically higher blood sucrose was positively associated with dental caries, but this association did not remain after accounting for multiple testing (Table 1). Genetically higher blood sucrose was suggestively positively associated with decayed and filled tooth surfaces divided by tooth surfaces (DFSS) but not with decayed, missing and filled tooth surfaces (DMFS). A directionally similar estimate with a much wider confidence interval (CI) was found for blood fructose with dental caries (Supplementary Table 2). Genetically instrumented blood sucrose was not significantly associated with CHD but the direction was positive, as it was for fructose (Table 1 and Supplementary Table 2). Genetically instrumented blood sucrose or fructose were not associated with T2D (Table 1 and Supplementary Table 2), adiposity, blood pressure, lipids, or glycaemic traits (Table 2 and Supplementary Table 3) using any method, with estimates in an inverse direction for adiposity, blood pressure and lipids.

Table 1 Associations of genetically predicted blood sucrose (per log10 transformed median-normalized SD increase) with dental caries [from the UK Biobank (ncase = 2110, ncontrol = 359,084)], DFSS and DMFS [from the GLIDE consortium, n =  ~ 26,792], CHD [from a meta-analysis in the CARDIoGRAMplusC4D consortium (ncase =  ~ 76,014, ncontrol =  ~ 264,785)] and T2D [from a meta-analysis in the DIAGRAM Consortium (ncase = 74,124, ncontrol = 824,006)].
Table 2 Associations of genetically predicted blood sucrose (per log10 transformed median-normalized SD increase) with adiposity [BMI from a meta-analysis of the GIANT Consortium and the UK Biobank (n =  ~ 806,834), WC and BFP from the UK Biobank (n =  ~ 361,194)], blood pressure [from a meta-analysis of the UK Biobank and ICBP (n =  ~ 757,601)], lipids [from the UK Biobank (n =  ~ 361,194) and GLGC (n =  ~ 188,578)], and glycaemic traits [from the MAGIC Consortium (n =  ~ 46,186)].

Discussion

Our Mendelian randomization study suggests a positive association of genetically higher blood sucrose with dental caries, but this association did not withstand allowing for multiple testing. We found no significant association of genetically predicted sucrose with CHD, but a small effect of sucrose on CHD cannot be excluded. Genetically predicted blood sucrose was not associated with risk of T2D, adiposity, blood pressure, lipids, or glycaemic traits. No associations were found for blood fructose.

Our finding of no significant association of blood sucrose with risk of CHD is inconsistent with some meta-analyses of observational studies suggesting a positive association of SSB with CHD8 or myocardial infarction27 and an observational study suggesting a positive association of sucrose intake with coronary events28. However, the observed associations in Western populations may be confounded by socioeconomic position, diet and lifestyle marked by SSB consumption8,27. In contrast, a Japanese cohort of 39,786 participants followed up for 18 years suggested no association of soft drink consumption with CHD29. Nevertheless, our finding of a non-significant positive association might be owing to inadequate power. Further Mendelian Randomization studies with stronger instruments for sugar intake or biomarkers would be very helpful about the specific role of sugar. More broadly genetic evidence seems to be emerging suggesting that protein rather than fat or sugar is the key dietary element affecting health30.

No association was found of genetically predicted blood sucrose or fructose with T2D, which is consistent with a meta-analysis of observational studies of sugar or fructose intake31 but not with others suggesting positive associations of SSB with T2D9,32. However, the studies of SSB also found positive associations of artificially sweetened beverages with T2D9,32, implying that the associations might be due to confounding rather than to sugar itself. The null association with adiposity is concordant with our previous finding of SSB with adiposity in Hong Kong Chinese children16, but inconsistent with positive associations of sugar or SSB intake with weight gain in observational studies in Western populations10,11. The finding is more consistent with meta-analyses of RCTs showing no effect of substituting sugar for other carbohydrates on weight10,17,20. Sucrose or fructose intake, particularly in the form of SSB, may cause adiposity by adding to extra energy intake but inducing less satiety33 thus promoting positive energy balance. However, an experimental study in mice found no difference in adiposity by sucrose intake but did find differences by fat intake34, suggesting that fat rather than sucrose drives over-consumption. Sensitivity analyses suggested a possible inverse association of blood sucrose with BMI, which might be chance or because sucrose might be at the expense of a lower fat intake. The lack of associations with blood pressure or lipids is consistent with meta-analyses of trials substituting sugar for complex carbohydrates on systolic blood pressure (SBP) and diastolic blood pressure (DBP)20 and of trials substituting fructose for glucose on triglyceride35, but inconsistent with another showing positive effects of sugar intake on blood pressure and all lipids with or without energy control18. However, our MR findings differ from observed positive associations of SSB with hypertension12, triglycerides and LDL-C13. The lack of associations with glycaemic traits is consistent with a meta-analysis of RCTs showing no effect of sugar intake on fasting insulin when substituting sugar for other macronutrients or subtracting sugar from the diet22. Nevertheless, most of the trials had small sample sizes and short duration18,20,22,35, making the findings less certain.

Sugar may cause CVD through several mechanisms. First, sugar may promote CVD through its effect on weight gain. Observational studies have suggested a weakened association of SSB with CHD after adjusting for body mass index (BMI) and energy intake8. However, while sugar at high doses in hypercaloric trials did increase weight17, sugar at usual intake especially as part of an isocaloric diet showed no effect on weight gain10,17. Second, sugar may cause CVD through metabolic effects of fructose. Unlike glucose, fructose can bypass phosphofructokinase and be metabolised to substrates for lipid synthesis in unregulated glycolysis36, thus leading to increased de novo lipogenesis, dyslipidaemia, and insulin resistance after excess intake36. Chronic fructose intake may activate gene expression of lipogenic enzymes to further enhance lipogenesis36. Unregulated fructose metabolism can also raise uric acid concentration, which has been linked to endothelial dysfunction, hypertension, and increased risk of CVD33. It is also possible that sugar intake might affect glucagon or insulin which may cause CHD37,38. However, we did not find any evidence of these pathways, which need validation using stronger instruments predicting sugar.

Our study has some limitations. First, we used genetic variants predicting sucrose and fructose from metabolomics studies using mass spectrometry24,26. Blood sucrose and fructose may not be perfect biomarkers of sugar intake. However, blood sucrose is mainly of dietary origin and metabolized very slowly25, so blood sucrose reflects sucrose intake. High fructose corn syrup (42% or 55% fructose, glucose and water) is another commonly used sweetener other than sucrose, which may be captured by blood fructose. Many other factors, including intestinal permeability and liver metabolism39, may also influence blood levels of sucrose and fructose, but are unlikely to confound the genetic associations. Second, we used the genetic variants associated with blood sucrose and fructose at less than genome-wide significance (P < 5 × 10–6) and the variance of sucrose and fructose explained by the genetic variants were small (6.2% and 0.6%). Weak instrument bias could attenuate the causal estimates towards the null23. However, the SNPs had a mean F-statistic > 10. Weak instrument bias is expected to be small if the p-value is below 1 × 10–540. Third, confounding by population stratification is possible. However, the genetic associations concern people of European ancestry generated with genomic control. The GWAS of blood sucrose suggested no inflation or deflation due to population stratification24. The genetic variants were not related to key confounders in the UK Biobank. Fourth, genetic predictors of sucrose and fructose may have pleiotropic effects. However, we found no unknown pleiotropy from MR-Egger and MR-PRESSO and no known pleiotropy in Ensembl or PhennoScanner. All methods gave similar results. Fifth, our study was mainly limited to people of European ancestry and might not apply to other populations. However, causes would be expected to be consistent41, although might not always be relevant42. Sixth, the instruments predicting blood sucrose were obtained from primarily women, which may not be applicable to men. However, there is no evidence that genetic predisposition to blood sucrose or fructose differs by sex or has different effects by sex. The estimates for blood sucrose or fructose with dental caries, BMI and blood pressure in women were similar to those in both sexes. Seventh, we could not check whether the associations varied by age, or for non-linear effects. Eighth, canalization, i.e., compensatory mechanisms driving the association of genetic variants with sucrose or fructose, may result in different associations in Mendelian randomization than would be observed from interventions. However, whether that exists is unknown. Finally, although the estimates were close to the null, we cannot rule out a small effect for CHD, which should be investigated further. We only have 80% power to detect an OR of about 1.05 for sucrose on CHD. However, the estimates for adiposity, blood pressure and lipids were in the opposite direction to those expected.

Conclusions

Our study found some evidence of a detrimental effect of blood sucrose on dental caries. However, our study suggests no effect of blood sucrose on T2D, adiposity, blood pressure, lipids, or glycaemic traits, whilst it remains possible that there could be a small effect of sucrose on CHD risk.

Methods

Genetic instruments for blood sucrose and fructose

Genetic variants, i.e., SNP, strongly (P < 5 × 10–6) predicting serum sucrose (log10 transformed median-normalized values) were extracted from an existing GWAS of serum metabolites adjusted for sex and age, obtained from non-targeted metabolomics using four independent ultra-high-performance liquid chromatography-tandem mass spectrometry24, in 1439 individuals (median age 58 years, 96.6% women) of European ancestry24. These associations were not adjusted for principal components because there was no evidence of inflation or deflation due to population stratification24. SNPs strongly (P < 5 × 10–6) predicting blood fructose (log10 transformed normalized) were extracted from an existing GWAS of blood metabolites adjusted for age, sex, batch and genomic control26, obtained using liquid-phase chromatography, gas chromatography separation and tandem mass spectrometry26 in 7824 individuals (mean age 55 years, 83.5% women) of European ancestry26. SNPs were discarded if they were non-biallelic or had a minor allele frequency less than 0.01.

Genetic associations with dental caries, CHD, and T2D

Genetic associations with dental caries (ncase = 2110, ncontrol = 359,084) were extracted from genetic summary statistics from the UK Biobank in ~ 361,194 white British individuals (mean age 57.5 years)43 provided by Neale Lab (http://www.nealelab.is/uk-biobank/). Dental caries was defined as hospital inpatient admission due to International Classification of Diseases (ICD) 10 K02. These genetic associations were obtained from linear regresstion adjusted for age, age2, sex, interactions of sex with age and age2, and 20 principal components. The linear regression estimates were transformed to log odds ratios (OR) using an approximation44. Genetic associations with DFSS (inverse normal rank transformed) and DMFS (standard deviation of residuals) were taken from summary statistics from the Gene-Lifestyle Interactions in Dental Endpoints (GLIDE) consortium in up to 26,792 individuals of European ancestry adjusted for age, sex, and study-specific covariates45. Genetic associations with CHD were taken from summary statistics of the Coronary Artery DIsease Genome wide Replication and Meta-analysis plus Coronary Artery Disease (CARDIoGRAMplusC4D) consortium from a meta-analysis (ncase =  ~ 76,014, ncontrol =  ~ 264,785) of CARDIoGRAMplusC4D 1000 Genomes, the Myocardial Infarction Genetics and CARDIoGRAM Exome, the UK Biobank SOFT CAD GWAS, and two small case–control studies in individuals of mainly European ancestry (77% in CARDIoGRAMplusC4D 1000 Genomes and 94% in the UK Biobank SOFT CAD GWAS)46 with genetic control. CHD cases included fatal or nonfatal myocardial infarction, percutaneous transluminal coronary angioplasty, coronary artery bypass grafting, chronic ischemic heart disease and angina determined from clinical diagnosis, medical records and self-report of medication usage. Controls were those who were not cases. Genetic associations with T2D were obtained from summary statistics from the DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) consortium from a meta-analysis of case (n = 74,124)-control (n = 824,006) studies47 in individuals of European ancestry adjusted for study-specific covariates and population structure but not BMI.

Genetic associations with adiposity, blood pressure, lipids, and glycaemic traits

Genetic associatons with inverse-normally transformed BMI were obtained from summary statitsics from a meta-analysis of the Genetic Investigation of ANthropometric Traits (GIANT) Consortium and the UK Biobank in up to 806,834 individuals of European ancestry adjusted for age, age2, assessment centre, and sex48. Genetic associations with inverse-rank normalized waist circumference (WC), body fat percentage (BFP), high-density lipoprotein cholesterol (HDL-C), LDL-C, and triglyceride were obtained from UK Biobank summary statitsics43 provided by Neale Lab (http://www.nealelab.is/uk-biobank/) in ~ 361,194 white British individuals adjusted for age, age2, and 20 principal components, sex and interactions of sex with age and age2. Genetic associations with inverse-normally transformed HDL-C, LDL-C, and triglyceride were also taken from GLGC in ~ 188,578 individuals of European ancestry adjusted for age, age2, and sex49. Genetic associations with SBP and DBP were obtained from a meta-analysis of the UK Biobank and the International Consortium of Blood Pressure-Genome Wide Association Studies (ICBP) in ~ 757,601 individuals of European ancestry adjusted for sex, age, age2, BMI, and genotyping chips50. Genetic associations with glycaemic traits, including indices of β-cell function (HOMA-B) and insulin resistance (HOMA-IR), and fasting insulin (all naturally log transformed), were taken from a meta-analysis of GWAS including the UK Biobank in up to 46,186 individuals of European ancestry adjusted for age, sex and study specific covariates with genomic control51 provided by the Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC).

Statistical analysis

We obtained uncorrelated SNPs (r2 ≤ 0.001) using the “ld_clump” function of “ieugwasr” R package. For SNPs not available for an outcome, we used proxies (r2 ≥ 0.8) identified from LDlink (https://analysistools.nci.nih.gov/LDlink/). We aligned genetic variants for exposure and outcome on effect allele as none were palindromic. We estimated the F-statistic for each SNP using an approximation52, where a F-statistic of 10 or above indicated a lower risk of weak instrument bias. We checked whether any of the SNPs were associated with key confounders (i.e., Townsend deprivation index, education, smoking, alcohol drinking, and moderate physical activity) in the UK Biobank at Bonferroni corrected significance (0.05 divided by the number of SNPs times the number of confounders) (http://www.nealelab.is/uk-biobank). We checked for known pleiotropic effects using two curated genotype-to-phenotype cross-references, Ensembl 99 (http://www.ensembl.org/index.html, accessed April 15, 2020), and PhenoScanner (http://www.phenoscanner.medschl.cam.ac.uk, accessed April 15, 2020).

We obtained Mendelian randomization estimates by meta-analysing SNP-specific Wald estimates (ratio of SNP on outcome to SNP on exposure)23 using inverse variance weighting (IVW) with multiplicative random effects which allows for heterogeneity but assumes balanced pleiotropy. We also used a weighted median which is valid as long as < 50% of the weight are contributed by invalid SNPs53, and MR-Egger which is valid as long as the instrument strength independent of direct effect assumption (i.e., the size of the pleiotropic effects is independent of their strength as instruments) holds52. MR-Egger provides a check on the validity of the IVW estimate, with a significant p-value (< 0.05) for the intercept indicating that the IVW estimate may be invalid52. We also used the Mendelian randomization pleiotropy residual sum and outlier (MR-PRESSO) test which can correct for potential horizontal pleiotropy (i.e., the variant has an effect on disease outside of its effect on the exposure)54.

We estimated the expected effect size of the association of blood sucrose with CHD for 80% power. At an R2 (the proportion of variance in blood sucrose explained by the genetic variants) of 6.2%, we have 80% power to detect an OR of 1.05 for sucrose on CHD. Here, we are testing a hypothesis, the effect on CHD with related secondary outcomes (CHD risk factors) and a control outcome (dental caries), for comprehensiveness we considered a p-value of 0.05 and a Bonferroni-corrected p-value of 0.05/7 = 0.007 (given 7 outcomes including CHD, 5 CHD risk factors and dental caries) as significant. All statistical analyses were performed using R version 3.6.3 (The R Foundation for Statistical Computing, Vienna, Austria)55 and the “MendelianRandomization” and “MRPRESSO” R packages. This study requires no ethical approval because only publicly available summary data was used.