Introduction

C-reactive protein (CRP), which is primarily synthesized and secreted in the liver, is present in atherosclerotic lesions but not in normal blood vessel walls. This observation suggests that CRP may play a role in atherogenesis. Thus, the circulating high-sensitivity CRP (hsCRP) concentration is a powerful biomarker that can be used to predict age-related cardioembolic risk.1 Previous studies reported that high CRP concentrations in plasma (>3 mg l−1) were associated with increased incidence and mortality because of cardiovascular disease (CVD) and metabolic syndromes in elderly populations.2, 3 Lifestyle-related risk factors, such as cigarette smoking and obesity, that modify the autonomic and inflammatory responses in the vascular system have been shown to be associated with both elevated CRP and the risk of CVD in general populations.4

Population-based studies have previously reported statistically significant associations between CRP levels and single-nucleotide polymorphism (SNP) markers located in the following genes: interleukin 6 receptor (IL6R, 1q21); C-reactive protein (CRP, 1q23); glucokinase (hexokinase 4) regulator (GCKR, 2p23); interleukin 6 (IL6, 7p21); cytochrome P450, family 17, subfamily A, polypeptide 1 gene (CYP17A1, 10q24.3); hepatocyte nuclear factor 1 homeobox A (HNF1A, 12q24.2); apolipoprotein C-1 (APOC1, 19q13.2); and apolipoprotein E (APOE, 19q13.2).5, 6, 7, 8, 9, 10, 11 Liu et al.12 found that the CYP17A1 gene variant (rs1004467) was associated with both plasma hsCRP level and risk of hypertension in Chinese patients.

Dehghan et al.13 identified 18 loci (for example, CRP, LEPR, GCKR, PABPC4, BCL7B, APOC1, HNF1A and so on) associated with CRP levels in a meta-analysis combining the results of 15 European studies. However, only the HNF1A gene variant has consistently achieved genome-wide significance in four subsequent genome-wide association studies (GWASs) composed of 10 112 Japanese patients obtained from Biobank Japan (IL6, HNF1A),14 1709 unrelated Filipino women (CRP, HNF1A),15 8722 Koreans (HNF1A)16 and 6692 Asians living in Singapore (CRP, HNF1A, APOE), respectively.17 Among the previous studies published on CRP, eight publications have reported genetic variants associated with CRP concentrations in Asian populations.6, 10, 11, 12, 14, 15, 16, 17 As far as we know, no meta-analysis of Asian studies has been conducted evaluating the effect of genetic variants on CRP concentrations.

Initially, we evaluated the effects of 12 SNPs, located in or near 7 candidate genes (that is, IL6R, CRP, GCKR, IL6, CYP17A1, HNF1A and APOE), and one intergenic locus (12q23.2), on plasma hsCRP concentration in a community-based cohort study composed of 748 Koreans. Subsequently, we performed a meta-analysis comprising 9 studies involving a total of 36 752 Asians to validate the effects of these genetic variants on hsCRP levels. Finally, we developed weighted genetic risk score (wGRS) models that combined the effects of the risk variants to predict high-risk CRP levels as well as increased risks of hypertension and CVD in Koreans.

Methods

Genetic association study in a Korean cohort

Study population and data collection

The population used for the genetic association study consisted of 748 elderly participants with an average age of 68 years. These participants are part of the Hallym Aging Study (HAS), a community-based cohort in Chuncheon city in the Republic of Korea. In 2004, we collected disease status, self-reported demographic and lifestyle data as well as clinical data after informed consent was obtained from each participant at the Hallym University Chuncheon Sacred Heart Hospital. Both systolic blood pressure and diastolic blood pressure were calculated as the averages of three measurements in sitting position. We measured the plasma hsCRP concentrations using a latex-agglutination turbidimetric assay (Pureauto S CRP latex, Daiichi Pure Chemicals, Tokyo, Japan). The plasma concentrations of total cholesterol and high-density lipoprotein cholesterol (HDL-C) were measured by an automatic analyzer (Hitachi 7600-210, Hitachi Medical, Hitachi, Japan). The total cholesterol to HDL-C ratio (T/HDL) was calculated by dividing total cholesterol by HDL-C. The study protocols have been approved by the Institutional Review Board of Hallym University (HIRB-2007-001). Detailed information on the HAS is available elsewhere.18

Genotype data analysis

The candidate SNPs for CRP levels were selected by reviewing the literature and public databases, such as dbSNP (www.ncbi.nlm.nih.gov/projects/SNP/), HuGE Navigator (http://hugenavigator.net/HuGENavigator/home.do)19 and SNPinfo (http://snpinfo.niehs.nih.gov/cgi-bin/snpinfo/snpfunc.cgi). We selected two to five tag SNPs located between 2 kb 5′-upstream and 3′-downstream for each of six candidate genes (that is, CRP, IL6R, GCKR, CYP17A1, HNF1A and APOE) and two additional intergenic loci: rs2097677, located in the 33.9 kb 5′-upstream of the IL6 gene (7p21), and rs10778213, located within a gene desert region (12q23.2). A total of 29 SNPs were genotyped for the 775 Koreans with data on plasma hsCRP levels using the GoldenGate Assay (Illumina, San Diego, CA, USA). Two individuals and four SNPs with genotyping call rates of <95% were excluded from subsequent analyses. In addition, we excluded 25 outliers with hsCRP levels >10 mg l−1 that indicates an acute inflammatory disease. We performed preliminary tests to evaluate the Hardy–Weinberg equilibrium, minor allele frequency and pairwise linkage disequilibrium (LD) using Haploview v.4.2.20

Because the hsCRP concentrations of the 748 individuals were not normally distributed, the values were transformed into a natural logarithmic scale (ln hsCRP) for subsequent analyses. We performed single marker analyses to test for associations between each of 25 SNPs and ln hsCRP under an additive model, after adjustments for the six covariates, age, sex, history of hypertension, smoking, body mass index and T/HDL using the STATA software package, v.11.2 (Stata, College Station, TX, USA). Further details can be found elsewhere.21

Meta-analysis of Asian populations

Systematic review of studies

We systematically searched for all published studies of CRP levels in humans, including GWASs, using the Human Genome Epidemiology (HuGE) Literature Finder, a comprehensive search engine of the HuGE Navigator for publications of genetic association studies that is continuously updated from PubMed.19 We collected a total of 352 articles, including 12 GWASs, using the keyword ‘C-reactive protein’ and using study category filtering (that is, ‘genetic association’ OR ‘gene-gene interaction’ OR ‘gene-environment interaction’) and gene filtering for seven candidate genes (that is, ‘CRP’ OR ‘IL6R’ OR ‘GCKR’ OR ‘IL6’ OR ‘CYP17A1’ OR ‘HNF1A’ OR ‘APOE’). We excluded animal studies, case–control studies and studies in non-Asian populations including Indo-Europeans. We also excluded studies that did not provide information on their estimated β-coefficient (β) or genotype quality (that is, genotyping success rate, minor allele frequency or Hardy–Weinberg equilibrium). A total of 8 publications consisting 11 study populations were selected from the literature review (Supplementary Figure S1). The GWAS and the replication of a Japanese study14 and three different GWASs by Dorajoo et al.17 were analyzed separately in the subsequent analysis. Table 1 provides detailed information regarding the nine Asian studies, including the current study (that is, HAS), analyzed in the current meta-analysis.

Table 1 Characteristics of eligible studies included in the Asian meta-analysis on CRP levels

SNP selection

We initially selected 51 SNP markers from 7 candidate genes and 1 intergenic SNP, rs10778213, in this review. We evaluated a pairwise LD of these SNPs using the LD TAG SNP Selection (TagSNP) of the SNPinfo Web Server that interfaces with dbSNP (http://snpinfo.niehs.nih.gov/snpinfo/snptag.htm) and excluded SNPs in complete LD with each other (r2=1) because they provide redundant information. We analyzed a total of 12 SNPs that were studied in at least 2 of 11 studies included in the current meta-analysis.

Statistical analysis

We combined the genotype data for 748 Koreans and 36 004 Asians. Using the genome-wide association meta-analysis (GWAMA) software package developed by Magi and Morris22 (http://www.well.ox.ac.uk/gwama/), we estimated the combined βs and P-values under both fixed- and random-effect inverse variance models with an assumption of an additive genetic model for the 12 SNPs. We performed the Cochran Q test using the I2 statistic to evaluate heterogeneity across studies. For each SNP, we summarized the β with 95% confidence interval for each study and for the combined study in a forest plot. We generated funnel plots for detecting publication bias and heterogeneity across studies. To correct for multiple testing of the 12 SNP markers, we applied a significance threshold of P<0.004 based on a Bonferroni correction.

Functional analysis in silico

A total of nine SNP markers were evaluated to predict their biological functions, such as nonsynonymous SNPs, stop codons, transcription factor binding sites, splicing regulators and miRNA binding sites, in Asian populations using the online SNP Function Prediction (FuncPred) tool (http://snpinfo.niehs.nih.gov/snpinfo/snpfunc.htm). We further evaluated the functional impact of five nonsynonymous SNPs (that is, rs2228145, rs1260326, rs1169288, rs2464196 and rs429358) using the web-based program PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2).

Genetic risk assessments for high hsCRP level, hypertension and CVD in Koreans

We calculated wGRS by multiplying the number of risk alleles (0, 1 and 2) for each SNP by the exponentiated β coefficent for that SNP. Although only three SNPs remained statistically significant after Bonferroni correction in the HAS, eight SNPs achieved the Bonferroni-adjusted significance level in the current meta-analysis, and the rs1004467 (P=0.017) is a well-known risk variant for hypertension in Asian populations.12 Thus, we summed the wGRS across nine SNPs (that is, rs2228145, rs7553007, rs1260326, rs2097677, rs1004467, rs1169288, rs7310409, rs2464196 and rs429358), categorized it into five groups and evaluated association of wGRS with hsCRP levels using a generalized linear model adjusted for the six covariates listed above. We further compared the wGRS under the assumption of an additive genetic model (that is, additive-wGRS) with the wGRS that combined the effect of each SNP estimated under the best-fitting genetic model (additive, dominant or recessive) (that is, model-specific wGRS) for all three phenotypes (Table 3 and Supplementary Table S4).

To assess genetic risks of hypertension and CVD, we created wGRS models composed of four CRP-associated SNPs (that is, rs2228145, rs2097677, rs1004467 and rs7310409) that were also associated with either hypertension or CVD. Hypertensive cases included 455 participants with systolic blood pressure/diastolic blood pressure of ≥140/90 mm Hg or having a history of hypertension. Cases of CVD were defined as 110 participants who had a history of CHD or stroke. We selected a common control group of 101 subjects with systolic blood pressure <120 mm Hg, diastolic blood pressure <80 mm Hg and no history of hypertension and CVD. We summed the wGRS across four SNPs and categorized the wGRS into three groups. We performed multiple logistic regression analyses between the wGRS and either hypertension or CVD, after adjusting for age, sex, smoking, drinking and body mass. Furthermore, we compared this model-specific wGRS with the wGRS after adjustment for the covariates specific to hsCRP (that is, age, sex, smoking, body mass index and T/HDL).

Results

Our meta-analysis consisted of 36 752 individuals obtained from 11 Asian study populations and a Korean cohort. The two Japanese studies by Okada et al.14 consisted of a GWAS and a replication study obtained from the BioBank Japan Project. The two Filipino studies were obtained from the same cohort, the Cebu Longitudinal Health and Nutrition Survey (CLHNS).11, 15 Dorajoo et al.17 performed three independent GWASs of Chinese, Malay and Asian-Indian populations. Table 1 summarizes the population characteristics and study designs of the 12 studies. We found significant difference across studies; age, gender ratio and other covariate adjustment in each study. The mean age of subjects in most of the studies was >45 years, except for the two Filipino studies (mean age was 21.5 and 38.2 years, respectively).6, 11 The gender ratio was 29 to 79% women, with Wu et al.15 including only women. Nine studies, including the Korean cohort, recruited subjects from the general population;10, 11, 12, 15, 16, 17 however, three studies consisted of patients only.6, 14 Eight studies, including the Korean cohort, excluded outliers with CRP levels >10 mg l−1 before analysis;6, 11, 15, 16, 17 however, no exclusion criteria for CRP level were observed in the other four studies.10, 12, 14 All studies log-transformed the CRP levels because of the highly skewed distribution before analysis. Most studies adjusted for age, sex and additional covariates in their statistical analyses, although the identity of these covariates was not identical between studies.

We summarized the results of the previous association studies for 12 SNPs on CRP levels using an additive model (Supplementary Table S1). The results of the generalized linear model for 12 SNPs, after adjustments for 6 covariates, using an additive model in 748 Koreans are presented in Table 2. Seven SNPs exhibited significant associations with plasma CRP levels, and three of these SNPs, rs2228145 (IL6R), rs7310409 (HNF1A) and rs429358 (APOE), remained significant following a Bonferroni correction (P<0.004). The Asian meta-analysis, including the Korean population, identified 9 out of 12 SNPs that were significantly associated with CRP levels under the additive model. Eight SNPs in the meta-analysis were significant following a Bonferroni correction. Two SNPs, rs7310409 (HNF1A, 12q24.2) and rs429358 (APOE, 19q13.2), exhibited the lowest P-value (β=0.06, P=3.4 × 10−23) and the highest effect size (β=0.27, P=7.1 × 10−8), respectively. Six SNPs revealed statistically significant heterogeneity (P<0.05). Therefore, we performed a subgroup analysis and these SNPs demonstrated strong evidence for an association with CRP levels in the homogenous subgroup analysis (that is, rs2228145, P=6.7 × 10−8; rs7553007, P=3.4 × 10−17; rs2097677, P=2.5 × 10−11; rs1169288, P=3.9 × 10−4; rs7310409, P=3.4 × 10−23; rs2464196, P=3.6 × 10−12). Two SNPs, rs2097677 (IL6) and rs1004467 (CYP17A1), that were insignificant in the Korean study became statistically significant in the meta-analysis, but not rs1004467, after Bonferroni correction. We presented both forest plots and funnel plots of the rs7310409 (Figures 1a and 1b) and the other eight SNPs (Supplementary Figures S2–S9). Genetic heterogeneity and/or publication bias was suspected in the two Filipino studies (Figure 1b).11, 15

Table 2 Associations of 12 SNPs with CRP levels in the HAS (748 Koreans) and in the Asian meta-analysis
Figure 1
figure 1

Forest (a) and funnel (b) plots for rs7310409 (HNF1A) in the Asian meta-analysis using an inverse variance model. The first author and the Korean cohort (Hallym Aging Study (HAS)) is represented in (a) and (b). Okada 1 and Okada 2 by Okada et al.14 show the results of a genome-wide association study (GWAS) and a replication study, respectively. Dorajoo 1–3 by Dorajoo et al.17 show the results of 2179 Chinese, 2275 Malay and 2238 Asian-Indian GWASs, respectively. The horizontal line indicates the 95% confidence interval (CI); the shaded square indicates the weight of each study. The estimates of effect size from the fixed-effect (black diamond) or random-effect (gray diamond) inverse variance models are shown in (a). The divergent lines are the pseudo 95% CIs and the central line indicates the pooled β in (b).

We identified possible biological functions for six out of the nine SNP markers implicated by the Asian meta-analysis as being associated with CRP levels. The rs2228145 (Asp358Ala) SNP, which is located in exon 9 of the IL6R gene, is an alternative exonic splicing enhancer that regulates the expression of IL6R. The rs1260326 (Leu446Pro, GCKR) was predicted to be potentially damaging to the base sequence. Three other nonsynonymous SNPs, rs1169288 (Ile27Leu, HNF1A), rs2464196 (Ser487Asn, HNF1A) and rs429358 (Cys130Arg, APOE), result in amino acid substitutions. An intronic SNP rs1004467 (CYP17A1) is located at a microRNA-binding site (Supplementary Table S2).

We tested for the combined effect of the nine SNPs (Figure 2a) using a wGRS model with adjustments for six covariates on hsCRP level in 748 Koreans (Figure 2b). Initially, we evaluated the combined effect of seven SNPs that showed significant associations in the Korean study; however, the mean hsCRP concentration of the highest additive-wGRS group calculated using seven SNPs (11.01–14.85) was lower (1.77 mg l−1) than that of the highest additive-wGRS group calculated using the nine SNPs suggested in the Asian meta-analysis (15.01–17.94, 2.55 mg l−1). As expected, the hsCRP concentration linearly increased with increasing wGRS in Koreans. The group of subjects with the highest additive-wGRS had an 2.5-fold higher mean CRP level compared with the subjects with the lowest additive-wGRS (3.39–6.00, 1.03 mg l−1; Figure 2). The additive-wGRS model appears to fit the CRP data better than the model-specific wGRS (Table 3 and Supplementary Table S3).

Figure 2
figure 2

Mean C-reactive protein (CRP) concentrations according to the number of risk allele (0, 1 and 2) for nine single-nucleotide polymorphisms (SNPs) (a) and the weighted genetic risk score (wGRS) for the combined effects of those nine SNPs (b). Bars indicate the number of individuals by risk allele score (left vertical axis). Dots connected by a solid line indicate the mean CRP levels according to each risk allele score and a short dash represents the standard error of the mean CRP levels (right vertical axis). In (b), the X axis is categorized by the range of wGRSs: 3.39–6.00, 6.01–9.00, 9.01–12.00, 12.01–15.00 and 15.01–17.94.

Table 3 Associations of model-specific weighted genetic risk score groups with hsCRP level, hypertension and CVD in 748 Koreans

Three SNPs, rs2228145 (IL6R), rs2097677 (IL6) and rs7310409 (HNF1A), out of 12 SNPs were associated with CVD, whereas two SNPs, rs1004467 and rs11191548 (CYP17A1), were associated with hypertension at P<0.05 (Supplementary Table S3 and Figure 3a). The inclusion of SNPs not being associated with the disease weakened the predictability of wGRS model. The wGRS composed of 4 SNPs associated with either hypertension or CVD have higher predictability than the wGRS of 9 SNPs selected for CRP (that is, 57.9% vs. 56.1% for hypertension; 65.7% vs. 65.1% for CVD, data not shown). Thus, we showed the combined effects of four SNPs in the wGRS models for two diseases (Figure 3a). The model-specific wGRS fits the data better than the additive-wGRS model for both hypertension and CVD (Table 3 and Supplementary Table S4). Subjects in the highest wGRS group (10.01–12.52) had a 2.1-fold and a 9.6-fold increased risks of hypertension (P=0.008) and CVD (P=3.2 × 10−6), respectively, compared with those in the lowest wGRS group (1.45–8.00; Figure 3b and Table 3). Associations between the wGRS models and either hypertension or CVD were significantly determined by different sets of covariates; and the risk of disease significantly increased along with the increase in both wGRS models (Table 3 and Supplementary Table S5).

Figure 3
figure 3

Proportion of cases with hypertension or coronary heart disease (CVD) according to the number of risk allele (0, 1 and 2) for four single-nucleotide polymorphisms (SNPs) (a) and the weighted genetic risk score (wGRS) for the combined effects of those four SNPs (b). Bars indicate the number of individuals by risk allele score (left vertical axis). Dots connected by a solid line indicate the proportion of cases according to each risk allele score (right vertical axis). In (b), the X axis is categorized by the range of wGRS: 1.45–8.00, 8.01–10.00 and 10.01–12.52. Gray bars and white bars indicate the percentage of case subjects and control subjects in the range of wGRS. Dots connected by a solid line indicate odds ratios (ORs) by wGRS groups.

Discussion

We evaluated the effects of 12 SNPs of 7 candidate genes and 1 intergenic locus previously indicated as being associated with CRP levels in a large-scale meta-analysis of 36 752 Asians. We showed that the alleles of SNPs from five genes, IL6R, CRP, GCKR, HNF1A and APOE, increased the CRP levels in both Asian and European populations, even though the minor allele was not always identical between ethnicities.6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17 A previous European meta-analysis on CRP levels did not report two genetic variants, CYP17A1 (rs1004467) and IL6 (rs2097677).13 Although the minor C allele of rs10778213, which is located within a gene desert region (12q23.2), was strongly associated with low CRP levels in 6345 white women from the Women’s Genome Health Study (P=1.2 × 10−10), it was not associated with CRP levels in Asian populations.7, 11, 15, 16, 17 Individuals with high wGRS appeared to be at considerably higher risk than those with any one risk allele. The combined effect of four SNPs, rs2228145 (IL6R), rs2097677 (IL6), rs7310409 (HNF1A) and the rs1004467 (CYP17A1), highly increased the risk of both hypertension and CVD in the current study.

Both IL6R and CRP gene polymorphisms were shown to be strongly associated with CRP levels through a pathway associated with the innate immune response.13, 23 The 358Ala allele of rs2228145 (in strong LD with rs4129267 and rs4537545) located in the IL6R gene was significantly associated with reduced CRP levels and reduced risk of CHD in two recent studies;24, 25 the protective effect was consistently observed in the current study (odds ratio=0.32, P=0.01). The SNP rs1205 (in complete LD with rs7553007), located within the 3′ flanking sequence of the CRP gene, was associated with regulating CRP expression. However, none of the CRP gene variants showed a consistent association with CHD risk in both the European study24 and the current Korean study. The rs1004467 (CYP17A1), which exhibited a suggestive association with increased CRP levels in our meta-analysis (P=0.017), was associated with blood pressure and hypertension in 3210 unrelated elderly Chinese Hans.12 To date, there is no published report on a potential relationship between the gene variants of IL6 and HNF1A and CVD. Although the IL6 gene variant (rs2097677) has not been identified in either European GWAS7, 8 or meta-analysis,13 a Japanese study suggested its association with CRP levels.14 Common variations of GCKR, HNF1A and APOE genes are related to abnormal metabolism and metabolic pathway regulation.7 An activating glucokinase mutation regulated by the GCKR gene causes maturity onset diabetes of the young type 2 (MODY2) and leads to persistent hyperinsulinemic hypoglycemia. In addition, the Leu446 allele of the GCKR coding variant (rs1260326) is associated with increased triglycerides and CRP levels, but lower fasting glucose, in the general population.26 The HNF1A gene consists of 10 exons spanning an 23-kb region and encodes a transcription factor regulating the expressions of liver-related genes, including CRP gene that is associated with CRP level, lipid profiles, glucose metabolism and inflammation response. This gene variant therefore plays a crucial role in controlling CRP level and developing CHD.27 Rare mutations of HNF1A polymorphisms cause MODY3 and are associated with the metabolic pathway for insulin secretion.28 APOE polymorphisms are associated with cholesterol release from blood vessels and the anti-inflammatory system.29 The rs429358 (Cys130Arg) variant of the APOE gene is also known to increase the risk of Alzheimer disease in multiethnic groups.30

We observed considerable differences not only in age, gender ratio and covariate adjustment, but also in the allele frequencies of SNPs among studies of five different Asian populations: Japanese, Korean, Chinese, Malay, Asian-Indian, Filipino and Taiwanese. However, the associations of the six SNPs; rs2228145, rs7553007, rs2097677, rs1169288, rs7310409, and rs2464196 that revealed the presence of a substantial heterogeneity, remained significant in homogenous subgroup analyses.

In conclusion, we highlight the major functions of the genetic variants that are crucial to immune response and abnormal metabolic pathways related to chronic inflammation. These findings warrant further investigation to validate the effects of SNPs associated with elevated CRP levels on the increased risk for developing CVD in a large-scale Asian population-based study. If the wGRS models developed in the current study are validated, they can serve as proxy measures for the increased genetic risk of hypertension and atherosclerotic CVD.