Background

Breast cancer is a heterogeneous disease [1]. According to the 2013 St Gallen Internation Expect Consensus, the expert panel defined and adopted four major breast cancer intrinsic subtypes determined by the expression of estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor-2 (HER2), and Ki-67 [2]. These four breast cancer subtypes (hormone receptor-positive tumors: luminal A and luminal B; and hormone receptor negative tumors: HER2 overexpressing and triple negative) demonstrate different distributions of age of onset, prognosis, and ethnicity [3,4,5].

Epidemiologic studies have shown that established breast cancer risk factors may have differential effects on breast cancer subtypes, suggesting that these subtypes may derive from distinct etiologic pathways [6, 7]. For example, reproductive-related factors such as more parities and earlier age at first birth were associated with reduced risks of luminal breast cancer subtypes, but they may even increase the risk of triple negative breast cancer [8]. However, most of the existing evidences regarding etiologic heterogeneity by breast cancer subtypes are based on Western studies [9, 10]. A recent case-only analysis conducted in Malaysia showed different distributions of parity, breastfeeding, and obesity across different subtypes [11]. Another case-control study from China found that overweight and breastfeeding were associated with increased risk of all subtypes of breast cancer [12]; however, this type of analyses, especially involving appropriate controls and case subtypes, in Asian populations are limited.

During the past two decades, incidence rates of breast cancer have increasing rapidly in most Asian countries [13], which may be attributed to the adoption of a Westernized lifestyle such as increasing obesity and changing reproductive patterns [14]. The recent generations of Hong Kong women are experiencing epidemiological transitions on reproductive-related risk factors, and these may have influenced the distribution of sex-hormone receptor-specific breast cancer. Age at menarche among Chinese women has shifted to an earlier age [15], while the median age at first birth has delayed substantially from 25.1 years in 1981 to 31.6 years in 2017 [16]. The prevalence of obesity is still lower in Hong Kong compared with most Western countries, but it has been increasing over the years [17]. In contrast to the protective effect of obesity among premenopausal women seen in Western populations, studies have shown that obesity is associated with increased breast cancer risk regardless of menopausal status in Asian women [18]. The heterogeneity of breast cancer risk associated with obesity by tumor subtypes among Asian breast cancer cases remains unclear.

In this study, we evaluated the associations between established breast cancer risk factors and breast cancer subtypes in a breast cancer case-control study conducted in Hong Kong.

Methods

Study Population and Design

We conducted a hospital-based case-control study in three public hospitals in Hong Kong between November 2011 and January 2018. We defined cases as Chinese women aged 20–84 years old who were newly diagnosed with primary breast cancer (International Classification of Disease, Tenth Revision, code C50) within 3 months prior to the interview. Cases with prior-surgery treatment on breasts were excluded. Each control was matched to a case using frequency matching in a 5-year age group, and controls were recruited from the same hospital where the cases were identified. Controls had a broad spectrum of diagnoses that were unrelated to breast cancer conditions (i.e., diseases of the digestive system, diseases of the circulatory system, diseases of the genitourinary system, endocrine, nutritional and metabolic diseases, and diseases of the nervous system). We excluded cases and controls if they had prior history of physician-diagnosed cancer in any site. A total number of 1156 cases and 1013 controls were recruited in this study with a response rate of 88.4 and 89.5%, respectively. This study was approved by the Joint Chinese University of Hong Kong-New Territories East Cluster Clinical Research Ethics Committee and the Ethics Committee of Kowloon West Cluster, and all participants signed written informed consent forms before the fieldwork prior to the interviews.

Trained interviewers performed a face-to-face interview using standardized questionnaires to obtain information on socio-demographic characteristics, tobacco smoking, alcohol drinking and dietary habits, reproductive-related factors, anthropometric risk factors, history of chronic diseases, history of breast examination and surgery, physical activity, and occupational history including shift work. Risk factors focused in this analysis included body mass index (BMI) and reproductive-related factors including age at menarche, menopausal status, age at first birth, number of children (parity), and overall duration of breastfeeding. In addition, each participant was asked to report whether any of her first degree relatives had a history of cancer diagnosed by physicians.

We extracted clinical data on tumor characteristics for breast cancer cases (such as ER, PR, HER2, Ki67, and histological grade) from hospital medical records and pathology reports. Cases with ER or PR Allred score of 3/8 or above or H score of 51/300 or above were defined as positive status [19]. Cases with HER2 results of 0 and 1+ on immunohistochemistry (IHC) testing were defined as HER2-negative (HER2−) and results of 3+ were identified as HER2-positive (HER2+). IHC result of 2+ was defined as HER2− if subsequent fluorescence in situ hybridization (FISH) or chromogenic in situ hybridization (CISH) was negative. Given the majority (80%) of patients with known FISH/CISH results of HER2 2+ were HER−, we defined IHC result of 2+ with missing FISH or CISH data as HER2−.

We classified cases into four intrinsic subtypes according to the suggestion by the 2013 St Gallen International Expert Consensus [2], including luminal A [ER+ and PR+ and HER2− and (Ki-67 < 20% or histological grade I/II)], luminal B [“ER+ and HER2− and (PR− or Ki-67 > 20% or histological grade III)” or “ER+ and HER2+”], HER2 overexpressing (ER− and PR− and HER2+), and triple negative (ER− and PR− and HER2−).

Statistical Analysis

We used one-way ANOVA and chi-square test to compare differences in categorical and continuous variables among breast cancer subtypes. For case-only analysis, polytomous logistic regression model was used to estimate the adjusted odds ratios (AORs) and 95% confidence intervals (95%CIs) for associations between BMI, reproductive-related factors, and breast cancer subtypes. As the outcome endpoints for case-only analysis are not binary variable, we used a single multinomial model rather than a sequence of binary choices in the data analysis using the luminal A as the reference to look into whether different breast cancer subtypes shared similar risk factors of reproductive-related factors, with the mutual adjustment of factors including age, family history of breast cancer, age at menarche, age at first birth/parity, menopausal status, breastfeeding frequency, and BMI. We combined overweight (BMI 25.0–29.9) and obesity (BMI ≥ 30) categories due to the low frequency of obesity in this study population. For case-control analysis, we used unconditional multivariate logistic regression model to estimate the risk for each subtype compared to controls for BMI and reproductive-related factors with the adjustment of the same set of risk factors as mentioned above. We further conducted a stratified analysis according to menopausal status for parity and BMI since these factors have previously shown effect modification by menopausal status. Due to small sample size in the stratified analysis, and also considering the correlation of hormone status with menopausal status, parity, and BMI [20], we combined luminal A and luminal B into a luminal group (i.e., hormone receptor-positive tumors), and combined HER2 overexpressing and triple negative into a non-luminal group (i.e., hormone receptor-negative tumors). Sensitivity analysis was conducted by removing control patients with ovarian and related diseases (International Classification of Disease, Tenth Revision, code N00-N99), and endocrine, nutritional, and metabolic diseases (International Classification of Disease, Tenth Revision, code E00-E99) and diseases of the circadian system (International Classification of Disease, Tenth Revision, code I00-I99) from our analysis to evaluate the effect of ovarian diseases on reproductive-related factors, and effect of metabolic diseases and circadian systems disorders on BMI. All statistical analyses were performed using SPSS 24.0 for Windows (SPSS, Chicago, IL, USA), and a two-sided p value less than 0.05 was considered statistically significant.

Results

A total of 923 (80% from 1156) breast cancer cases with known status of ER, PR, HER2, and Ki67 or histological grade status were included in this report. The most prevalent subtype was luminal B (393, 43%), followed by luminal A (294, 31%), triple negative (137, 15%), and HER2 overexpressing (99, 11%). The distribution of selected risk factors of breast cancer among breast cancer subtypes are summarized in Table 1. HER2 overexpressing and triple negative breast cancer cases were more likely to be younger than luminal A and luminal B patients. Family history of breast cancer and other examined factors did not vary significantly by subtypes. Those with missing BMI value were relatively older, later age at menarche, and more likely to have missing value in other reproductive variables (Supplementary Table 1).

Table 1 Distribution of selected breast cancer risk factors among 955 breast cancer cases in Hong Konga

Table 2 shows the associations between examined risk factors and breast cancer subtypes in case-case comparisons. Compared with luminal A patients, luminal B (AOR = 1.76, 95% CI = 1.07–2.88), HER2 overexpressing (AOR = 3.40, 95% CI = 1.22–7.39), and triple negative (AOR = 2.39, 95% CI = 1.18–4.82) patients were more likely to be postmenopausal. Triple negative patients were more likely were younger at first child birth (AOR = 2.61, 95% CI = 1.18–4.82, aged < 26 years old compared to nulliparity) than luminal A patients. In general, breastfeeding was less common among HER2 overexpressing and triple negative breast cancer women compared with luminal A women, but the differences did not reach the statistical significance (Table 2).

Table 2 Case-case odds ratios (ORs) and 95% confidence intervals (95% CIs) for associations between selected breast cancer risk factors and subtypes among Chines breast cancer patients in Hong Kong a

Table 3 summarizes the results from case-control comparisons showing overall and subtype-specific breast cancer risks associated with selected breast cancer risk factors. Compared to controls, luminal A cases (AOR = 0.37, 95% CI = 0.23–0.58) were significantly less likely to be postmenopausal. Having multiple children and having children at younger ages were protective for luminal A (AOR = 0.62, 95% CI = 0.39–1.08, comparing ≥ 3 children to nulliparity; AOR = 0.61, 95% CI = 0.38–0.99, comparing < 26 years to nulliparity) and luminal B patients (AOR = 0.56, 95% CI = 0.35–0.90, comparing ≥ 3 children to nulliparity; AOR = 0.59, 95% CI = 0.39–0.88, comparing < 26 years to nulliparity), but not for HER2 overexpressing (AOR = 1.01, 95% CI = 0.43–2.40, comparing ≥ 3 children to nulliparity; AOR = 1.02, 95% CI = 0.47–2.22, comparing < 26 years to nulliparity) or triple negative patients (AOR = 1.38, 95% CI = 0.65–2.93, comparing ≥ 3 children to nulliparity; AOR = 1.60, 95% CI = 0.83–3.08, comparing < 26 years to nulliparity). Overweight/obesity was associated with increased risk for all subtypes except for HER2 overexpressing women (AOR = 1.38, 95% CI = 0.83–2.28). The association for breastfeeding was not significant for any subtype. Similar results were obtained in the sensitivity analysis removing 59 control patients with ovarian and related diseases from our model (Supplementary Table 2).

Table 3 Case-control odds ratios (ORs) and 95% confidence intervals (95% CIs) for overall and breast cancer subtypes specific risks among Hong Kong Chinese womena

We further conducted a stratified analysis by menopausal status for parity and BMI (Table 4). Parity was associated with a reduced risk in postmenopausal luminal patients (AOR = 0.36, 95% CI = 0.22–0.60, comparing ≥ 3 children to nulliparity) but an increased risk for premenopausal non-luminal women (AOR = 4.10, 95% CI = 1.49–11.28, comparing ≥ 3 children to nulliparity). Overweight or obesity was associated with increased risk for all subtypes except for premenopausal luminal patients.

Table 4 Case-control odds ratios (ORs) and 95% confidence intervals (95% CIs) for breast cancer subtypes specific risks associated with body mass index and parity among Hong Kong Chinese women, stratified by menopausal statusa

Discussion

This study is the first to demonstrate that there were notable variations in the effects of several reproductive-related factors (age of first birth, breastfeeding, parity) on specific breast cancer subtypes among Chinese women in Hong Kong. Consistent with findings from previous studies based primarily on Western populations, we found that parity and younger age at first childbirth were protective for luminal breast cancer only [21]. In contrast to what was seen in Western women, overweight or obesity was not associated with reduced breast cancer risk among premenopausal women, regardless of the subtype. Given the increased risk associated with obesity for all breast cancer subtypes among both pre- or post-menopausal women except for premenopausal luminal patients, our results highlight the importance of weight control in breast cancer prevention in Asian women.

Early age at first birth and parity are established reproductive-related protective factors for breast cancer. Consistent with previous studies [10, 22, 23], we found that the similar differential effect of parity and age at first birth by subtypes and menopausal status. Also in line with findings from previous studies especially conducted in African American women [24], we found that long-term breastfeeding was less prevalent in HER2 overexpressing and triple negative patients compared with luminal A patients and breastfeeding had a stronger protection for these two subtypes, although the differences did not reach the statistical significance probably due to small sample size.

Consistent with prior study from Southwest China, higher BMI was associated with increased breast cancer risk regardless of subtypes, which appears to be unique in Asian populations [12]. Although the association between high BMI and breast cancer risk is stronger for postmenopausal women, the similar strength of association was found for premenopausal women with non-luminal subtypes. However, a previous study using data from a series of breast cancer case-control studies conducted in the USA also found that breast cancer risk factor associations may vary by method of detection [25]. For example, they showed that the association between obesity and increased breast cancer risk among postmenopausal women was stronger for mammography-detected than for woman/clinician-detected breast cancer. Among premenopausal women, there was a positive association between obesity and risk of mammography-detected cancer but an inverse association between obesity and risk of woman/clinician-detected cancer [25]. There is no population-based screening program in Hong Kong and cancer in the majority of breast cancer cases in our study was woman/clinician-detected. Thus, the variation of detection methods is unlikely to have the significant influence on the heterogeneous associations by subtype we observed.

In this study, we found that luminal patients were more likely to be premenopausal compared with controls or other breast cancer subtypes. Several previous studies conducted in China found similar patterns [22, 26]. This association is likely driven by the birth cohort effect, represented by increasing exposures to Westernized lifestyle-associated risk factors among younger women and these risk factors are known to have stronger associations with luminal breast cancers. In contrast, older women who had experienced menopause were more likely to have the traditional lifestyles and therefore had lower risk for developing luminal breast cancer. Alternatively, Asian women may have distinct hormone-associated etiology, as recently suggested by Lin et al. in a study comparing age-specific probabilities of breast tumor subtypes between Asian and USA women using data from eight hospital-based cancer registries in East Asian countries and the Surveillance, Epidemiology, and End Results (SEER) Program [27].

Our study demonstrated varied associations between obesity, specific reproductive-related factors, and breast cancer subtypes using case-case and case-control analyses, which had been less investigated in Asian populations. Results of our study thus contributed new evidence to the current body of literature regarding etiologic heterogeneity by breast cancer subtypes in Asian populations. All participants in this study were recruited from three large public hospitals, with a high response rate (88.4%). Test-retest reliability was conducted among 158 cases and 153 controls at least 1 month after the initial interview, showing an overall Kappa consistency rate of 83% for the items included in the test-retest reliability study. However, several limitations should be noted. Inconsistent with previous study [26], the proportion of luminal B was higher than luminal A in this study. If we used the same classification, the proportion of luminal A (ER+ and/or PR+, HER2−) will be higher than luminal B (ER+ and/or PR+, HER2+). In our study, we classified our cases into different intrinsic subtypes according to the suggestion by the 2013 St Gallen International Expert Consensus Definition in order to make the classification more precise, in which another study using St Gallen classification have a similar proportion of luminal A and luminal B [11]. We classified HER2 IHC 2+ patients with missing FISH/CISH data as HER2 negative, which might have led to a misclassification in HER2 status. However, we confirmed that around 83% of HER2 IHC 2+ cases were HER2− in the database of one of our collaboration hospitals, suggesting that our major findings are unlikely to be driven by the misclassification. Similar to any other case-control studies, potential recall bias may not be totally avoided in our study, but we tried to minimize it by only recruiting recently diagnosed incident breast cancer patients who were introduced this study as a “general women health study.” We are aware that the missing data of BMI may be a concern, and thus we demonstrated the distribution of selected characteristics by the availability of BMI (Supplementary Table 1). We found that women with missing BMI tend to be older and have a higher missing rate in reproductive-related factors. Furthermore, the selection of hospital-based controls may not represent the general population. We only used age and recruitment hospitals as matching variables when recruiting control patients (for example, we did not collect their lipid profiles or insulin sensitivity for matching), which may have led to biased estimates. To address this question, we recruited controls from the same hospitals with a variety of disease spectrum that were not related to any type of breast diseases and exposures of interest. Previous hospital-based case-control studies provided evidence for a similar magnitude of risk estimates obtained from hospital-based and population-based controls especially for controls with a variety of disease types [28]. We also conducted sensitivity analysis by excluding controls patients of ovarian and related diseases, metabolic diseases, and circadian system diseases that were related to the exposure of interest, and the associations between reproductive-related factors and BMI and overall breast cancer remained unchanged after the restriction (Supplementary Table 2); we also performed sensitivity analysis by only excluding controls patients of ovarian and related diseases and circadian system diseases (i.e., including control patients with endocrine, nutritional, and metabolic diseases) and results remained consistent (Supplementary Table 2). In addition, relative risks for those established risk factors in our study were similar to what were previously reported in most other studies, suggesting that the bias of control selection may not pose a major impact on our results. Finally, we did not correct for multiple comparisons when testing multiple risk factors and subtypes of breast cancer. Nevertheless, the majority of statistical tests were based on prior hypotheses supported by previous studies.

Conclusions

In conclusion, this study demonstrated heterogeneous associations of established breast cancer risk factors with different subtypes of breast cancer in an Asian population. These results, which warrant future investigations in large population-based studies in additional Asian populations, may have important implications on breast cancer subtype-specific risk prediction and prevention.