Variable expression quantitative trait loci analysis of breast cancer risk variants

Wiggins, George A. R.; Black, Michael A.; Dunbier, Anita; Merriman, Tony R.; Pearson, John F.; Walker, Logan C.

doi:10.1038/s41598-021-86690-5

Download PDF

Article
Open access
Published: 30 March 2021

Variable expression quantitative trait loci analysis of breast cancer risk variants

George A. R. Wiggins¹,
Michael A. Black²,
Anita Dunbier²,
Tony R. Merriman²,
John F. Pearson^1,3^na1 &
…
Logan C. Walker¹^na1

Scientific Reports volume 11, Article number: 7192 (2021) Cite this article

1300 Accesses
5 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Genome wide association studies (GWAS) have identified more than 180 variants associated with breast cancer risk, however the underlying functional mechanisms and biological pathways which confer disease susceptibility remain largely unknown. As gene expression traits are under genetic regulation we hypothesise that differences in gene expression variability may identify causal breast cancer susceptibility genes. We performed variable expression quantitative trait loci (veQTL) analysis using tissue-specific expression data from the Genotype-Tissue Expression (GTEx) Common Fund Project. veQTL analysis identified 70 associations (p < 5 × 10^–8) consisting of 60 genes and 27 breast cancer risk variants, including 55 veQTL that were observed in breast tissue only. Pathway analysis of genes associated with breast-specific veQTL revealed an enrichment of four genes (CYP11B1, CYP17A1 HSD3B2 and STAR) involved in the C21-steroidal biosynthesis pathway that converts cholesterol to breast-related hormones (e.g. oestrogen). Each of these four genes were significantly more variable in individuals homozygous for rs11075995 (A/A) breast cancer risk allele located in the FTO gene, which encodes an RNA demethylase. The A/A allele was also found associated with reduced expression of FTO, suggesting an epi-transcriptomic mechanism may underlie the dysregulation of genes involved in hormonal biosynthesis leading to an increased risk of breast cancer. These findings provide evidence that genetic variants govern high levels of expression variance in breast tissue, thus building a more comprehensive insight into the underlying biology of breast cancer risk loci.

Genetic variations of DNA bindings of FOXA1 and co-factors in breast cancer susceptibility

Article Open access 13 September 2021

Fine-mapping of 150 breast cancer risk regions identifies 191 likely target genes

Article 07 January 2020

MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer

Article Open access 23 January 2023

Introduction

Genome wide association studies (GWAS) in breast cancer have identified more than 180 common risk variants^1,2,3, however the causal genes and biological mechanisms which confer disease susceptibility remain largely unknown. Risk variants are often located in non-coding regions making it difficult to determine pathogenic pathways. Approximately 700 potential gene targets of breast cancer risk variants have been identified using analytical methods that employ genomic data from chromatin interactions, enhancer–promoter correlations, transcription binding, topologically associated domains and gene expression^1,3.

Gene expression traits are under genetic regulation and the heritability of differences in genotypes have been extensively described⁴. For example, identification of expression quantitative trait loci (eQTL) has been a key approach for investigating tissue-specific effects of breast cancer risk variants under the hypothesis that non-breast tissue may be involved in breast cancer risk⁵. Gene expression patterns are often explored assuming genetic control of mean expression level, however the variability of gene expression is also genetically controlled^6,7,8,9. Just as differences in expression means have been associated with genotype so too differences in expression variability can be associated with genotype.

Gene expression variability has been described in a wide range of organisms including prokaryotes¹⁰, yeast^6,7 and complex multicellular organisms^11,12,13. Furthermore, gene expression variability had been shown to be important in early human development¹⁴, schizophrenia¹⁵ and cancer subtypes^12,13. The effects of genetic variation on gene expression variability has been recently described in human derived lymphoblastoid cell lines from HapMap individuals⁸ and in the TwinsUK cohort^16,17.

Breast cancer risk variants associated with eQTL, based on mean gene expression, have been investigated in both breast tissue (tumour and normal), and non-breast tissue^5,18,19. However, the mechanisms underlying breast cancer risk for the majority of variants remains to be uncovered. Here, we demonstrate variable expression quantitative trait loci (veQTL) as a method for testing the association of variants with gene expression variability. We performed veQTL analysis on 181 variants that have been previously associated with breast cancer risk and identified 60 new candidate genes and pathways associated with 27 breast cancer risk variants.

Methods

Data acquisition and processing

Genotype and expression data were acquired through the database of Genotypes and Phenotypes (dbGaP) and the Genotype-Tissue Expression (GTEx) Common Fund Project (release version phs000424.v7.p2.) under the project title “Identification of variable expression quantitative trait loci that are associated with cancer risk”. Datasets from breast, ovarian, lung and kidney tissue used in this study were obtained through the dbGaP approval number 17463.

Genotype data from 635 individuals acquired through GTEx were converted to chromosome-specific matrices, where the genotypes were numbered by the minor allele count. For tissue specific analysis, only genotypes from individuals with tissue expression data in a given tissue (e.g. breast, kidney, ovary and lung) were retained. Genotypes were filtered so that only bi-allelic genotypes of at least 10 subjects with two or more genotypes (AA, Aa, aa) were retained.

Normalised Reads Per Kilobase of transcript, per Million mapped reads (RPKM) counts for 56,203 unique Ensembl (https://www.ensembl.org/) gene ids were split into tissue-specific datasets. For each dataset, only transcripts with RPKM > 0.1 in at least 10 samples were retained. Subjects with multiple tissue-specific samples were collapsed by calculating the average RPKM values. Linear regression models were used to correct expression data for age and sex as covariates.

veQTL and eQTL analysis

Tissue specific veQTL were mapped for breast cancer risk variants that passed the filtering criteria (Supplementary Table S1). For each gene, veQTL were mapped by testing for equal variance among individuals of different genotypes using the Brown–Forsythe method²⁰. Compared to other analysis of variance methods, the Brown-Forsythe method is more tolerant to non-normality with type I error^21,22. A custom R script (https://github.com/jfpuoc/veQTL) was used to calculate Brown–Forsythe test-statistics (W, Eq. 1) on each genotype and all transcripts. For a response variable y in j groups, transformed to the median absolute deviation Z_ij =|y_ij – y_j.| where y_j. is the median in group j, then W is defined by:

$$ \begin{array}{*{20}c} {W = \frac{{\left( {N - k} \right)\mathop \sum \nolimits_{i = 1}^{k} N_{i} (Z_{i.} - Z_{..} )^{2} }}{{\left( {k - 1} \right)\mathop \sum \nolimits_{i = 1}^{k} \mathop \sum \nolimits_{i = 1}^{{N_{i} }} (Z_{ij} - Z_{i.} )^{2} }}} \\\\ {{\text{Brown - Forsythe}}\,{\text{ test }}\,{\text{statistic}}} \\ \end{array} $$

(1)

where N is the number of samples, k is the number of different genotypes (2 or 3), N_i is the number of samples in group i, Z_i is the mean of the absolute deviation from the medians for group i and Z_.. is the mean of the absolute deviations from all samples from their respective group medians. The resulting W statistics follows the F-distribution with degrees of freedom df1 = k – 1 and df2 = N – k²⁰.

veQTL analysis was performed using the residuals of the linear model correcting for age and sex, and the genotypes that met the filtering criteria. In instances where two genotypes were observed in more than 10 samples, and the third genotype was observed in less than 10 samples, the test statistic was only computed between groups with at least 10 samples.

Tissue-specific eQTL analysis was performed in the same four tissue datasets used for veQTL. The ultra-rapid MatrixEQTL package in R was used to calculate p values for variant-gene pairs using a linear regression model and correcting for age and sex as covariates²³.

We limited proposed breast cancer susceptibility genes to those that had: (i) significant (p < 5.0 × 10^–8) gene expression variability associated with a breast cancer risk variants, (ii) the significant veQTL association was only observed in breast tissue and (iii) the gene was only associated with a change in expression variability (i.e. veQTL) and not change in mean expression (i.e. eQTL).

Pathway enrichment analysis

Genes identified with altered expression by either veQTL or eQTL analysis were annotated using their entrez identifier. Pathway analysis was performed using the R packages clusterProfiler and DOSE^24,25. Each candidate gene list was compared to the background transcriptome for over representation of genes in pathways annotated by GO terms.

Results

Identification of veQTLs and eQTLs

The GTEx dataset comprises 635 genotyped samples, of which tissue samples from normal breast (n = 255), lung (n = 387), kidney (n = 41) and ovary (n = 123) were used. A major proportion of breast cancer risk variants are predicted to alter expression of cancer susceptibility gene(s) in breast tissue. To identify veQTL that specifically increase risk in breast tissue, even if the genes in the veQTL are ubiquitously expressed in multiple tissues, we only considered veQTL that were uniquely identified in breast tissue (i.e. breast-specific veQTL). These assumptions, would however miss breast cancer susceptibility genes whose expression variability is tolerated in other tissue but not breast.

RNA-sequencing and genotype data were split into tissue-specific datasets and filtered to remove low frequency genotypes and genes with low expression. After pre-processing 33,059, 29,522, 25,026 and 35,137 transcripts were retained for the breast, ovary, kidney and lung, respectively.

Large genome-wide association studies (GWAS) have identified variants associated with breast cancer risk or subtype specific breast risk. In total we identified 181 breast cancer risk variants in the literature (Supplementary Table S1), of which 152, 148, 106 and 152 breast cancer risk variants were retained after filtering non-biallelic and genotypes with few minor alleles (see methods) for the breast, ovary, kidney and lung datasets, respectively (Fig. 1, Supplementary Table S1).

We tested for associations between breast cancer risk variants and gene expression variability, correcting for sex and age, in four tissues. These analyses identified significant (p < 5 × 10^–8) veQTL interactions with breast cancer risk variants in the breast (70), ovary (9) and lung (109) (Table 1, Supplementary Tables S2–S4). No significant associations were observed in the kidney analysis. By comparison, the number of observed eQTL in breast (155), ovary (19) and lung (123) were greater, similarly there were no significant kidney eQTL. The majority of veQTL and eQTL associations were trans and acted over distances greater than 1 Mb or between chromosomes. Only 2/70, 5/109 and 2/9 significant association were cis-veQTL (+ /− 1 Mb) in the breast, lung and ovary, respectively. A greater proportion of eQTL were observed in cis compared to veQTL, with approximately 5% of veQTL and 13% eQTL acting in cis (Table 1).

Table 1 Significant veQTL and eQTL breast cancer variants and associated genes for each tissue.

Full size table

Classes of veQTLs

By assessing expression values associated with each genotype across the four different tissues, we observed three classes of veQTL (Fig. 2). Class I resembled a homozygous recessive phenotype, where the presence of two minor alleles was associated with altered gene expression variability. Class II showed a dominant phenotype where the dosage of the minor allele correlated with the change in expression variability. Class III resembled a heterozygous phenotype where the presence of two different alleles altered gene expression variability. Significant breast veQTL were largely Class I homozygous recessive (56%), (Fig. 2), while the majority (9/11) of Class II veQTL were also eQTL. In total, 21 veQTL (30%) were also eQTL. Seven breast cancer risk variants that had significant veQTL, were unable to be classified as no sample was homozygous for the minor allele. However in all seven variants, gene expression variability was greater in heterozygous samples, thus ruling out a Class I veQTL.

Comparison of veQTL and eQTL

To estimate biases in dataset-specific veQTL analysis quantile–quantile plots (q–q plots) were generated and genomic factors estimated for each tissue (Fig. 3a). No substantial genomic inflation (λ < 1.1) was observed for the veQTL analysis in the breast, lung or ovary (λ ranged 1.00–1.05). However, a larger genomic inflation factor of 1.15 was observed for kidney tissue, implying a small underlying bias in the analysis (Fig. 3a).

Tissue specific p values distributions were similar between veQTL and eQTL analyses (Fig. 3b). Three tissues (breast, lung and ovary) displayed an anti-conservative distribution with a greater number of p values tending towards zero. For the larger lung and breast datasets, there was a greater number of p values near zero compared to ovary tissue, suggesting a greater number of tests that reject the null hypothesis of no difference in expression variability between groups. Examination of the kidney dataset demonstrated a uniform distribution of p-values, highlighting the limited effect for the selected variants for both veQTL and eQTL analysis. Variant-gene pairs were ranked according to eQTL significance and the rank correlation of p-values between eQTL and veQTL analysis were calculated for each tissue specific dataset. Correlations ranged from 0.052 in the kidney to 0.183 in the lung, suggesting the variant-gene ranks between veQTL and eQTL analysis are different and veQTL analysis identified a novel set of genes associated with risk variants (Fig. 3c).

Identification of potential target genes of breast cancer risk variants

The majority of breast cancer variants have no known associations with other traits, however 25 variants have previously been associated with a phenotype other than breast cancer risk (www.gwascentral.org, Supplementary Table S5). Two variants (rs11571833 and rs17879961) have been previously associated with lung cancer, while rs10069690 and rs74911261 have been associated with ovarian and kidney cancers, respectively. Interestingly, none of these variants were significantly associated with differential variability in any genes in these tissues. However, rs10069690 did have significant association with differential variability in gene expression in each of the lung and breast analysis. As the majority of the variants only show evidence for breast cancer risk, we eliminated any veQTL that was observed in a non-breast tissue (Fig. 4). Fifty-five of the 70 significant breast veQTL were observed in breast tissue only. Pathway enrichment analysis of the candidate genes associated with these breast-specific veQTL revealed hormonal biosynthetic processes and collagen fibril organisation pathways that were enriched (Fig. 4). The enrichment of the hormonal pathways listed in Fig. 4 were driven by four genes (CYP11B1, CYP17A1 HSD3B2 and STAR) all of which were associated with the risk variant rs11075995. By comparison, the 88 veQTL that were significant in lung tissue were not significantly enriched for any pathway using pathway analysis (data not shown).

rs11075995 alters expression of genes involved in C ₂₁ steroid synthesis

The minor allele (A) of rs11075995, which is associated with ER negative breast cancer risk, was found to be associated with increased variability in expression of four genes by veQTL analysis (Fig. 5). To connect the signals of veQTL analysis with the association of breast cancer risk, we utilised the GWAS signals generated by Michailidou et al.³ on the largest meta-analysis of breast cancer risk to date and on veQTL signals generated using the GTEx data. Regional plots at the rs11075995 locus for ER negative breast cancer risk associations or trans-veQTL with candidate genes were visually examined to determine likely casual variants (Fig. 5). Two signals were identified associated with ER negative breast cancer risk, one of which was the lead variant rs11075995 (Fig. 5a). The same variants (rs11075995) produced the strongest signal for variable expression of all four candidate genes involved in the C21-steroidal pathway (Fig. 5b).

The candidate genes (CYP11B1, CYP17A1 HSD3B2 and STAR) associated with rs11075995 are all involved in the conversion of cholesterol to hormones via the C21 steroidal biosynthesis pathway (Fig. 6). STAR is involved in the transportation of free cholesterol into the mitochondria where it is converted to pregnenolone. The remaining three candidate genes all code for enzymes that catalyse the conversion of multiple molecules and act in several pathways which produce different hormones (Fig. 6).

The rs11075995 SNP is located in the second intron of the FTO gene (Fig. 7)., a Fe²⁺/2-oxoglutarate-dependent oxidative RNA demethylases important in the demethylation of RNA methyladenosine (m6A)²⁶. Variants in this locus are associated with increased body mass index (BMI), the mechanism of action has been linked to expression changes of the neighbouring gene IRX3 in the human brain and in particular the hypothalamus^27,28. Furthermore, there is conflicting evidence of rs11075995 association with breast cancer risk. Recent studies identified a loss of breast cancer risk association after adjusting for BMI²⁹. However, Garcia-Closas and colleagues tested the association with ER negative breast cancer risk after adjusting for BMI and observed no change³⁰. We therefore explored the effects of the rs11075995 on the expression of both FTO and IRX3 in breast tissue. Neither FTO nor IRX3 had significant breast eQTL or veQTL associations with rs11075995. However, FTO (p = 0.05), and not IRX3 (p = 0.29), had decreased expression in the homozygous minor allele individuals in breast tissue (Fig. 7b, Supplementary Fig. S1).

Ethics approval and consent to participate

This research was approved by the University of Otago Ethics Committee.

Discussion

Tissue-specific veQTL datasets were generated for breast cancer variants in four normal tissues dataset acquired from GTEx. To predict candidate genes involved in breast cancer risk, significant (p < 5 × 10^–8) veQTL unique to breast tissue were considered. This approach identified 60 candidate genes that were associated with 27 variants. The majority of significant veQTL were class I and displayed a homozygous recessive like phenotype (Fig. 2). Furthermore, veQTL analysis identified distinctly different genes compared to eQTL analysis (Fig. 3). Although, 30% of class II breast veQTL were also eQTL, highlighting a small subset of genes that had both changes in mean expression and variability associated with minor allele dosage.

Pathway analysis of the 60 candidate genes found several hormonal biosynthetic pathways enriched along with monocyte chemotaxis and collagen fibril organisation (Fig. 4). The enrichment of the hormonal biosynthetic pathway was driven by the presence of four genes (CYP11B1, CYP17A1, HSD3B2 and STAR) all of which were variable in association with the risk allele of rs11075995. Furthermore, rs11075995 produced the strongest signal for variable expression for all four candidate genes and was the most likely casual variant (Fig. 5).

Breast cancer development has been associated with exposure to steroid hormones³¹. These hormones are typically synthesised in non-breast tissues (e.g. ovary and adrenal gland) and are secreted into the circulating system to act on distant tissues (e.g. breast). The activation of local hormone biosynthesis, associated with the risk allele of rs11075995, through the metabolism of cholesterol to pregnenlone may lead to greater exposure and/or hormone imbalance in breast tissues, which may drive tumourigenesis. Local steroidogenesis and ultimately production of androgens has been observed in androgen independent advance prostate cancers³². In prostate cancer, the local production of androgens may explain the development of hormonal treatment resistance in late-stage prostate cancers.

Summary statistics of GWAS signals obtained through GWAS central (www.gwascentral.org) identified significant associations of rs11075995 with overall and ER negative breast cancer risk and with body mass index (Supplementary Table S5). No other trait was reported to be associated at p < 0.001 with rs11075995. BMI is a known dose-dependent risk factor for developing breast cancer in post-menopausal women³³. Interestingly, breast cancer risk association studies that have adjusted for BMI have demonstrated a dependence for variants at the rs11075995 locus on BMI status²⁹. However, an independent relationship was described for ER negative breast cancer risk and BMI for rs11075995³⁰, suggesting that variants in the same locus may have disease-specific risk profiles.

The variant rs11075995 is located in intron 2 of the FTO gene. Interestingly, we observed a marginally significant decrease in FTO (p = 0.05) expression in breast tissue associated with individuals homozygous for the rs11075995 risk allele. FTO is involved in demethylation of RNA adenosine (m6A). Methylated adenosine are post-transcriptional modifications which signals RNAs for processing, including degradation and splicing³⁴. The four genes associated with rs11075995 all have the m6A target site (GGACU). RNA variability may occur due to dysregulation of these pathways (mRNA degradation and splicing) in response to decreased FTO expression.

Variants in intron 1 and 2 of FTO have been strongly associated with obesity and changes in BMI^27,35, however these variants act on the expression of the neighbouring gene IRX3 in the hypothalamus region of the brain²⁸. Iroquois homeobox protein 3 (IRX3) is a highly conserved transcription factor typically expressed during neural development³⁶. The role of IRX3 in obesity is yet to be fully elucidated with conflicting reports of body mass associated to deficient Irx3. Smemo et al., described a 30% increase in body weight of Irx3-deficient mice²⁸. While in contrast the partial depletion of Irx3 through a lentiviral system resulted in mice with greater body mass³⁷.

Intriguingly, both IRX3 and FTO are highly expressed in the hypothalamus, a region of the brain important to hormonal regulation^28,37. It is unknown whether risk variants, for either BMI or breast cancer, directly disrupt the regulation of hormonal control in the hypothalamus. Furthermore, it is unclear what effect IRX3 expression would have on breast cancer risk and whether any effect would be independent of the risk attributed to obesity alone. A better understanding of the downstream transcriptional targets of IRX3 may identify pro-tumourgeneic pathways.

Our results are based on data from largely white, European derived ancestry (GTEx is 85% white), hence extrapolation to more diverse ancestry is a limitation of this study. However, these results are consistent with the hypothesis that different variants in the FTO locus may be associated with tissue-specific hormonal control and subsequently different pathologies. Consequently, we would expect differences in the regulation of C21 hormones in breast tissue for the different rs11075995 genotypes. Furthermore, candidate genes identified through veQTL analysis require functional validation. A major challenge with assessment of intra-sample gene expression variability is the limitation of single-point ‘grind and bind’ approaches. However, approaches such as RNA hybridisation in situ and single cell RNA-sequencing do provide the ability to detect expression variability. It is of further importance to derive the mechanism of variability which may be driven by interaction of genotypes with exposures or epistasis.

Conclusions

In summary, breast cancer risk variants are associated with variable expression of candidate breast cancer susceptibility genes. These included genes involved in hormonal biosynthetic pathways that are associated with a single variant (rs11075995). To our knowledge, this is the first time gene expression variability has been used to identify candidate cancer susceptibility genes.

Data availability

The data that support the findings of this study are available from GTEx and dbGaP but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of GTEx and dbGaP.

Abbreviations

eQTL:: Expression quantitative trait loci
veQTL:: Variable expression quantitative trait loci

References

Michailidou, K. et al. Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer. Nat. Genet. 47(4), 373–80 (2015).
Article CAS Google Scholar
Milne, R. L. et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat. Genet. 49(12), 1767–78 (2017).
Article CAS Google Scholar
Michailidou, K. et al. Association analysis identifies 65 new breast cancer risk loci. Nature 551(7678), 92–4 (2017).
Article ADS Google Scholar
Petretto, E. et al. Heritability and tissue specificity of expression quantitative trait loci. PLoS Genet. 2(10), e172. https://doi.org/10.1371/journal.pgen.0020172 (2006).
Article CAS PubMed PubMed Central Google Scholar
Ferreira, M. A. et al. Genome-wide association and transcriptome studies identify target genes and risk loci for breast cancer. Nat. Commun. 10(1), 1741 (2019).
Article ADS Google Scholar
Raser, J. M. & O’Shea, E. K. Control of stochasticity in eukaryotic gene expression. Science (80-) 304(5678), 1811–4 (2004).
Article ADS CAS Google Scholar
Blake, W. J., Kaern, M., Cantor, C. R. & Collins, J. J. Noise in eukaryotic gene expression. Nature 422(6932), 633–7 (2003).
Article ADS CAS Google Scholar
Hulse, A. M. & Cai, J. J. Genetic variants contribute to gene expression variability in humans. Genetics 193(1), 95–108 (2013).
Article CAS Google Scholar
Spielman, R. S. et al. Common genetic variants account for differences in gene expression among ethnic groups. Nat. Genet. 39(2), 226–31. https://doi.org/10.1038/ng1955 (2007).
Article CAS PubMed PubMed Central Google Scholar
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expression in a single cell. Science 297(5584), 1183–6 (2002).
Article ADS CAS Google Scholar
Hasegawa, Y. et al. Variability of gene expression identifies transcriptional regulators of early human embrynic development. PLOS Genet. 11(8), e1005428. https://doi.org/10.1371/journal.pgen.1005428 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bueno, R., & Mar, J. C. Changes in gene expression variability reveal a stable synthetic lethal interaction network in BRCA2-ovarian cancers. Methods. 2017. Available from: http://linkinghub.elsevier.com/retrieve/pii/S1046202317300695.
Ecker, S., Pancaldi, V., Rico, D. & Valencia, A. Higher gene expression variability in the more aggressive subtype of chronic lymphocytic leukemia. Genome Med. 7(1), 8 (2015).
Article Google Scholar
Hasegawa, Y. et al. Variability of gene expression identifies transcriptional regulators of early human embryonic development. PLOS Genet. 11(8), e1005428. https://doi.org/10.1371/journal.pgen.1005428 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhang, F., Shugart, Y. Y., Yue, W., Cheng, Z., Wang, G., Zhou, Z., et al. Increased variability of genomic transcription in schizophrenia. Sci Rep. 2015;5.
Brown, A. A. et al. Genetic interactions affecting human gene expression identified by variance association mapping. Elife 3, e01381 (2014).
Article Google Scholar
Wang, G., Yang, E., Brinkmeyer-Langford, C. L. & Cai, J. J. Additive, epistatic, and environmental effects through the lens of expression variability QTL in a twin cohort. Genetics 196(2), 413–25 (2014).
Article CAS Google Scholar
Li, Q. et al. Integrative eQTL-based analyses reveal the biology of breast cancer risk loci. Cell 152(3), 633–41 (2013).
Article CAS Google Scholar
Guo, X. et al. A comprehensive cis-eqtl analysis revealed target genes in breast cancer susceptibility loci identified in genome-wide association studies. Am. J. Hum. Genet. 102(5), 890–903 (2018).
Article CAS Google Scholar
Brown, M. B. & Forsythe, A. B. Robust tests for the equality of variances. J. Am. Stat. Assoc. 69(346), 364–367 (1974).
Article Google Scholar
Struchalin, M. V., Dehghan, A., Witteman, J. C. M., van Duijn, C. & Aulchenko, Y. S. Variance heterogeneity analysis for detection of potentially interacting genetic loci: Method and its limitations. BMC Genet. 11(1), 92. https://doi.org/10.1186/1471-2156-11-92 (2010).
Article PubMed PubMed Central Google Scholar
Marderstein, A. R. et al. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am. J. Hum. Genet. 108(1), 49–67 (2021).
Article CAS Google Scholar
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28(10), 1353–8 (2012).
Article CAS Google Scholar
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. ClusterProfiler: An R package for comparing biological themes among gene clusters. Omi A J. Integr. Biol. 16(5), 284–287 (2012).
Article CAS Google Scholar
Yu, G., Wang, L. G., Yan, G. R. & He, Q. Y. DOSE: An R/Bioconductor package for disease ontology semantic and enrichment analysis. Bioinformatics 31(4), 608–609 (2015).
Article CAS Google Scholar
Han, Z. et al. Crystal structure of the FTO protein reveals basis for its substrate specificity. Nature 464(7292), 1205–9 (2010).
Article ADS CAS Google Scholar
Frayling, T. M. et al. A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity. Science (80-) 316(5826), 889–94 (2007).
Article ADS CAS Google Scholar
Smemo, S. et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature 507(7492), 371–375 (2014).
Article ADS CAS Google Scholar
Kang, Y., Liu, F. & Liu, Y. Is FTO gene variant related to cancer risk independently of adiposity? An updated meta-analysis of 129,467 cases and 290,633 controls. Oncotarget 8(31), 50987–50996 (2017).
Article Google Scholar
Garcia-Closas, M. et al. Genome-wide association studies identify four ER negative-specific breast cancer risk loci. Nat. Genet. 45(4), 392–398 (2013).
Article CAS Google Scholar
Key, T. J., Verkasalo, P. K. & Banks, E. Epidemiology of Breast Cancer. Vol. 2, Lancet Oncology 133–40 (Elsevier, 2001).
Google Scholar
Dillard, P. R., Lin, M. F. & Khan, S. A. Androgen-independent prostate cancer cells acquire the complete steroidogenic potential of synthesizing testosterone from cholesterol. Mol. Cell Endocrinol. 295(1–2), 115–120 (2008).
Article CAS Google Scholar
Liu, K. et al. Association between body mass index and breast cancer risk: Evidence based on a dose–response meta-analysis. Cancer Manag. Res. 10, 143–151 (2018).
Article Google Scholar
Mauer, J. et al. FTO controls reversible m6Am RNA methylation during snRNA biogenesis. Nat. Chem. Biol. 15(4), 340–7 (2019).
Article CAS Google Scholar
Wood, A. R. et al. Variants in the FTO and CDKAL1 loci have recessive effects on risk of obesity and type 2 diabetes, respectively. Diabetologia 59(6), 1214–21. https://doi.org/10.1007/s00125-016-3908-5 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cohen, D. R., Cheng, C. W., Cheng, S. H. & Hui, C. C. Expression of two novel mouse Iroquois homeobox genes during neurogenesis. Mech. Dev. 91(1–2), 317–321 (2000).
Article CAS Google Scholar
de Araujo, T. M. et al. The partial inhibition of hypothalamic IRX3 exacerbates obesity. EBioMedicine 1(39), 448–460 (2019).
Article Google Scholar

Download references

Author information

These authors contributed equally: John F. Pearson and Logan C. Walker.

Authors and Affiliations

Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
George A. R. Wiggins, John F. Pearson & Logan C. Walker
Department of Biochemistry, University of Otago, Dunedin, New Zealand
Michael A. Black, Anita Dunbier & Tony R. Merriman
Biostatistics and Computational Biology Unit, University of Otago, Christchurch, New Zealand
John F. Pearson

Authors

George A. R. Wiggins
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Black
View author publications
You can also search for this author in PubMed Google Scholar
Anita Dunbier
View author publications
You can also search for this author in PubMed Google Scholar
Tony R. Merriman
View author publications
You can also search for this author in PubMed Google Scholar
John F. Pearson
View author publications
You can also search for this author in PubMed Google Scholar
Logan C. Walker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.A.R.W., J.P. and L.C.W. conceived of the study. G.A.R.W., M.A.B., A.D., J.P. and L.C.W. designed and coordinated the study. G.A.R.W. performed the bioinformatics and statistical analyses. T.R.M. provided the resources from GTEx. G.A.R.W. drafted the manuscript. All authors have read, contributed to and approved the final manuscript.

Corresponding author

Correspondence to Logan C. Walker.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wiggins, G.A.R., Black, M.A., Dunbier, A. et al. Variable expression quantitative trait loci analysis of breast cancer risk variants. Sci Rep 11, 7192 (2021). https://doi.org/10.1038/s41598-021-86690-5

Download citation

Received: 29 October 2020
Accepted: 12 March 2021
Published: 30 March 2021
DOI: https://doi.org/10.1038/s41598-021-86690-5

This article is cited by

Imagine beyond: recent breakthroughs and next challenges in mammary gland biology and breast cancer research
- Renée van Amerongen
- Mohamed Bentires-Alj
- Maria dM. Vivanco
Journal of Mammary Gland Biology and Neoplasia (2023)
Increased gene expression variability in BRCA1-associated and basal-like breast tumours
- George A. R. Wiggins
- Michael A. Black
- Logan C. Walker
Breast Cancer Research and Treatment (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Genetic variations of DNA bindings of FOXA1 and co-factors in breast cancer susceptibility

Fine-mapping of 150 breast cancer risk regions identifies 191 likely target genes

MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer

Introduction

Methods

Data acquisition and processing

veQTL and eQTL analysis

Pathway enrichment analysis

Results

Identification of veQTLs and eQTLs

Classes of veQTLs

Comparison of veQTL and eQTL

Identification of potential target genes of breast cancer risk variants

rs11075995 alters expression of genes involved in C 21 steroid synthesis

Ethics approval and consent to participate

Discussion

Conclusions

Data availability

Abbreviations

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Imagine beyond: recent breakthroughs and next challenges in mammary gland biology and breast cancer research

Increased gene expression variability in BRCA1-associated and basal-like breast tumours

Comments

Search

Quick links

rs11075995 alters expression of genes involved in C ₂₁ steroid synthesis