Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Gene-set meta-analysis of lung cancer identifies pathway related to systemic lupus erythematosus

  • Albert Rosenberger ,

    arosenb@gwdg.de

    Affiliation Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany

  • Melanie Sohns,

    Affiliation Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany

  • Stefanie Friedrichs,

    Affiliation Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany

  • Rayjean J. Hung,

    Affiliations Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, Canada, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada

  • Gord Fehringer,

    Affiliation Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, Canada

  • John McLaughlin,

    Affiliation Public Health Ontario, Toronto, Canada

  • Christopher I. Amos,

    Affiliation Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, United States of America

  • Paul Brennan,

    Affiliation International Agency for Research on Cancer, Lyon, France

  • Angela Risch,

    Affiliation Division of Molecular Biology, University Salzburg, Salzburg, Austria

  • Irene Brüske,

    Affiliation Institute of Epidemiology I, Helmholtz Center Munich, Munich, Germany

  • Neil E. Caporaso,

    Affiliation Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, United States of America

  • Maria Teresa Landi,

    Affiliation Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, United States of America

  • David C. Christiani,

    Affiliation Harvard University School of Public Health, Boston, Massachusetts, United States of America

  • Yongyue Wei,

    Affiliation Harvard University School of Public Health, Boston, Massachusetts, United States of America

  • Heike Bickeböller

    Affiliation Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany

Abstract

Introduction

Gene-set analysis (GSA) is an approach using the results of single-marker genome-wide association studies when investigating pathways as a whole with respect to the genetic basis of a disease.

Methods

We performed a meta-analysis of seven GSAs for lung cancer, applying the method META-GSA. Overall, the information taken from 11,365 cases and 22,505 controls from within the TRICL/ILCCO consortia was used to investigate a total of 234 pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database.

Results

META-GSA reveals the systemic lupus erythematosus KEGG pathway hsa05322, driven by the gene region 6p21-22, as also implicated in lung cancer (p = 0.0306). This gene region is known to be associated with squamous cell lung carcinoma. The most important genes driving the significance of this pathway belong to the genomic areas HIST1-H4L, -1BN, -2BN, -H2AK, -H4K and C2/C4A/C4B. Within these areas, the markers most significantly associated with LC are rs13194781 (located within HIST12BN) and rs1270942 (located between C2 and C4A).

Conclusions

We have discovered a pathway currently marked as specific to systemic lupus erythematosus as being significantly implicated in lung cancer. The gene region 6p21-22 in this pathway appears to be more extensively associated with lung cancer than previously assumed. Given wide-stretched linkage disequilibrium to the area APOM/BAG6/MSH5, there is currently simply not enough information or evidence to conclude whether the potential pleiotropy of lung cancer and systemic lupus erythematosus is spurious, biological, or mediated. Further research into this pathway and gene region will be necessary.

Introduction

Since the beginning of the 20th century, lung cancer (LC) occurrence has been increasing rapidly and has become the most common cancer in males. It is the main cause of cancer-related death worldwide [1] and tobacco smoke is its major risk factor. The risk of developing LC in current smokers is 7.6 to 9.3 times higher compared to that of never smokers [2]. However, around every fourth LC case is not attributable to smoking [3]. A five-fold increased risk of developing early-onset LC in the presence of a family history of early-onset LC in any first-degree relatives has also been observed [4, 5]. This and other evidence has led to the general acceptance that a genetic component in early-onset LC development exists. However, an increased risk of developing LC has also been observed in patients with other disease, such as COPD, pneumonia, tuberculosis, or the autoimmune disorder systemic lupus erythematosus (SLE) [6, 7]. In the case of patients with SLE, an increased relative risk (RR) of developing LC was observed as being 1.68 (95%-CI: 1-33-2.13) [6]. In spite of multiform clinical manifestations and outcomes, it is generally accepted that genetics plays a role in SLE [8]. In light of the results of this investigation, we will discuss a shared genetic susceptibility as a possible connection between SLE and LC.

Genome-wide association studies (GWASs) have revealed that genomic variations at e.g. 5p15.33, 6p21-22 and 15q25 influence LC risk in European populations [916]. Further weakly associated single markers in at least 12 genes have been found given their known role within certain molecular mechanisms [1721]. Since associated genes are elements of respective pathways, one may assume that nicotine dependency [14], inflammation [16, 22], or DNA repair [23], among others, play a role in an individual’s susceptibility to developing LC.

The usual approach to identify such molecular mechanisms with GWAS is primarily to investigate single-marker-association and then allocate these markers to genes and finally the genes to pathways. Doing so, either the marginal effect of a single marker and/or the sample size needs to be large, because a low genome-wide level of significance of 1 x 10−7 or smaller is needed owing to multiple testing. Gene-set analysis (GSA) strategies were proposed as complementary approaches in the investigation of the genetic basis of a disease using GWAS results [2426], by seeking to identify sets of genes (GS) with sufficient enrichment of marker-specific significance for an association with a phenotype.

GSA approaches provide no effect estimates of the association, but only p-values (pGS). To pool the pGS-values of several GSAs, it is important to take into account the concordance across studies of all single-marker-association point estimates related to every gene in a considered gene set [27]. However, one only needs to correct for multiple testing using the lower number of GSs being investigated instead of the larger number of genotyped markers. Once a GS has been found to be significantly associated, a search may be conducted for the genes that drive its significance and for the hosted markers which are concordant across studies based on their observed associations.

Here we aimed to identify pathways taken from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [28] as being associated with LC. KEGG provides a collection of manually drawn pathway maps representing an up-to-date knowledge on the molecular interaction and reaction networks. This includes pathways for metabolisms (e.g. nicotinate and nicotinamide metabolism), for genetic information processing (e.g. DNA repair), for environmental information processing (e.g. Wnt signaling), for cellular processes (e.g. cell cycle), for organismal systems (e.g. circadian rhythm) and last but not least for human diseases (e.g. LC or SLE) [29]. We refrained from restricting the KEEG collection, because pathways that are potentially involved in the etiology of LC (examples are given above in brackets) are contained in every upper mentioned category.

Our subsequent goal was to determine the driving genes in the pathways identified in the first step. To this end, we combined the results of seven LC GWASs from the Transdisciplinary Research in Cancer of the Lung / International Lung Cancer Consortium (TRICL / ILCCO) in a meta-analysis.

Materials and methods

Description of studies

The meta-analysis was based on summary data from seven previously reported LC GWASs form TRICL / ILCCO (Fig 1). We included 11,365 LC cases and 22,505 controls of European descent in the analysis. An overview as well as study name abbreviations are given in Table 1. Details and references are provided Supplement S1 File.

thumbnail
Table 1. Characteristics of lung cancer GWASs of the International Lung Cancer Consortium (ILCCO).

https://doi.org/10.1371/journal.pone.0173339.t001

Strategy and methods

In the original GWASs, a log-additive mode of inheritance was fitted for each marker, adjusting for age, sex, smoking status, study center (if applicable), and the first three principal components to account for hidden genomic structure. The results of marker-by-marker association testing were used as input information for the GSAs.

For this meta-analysis, we set up a two-phase seamless design consisting of a screening phase and a replication phase. In the screening phase, the results of MDACC, TORONTO, GLC, and CE were combined, because GSA of these studies was performed for 234 KEGG pathways previously [30, 31]. In the replication phase, the results of the remaining studies NCI, deCODE, and HARVARD were combined to investigate only those pathways whose findings in the screening phase proved promising. If necessary, GSA was performed using the program ALIGATOR [32]. The method META-GSA [27] was performed to pool GSA results (p-values pGS,s) at each stage. The aim of META-GSA is to increase statistical evidence by pooling the p-values pGS,s of GSAs, taking also into account the concordance of the signs of single-marker-association point estimates and related p-values of all markers (pm,s) assigned to genes contained in the GS [27]. The core element of this approach is a directed p-value (PDR), combining significance and direction of single markers and LD to other markers. Necessary estimates of LD were based on the genotype data of GLC, with imputation of missing markers based on the 1000-Genome Project [33], the 1000-GenomePilot 1-Panel or the HapMap3-Panel as available using the SNAP online tool [34].

The SNP-to-gene annotation (StG) for humans of the ENSEMBL database [35] was used. Markers with LD of at least r2≥0.8 to any marker inside a gene were additionally assigned to that gene [36]. All genes were then annotated to 234 gene sets from the KEGG database (gene-to-pathway annotation (GtP)).

Both phases can be considered as the first and the second stage of a seamless, adaptive study with interim selection of gene sets (“drop-loser design” [37]). The investigation of every KEGG pathway with a pooled pscr. < β1 = 1/234 in the screening phase was stopped early for futility. The significance, combining screening and replication phase, was assessed according to the “method based on the sum of p-values” (MSP) [37, 38]. The p-value was then calculated by the equation . This pGS needs to be corrected for multiple testing by taking into account the total number of 234 pathways. Due to pathway overlap we estimated the number of independent tests teff according to the lowest slope method (LSM) [39] considering all pscr.-values of the screening phase. Applying a Bonferroni-like correction then yields the final p-value pGS,corr. = min(1,teffpGS). Furthermore, META-GSA was also applied to all seven studies and all pathways surviving the screening phase to take into account the concordance of single-marker-association point estimates across all considered studies at the same time.

The next step was to identify the main genes driving the significance of gene sets (denoted as pGS–driving genes). Thus we contrasted the mean of PDRs across studies for each gene ( as a measure of concordance) with pooled p-values regarding the gene-level statistics (pgene as measure of significance, calculated according to Fisher’s χ2-method). To judge these findings adequately, we also calculated for the known LC-related genes CLTM1L, TERT, CHRNB4, CHRNA3, CHRNA5, MSH5, BAG6, RAD52 and CDKN2B. Within these genes we looked markers with a large mean of PDRs across studies ().

Finally, we performed a sub-group meta-analysis for the one identified KEGG pathway according to histological subtype (AdenoLC, SqCLC, SCLC and LCLC), sex, age (older or younger than 50 years), and smoking behavior (current, former, ever and never smokers).

During this investigation the region 6p21-22 became of interest. Respective correlation of marker genotypes and gene expression (eQTL) was previously measured in non-neoplastic pulmonary parenchymal samples taken some distance from the primary tumor in LC patients [40]. We used the estimated correlation between every SNP located between 31.6MB and 32.2 MB (all within 6p21-22) and the expression of the genes APOM, BAG6, MSH5 (reported as relevant in LC), C2, C4B, SKIV2L, STK19 (closely located to genes driving the significance in this META-GSA application) and TNXB (reported as relevant for SLE), in total 5,572 estimated correlations. Estimating teff = 5309 independent tests (by LSM) yields a global threshold for significance of 1x10-7.

Results

Association of pathways: Screening and replication phase

Only three of the 234 pathways investigated revealed a p-value lower than the futility threshold and were selected for the replication phase: hsa05322: systemic lupus erythematosus (SLE), hsa00790: folate biosynthesis and hsa04940: type I diabetes mellitus (Table 2). Only for the SLE pathway we were able to achieve a low p-value when combining screening and replication phase and correcting for multiple testing (pGS,corr = 0.0615). Combining all seven studies in a single META-GSA, in order to take the concordance of single-marker-association point estimates of all studies into account adequately, yielded a pGS-value of 0.0306 for this SLE pathway. This indicates sufficient enrichment and satisfactory concordance of marker-specific significance for an association with LC.

Genes driving significance

Four genes of the SLE pathway (HIST1-H4L,-1BN, -H2AK, -H4K) and their close neighbor HIST1H2BN strike out by concordance of marker-specific association () across studies and a gene-level pgene –value lower than 0.01 (Table 3). All five genes belong to the histone cluster 1 and are closely located within 41 kb of each other on 6p22.1. Weaker concordance was observed for further two less significant genes (pgene -value < 0.05): C4A ( = -0.41) and C2 ( = 0.33).

thumbnail
Table 3. Significance and concordance of selected genes of interest.

https://doi.org/10.1371/journal.pone.0173339.t003

Markers driving significance

The markers rs13194781, rs1270942 and rs389884 are those with the largest -values (all >0.7) and the strongest associations with LC (in terms of OR). For rs13194781, which is located within HIST1H2BN (ENSEMBL definition), an OR of 1.23 (p = 0.0032) was estimated. The markers rs1270942 and rs389884 are perfect proxies for each other according to the 1000-Genome Pilot 1-panel [33]. They are closely located upstream of C2 and downstream of C4A, respectively. There is no LD with the first marker rs13194781 (Table 4).

thumbnail
Table 4. Markers with <0.5 in genes of interest on 6p21-22.

https://doi.org/10.1371/journal.pone.0173339.t004

Subgroup meta-analysis

We revealed more evidence for an association of the SLE pathway with AdenoLC (pGS = 0.0030) than for any other histotype. We also found the association to be significant in women (pGS = 0.0112) but not in men (pGS = 0.1453) and in older cases (pGS = 0.0002) but not in younger (pGS = 0.0588). No significant association was observed when stratifying according to smoking behavior (Table 5). Significance within the considered subgroups is driven by same pGS-driving genes of the region 6p22.1–22.2 as in the total sample (C2 and the genes of the histone 1 cluster). Also, most of the more moderate concordant genes that drive significance of hsa05322 in at least one of the considered subgroups are histone-coding genes.

thumbnail
Table 5. Subgroup analysis for hsa05322: histological subtypes, sex, age, smoking.

https://doi.org/10.1371/journal.pone.0173339.t005

SNP ⨯ eQTL correlation

Both aforementioned SNPs belonging to C2/C4A, rs1270942 and rs389884, are significant correlated with the expression of the gene APOM (p<10−13), which is located about 500 kb away (Fig 2). However, the expression pattern is this region is puzzling, since other markers within C2 (rs537160, rs622871, rs630379) are also correlated with the gene expression in non-neoplastic samples of LC patients of the neighboring gene C4B (not part of the investigated KEGG pathway, although related to SLE). It is also remarkably that the correlation of SNPs belonging to C2/C4A with the expression of C2 is less significant (p ~10−3) than with the expression of SKIV2L (p ~10−5), which is not related to SLE.

thumbnail
Fig 2. Association and correlation with gene expression in the chromosome 6p21-22 region.

LC—lung cancer, SLE—systemic lupus erythematosus; correlation to gene expression: pooled p-values as reported by Nguyen et al., 2014 [40]; association with LC: pooled p-values as reported by Timofeeva et al. 2012 [13].

https://doi.org/10.1371/journal.pone.0173339.g002

Discussion

We could demonstrate an accumulation of genomic association with LC in the KEGG pathway hsa05322, which comprises genes related to SLE. This suggests some cross-phenotype (CP) association with LC and SLE. The significance was higher in the subgroup of AdenoLC patients than within other histological subtypes and in women compared to men. This fits our expectations in view of women, who predominantly develop AdenoLC, are more often affected with SLE than men [41], who predominantly develop smoking-related SqCLC [1, 42].

All pGS–driving genes identified in this meta-analysis are located within or next to the major histocompatibility complex (MHC) on chromosome 6p21-22 (Fig 2), albeit in two separate areas, about 3000 kb apart. The first area comprises the genes of histone cluster I: HIST1-H4L, -1BN, -2BN, -H2AK, -H4K (the strongest associated marker is rs13194781; OR = 1.23, p = 0.0032). It is well known that a variety of histone related modifications are either related to cancer or to SLE, or to both [8, 43]. They play a role e.g. in DNA repair, cell cycle or gene expression [8, 44], which by themselves are associated to LC or SLE, respectively [23, 45]. Interestingly enough, we detected associations to LC of the DNA signature of histone coding genes, rather than with respect to some kind of epigenetic outcome.

The second area comprises the genes C2, C4A, and C4B (the strongest associated markers are rs1270942 and rs389884; OR = 1.27, p = 0.009). It is well established, that reduced gene expression of C2 and C4A can predispose to SLE [46]. This two genes, and perhaps also C4B, are involved in the clearance of apoptotic bodies [8]. This is in turn crucially important for controlling inflammation, which plays a role in the development of LC [3].

However, the identification of disease-relevant genes in the MHC region (6p21–6p22) and far beyond is complicated owing to the strong and extensive LD across both common and rare haplotypes [47]. Hence any observed CP association will probably tag plenty of genes. An association of the gene area APOM/BAG6/MSH5 in the MHC region with LC has previously been reported, which is strongest for SqCLC and AdenoLC [9, 13]. The strongest associations with SqCLC in this area was previously reported for the markers rs3117582 (located within BAG6 and APOM; OR = 1.3, p = 4.5×10−10), which was found associated also with SLE (OR = 2.2, p = 4.2×10-21) [48]. This marker is about 220 kB apart but in strong LD with the newly identified markers rs1270942 and rs389884 (located close to C2; Table 4 and Fig 2). More important, a highly significant correlation between markers of the area C2/C4A/C4B with the expression of the gene APOM in non-neoplastic samples taken from LC patients was also recently reported [40] (Fig 2). APOM is involved in lipid transport and is linked with high-density lipoprotein cholesterol in the pathogenesis of emphysema, which is on the other hand considered as associated with LC [49, 50]. But other explanations of the observed associations have been given, too; for instant a connection to embryonic lethality with defects in the development of the lung (related to the function of BAG6) or deficits in mismatch excision repair (related to the function of MSH5) [13]. Moreover, the association of MSH5 with SLE was reported as not shared with other autoimmune/inflammatory diseases [51].

Apart from all this, some remarks about the applied method need to be made. The whole approach is an intensive investigation of p-values, which—in the context of this project—are indicators of evidence for or against the rejection of a null-hypothesis of no genetic association. We used the program ALIGATOR to perform GSA, which circumvents bias due to uneven counts of markers per gene as well as genes per gene set [32]. Choosing another algorithm would probably lead to different results [31]. In addition, a p-value can be used to justify the existences of an association; however it is not solely determined by the strength of the observed effect, but also by factors like sample size, the used statistical model and the applied test procedure. Hence we can present significance of our findings but are unable to estimate the part of LC risk that can be attributed to the identified genes or gene sets.

Conclusion

We were able to identify CP risk factors by first pooling results of gene set analyses and looking afterwards for those genes driving the significance of discovered gene sets. In doing so, we have discovered a pathway that is currently marked as specific to SLE as being significantly implicated in LC. The gene region 6p21-22 in this pathway appears to be more extensively associated with lung cancer than previously assumed. Given wide-stretched linkage disequilibrium to the area APOM/BAG6/MSH5, there is currently simply not enough information or evidence to conclude whether the potential pleiotropy of LC and SLE is spurious, biological, or mediated. Further research into this pathway and gene region will be necessary.

Supporting information

S3 File. Meta-analysis on Genetic Association Studies Checklist | PLOS ONE.

https://doi.org/10.1371/journal.pone.0173339.s003

(DOCX)

Acknowledgments

This study was conducted under the auspices of the TRICL Research Team and the ILCCO network. We would like to thank all the participants and clinicians who took part in the original studies. Furthermore, we would like to thank all the researchers who made their original data available. We would specifically like to thank Yohan Bossé from Université Laval, Quebec, for providing us with the summary data of his mRNA experiments.

Author Contributions

  1. Conceptualization: A. Rosenberger.
  2. Formal analysis: A. Rosenberger SF MS GF.
  3. Funding acquisition: RH JM CA PB A. Risch IB NC ML DC HB.
  4. Investigation: RH JM CA PB A. Risch IB NC ML DC HB.
  5. Methodology: A. Rosenberger.
  6. Project administration: CA RH HB.
  7. Resources: RH JM CA PB A. Risch IB NC ML DC YW HB.
  8. Writing – original draft: A. Rosenberger HB RH NC.
  9. Writing – review & editing: A. Rosenberger.

References

  1. 1. Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2011;61(2):69–90. pmid:21296855
  2. 2. Lee PN, Forey BA, Coombs KJ. Systematic review with meta-analysis of the epidemiological evidence in the 1900s relating smoking to lung cancer. BMC Cancer. 2012;12:385. PubMed Central PMCID: PMC3505152. pmid:22943444
  3. 3. Sun S, Schiller JH, Gazdar AF. Lung cancer in never smokers—a different disease. Nature reviews Cancer. 2007;7(10):778–90. pmid:17882278
  4. 4. Cassidy A, Myles JP, Duffy SW, Liloglou T, Field JK. Family history and risk of lung cancer: age-at-diagnosis in cases and first-degree relatives. British journal of cancer. 2006;95(9):1288–90. PubMed Central PMCID: PMC2360569. pmid:17003779
  5. 5. Kreuzer M, Kreienbrock L, Gerken M, Heinrich J, Bruske-Hohlfeld I, Muller KM, et al. Risk factors for lung cancer in young adults. Am J Epidemiol. 1998;147(11):1028–37. pmid:9620046
  6. 6. Ni J, Qiu LJ, Hu LF, Cen H, Zhang M, Wen PF, et al. Lung, liver, prostate, bladder malignancies risk in systemic lupus erythematosus: evidence from a meta-analysis. Lupus. 2014;23(3):284–92. pmid:24429300
  7. 7. Brenner DR, Boffetta P, Duell EJ, Bickeboller H, Rosenberger A, McCormack V, et al. Previous lung diseases and lung cancer risk: a pooled analysis from the International Lung Cancer Consortium. Am J Epidemiol. 2012;176(7):573–85. PubMed Central PMCID: PMCPMC3530374. pmid:22986146
  8. 8. Costa-Reis P, Sullivan KE. Genetics and epigenetics of systemic lupus erythematosus. Current rheumatology reports. 2013;15(9):369. pmid:23943494
  9. 9. Wang Y, Broderick P, Webb E, Wu X, Vijayakrishnan J, Matakidou A, et al. Common 5p15.33 and 6p21.33 variants influence lung cancer risk. NatGenet. 2008;40(12):1407–9.
  10. 10. Hung RJ, McKay JD, Gaborieau V, Boffetta P, Hashibe M, Zaridze D, et al. A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25. Nature. 2008;452(7187):633–7. pmid:18385738
  11. 11. Amos CI, Wu X, Broderick P, Gorlov IP, Gu J, Eisen T, et al. Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat Genet. 2008;40(5):616–22. pmid:18385676
  12. 12. Truong T, Hung RJ, Amos CI, Wu X, Bickeboller H, Rosenberger A, et al. Replication of lung cancer susceptibility loci at chromosomes 15q25, 5p15, and 6p21: a pooled analysis from the International Lung Cancer Consortium. J NatlCancer Inst. 2010;102(13):959–71.
  13. 13. Timofeeva MN, Hung RJ, Rafnar T, Christiani DC, Field JK, Bickeboller H, et al. Influence of common genetic variation on lung cancer risk: meta-analysis of 14 900 cases and 29 485 controls. Human molecular genetics. 2012;21(22):4980–95. Epub 2012/08/18. PubMed Central PMCID: PMCPMC3607485. pmid:22899653
  14. 14. Brennan P, Hainaut P, Boffetta P. Genetics of lung-cancer susceptibility. The lancet oncology. 2011;12(4):399–408. Epub 2010/10/19. pmid:20951091
  15. 15. Wang Y, McKay JD, Rafnar T, Wang Z, Timofeeva MN, Broderick P, et al. Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer. Nat Genet. 2014;46(7):736–41. PubMed Central PMCID: PMC4074058. pmid:24880342
  16. 16. Fehringer G, Liu G, Pintilie M, Sykes J, Cheng D, Liu N, et al. Association of the 15q25 and 5p15 lung cancer susceptibility regions with gene expression in lung tumor tissue. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2012;21(7):1097–104. Epub 2012/04/28.
  17. 17. Timofeeva M, Kropp S, Sauter W, Beckmann L, Rosenberger A, Illig T, et al. Genetic polymorphisms of MPO, GSTT1, GSTM1, GSTP1, EPHX1 and NQO1 as risk factors of early-onset lung cancer. IntJ Cancer. 2010;127(7):1547–61.
  18. 18. Leng S, Picchi MA, Liu Y, Thomas CL, Willis DG, Bernauer AM, et al. Genetic variation in SIRT1 affects susceptibility of lung squamous cell carcinomas in former uranium miners from the Colorado plateau. Carcinogenesis. 2013;34(5):1044–50. PubMed Central PMCID: PMC3643420. pmid:23354305
  19. 19. Hung RJ, Christiani DC, Risch A, Popanda O, Haugen A, Zienolddiny S, et al. International Lung Cancer Consortium: pooled analysis of sequence variants in DNA repair and cell cycle pathways. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2008;17(11):3081–9. PubMed Central PMCID: PMC2756735.
  20. 20. Manuguerra M, Saletta F, Karagas MR, Berwick M, Veglia F, Vineis P, et al. XRCC3 and XPD/ERCC2 single nucleotide polymorphisms and the risk of cancer: a HuGE review. Am J Epidemiol. 2006;164(4):297–302. pmid:16707649
  21. 21. Brenner DR, Brennan P, Boffetta P, Amos CI, Spitz MR, Chen C, et al. Hierarchical modeling identifies novel lung cancer susceptibility variants in inflammation pathways among 10,140 cases and 11,012 controls. Hum Genet. 2013;132(5):579–89. Epub 2013/02/02. PubMed Central PMCID: PMCPMC3628758. pmid:23370545
  22. 22. Gomes M, Teixeira AL, Coelho A, Araujo A, Medeiros R. The role of inflammation in lung cancer. Advances in experimental medicine and biology. 2014;816:1–23. pmid:24818717
  23. 23. Kiyohara C, Takayama K, Nakanishi Y. Lung cancer risk and genetic polymorphisms in DNA repair pathways: a meta-analysis. Journal of nucleic acids. 2010;2010:701760. PubMed Central PMCID: PMC2958337. pmid:20981350
  24. 24. Sohns M, Rosenberger A, Bickeboller H. Integration of a priori gene set information into genome-wide association studies. BMC proceedings. 2009;3 Suppl 7:S95. PubMed Central PMCID: PMC2795999.
  25. 25. Peng G, Luo L, Siu H, Zhu Y, Hu P, Hong S, et al. Gene and pathway-based second-wave analysis of genome-wide association studies. European journal of human genetics: EJHG. 2010;18(1):111–7. pmid:19584899
  26. 26. Luo L, Peng G, Zhu Y, Dong H, Amos CI, Xiong M. Genome-wide gene and pathway analysis. European journal of human genetics: EJHG. 2010;18(9):1045–53. pmid:20442747
  27. 27. Rosenberger A, Friedrichs S, Amos CI, Brennan P, Fehringer G, Heinrich J, et al. META-GSA: Combining Findings from Gene-Set Analyses across Several Genome-Wide Association Studies. PLoS One. 2015;10(10):e0140179. PubMed Central PMCID: PMCPMC4621033. pmid:26501144
  28. 28. Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012;40(Database issue):D109–D14. pmid:22080510
  29. 29. KEEG Pathway Database [Internet]. Kanehisa Laboratories. 1995–2017. Available from: http://www.genome.jp/kegg/pathway.html.
  30. 30. Tintle N, Lantieri F, Lebrec J, Sohns M, Ballard D, Bickeboller H. Inclusion of a priori information in genome-wide association analysis. Genetic epidemiology. 2009;33 Suppl 1:S74–80. PubMed Central PMCID: PMC2922922.
  31. 31. Fehringer G, Liu G, Briollais L, Brennan P, Amos CI, Spitz MR, et al. Comparison of pathway analysis approaches using lung cancer GWAS data sets. PLoS One. 2012;7(2):e31816. Epub 2012/03/01. PubMed Central PMCID: PMCPMC3283683. pmid:22363742
  32. 32. Holmans P, Green EK, Pahwa JS, Ferreira MA, Purcell SM, Sklar P, et al. Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. Am J Hum Genet. 2009;85(1):13–24. pmid:19539887
  33. 33. Genomes Project C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65. PubMed Central PMCID: PMC3498066. pmid:23128226
  34. 34. Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, de Bakker PI. SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008;24(24):2938–9. PubMed Central PMCID: PMC2720775. pmid:18974171
  35. 35. Cunningham F, Amode MR, Barrell D, Beal K, Billis K, Brent S, et al. Ensembl 2015. Nucleic Acids Res. 2014.
  36. 36. Malzahn D, Friedrichs S, Bickeböller H. Comparing strategies for combined testing of rare and common variants in whole sequence and genome-wide genotype data. BMC proceedings. 2015:accepted for publication.
  37. 37. Chang M. Adaptive design theory and implementation using SAS and R: Boca Raton: Chapman & Hall/CRC; 2008.
  38. 38. Chang M. Adaptive design method based on sum of p-values. Statistics in medicine. 2007;26(14):2772–84. pmid:17133651
  39. 39. Hsueh HM, Chen JJ, Kodell RL. Comparison of methods for estimating the number of true null hypotheses in multiplicity testing. Journal of biopharmaceutical statistics. 2003;13(4):675–89. pmid:14584715
  40. 40. Nguyen JD, Lamontagne M, Couture C, Conti M, Pare PD, Sin DD, et al. Susceptibility loci for lung cancer are associated with mRNA levels of nearby genes in the lung. Carcinogenesis. 2014;35(12):2653–9. PubMed Central PMCID: PMC4247514. pmid:25187487
  41. 41. Brinks R, Fischer-Betz R, Sander O, Richter JG, Chehab G, Schneider M. Age-specific prevalence of diagnosed systemic lupus erythematosus in Germany 2002 and projection to 2030. Lupus. 2014;23(13):1407–11. pmid:24928831
  42. 42. Devesa SS, Bray F, Vizcaino AP, Parkin DM. International lung cancer trends by histologic type: male:female differences diminishing and adenocarcinoma rates rising. International journal of cancer Journal international du cancer. 2005;117(2):294–9. pmid:15900604
  43. 43. Chervona Y, Costa M. Histone modifications and cancer: biomarkers of prognosis? American journal of cancer research. 2012;2(5):589–97. PubMed Central PMCID: PMC3433108. pmid:22957310
  44. 44. House NC, Koch MR, Freudenreich CH. Chromatin modifications and DNA repair: beyond double-strand breaks. Frontiers in genetics. 2014;5:296. PubMed Central PMCID: PMC4155812. pmid:25250043
  45. 45. Kazma R, Babron MC, Gaborieau V, Genin E, Brennan P, Hung RJ, et al. Lung cancer and DNA repair genes: multilevel association analysis from the International Lung Cancer Consortium. Carcinogenesis. 2012;33(5):1059–64. Epub 2012/03/03. PubMed Central PMCID: PMCPMC3334518. pmid:22382497
  46. 46. Leffler J, Bengtsson AA, Blom AM. The complement system in systemic lupus erythematosus: an update. Annals of the rheumatic diseases. 2014;73(9):1601–6. pmid:24845390
  47. 47. Ahmad T, Neville M, Marshall SE, Armuzzi A, Mulcahy-Hawes K, Crawshaw J, et al. Haplotype-specific linkage disequilibrium patterns define the genetic topography of the human MHC. Human molecular genetics. 2003;12(6):647–56. pmid:12620970
  48. 48. Alonso MD, Martinez-Vazquez F, Riancho-Zarrabeitia L, Diaz de Teran T, Miranda-Filloy JA, Blanco R, et al. Sex differences in patients with systemic lupus erythematosus from Northwest Spain. Rheumatology international. 2014;34(1):11–24. pmid:23812032
  49. 49. Burkart KM, Manichaikul A, Wilk JB, Ahmed FS, Burke GL, Enright P, et al. APOM and high-density lipoprotein cholesterol are associated with lung function and per cent emphysema. The European respiratory journal. 2014;43(4):1003–17. PubMed Central PMCID: PMC4041087. pmid:23900982
  50. 50. Brenner DR, Hung RJ, Tsao MS, Shepherd FA, Johnston MR, Narod S, et al. Lung cancer risk in never-smokers: a population-based case-control study of epidemiologic risk factors. BMC Cancer. 2010;10:285. pmid:20546590
  51. 51. Fernando MM, Freudenberg J, Lee A, Morris DL, Boteva L, Rhodes B, et al. Transancestral mapping of the MHC region in systemic lupus erythematosus identifies new independent and interacting loci at MSH5, HLA-DPB1 and HLA-G. Annals of the rheumatic diseases. 2012;71(5):777–84. PubMed Central PMCID: PMC3329227. pmid:22233601