Micronuclei in Cord Blood Lymphocytes and Associations with Biomarkers of Exposure to Carcinogens and Hormonally Active Factors, Gene Polymorphisms, and Gene Expression: The NewGeneris Cohort

Background: Leukemia incidence has increased in recent decades among European children, suggesting that early-life environmental exposures play an important role in disease development. Objectives: We investigated the hypothesis that childhood susceptibility may increase as a result of in utero exposure to carcinogens and hormonally acting factors. Using cord blood samples from the NewGeneris cohort, we examined associations between a range of biomarkers of carcinogen exposure and hormonally acting factors with micronuclei (MN) frequency as a proxy measure of cancer risk. Associations with gene expression and genotype were also explored. Methods: DNA and protein adducts, gene expression profiles, circulating hormonally acting factors, and GWAS (genome-wide association study) data were investigated in relation to genomic damage measured by MN frequency in lymphocytes from 623 newborns enrolled between 2006 and 2010 across Europe. Results: Malondialdehyde DNA adducts (M1dG) were associated with increased MN frequency in binucleated lymphocytes (MNBN), and exposure to androgenic, estrogenic, and dioxin-like compounds was associated with MN frequency in mononucleated lymphocytes (MNMONO), although no monotonic exposure–outcome relationship was observed. Lower frequencies of MNBN were associated with a 1-unit increase expression of PDCD11, LATS2, TRIM13, CD28, SMC1A, IL7R, and NIPBL genes. Gene expression was significantly higher in association with the highest versus lowest category of bulky and M1dG–DNA adducts for five and six genes, respectively. Gene expression levels were significantly lower for 11 genes in association with the highest versus lowest category of plasma AR CALUX® (chemically activated luciferase expression for androgens) (8 genes), ERα CALUX® (for estrogens) (2 genes), and DR CALUX® (for dioxins). Several SNPs (single-nucleotide polymorphisms) on chromosome 11 near FOLH1 significantly modified associations between androgen activity and MNBN frequency. Polymorphisms in EPHX1/2 and CYP2E1 were associated with MNBN. Conclusion: We measured in utero exposure to selected environmental carcinogens and circulating hormonally acting factors and detected associations with MN frequency in newborns circulating T lymphocytes. The results highlight mechanisms that may contribute to carcinogen-induced leukemia and require further research. Citation: Merlo DF, Agramunt S, Anna L, Besselink H, Botsivali M, Brady NJ, Ceppi M, Chatzi L, Chen B, Decordier I, Farmer PB, Fleming S, Fontana V, Försti A, Fthenou E, Gallo F, Georgiadis P, Gmuender H, Godschalk RW, Granum B, Hardie LJ, Hemminki K, Hochstenbach K, Knudsen LE, Kogevinas M, Kovács K, Kyrtopoulos SA, Løvik M, Nielsen JK, Nygaard UC, Pedersen M, Rydberg P, Schoket B, Segerbäck D, Singh R, Sunyer J, Törnqvist M, van Loveren H, van Schooten FJ, Vande Loock K, von Stedingk H, Wright J, Kleinjans JC, Kirsch-Volders M, van Delft JHM, NewGeneris Consortium. 2014. Micronuclei in cord blood lymphocytes and associations with biomarkers of exposure to carcinogens and hormonally active factors, gene polymorphisms, and gene expression: The NewGeneris Cohort. Environ Health Perspect 122:193–200; http://dx.doi.org/10.1289/ehp.1206324


Introduction
Cancer incidence among European children, specifically leukemia, has steadily increased over the last three decades (Kaatsch 2010). In view of the relatively short latent period for leukemia and its very early onset in childhood, it has been suggested that fetal exposure to environmental carcinogens may increase susceptibility to this cancer (Wild and Kleinjans 2003).
The European Union (EU)-funded project Newborns and Genotoxic exposure risks (NewGeneris) was designed to evaluate the hypothesis that maternal intake of dietary and other environmental carcinogens results in in utero exposure and early biological effects in the unborn child, possibly leading to increased risk of cancer in later childhood (Merlo et al. 2009). The primary aim of the present study was to investigate the volume 122 | number 2 | February 2014 • Environmental Health Perspectives relationship between biomarkers of exposure to carcinogenic compounds and micro nuclei (MN) frequency in umbilical cord blood lympho cytes from the NewGeneris motherchild birth cohort. The secondary aim was to ascertain whether individual genotypes modify these relationships.
Facilitated by the development of microarray technologies, gene expression-based biomarkers have been developed and applied for human biomonitoring purposes Rager et al. 2011;Ren et al. 2011;van Leeuwen et al. 2008). Gene expression profiling has the potential to identify new biomarkers of exposure that may simultaneously reflect the earliest biological events in disease pathogenesis. Here, we evaluated the expression of 36 genes that were associated with biomarkers of carcinogen exposure by quantitative real-time polymerase chain reaction (qRT-PCR) (Hochstenbach et al. 2012).
MN frequency was assessed as the primary outcome. MN are a potential biomarker of cancer risk, because increased micronucleated binucleated (MNBN) frequencies in T lymphocytes have been shown to be associated with cancer risk in adults (Bonassi et al. 2007). MN are small extranuclear bodies arising in dividing cells that are caused by chromosomal breakage and/or whole chromosome loss (Fenech 2007;Kirsch-Volders et al. 2011). MNBN provide a measure of the lesions that have recently occurred in vivo, whereas micronucleated mononucleated lymphocytes (MNMONO) give an estimation of the genome damage accumulated over a long period in stem cells and circulating lymphocytes (Kirsch-Volders and Fenech 2001).
Furthermore, we performed a genomewide association study (GWAS) to investigate whether associations between exposure biomarkers and MN are modified by genetic variation.

Study population and sample collection.
Pregnant women (n = 1,200) were enrolled between 2006 and 2010 in Heraklion, Crete, Greece; Barcelona and Sabadell, Spain;Bradford, England;Copenhagen, Denmark;and Oslo and Akerhus, Norway (Pedersen et al. 2012). The participation of mothers in the study was based on previously described eligibility criteria (Pedersen et al. 2012). Study protocols were approved by local ethics committees, and informed consent was obtained from all participating mothers before sample collection.
Detailed information on personal characteristics, including demographic, health, and lifestyle factors, was obtained using extensive questionnaires completed by mothers before or around the time of delivery. Information on dietary habits during pregnancy was obtained from country-specific food-frequency questionnaires (FFQs). Information on birth weight, gestational age, sex, and type of delivery was obtained from maternity records. Gestational age (completed weeks) was computed based on last menstrual period or ultrasound-based estimated date of conception.
Blood samples were collected from 1,151 mother-infant dyads following a common protocol as described previously (Merlo et al. 2009). Umbilical cord blood samples were collected immediately after birth from the cord vein of newborns and locally processed. Samples were kept at -20°C or -80°C until shipment on dry ice to the study laboratories.
Biomarkers of exposure and early biological effect. DR CALUX® bioassay. Dioxinlike activity, expressed as aryl hydrocarbon receptor (AhR)-mediated activation of the extractable lipid fraction from plasma, was determined through the DR CALUX bioassay developed by BioDetection System (Murk et al. 1996). Blood was collected in heparinized tubes and plasma was isolated by centrifugation on the day of collection and frozen at -20 o C. One to three milliliters of cord blood plasma was used for extraction of lipophilic compounds. The procedure for the DR CALUX® bioassay has been described in detail previously (Behnisch 2005). Additional information is provided in Supplemental Material (p. 3).
ERα and AR CALUX® bioassays. Estrogenic and androgenic activity in cordblood plasma was determined using the ERα and AR CALUX® Bioassays. The ERα and AR CALUX® bioassays comprise human bone cell lines (U2OS), stably incorporating the firefly luciferase gene coupled to responsive elements (REs) as a reporter gene for the presence of (xeno-) estrogens (ERα CALUX®) and androgens (AR CALUX®) (Sonneveld et al. 2005). Additional information is provided in Supplemental Material (p. 4).
Hb adducts. Erythrocytes were isolated by centrifugation on the day of collection and stored at -20°C. AA-, GA-, and EtO-Hb adducts were simultaneously determined by the adduct FIRE procedure using liquid chromatography tandem mass spectrometry with performance and validation standards as described in detail elsewhere (von Stedingk et al. 2010(von Stedingk et al. , 2011. In total, Hb adduct levels were measured in 1,151 cord blood samples. DNA adducts. DNA was isolated with the Qiagen Midi Kit (no. 13343; Qiagen, Hilden, Germany) with some modifications of the manufacturer's protocol, as reported previously (Kovács et al. 2011). Additional details are provided in Supplemental Material (pp. 4-8).
Immunoslot blot analysis of M 1 dG. M 1 dG was determined by an immunoslot blot method, using a murine M 1 dG monoclonal primary antibody (D10A1), provided by L. Marnett (Vanderbilt University, TN, USA), as described previously (Singh et al. 2001).
Postlabeling analysis of bulky DNA adducts. Bulky DNA adducts were detected with the nuclease P1 modification of the 32 P-postlabeling procedure as detailed elsewhere (Kovács et al. 2011). Interlaboratory differences in levels were adjusted for, as described in Supplemental Material (p. 8).
Cytokinesis block micronucleus assay. The in vitro cytokinesis blocked MN assay was carried out according to the standardized protocol developed for semiautomated image analysis (Decordier et al. 2009) and adapted for umbilical blood . MN were scored in both MNBN and MNMONO T lymphocytes (Kirsch-Volders and Fenech 2001). To harmonize slide preparation, the cohort cytologists were trained by I.D., K.V.L., and M.K.-V. (Vrije Universiteit Brussel; VUB). Slides were sent to VUB, where staining and MN analysis occurred. Quality control after staining included visual selection of slides with good quality, using a light microscope and based on a good spreading, swelling, and amount of cells. The automated scoring procedure followed by visual validation of selected micronucleated cells was carried out by the same researcher, using the PathFinder™ platform installed by IMSTAR S.A. (Paris, France) at the VUB laboratory; this consisted of a PathFinder™ CELLSCAN™ capture station and two PathFinder™ MN analysis workstations. Reproducibility of the automated image analysis combined with the visual validation step was investigated by assessing the intercapturing variability (Decordier et al. 2009. At the end of the processing step, cells containing detected MN are presented one by one on the screen and confirmed or rejected by the scorer, according to the Human MicroNucleus project scoring criteria (Fenech et al. 2003). According to guideline T487 of the Organisation for Economic Co-operation and Development (OECD), only subjects with at least 1,000 BN T lymphocytes counted were considered for statistical analysis (OECD 2010).
Gene expression analysis. To preserve RNA for gene expression analysis, 0.4 mL of heparin-anticoagulated whole blood was mixed with 1.2 mL RNAlater (Ambion/ Applied Biosystems, Nieuwerkerk aan den Ijssel, the Netherlands) as soon as possible after blood collection. Samples were kept at -80°C until shipment on dry ice to the research laboratory at Maastricht University. Total RNA was isolated using the RiboPure-Blood system (Ambion) according to the manufacturer's instructions. RNA integrity was verified by gel electrophoresis (2100 BioAnalyzer; Agilent Technologies, Amstelveen, the Netherlands).
Fluidigm's BioMark™ Dynamic Array (Fluidigm, Amsterdam, the Netherlands) technology was used for gene expression analyses by qRT-PCR, which was conducted by ServiceXS (Leiden, the Netherlands). Thirty-six genes were selected from a whole genome gene-environment interaction study on neonates (n = 84) from the Norwegian cohort (Hochstenbach et al. 2012). Selection was primarily based on correlations (r ≥ 0.75, ≤ -0.75, p < 0.05) of gene expression with toxic dietary exposures (i.e., genotoxic or immunotoxic) estimated based on FFQs, CALUX assay-based evidence of exposure to estrogenic-, androgenic-, and dioxin-like compounds, Hb adduct levels, and MN frequencies. Only mechanistically relevant genes were selected, based on gene ontologies such as DNA repair, cell cycle, apoptosis, and cell proliferation. For each of the correlations, mechanistically relevant genes were selected, resulting in 36 unique genes. Five reference genes were selected, based on low variance across all individuals. TaqMan gene expression assays (Applied Biosystems) were used (see Supplemental Material, Table S1), and qRT-PCR was conducted according to the manufacturer's protocol. Each sample was analyzed in duplicate, and an average C t (threshold cycle) value obtained. On all RT-PCR plates, a reference sample at various dilutions was included for quality control assessment of interplate reproducibility. The raw C t value upper cut-off was set to 26; genes exceeding this value were classified as unexpressed. For normalization, the average C t of the five reference genes was subtracted from the C t value of each gene.
Genome-wide association studies and candidate genes analyses. We conducted a genome-wide scan of approximately 300,000 tagging single nucleotide polymorphisms (SNPs) using the Illumina HumanCytoSNP-12 v1 (Illumina Inc., Hayward, CA, USA) according to the manufacturer's protocols. Genotype calling was done using Illumina GenomeStudio 2010. Genomic DNA was isolated from 900 cord blood samples and was used to genotype each child. Quality control was performed on a per-sample and per-SNP basis. We excluded 33 duplicates, 23 samples with a genotype call rate < 98.5%, and 14 twins, leaving 830 genotyped samples available for analysis. We used a general genetic model retaining the three distinct genotypes and without making any assumption about the direction of the SNP's association in the heterozygote compared with the two homozygote classes. According to nonmutually exclusive SNP-based quality checks, 6,801 SNPs were excluded because of Hardy-Weinberg equilibrium violation (p < 10 -6 ), 35,429 because they had a minor allele frequency (MAF < 1%), and 7,338 because missing genotype was > 10%, resulting in 258,246 of 298,199 SNPs left for statistical analyses. A total of 435 newborns had both SNPs and MN results available, and they were used in GWAS statistical analyses.
In addition, SNPs present in metabolism and DNA repair genes were selected a priori by the consortium as candidate genes based on the available knowledge on functionalities with respect to bioactivation (CYP1A1, CYP2E1, CYP2D6, EPHX1, and EPHX2) and detoxification (GSTM1) of DNA adduct-forming metabolites, base excision repair of oxidative adducts (OGG1), nucleotide excision repair of bulky adducts (XRCC1, ERCC2/XPD, XPA, and XRCC3), repair of alkylated adducts (MGMT, ALKB, and MPG) and of thymine adducts (TDG), and with respect to folate metabolism, which is known to interfere with micronucleus formation (MTHFR, MTR, and MTRR).
Cohort (country), maternal age (continuous), gestational age (continuous), prepregnancy maternal body mass index (continuous), maternal smoking during pregnancy (any or none), environmental tobacco smoke (ETS) exposure during pregnancy (any or none), maternal ethnicity (Caucasian, others), and newborn sex and birth weight (continuous) were selected as potential confounders a priori and included in all models. Observations with missing covariates were excluded from the statistical analysis. We report the relative difference in the frequency of MN for each category of exposure relative to the lowest exposure category and the associations between 1-unit increases in gene expression and MN frequency as the mean ratio (MR) and its 95% CI. The likelihood ratio test was used as a global test of statistical significance over all categories of each exposure biomarker, SNPs allele variants, and the interactions between exposure biomarkers with gene expression and with SNPs.
We estimated associations between the expression of each of the 36 genes evaluated and categorical exposure biomarkers using separate multivariable linear regression models adjusted for the covariates listed above. The F-test was used as a global test of statistical significance over all categories of each exposure biomarker. For each exposure biomarker we report the differences in gene expression associated with the highest versus lowest category of exposure biomarkers.
For gene expression and GWAS analyses, we adjusted the estimated p-values to account for multiple comparisons using standard methods (Benjamini and Hochberg 1995;Hochberg 1988;Holm 1979). This criterion was used to identify SNPs associated with MN as main predictors or as effect modifiers of the exposure biomarkers-MN and gene expression-MN associations. No adjustment was made for p-values estimated from the analyses of a priori-selected candidate genes. p-Values < 0.05 were considered statistically significant. All associations were examined in newborns with MN assay data available (n = 623) and with exposure biomarkers, gene expression, and GWAS data available. Sample sizes for individual association analyses varied as indicated in the results.

Results
Levels of biomarkers of exposure (i.e., Hb and DNA adducts, and AR, ERα, and DR CALUX® activity) detected in newborns are reported in Table 1. The number of observations for each biomarker varied reflecting the variable amount of biological specimens collected from cord blood and the assays prioritization adopted (i.e., Hb adducts, DNA adducts, and CALUX® activity). The largest number of observations was available for AA-Hb adducts (n = 1,151) and the smallest for DR CALUX® (n = 725). For all biomarkers large variations were present (e.g., AA-Hb adducts: median = 14.4 pmol/g Hb; range, 4.4-124.8; M 1 dG-DNA adducts: median = 9.9/10 8 nucleotides; range, 0.5-324.7).
Descriptive statistics for MNBN and MNMONO T lymphocytes are shown in Table 2 by cohort and by sociodemographic, reproductive, and lifestyle factors. Again large interindividual variations were observed within and between cohorts, with the highest level of MN observed in Greece (MNBN mean = 1.79 ± 1.50 per 1,000 binucleated T lymphocytes) and the lowest in the United Kingdom (MNBN mean = 0.55 ± 0.74).
None of the global tests of associations across all categories of exposure were statistically significant, and there was no evidence of monotonic dose-response trends with increasing levels of exposure for associations of AA-, GA-, or EtO-Hb adducts (quintiles); PAH-, bulky-, or O 6 -MG-DNA adducts (quartiles); or DR CALUX® plasma levels (quartiles) and frequencies of MNBN and MNMONO T lymphocytes (Table 3). A significant overall association was found between M 1 dG levels and the frequency of MNBN lymphocytes, although associations relative to the lowest quartile of M1dG were positive for the second and third quartiles and negative for the highest quartile. ERα CALUX® plasma levels were significantly associated with the frequency of MNBN and MNMONO lymphocytes and AR CALUX® with the frequency of MNMONO lymphocytes. No monotonic exposure-outcome association was observed between ERα CALUX® or AR CALUX® and MN. For ERα CALUX® a significant negative association with MNBN was detected for the second quartile, followed by a weak nonsignificant positive association with the third and fourth quartiles while the associations with MNMONO were negative for the second and fourth quartiles. The strongest associations were detected for AR CALUX® and MNMONO T lymphocytes and were positive for the second and third quartiles and negative for the fourth quartile.
One-unit increases in the expression of 7 of the 36 genes evaluated (PDCD11, LATS2, TRIM13, CD28, SMC1A, IL7R, and NIPBL) were associated with significantly lower MNBN frequencies, with MR ranging from 0.81 (95% CI: 0.88, 0.96) for PDCD11 to 0.64 (95% CI: 0.77, 0.97) for NIPBL ( Figure 1A). The frequency of MNMONO was not significantly associated with expression of any of the genes tested (data not shown).  In models with gene expression levels as the dependent variable, expression was significantly higher in association with the highest versus lowest category of bulky DNA adducts and of M 1 dG levels for five and six genes, respectively ( Figure 1B). Conversely, expression levels were significantly lower for a total of 11 genes in association with the highest versus lowest category of plasma AR CALUX® (eight genes), ERα CALUX® (two genes), and DR CALUX® (seven genes) ( Figure 1B). Associations with lower levels of exposure are not reported. Six of the seven genes whose expression was associated with significantly lower MNBN frequency (i.e., all except TRIM13; Figure 1A) were significantly associated with the highest versus lowest category of at least one exposure biomarker (M 1 dG, DR CALUX®, ERα CALUX®, or AR CALUX®; Figure 1B).
GWAS was carried out on 435 newborns with data available for both SNPs and micronuclei. Confounding by population stratification was assessed (see Supplemental Material, Figures S1 and S2) and confirmed that genotype variations occurred between population subgroups (i.e., maternal ethnicity and newborns' country of birth), justifying the need for adjustment in statistical analyses. None of the GWAS SNPs were significant predictors of MNBN frequencies (see Supplemental Material, Figure S3). Investigation of the exposure biomarkers-SNPs interactions on the occurrence of MNBN revealed a cluster of significant SNPs (on chromosome 11) for AR CALUX® modeled as a continuous variable (see Supplemental Material, Figure S4). The four SNPs acting as effect modifiers of the relationship between AR CALUX® and the frequency of MNBN lymphocytes are given in Supplemental Material , Table S2. The association of these SNPs were reported per unit increase of plasma AR CALUX® and varied according to the allele variants. For each of the SNPs shown, there was a significant positive association between a 1-unit increase in plasma AR CALUX® and MNBN frequency among participants with one homozygous genotype, and a significant negative association with the alternate homozygous genotype (e.g., for rs7131537, MR = 2.54; 95% CI: 1.69, 3.75 for CC and MR = 0.36; 95% CI: 0.21, 0.60 for AA, with a null association among AC hetero zygotes compared with an overall estimated association MR = 1.14; 95% CI: 0.88, 1.47; data not shown).
Furthermore, 89 SNPs from the 18 a priori selected candidate genes were investigated for association with MN frequencies. SNPs in EPHX1, EPHX2, and CYP2E1 were significantly associated (unadjusted overall p-value < 0.05) with the frequency of MNBN lymphocytes (Table 4). None of the candidategene SNPs were significantly associated with

Discussion
Here, we show that exposure biomarkers and T lymphocyte MN levels are measurable in cord blood, that large variations exist for these in the European newborn population, and also that some of the exposure biomarkers are associated with MN levels (as independent variables) and with gene polymorphisms (when the biomarkers are modeled as dependent variables). This suggests that the fetus may be exposed to carcinogenic chemicals in utero via the placenta, and that such exposures may be sufficient to exert early biological effects manifested as an increase in the frequency of MNBN, a marker that has been associated with cancer risk in adults (Bonassi et al. 2007). However, our findings should be interpreted with caution given that associations did not show evidence of consistent dose-response relations with increasing levels of exposure. M 1 dG is the major DNA adduct arising from malondialdehyde, a genotoxic by-product of lipid peroxidation of polyunsaturated fatty acids with a high number of double bonds that also can be formed during food preparation (Jeong and Swenberg 2005). A significant overall association was detected between M 1 dG adduct levels and MNBN frequency, although the positive association was limited to the second and third quartiles, with the highest quartile of M 1 dG adducts being associated with the lowest MNBN frequency when compared with the lowest quartile. This association indicates recent exposure to malondialdehyde, because MNBN formation reflects recent genetic damage that results in micronuclei formation when cell replication is induced in vitro. No association was found between Hb adducts with MNMONO frequencies; however, fetal exposure to compounds detected by ERα CALUX and AR CALUX induced significant increases of MNMONO, possibly reflecting genetic damage accumulated during fetal development (Kirsch-Volders and Fenech 2001). The CALUX assays measure estrogenic, androgenic, or dioxin-like activities that could result from a variety of compounds or mixtures of compounds. Consequently, associations cannot be attributed to specific exposures. Infant acute leukemia is a frequent childhood cancer, and maternal exposure to hormones during pregnancy has been reported as a potential risk in disease occurrence (Pombo-de-Oliveira and Koifman 2006). A recent review (Holland et al. 2011) on MN in neonates and children concluded that exposure to environmental pollutants and radiation leads to increased MN; however, no information was provided on possible associations with other biomarkers of exposure and/or early effect, as presented in the present study.
The reduced number of samples available for the statistical analyses of the relationships between exposure biomarkers and MN levels is a limitation of the study and may have introduced false-negative findings. Conversely, some of the detected significant associations may have resulted from the multiple comparisons performed, increasing the chance of false-positive findings. In addition none of the observed associations followed a dose-response pattern.
We explored the expression of 36 genes by qRT-PCR as potential new biomarkers of toxic exposure. The expression of seven genes was negatively associated with MNBN (none with MNMONO), namely SMC1A, LATS2, TRIM13, PDCD11, CD28, IL7R, and NIPBL. The expression of these particular genes has previously been shown to be affected by one or more genotoxic carcinogens in experimental models (Mattingly et al. 2003). However, because detailed exposure data were absent, we could not further substantiate the involvement of specific chemicals. Using the dedicated TRANSFAC® software (BIOBASE Biological Databases, Beverly, MA, USA; http://www.biobase-international. com) for finding transcription factor expression in our transcriptomic data, we identified no transcription factor that could regulate all these genes. Given that MN are formed during metaphase/anaphase/telophase transition, it was of interest that most of the genes identified are involved in progression through the cell cycle, cell division, spindle formation, or DNA damage responses. SMC1A encodes a protein that is part of the cohesin protein complex and is involved in sister chromatid cohesion during the cell cycle (Bauerschmidt et al. 2011). The tumor suppressor gene LATS2 encodes a protein that interacts with centrosome proteins and is required for correct spindle formation (Abe et al. 2006). TRIM13 encodes a kinase involved in many different cellular processes including proliferation and apoptosis (Nakashima 2002). Furthermore, CD28 and PDCD11 are involved in apoptosis (Lacana and D'Adamio 1999;Walker et al. 1998). NIPBL is required for association of cohesin with chromosomes, for early processing of double-strand breaks and for the DNA damage checkpoint (Oka et al. 2011). For IL7R, the biological relevance for its association with MNBN remains unclear.
The expression of six of the seven genes associated with MNBN was also associated with the highest versus lowest level of one or more exposure biomarkers ( Figure 1). CD28, IL7R, and PDCD11A were associated with the mutagenic DNA adduct M 1 dG. CD28 and PDCD11 are mainly involved in processes linked to genotoxic stress, such as apoptosis and cell cycle (Lacana and D'Adamio 1999;Walker et al. 1998). LATS2 and SMC1A were associated with DR CALUX®, through which compounds that activate the transcription factor AhR, such as PCDDs (polychlorinated dibenzodioxins), PCDFs (polychlorinated dibenzofurans), dioxin-like PCBs (polychlorinated biphenyls), and PAHs (Pedersen et al. 2010) are measured; many of the latter are genotoxic. Activation of the AhR participates in pathways such as cell cycle regulation, apoptosis and immune responses (Marlowe and Puga 2005). Although LATS2 and SMC1A are not known to be regulated by AhR, both genes are involved in certain subprocesses of the cell cycle. NIPBL was associated with AR CALUX®, which measures compounds with androgenic activity. Like AhR, AR is a transcription factor and regulates the expression of various genes involved in cell cycle control, apoptosis, cell growth, and differentiation (Heisler et al. 1997). Although NIPBL is not known to be regulated by AR, it is linked to genotoxic stress related processes and is involved in the cell cycle through its mediating function in sister chromatid cohesion (Watrin et al. 2006).
In summary, associations between gene expression profiles and MN induction reflect the origin of MN: Many of the genes are associated with chromosome breakage or loss, and particularly interference with spindle and chromatid segregation. Their associations with exposure biomarkers support their relevance in relation to genotoxic processes.
The analysis of genetic susceptibility was conducted using GWAS. A strong signal was observed on chromosome 11 for an interaction with AR CALUX® on MNBN frequency (see Supplemental Material, Table S2, Figure S4). The gene closest to this hotspot is FOLH1 (folate hydrolase 1) and could thus be the genetic factor that affects this relationship. Several pseudogenes were closer, but were excluded because their function is unclear. FOLH1, also known as PMSA (prostatespecific membrane antigen), is overexpressed in prostate cancer and is negatively regulated by androgen (Ghosh et al. 2005). Furthermore, a polymorphism in FOLH1 associated with lower levels of serum folate and hyperhomocysteinemia has been described (Devlin et al. 2000). Low folate is recognized as a risk factor for chromosome instability (Ames 2001) and MN induction (Fenech and Crott 2002). An interaction between androgen exposure and a polymorphism that modulates FOLH1 expression might affect folate levels and thereby modify MNBN frequencies.
GWAS was carried out on 435 newborns with data available for both SNPs and micronuclei. The relatively small sample size is a limitation of the GWAS analysis and is likely to have introduced a risk of false-negative findings due to reduced statistical power to detect the studied associations. To reduce false-positive findings, we accounted for multiple comparisons in our primary GWAS analysis, although candidate gene analyses were not adjusted for multiple comparisons. We identified significant associations between a priori-selected SNPs in EPHX1, EPHX2, and CYP2E1 and the frequency of MNBN lymphocytes (Table 4). These SNPs do not affect the protein code, but might be in linkage disequilibrium with causative variants. However, noncausal associations cannot be ruled out, and further clarification is required given inconsistent associations reported between these genes and MN in the literature (Dhillon et al. 2011).
In this study, samples from almost 1,200 newborns were collected. Because of limited sample volumes, the number of biomarker measurements varied from 1,151 for the AA-Hb adduct to 623 for MNBN, and 435 newborns had data available for both SNPs and micronuclei. For some analyses data were available for a limited number of observations: between 434 and 424 subjects for the associations between MNBN and candidate SNPs, and < 220 subjects for the interactions SNPsexposure biomarkers on MNBN frequency. Although we were able to conduct association studies between individual exposure markers with MNBN, this seriously limited our ability to investigate the interaction between multiple exposure biomarkers and MNBN.

Conclusions
We demonstrated that gene expression, lymphocyte MN levels, and a variety of biomarkers of environmental (geno)toxic exposure can be measured in newborn cord blood samples, and that there is interindividual variation in these markers in the European population. Associations of exposure biomarkers and genes (at the level of both gene expression and genotype) with MN frequencies may help volume 122 | number 2 | February 2014 • Environmental Health Perspectives generate new hypotheses about mechanisms of carcinogen-induced leukemia. The associations that we report must be interpreted with caution because we did not measure specific exposures, we did not observe monotonic dose-response relations, and we cannot rule out noncausal associations.
Nevertheless, our results suggest that internal exposure of the fetus to toxic chemicals occurs during apparently normal pregnancies, that such exposures may increase the frequency of MN formation [which, although of uncertain relevance in newborns (Holland et al. 2011) has been associated with cancer risk in adults (Bonassi et al. 2007)], and that some children may be more susceptible to genotoxic effects of in utero exposures than others.
Ultimately, information on the effects and sources of in utero genotoxic exposures could be used by regulators and industry to develop policy measures and strategies to reduce such exposures in order to improve children's health and reduce the incidence of childhood cancer.