Dental Amalgam Restorations and Children’s Neuropsychological Function: The New England Children’s Amalgam Trial

Background A concern persists that children’s exposure to mercury vapor from dental amalgams produces neurotoxicity. Objective Our goal was to compare the neuropsychological function of children, without prior exposure to dental amalgam, whose caries were repaired using either dental amalgam or mercury-free composite materials. Methods We conducted a randomized controlled trial involving 534 6- to 10-year-old urban and rural children who were assessed yearly for 5 years using a battery of tests of intelligence, achievement, language, memory, learning, visual–spatial skills, verbal fluency, fine motor function, problem solving, attention, and executive function. Results Although the mean urinary mercury concentration was greater among children in the amalgam group than the composite group (0.9 vs. 0.6 μg/g creatinine), few significant differences were found between the test scores of children in the two groups. The differences found were inconsistent in direction. Analyses using two cumulative exposure indices—surface years of amalgam and urinary mercury concentration—produced similar results. Conclusions Exposure to elemental mercury in amalgam at the levels experienced by the children who participated in the trial did not result in significant effects on neuropsychological function within the 5-year follow-up period.


Research | Children's Health
We previously reported on a randomized clinical trial, the New England Children's Amalgam Trial (NECAT), in which no significant differences were found, over a 5-year follow-up interval, between the neuropsychological scores of children for whom dental amalgam was used to repair caries and the scores of children for whom mercury-free composite materials were used . The Full-Scale IQ score on the Wechsler Intelligence Scale for Children-Third Edition (WISC-III; Wechsler 1991) was the primary end point, and the General Memory Index (GMI) on the Wide Range Assessment of Memory and Learning (WRAML; Sheslow and Adams 1990), and the Visual Motor Composite (VMC) on the Wide Range Assessment of Visual-Motor Ability (WRAVMA; Adams and Sheslow 1995) were the two secondary end points. Each of these is a global score, derived by combining a child's performance on tasks that assess somewhat different abilities.
The additional analyses reported in this article address three issues. First, if mercury vapor liberated from dental amalgams produces specific rather than general neuropsychological effects and are most likely to be evident on tests that assess specific cognitive domains, global scores such as Full-Scale IQ, GMI, and VMC might be relatively insensitive to important treatment-group differences.
Even in the absence of treatment-group differences on global test scores, differences in specific cognitive domains could, depending on their nature and severity, represent morbidities with important consequences for children's health and well-being. Therefore, here we report comparisons of the scores of the amalgam and composite groups on the subscales that contribute to Full-Scale IQ, the GMI, and the VMC, as well as scores on a battery of additional, domain-focused, neuropsychological and educational tests.
Second, the exposure index used in the primary analyses of the trial was treatmentgroup assignment. This could have introduced a form of exposure misclassification insofar as the variability in the treatment needs of the children in the amalgam group resulted in the receipt of variable amounts of amalgam and thus in their potential exposure to mercury. Therefore, we repeated the analyses replacing treatment-group assignment with two continuously distributed indices of exposure: surface-years of amalgam and urinary mercury concentration.
Third, it is possible that only a subset of children experienced adverse neuropsychological effects as the result of exposure to amalgam, either because of behaviors, such as bruxism or frequent gum chewing, that cause enhanced release of mercury (Barregard 2005; Barregard et al. 1995) or because of enhanced sensitivity to mercury. Two recent studies of dental professionals suggest that polymorphisms for brain-derived neurotrophic factor (Echeverria et al. 2005;Heyer et al. 2004) and the coproporphyrinogen oxidase gene (Echeverria et al. 2006) modify the neurotoxicity of elemental mercury. If the prevalence of such enhanced vulnerability to elemental mercury is low or the associated increase in neuropsychological toxicity is modest in magnitude, its impact on the mean scores of children in the amalgam group might not have been sufficiently large to produce significant treatment-group differences. In an attempt to identify the presence of a subgroup of children who are particularly sensitive to amalgam, we compared the distributional characteristics of the scores within the treatment groups.

Study design and participants.
Children were eligible if they were 6-10 years of age, fluent in English, had no known prior or existing amalgam restorations, had two or more posterior teeth with dental caries, and did not have a physician-diagnosed psychological, behavioral, neurological, immunosuppressive, or renal disorder (Children's Amalgam Trial Study Group 2003). Children were recruited from several community dental clinics in the Boston/Cambridge area of Massachusetts, an urban setting, and from a dental clinic in Farmington, Maine, a rural area.
A total of 5,116 children were screened for eligibility. Eligibility was confirmed for 598 children. At baseline visits, children received a dental examination by a study dentist, X rays, and standard preventive dental care (e.g., cleaning, application of sealants). Other baseline visit activities included the collection of blood and urine samples, anthropometric measurements of height, weight, and body fat, neuropsychological testing of the child, a health interview with the child's guardian, and neuropsychological testing of the guardian.
After completion of baseline visits, children were randomized to a study treatment arm. Randomization was stratified by geographic location (Boston/Cambridge vs. Farmington) and number of teeth with caries (two to four vs. five or more), using randomly permuted blocks within each of the four strata.
The NECAT was conducted in accordance with all applicable requirements for the protection of human subjects. All children provided assent and parents provided informed consent. The study protocol was approved by the institutional review boards of the New England Research Institutes, the Forsyth Institute, and the clinics from which children were recruited.
Interventions and follow-up. All children had semiannual dental examinations as well as additional visits required to meet treatment needs. For children in the amalgam arm, a dispersed-phase amalgam was used to restore all posterior teeth with caries at baseline and to repair incident caries during the 5-year trial period. For children in the composite arm, composite material (white filling) was used for all restorations. Following standard clinical practice, for both groups, composite material was used to repair caries in the front teeth, and stainless steel crowns were used to restore primary teeth with extensive lesions that could not otherwise be restored. The choices of dental materials and techniques were standardized across sites and dentists.
Urine samples were collected annually and analyzed for elemental mercury using cold vapor atomic absorption. Values were expressed as micrograms per gram creatinine. The analyses reported use only urinary mercury concentrations in samples collected at 3, 4, and 5 years of follow-up. After 1 February 2000, the detection limit, which had been 1.5 ng/mL, was reduced to 0.45 ng/mL as a result of increasing the volume of sample analyzed for each child. Samples with a mercury concentration below the detection limit were assigned a value of 0.45/√2 .
Participants and dentists could not be blinded to treatment assignment, but all individuals who collected outcome data (e.g., neuropsychological tests) or analyzed specimens (e.g., for mercury) were blinded to children's treatment assignments.
Neuropsychological assessments. At baseline, before randomization and the receipt of any dental treatments, children participated in two 3-hr neuropsychological test sessions. At the first session, the WISC-III (Wechsler 1991) and the Wechsler Individual Achievement Test (WIAT) (Psychological Corporation 1992) were administered. These tests were again administered at 3 and 5 years after baseline. The second baseline session consisted of a battery of domain-focused tests. This test battery, which was again administered at 1, 2, and 4 years after baseline, included the WRAML (Sheslow and Adams 1990), the WRAVMA (Adams and Sheslow 1995), the Trail-Making Test (Spreen and Strauss 1991), finger tapping (the WPS Electronic Tapping Test; Western Psychological Services, Los Angeles, CA), ordered and unordered verbal cancellation (Mesulam 1985), category fluency (McCarthy 1972), the Controlled Oral Word Association Test (letter fluency) (Spreen and Strauss 1991), simple visual reaction time (the Standard Reaction Timer; Software Science, Cincinnati, OH), the Stroop Color-Word Interference Test (Trenerry et al. 1989), and the Wisconsin Card Sorting Test (Heaton et al. 1993).
A total of 14 testers were used at the Boston/Cambridge site and five testers at the Farmington site. Quality control of the assessments was assured by having all examiners trained and certified by one supervising psychologist (D.C.B.) before conducting assessments of children enrolled in the trial, and monitored over the course of data collection. Each testing session was completely rescored by a second individual and errors were corrected. A variety of computerized algorithms were used to check the entered data for internal consistency.
Sample size determination. The trial was designed to achieve 80% power to detect a 3-point difference between treatment arms in 5-year change in Full-Scale IQ score, adjusted for baseline IQ score and randomization stratum. Assuming a retention rate of 75% over the 5-year follow-up period, the recruitment goal was 250 children per treatment arm, for a total sample size of 500 children.
Statistical analysis. We used intention-totreat analyses, using analysis of covariance, to compare the amalgam and composite groups in terms of the changes, over 5 years, in scores on the WISC-III and WIAT and the changes, over 4 years, in scores on the domain-focused tests. These analyses thus indicated whether 2.4 ± 1.9 (1-13) 2.3 ± 1.5 (1-11) a Race was self-reported by parents. Table 2. Dental treatment and amalgam exposure at the end of the 5-year follow-up period, by treatment group [mean ± SD (range)].

Dental treatment Amalgam Composite
No. of restored surfaces in mouth 5.3 ± 5.2 (0-36)* 6.1 ± 6.0 (0-36)* No. of restored amalgam surfaces a 4.0 ± 4.0 (0-21) 0.05 ± 0.6 (0-9) Cumulative no. of surfaces restored (over 5 years) b 14.8 ± 9.5 (2-55)** 16.0 ± 9.8 (2-51)** Cumulative no. of surfaces restored with amalgam b 11.7 ± 7.0 (0-35) 0.05 ± 0.6 (0-9) the central tendencies of the distributions of change scores differed in the amalgam and composite groups. Adjustments were made for baseline covariates: test score, randomization stratum, age, sex, socioeconomic status, hair mercury, and blood lead level. We calculated socioeconomic status using the method developed by Green (1970). Hair mercury was included to control for methylmercury, a form of mercury that is known to be a developmental neurotoxicant (Weiss 2006) but acquired primarily by consumption of contaminated seafood. Elevated blood lead level is a well-known developmental risk factor, with an increased prevalence among children who are socioeconomically disadvantaged (Bellinger 2006). We evaluated children's neuropsychological test scores in relation to two continuously distributed indices of exposure. The first index was surface-years of amalgam (number of amalgam surfaces weighted by number of years present), calculated from the information contained in dental clinic records regarding dates of amalgam placement, the number of tooth surfaces involved in the restoration, the timing of loss of primary teeth containing amalgam restorations, and the like. The second index was urinary mercury concentration. Scores on the WISC-III and WIAT, both of which were administered at year 5 of follow-up, were evaluated in relation to the mean of available urinary mercury concentrations at years 3, 4, and 5. Scores on the other tests, which were administered for the final time at year 4 of follow-up, were evaluated in relation to the mean of available urinary mercury concentrations at years 3 and 4 of followup. We evaluated the associations between these indices of exposure and children's neuropsychological test scores using analysis of covariance, adjusting for the same set of baseline covariates used in the intention-to-treat analyses.
We conducted the analyses using the Kolmogorov-Smirnov test (Stuart and Ord 1991) to compare characteristics other than the central tendency of the distributions of the change scores in the treatment groups. When the results indicated that the change scores of children in the two groups did not come from the same distribution, we determined whether this was attributed to differences between treatment groups in the percentages of children with change scores indicating substantial deterioration of performance over the follow-up interval. Of particular interest was whether, in the absence of a treatment-group difference in mean change score, a greater percentage of children in the amalgam group than in the composite group demonstrated such deterioration.
To evaluate the impact of interexaminer variability on the results, we repeated the intention-to-treat analyses including adjustment for a set of indicator variables representing the neuropsychological examiners. Table 1 shows that the treatment groups did not differ significantly in terms of age, race, household income, education of primary caregiver, Full-Scale IQ, hair and urinary mercury concentrations, blood lead level, and number of decayed tooth surfaces. Females outnumbered males in the composite arm. Children were primarily non-Hispanic white (62%), with non-Hispanic blacks making up an additional 19% of the sample. The mean number of total decayed tooth surfaces at baseline was 9.5, with 1.7 of the surfaces being in permanent teeth. Slightly more than half of the children (54%) had five or more teeth with caries that required restoration, with the rest having two to four carious teeth. Children from the Boston/Cambridge area tended to have more caries than did children from Farmington (10.3 vs. 8.6 carious surfaces, respectively).

Results
Children continued to have dental treatment needs over the course of the 5-year follow-up period, averaging approximately one new filled surface per year. The treatment needs were similar in the treatment groups (Table 2). At the end of the follow-up period, the number of restored surfaces in place ranged from 0 to 36 in both treatment groups, and the mean values did not differ significantly (p = 0.16). The amalgam and composite groups also did not differ significantly in the cumulative number of restored surfaces over the trial (p = 0.10). In the amalgam group, 79% of the surfaces had been restored with amalgam. At the end of the 5-year follow-up period, the mean urinary mercury concentration was significantly  greater among children in the amalgam group (0.9; range, 0.1-5.7 µg/g creatinine) than among children in the composite group (0.6; range, 0.1-2.9 µg/g creatinine) (p < 0.001). Table 3 shows the change scores over the follow-up interval for each test score. Of all the change scores evaluated, only two differed significantly between the amalgam and composite treatment groups. On the Number-Letter Memory subtest of the WRAML, the 4-year change score of the amalgam group was significantly more positive than was the change score of the composite group, indicating greater improvement over time in the amalgam group. The 4-year change in the time required to complete Part B of the Trail-Making Test was significantly more negative in the composite group than in the amalgam group, indicating greater improvement over time in the composite group.
The results of analyses using surface-years of amalgam or urinary mercury concentration as the exposure metric were consistent with those of the intention-to-treat analyses, providing no evidence of a detrimental effect of amalgam on children's test scores. The coefficient for surface-years of amalgam was significant for three scores (Picture Memory and Number-Letter Memory of the WRAML and letter fluency), but for all three scores, the sign was positive, indicating that the score improved with increasing exposure to amalgam (Table 4). Urinary mercury concentration was not significantly associated with any of the test scores ( Table 5).
The results of the Kolmogorov-Smirnov tests indicated that the only two scores for which the distributions of changes scores in the amalgam and composite groups differed significantly were two subtests of the WRAML: Finger Windows and Number-Letter Memory (Table 3). On both tests, however, children in the amalgam group showed greater improvement over time than the children in the composite group, with the difference being significant for Number-Letter Memory. The distributions of change scores were not significantly different on the Trail-Making Test Part B (time to complete), the test on which the composite group showed significantly more improvement than the amalgam group.
Adjustment for neuropsychological examiner did not produce results that were appreciably different (data not shown).

Discussion
These analyses revealed an absence of consistent differences between the scores of children in the amalgam and composite treatment groups on a battery of neuropsychological tests that assessed a wide range of domains, including intelligence, achievement, language, memory, learning, visual-spatial skills, verbal fluency, fine motor function, problem solving, attention, and executive function. The findings were similar when the dichotomous variable treatment-group assignment was replaced by two continuously distributed indices of exposure, one that combined the amount and duration of amalgam a child received and one that was a biomarker, urinary mercury concentration. Furthermore, no evidence was found to support the hypothesis that a subset of children in the amalgam group suffered substantial harm. The number of significant differences observed was similar to that which might have been expected to occur by chance.
Although neuropsychological deficits associated with amalgam exposure have been reported in several studies of dental professionals Echeverria et al. 1995Echeverria et al. , 1998Ngim et al. 1992) and others exposed occupationally to mercury (Rohling and Demakis 2006), our findings are similar to those involving mercury exposure in cohorts drawn from the general population of adults and children (Brownawell et al. 2005). In a cross-sectional study of 550 30-to 49year-old healthy employed adults, scores on tests of verbal memory, nonverbal memory, attention, psychomotor speed, and fine motor coordination were not significantly associated with any of several exposure indices considered (number of visible amalgam surfaces, number of visible occlusal amalgam surfaces, urinary mercury concentration) (Factor-Litvak et al. 2003). The mean urinary mercury concentration in that cohort of adults, 1.7 µg/g creatinine, was higher than the mean concentration of 0.9 µg/g creatinine among the children in the amalgam treatment group in our trial 5 years after placement of their first amalgam restorations. In a study of 1,663 Vietnam-era veterans, the total number of tooth surfaces with amalgam fillings was unrelated to clinical neurological signs (e.g., tremor, coordination, station, gait, strength, sensation, muscle stretch reflexes, or indices of peripheral neuropathy), although it was associated with vibrotacile sensation in nondiabetic participants (Kingman et al. 2005).
In a study of 384 German 6-year-olds, 24-hr urinary excretion of mercury, which averaged 0.16 µg, was not significantly related to scores on a variety of tests, including the Vocabulary and Block Design subtests of the WISC and five tests of the computerized Neurobehavioral Evaluation System 2 (pattern comparison, pattern memory, tapping, simple reaction time, continuous performance test) (Walkowiak et al. 1998). In this cohort, some indices of visual contrast sensitivity did decline with increasing urinary mercury excretion, however (Altman et al. 1998).
Over the course of the follow-up interval, the scores of children in both treatment groups tended to change in the direction of improved performance, even on tests for which scores are standardized for age. Several factors might have contributed to improved performance over time. First, this could represent a type of sampling bias, reflecting the characteristics of families who are motivated to enroll in such a trial and to participate for its full duration. Second, all tests except the WISC-III and WIAT were administered yearly, so the general improvement in scores might reflect the familiarity that children developed with the test materials and expectations. Particularly large improvements tended to be on performance-based tests, such as the WRAVMA pegboard and the Processing Speed composite of the WISC-III, one component of which is Symbol Search, a timed task that involves matching symbols and digits. A substantial improvement was also noted on the WRAML Learning Index, which reflects the rapidity with which a child learns new material, such as sound-symbol pairs, a word list, and the locations of hidden designs. Repeated administration of these tasks, even at yearly intervals, might be expected to result in an increased rate of acquisition of the material.
As noted, the dental treatment needs of the children enrolled in the trial were substantial and exceeded those typical of the general population of U.S. children. For example, among 6-to 11-year-old children who participated in the National Health and Nutrition Examination Survey (NHANES) 1999-2002, the prevalence of dental caries in primary teeth was 22%, and the mean numbers of decayed or filled primary teeth and surfaces were 1.7 and 3.7, respectively (Beltran-Aguilar et al. 2005). The prevalence of dental caries in permanent teeth was 20% (mean number of decayed, missing, and filled  Table 4. Associations between neuropsychological test scores and surface-years of exposure to amalgam restorations [adjusted coefficient ± SE (n)].
a Adjusted for baseline score, randomization stratum, baseline age, sex, baseline family socioeconomic status, baseline hair mercury concentration, and baseline blood lead concentration. b Coefficient represents the change in test score for each unit increase in surface-years of amalgam. c Standard score.  et al. 2005). Therefore, children assigned to the amalgam group in the NECAT are likely to have experienced greater exposures to mercury vapor from amalgams than do most children in the United States. Given our failure to detect significant differences between the amalgam and composite groups in neuropsychological function, these results provide reassurance that the use of dental amalgam to repair caries is not producing substantial neuropsychological morbidity in the general population of children in the United States. The conclusions must be tempered, first, by a recognition that the follow-up interval of 4-5 years might have been too short to allow for the expression of such deficits. Second, the critical window of children's greatest vulnerability to elemental mercury might already have passed by the time the children were enrolled in the trial (≥ 6 years of age). Given the heightened sensitivity of the fetus to methylmercury, prenatal exposure to mercury vapor, which is known to cross the placenta, warrants increased attention. In the NHANES 1999-2000 survey, among women of child-bearing age, an increase of 10 dental surfaces restored with amalgam was associated with an estimated increase of 1.8 µg/L in urinary mercury concentration (Dye et al. 2005). Mercury level in amniotic fluid is weakly associated with number of amalgam fillings (Luglie et al. 2005). A recent case-control study did not, however, find an increased risk of delivering a low-birth-weight infant among women who had up to 11 amalgam restorations placed during pregnancy (Hujoel et al. 2005). The results of studies of the reproductive outcomes of women with dental workplace exposures have been mixed (Dahl et al. 1999;Elghany et al. 1997;Ericson and Kallen 1989). In some of these studies, distinguishing the potential impact of mercury exposure from the impacts of other workplace exposures, such as to disinfectants containing ethanol and benzene, is difficult. Third, the prevalence of children with enhanced sensitivity to elemental mercury might be too low among the children enrolled in the NECAT for us to have been able to detect their effects on the distribution of responses. Fourth, children with preexisting neuropsychological or behavioral disorders were not eligible for enrollment. Our findings therefore do not provide any information about the possibility that amalgam-related exposure to mercury vapor might exacerbate such disorders. Nevertheless, our results indicate that even among children with substantial dental needs, an increased risk of neuropsychological deficits could not be detected among children whose dental restorations contained elemental mercury.

Amalgam and neuropsychological function
Environmental Health Perspectives • VOLUME 115 | NUMBER 3 | March 2007 445 Table 5. Associations between neuropsychological test scores and urinary mercury concentration [adjusted coefficient ± SE (n)].
a Adjusted for baseline score, randomization stratum, baseline age, sex, baseline family socioeconomic status, baseline hair mercury concentration, and baseline blood lead concentration. b Coefficient represents the change in test score for each unit increase in urinary mercury concentration. c Standard score.