Introduction

Tropical forest tree species tend to exhibit spatially clumped distributions (Hubbell, 1979; Hubbell and Foster, 1983; Condit et al, 2000), while having low numbers of conspecific individuals per unit area. Spatially limited seed dispersal contributes to the patterns of clumping. However, in comparisons between tropical and temperate zone trees, it is unclear whether tropical trees experience greater dispersal limitation due to their preponderant reliance on animals for both pollination and seed dispersal (Bawa, 1990). Moreover, it is difficult to predict patterns of gene flow in tropical trees because of the unpredictable effect of animal behavior on gene movement (Nason and Hamrick, 1997; Hamrick and Loveless, 1986). Models of gene flow for tropical forest trees require either (1) direct observation of pollinator movements or seed dispersal agents (Webb and Bawa, 1983; Howe, 1986; Schupp, 1993; Boshier et al, 1995); (2) indirect estimates based on Fst-based statistics or spatial autocorrelation (Hamrick et al, 1993; Hamilton and Miller, 2002; Dick et al, 2003; Vekemans and Hardy, 2004); or (3) use of genetic markers to reconstruct seed and pollen movement through parentage analyses (Aldrich and Hamrick, 1998; Ouborg et al, 1999; Jordano and Godoy, 2002; Sezen et al, 2005). Here we contribute to an emerging body of data on spatial genetic structure in tropical trees through an analysis of spatial genetic structure of the tropical tree Simarouba amara within the 50-ha Forest Dynamics Plot (FDP) on Barro Colorado Island (BCI), Panama where we have two decades of demographic data based on periodic censuses of all S. amara 1 cm diameter at breast height (dbh).

The specific objectives of our study were: (1) to describe the genetic diversity of a mapped population of S. amara within a lowland Neotropical forest using microsatellite and AFLP data; (2) to evaluate the genetic structure of three size classes of individuals (>10, 1–10, and <1 cm dbh) at the 50 ha scale; and (3) to assess the level of heterozygosity across those three size classes. If seed dispersal and pollen movement is limited, our prediction is increased genetic relatedness among individuals in close proximity. However, if effective seed dispersal is widespread relative to the distance between reproductive individuals, or if there is extensive overlap of seed shadows, the result should be a lack of detected spatial genetic structure.

Materials and methods

Study species

S. amara Aubl. (Simaroubaceae) is a widespread Neotropical tree species found in lowland moist forests from upper Mesoamerica (Honduras, Nicaragua) across the lower rim of the Amazon basin (Bolivia and Peru) eastward to French Guiana. The wood of S. amara is used locally for paper, furniture building, interior construction, plywood, and matches (Rodriguez von Platen, 2000). S. amara is dioecious, insect pollinated, and dispersed primarily by vertebrates (birds and large mammals). It grows to 35 m in height with a maximum reported diameter at breast height (dbh) on BCI of 70 cm (Croat, 1978). S. amara requires intermediate to high light environments, and on BCI typically flowers from the end of January through April. The unisexual flowers of S. amara occur in terminal panicles comprised of numerous, small (<1 cm long), pale yellowish flowers. Data from 15 years reporting all flowering and fruiting parts arriving in seed traps suggest that there is overlap in the phenology of S. amara adults (SJ Wright, unpublished data). Flowering persists for approximately 11–15 weeks annually within the 50 ha plot. The floral syndrome of S. amara is typical of pollination by generalist small insects such as small bees and moths (Bawa, 1990), which have been observed visiting the flowers of this species (BDH pers. obs.).

Ripe fruits are produced within 1–3 months after pollination. The fruits are large-seeded and fleshy, approximately 17 mm in length, with a seed of 10–14 mm, and are presented in clusters of 3–5 drupes (Croat, 1978). Seeds of S. amara do not experience dormancy. Primates and birds are the primary dispersers of S. amara. The brightly colored green to purplish-black fruits attract chacalacas, flycatchers, motmots, and thrushes (Duke as cited in Croat, 1978), as well as howler and spider monkeys (Hladik and Hladik, 1969) and tamarins (BDH pers. obs). Leaf-cutter ants have been observed to carry the seeds, and dense seedling carpets of S. amara have been noted only at the two leaf-cutter ant dump sites in the 50 ha FDP (BDH pers. obs).

The year 2000 BCI plot census contained 1230 S. amara individuals 1 cm. The FDP contains 126 individuals 20 cm dbh, which represents the minimum reproductive size, although most individuals do not reproduce below 30 cm dbh (N=72 individuals 30 cm dbh). A total of 48 female trees have been reproductive during the past 3 years according to our observations. Trees flower and fruit annually, though not every reproductive female produces fruit each year (Hardesty, unpubl. data). The mean dispersal distance for S. amara seeds is 39 m based on maximum likelihood best-fit estimates (‘inverse modeling’) of arrival in seed traps using known distances from reproductive-sized adults within the FDP (Muller-Landau, 2001). Current inverse models do not take tree gender into consideration, however, and likely underestimate seed dispersal distances.

Study site

This research was carried out in the 50 ha permanent FDP within the Barro Colorado National Monument (9°10′N, 79°51′W), Republic of Panama. The FDP is located on a plateau near the center of BCI, an island formed by the flooding of Lake Gatun following the construction of the Panama Canal in the early 1900s. The lowland moist tropical forest on BCI receives approximately 2600 mm of rainfall annually, and the island has a pronounced dry season between December and April (Leigh, 1982). Approximately every 5 years beginning in 1982, all freestanding woody plants within the FDP with 1 cm diameter breast height (dbh) are identified, mapped, and measured to assess growth, mortality, and recruitment of new individuals.

Sampling strategy

Using the FDP inventory data, we randomly selected 200 S. amara individuals from two size classes: (1) 1–10 cm dbh (N=100), and (2) >10 cm dbh (N=100) (Figure 1). This was carried out by assigning a random number to each tagged individual, then selecting the top 100 numbers in each category after sorting them in ascending order. In order to assess fine level spatial structure below the 1 cm dbh class, we performed an additional sample of 100 ‘seedlings’ that were randomly selected from approximately 500 individuals with <1 cm dbh that had been collected in a survey across nearly 40 ha of the FDP. Tissue was collected only from healthy plants in which there was sufficient leaf material such that removal of tissue would be unlikely to cause mortality. In patches of high density (>1 seedling/m2), tissue from every fourth plant was collected, whereas in lower density areas, tissue was collected from each small plant encountered.

Figure 1
figure 1

The 50 ha permanent Forest Dynamics Plot on Barro Colorado Island, with 300 S. amara randomly chosen for this study. Locations of 100 individuals of <1 cm dbh are represented with triangles, 100 individuals with a dbh of 1–10 cm are represented with circles, and 100 individuals with a dbh >10 cm are depicted with squares.

Leaf collection and DNA extraction

Leaves were collected directly from branches or by using a sling-shot to drop leaves from the canopy of tall trees. The fresh tissue was placed in plastic bags in a cooler with ice while in the field. In the laboratory, samples were either refrigerated and the DNA extracted within 2 days, or they were put into labeled vials and flash-frozen in liquid nitrogen. DNA extractions using 40–50 mg of the fresh tissue were performed using DNEasy™ kits (Qiagen Corporation).

Microsatellite and AFLP genotyping

Microsatellite protocol

We tested 11 of the 18 published microsatellite primers for S. amara (Rodriguez von Platen et al, 2000), but poor or unreliable PCR amplification promoted use of the following five dye-labeled primer pairs: SA02 (TET), SA05 (FAM), SA06 (FAM), SA27 (HEX), and SA29 (FAM) (Rodriguez von Platen et al, 2000). The PCR (10.0 μl total) contained 250 μm of dNTPs, 25 μM MgCl2, 0.0625 μl of Taq polymerase (Qiagen Corporation), and 0.5 μl of the forward and reverse primer (10 μM stock). All PCRs were performed on an MJ Research PTC-200 thermal cycler, using the following protocol: 5 min at 94°C; 25 cycles of 45 s at 94°C, 1 min at 53°C, and 30 s at 72°C; ending with 15 min at 72°C. Amplification products were electrophoresed using Rox 400 size standard (Applied Biosystems Incorporated, ABI) on an MJ Research Base Station Automated DNA Analyzer. Forward primers were labeled with the same fluorescent dyes (HEX, FAM, or TET) on loci with nonoverlapping size ranges to permit loading of all five loci in a single lane.

AFLP protocol

For amplified fragment length polymorphisms, we used a protocol modified from Gadeul et al (2000): digestion and ligation (11.0 μl total volume): 1.1 μl of T4 Buffer 10 × ; 1.1 μl of 0.5 M NaCl; 0.5 μl BSA; 1 mg/ml MseI (1 U) 0.1 μl; EcoRI (5 U) 0.25 μl; T4-Ligase (1.2 U) 0.2 μl; MseI Adaptors 1.0 μl; EcoRI Adaptors 1.0 μl; ddH2O 0.2 μl. In total, 5.5 μl of 10 mg/μl DNA was added and the reaction was incubated for 2 h at 37°C.

Preselective PCR (total volume 25 μl)

We used 14.9 μl ddH2O, 2.5 μl 10 × buffer; 1.5 μl MgCl2; 2.0 μl of 25 μM dNTPs; 0.5 μl of 10 μM EcoRI (E/A); 0.5 μl of 10 μM MseI (M/C); 0.1 μl Qiagen Taq; and 3.0 μl of digested and diluted (1:10) DNA. The PCR was performed for 2 min at 72°C; 25 cycles of 30 s at 94°C, 1 min at 56°C, and 30 s at 72°C; ending with 10 min at 72°C.

Selective PCR (12.5 μl reaction volume) required

1.25 μl of 15 mM MgCl2 10 × Buffer; 0.5 μl 25 mM MgCl2; 1.0 μl of 25 mM dNTPs; 10 μM 0.05 μl EcoRI (E-AC or -ATG); 0.25 μl MseI (M-CTG or -CTT); 0.1 μl Hotstart Taq; 0.1 μl of 1 mg/ml BSA; 6.75 μl ddH2O; 2.5 μl preselective (20 × dilution) DNA. The PCR was performed for 15 min at 95°C; 13 cycles of 30 s at 94°C, 1 min at 65–55.9°C, 1 min at 72°C (ΔT=−0.7°C/cycle); 23 cycles of 30 s at 94°C, 30 s at 56°C, 1 min at 72°C; ending with 15 min at 72°C. Two primer pairs were used for AFLP analyses: Mse1+AC/EcoR1+/CTG and Mse1+ATG/EcoR1+CTT. Amplification products were electrophoresed and analyzed using the same equipment and software described for microsatellites.

Data analyses

Control individuals were run on different gels to insure consistency between gel runs and to evaluate possible base pair shifts that might result in allele assignment errors for both microsatellite and AFLP data. For the microsatellite and AFLP loci, we had <1 and 3% error rates, respectively, in allele assignment for the 200 individuals genotyped using both techniques, comparing individuals run on multiple gels at all loci. Each of the five microsatellite loci was scored using the program Cartographer (MJ Research), utilizing locus binning rules to avoid scoring stutter bands or erroneous alleles. Cervus assignment software (v. 2.0; Marshall et al, 1998) was used to calculate the number of effective alleles, observed and expected heterozygosity, frequency of null alleles, and exclusion probabilities at each locus with knowledge of one parent. GENEPOP (v. 3.3; Raymond and Rousset, 1995) was used to calculate Fis according to Weir and Cockerham (1984), and departure from Hardy–Weinberg equilibrium was tested using Haldane's exact test (1954), with exact P-values estimated using the Markov chain method. Alleles were permuted among individuals within cohorts. We used 1000 random permutation, and jackknifing was performed over loci to estimate standard errors. We used one-way ANOVA and Student's t-test to compare allelic richness among the size classes, and z-test to compare Ho and He among cohorts.

AFLP bands were scored using Cartographer. After scoring, data were transformed to a 1–0 presence/absence matrix using the program CoTrix (Degen, 1999). We also hand-scored all bands. To ensure consistency, bands with less than 10% of the average peak height were not included in analyses. In CoTrix, the fragment data are first sorted by length. The user then determines bounds for each fragment length category using a measure of ‘coarseness’, whereby fragments differ in length by the number of base pairs selected by the programmer. If coarseness is set too low, the possibility of assigning different lengths to identical fragments increases due to variation in electrophoretic mobility across gels, whereas setting coarseness too high may result in the assignment of fragments of different length to the same band class (Degen, 1999). We set the band coarseness at 1.4 base pair differences possibly erring on the conservative side of false locus assignment, and we excluded individuals with fewer than 30 bands, under the assumption that these samples had amplified poorly.

After generating a presence–absence matrix for AFLP bands or tabulating microsatellite alleles, data were analyzed for the existence of spatial autocorrelation using SGS (Degen et al, 2001b). In SGS, the significance values are verified using permutation tests (n=1000) based on Monte-Carlo simulation to evaluate deviation from a spatially random distribution of each calculated measure (Manly, 1997), against the null hypothesis of no spatial structure. Simulations compared observed values with the distribution obtained after permutations for each distance class using an individual's AFLP or microsatellite profile. SGS uses the Euclidean distance between pairs of data points. Distograms presenting the Tanimoto distance for AFLP's (Gregorius, 1978), and correlograms using Moran's Index of spatial autocorrelation for microsatellite data (Sokal and Wartenberg, 1983) between all pairs of individuals within an assigned spatial distance class, are plotted against the distance class. At least 30 pairs of individuals must be present at each distance class to ensure statistical robustness (Legendre and Fortin, 1989). At small distance intervals (eg <20 m), fewer than 30 pairs of individuals occur. Thus, our study focuses on 20 m distance classes. Due to insufficient number of <1 cm dbh pairs of individuals at some of the further distance classes, we restricted the seedling analyses to distances 640 m. We used the program SGS (Degen et al, 2001b) to calculate an aggregation index based on Clark and Evans's index (R) (Ripley, 1981) for each size class to determine the pattern of spatial distribution of S. amara individuals. Values of R<1 indicate an aggregated distribution, R=1 signifies random distribution, while R>1 is associated with regularly spaced individuals.

Results

The five polymorphic microsatellite loci ranged in observed heterozygosity between 0.12 and 0.75 (He range=0.33–0.72) (Table 1). Of the 50 alleles scored, eight (16.7%) were rare, occurring in less than 1% of the individuals genotyped. In total, 12 alleles were ‘private alleles, occurring in only one cohort, with 5, 4, and 3 alleles occurring solely at the seedling, 1–10 and >10 cm dbh size classes, respectively. The presence of null alleles at locus SA06 is considered probable based on estimates using Cervus (Table 1). The mean numbers of alleles per cohort ranged from 7.6 to 8.2 and were not significantly different (ANOVA, P>0.05). Mean observed and expected heterozygosities (Ho and He) ranged from 0.502 to 0.524 (Ho) and 0.556 to 0.564 (He) (Table 1) and did not differ among cohorts (z-test, P>0.05).

Table 1 Summary table for profiles of microsatellite loci of S. amara on BCI, with number of alleles (k), number of effective alleles (Ae), observed heterozygosity (Ho), expected heterozygosity (He), and null allele frequency (Null) calculated by the program Cervus (Marshall et al, 1998).

Significant deviation from Hardy–Weinberg equilibrium was detected at multiple loci in each of the three cohorts, though it was most frequent at the smallest size class, occurring at four of the five loci (Table 1). The fixation index (Fis) was most pronounced at SA06 in all cohorts, perhaps because of putative null alleles at that locus. While there was a significant heterozygote deficit for each size category (Table 1), there was no significant difference in the magnitude of the multi-locus Fis among size categories.

The mean number of AFLP loci per primer per individual was 43.9 (range=26–71 loci) excluding individuals that failed to amplify. In total, we analyzed 155 AFLP loci for 190 individuals. No genetic structure was detected at short distances for the pooled population of S. amara 1–10 and >10 cm dbh genotyped using AFLP's (not shown), though using microsatellites, structure was evident to 20 m.

When size categories were analyzed separately using Moran's Index, spatial autocorrelation was significant for the >10 cm and 1–10 cm dbh size classes from 0 to 40 and 0 to 20 m, respectively (Figure 2a and b). AFLP analysis for the same classes failed to detect significant spatial autocorrelation at the same distances (Figure 3a and b). The 100 seedlings <1 cm dbh exhibited significant genetic structure at distance classes of 0–40, 80–100, and 120–140 m using microsatellites. We observed a negative autocorrelation at distances between 480 and 540 m (Figure 2c).

Figure 2
figure 2

Correlogram of microsatellite data using Moran's Index to depict genetic spatial autocorrelation for 100 randomly selected S. amara in the 50 ha FDP representing three size categories at 20 m distance classes: >10 cm dbh (a), 1–10 cm dbh (b) and <1 cm dbh (c). The reference lines around the point estimates represent 95% confidence intervals.

Figure 3
figure 3

Distogram of AFLP data using Tanimoto Distance to spatial autocorrelation for S. amara individuals in two size categories at 20 m distance classes within the 50 ha FDP: >10 cm dbh (a) and 1–10 cm dbh (b). The reference lines around the point estimates represent 95% confidence intervals.

Aggregation index values of 0.988, 0.982, and 0.635 for the >10, 1–10, and <1 cm dbh cohorts, respectively, indicate that only at the seedling cohort are individuals more aggregated than would be expected at random. The mean distance to the nearest individual for the >10, 1–10, and the <1 cm dbh cohorts is 34.1, 34.1, and 21.3 m, respectively. Combining the largest two size classes reduces the mean distance between nearest individuals to 23.5 m, though spatial aggregation remains nonsignificant.

Discussion

Theoretical models predict the existence of local spatial genetic structure under locally restricted gene flow (Wright, 1943, 1978), and an absence of spatial structure when gene flow via seeds is extensive. Where seed dispersal is limited, we expect to observe increased relatedness at decreasing spatial scales, particularly in plant populations with low densities of reproductive adults (Loiselle et al, 1995; Peakall and Beattie, 1995; Schnabel et al, 1998), as is generally found in tropical rain forest trees (Hubbell and Foster, 1983). Even with random mating, patches of highly related individuals (half siblings) will result in local spatial genetic structure (Vekemans and Hardy, 2004). Widespread seed dispersal and large neighborhood size or overlapping seed shadows can erase the spatial genetic structure, however, even if pollen flow is restricted (Hamrick and Loveless, 1986; Kalisz et al, 2001).

In comparing AFLP and microsatellite utility in this study, the two marker systems failed to yield identical results at each distance category. In fact, the AFLP analyses failed to find significant spatial autocorrelation, though it was detected using microsatellites in all three cohorts. There has been some difference of opinion regarding the relative usefulness of AFLPs and microsatellites (summarized in Campbell et al, 2003). We found that microsatellite markers demonstrated significant results in some instances where AFLP marker results were not significant, in spite of using a sizeable number of AFLP bands for analysis. This difference may be due to the fact that microsatellite markers are codominant and thus are more informative in this study than were the dominant AFLP markers. In further discussion we refer specifically to microsatellite findings.

We suggest that localized seed fall from related individuals accounts for the spatial genetic structure detected at the shortest distance categories for S. amara. The genetic structure pattern we observed at longer distance classes (between 480 and 600 m) for the <1 cm dbh category results from nearby individuals being more closely related than randomly selected individuals in the population. At further distances, other ‘families’ of trees are encountered, resulting in negative autocorrelation, as fewer alleles are shared than expected by chance. Perhaps, the lack of relatedness we detected at longer distance classes reflects long-distance dispersal events, which have resulted in the recruitment and persistence of individuals unrelated to their neighbors.

The spatial scale of genetic autocorrelation declines from the <1 cm dbh size class to the 1–10 and >10 cm dbh classes, suggesting that demographic thinning may be occurring between seed deposition, seedling establishment, and recruitment to later life history stages. The mean minimum distance between seedlings is less than that of older individuals: 21.3 m (range=0.9–1003.5 m), compared to 34.1 m (range=3.3–1006.1 m) and 34.1 m (range=1.3–1012.8 m) for 1–10 and >10 cm dbh trees, respectively. Also, since our relatedness values for the larger two size classes does not exceed 1.2 and for <1 cm dbh individuals relatedness is even less, it seems likely that some overlapping of seed shadows is present, even at the short distances.

Data from the FDP censuses intervals provide further evidence of demographic thinning in S. amara. Only 438 of the 1239 stems from the 1982 census were alive in the 2000 census, with the highest mortality occurring at the smallest (1–2 cm dbh) size classes. For example, between the 1995 and 2000 censuses, 137 1–2 cm dbh individuals (37%) died, whereas 62 (16%) of the 371 trees that died were 10 cm dbh. Such thinning may contribute to the decay of spatial genetic structure through the culling of half-sibs in aggregations, or selection against inbred individuals. Selection against homozygotes is suggested by the slight decrease in Fis from the <1 cm dbh individuals to the larger size classes, a pattern that has been reported for other tropical trees (eg Alvarez-Buylla et al, 1996; Hufford and Hamrick, 2003). We found that overall levels of heterozygosity at all five loci were remarkably consistent among the three cohorts.

Muller-Landau (2001) estimated a mean dispersal distance of 39 m for S. amara from seed trap data, only slightly less than the mean minimum distance, 43.6 m, between any two of the 48 female trees that have been reproductive during the last 3 years. Nonetheless, the generally low density of >20 cm dbh S. amara on the FDP (2.4 trees per ha−1) suggests that overlapping seed shadows are unlikely to be sufficiently pronounced to result in a mixture of different single tree progenies that might obscure weak spatial structure, and our finding of genetic autocorrelation at short distances in all age classes support this. With an estimated mean dispersal distance of nearly 40 m, one would expect to observe at least some weak spatial genetic structure in S. amara at distances less than 100 m if the seed dispersal models utilized by Muller-Landau (2001) are accurate. We do observe structure at shorter distance classes for all three cohorts, providing support for these models. Current modeling techniques, however, likely underestimate seed dispersal distances because they assume seeds in traps arrive from the nearest adult tree. Models also fail to adequately account for long-distance seed input that may play a substantial role in spatial genetic processes (Oddou-Muratorio et al, 2001), as well as individual variability in fecundity and survivorship (Godoy and Jordano, 2001). Perhaps most importantly for the present study, the current models fail to incorporate dioecy and thus underestimate seed dispersal because any reproductive size adult is considered a potential seed donor.

Our study contributes to a growing database on patterns of spatial genetic structure at the stand level (1 ha to several kilometers) for vertebrate-dispersed tropical tree species (eg Degen et al, 2001a; Dutech et al, 2002). Our experimental design is comparable to that of Degen et al (2001a), who analyzed spatial genetic structure using RAPD markers for eight canopy tree species in French Guiana using 150 m distance categories. They used the same randomized sampling strategy, but only considered trees >10 cm dbh. Four of the animal dispersed species in their study showed no evidence of spatial genetic structure, although the other four species did exhibit significant spatial autocorrelations. Weak spatial genetic structure for both adults and seedlings was also observed within a large population of the vertebrate-dispersed tree, Carapa procera, in French Guiana (Doligez and Joly, 1997), though data were analyzed at the scale of hundreds of meter distance classes. Fine scale genetic structure, even at the shortest (13 m) distance class, was described as ‘weak at most’ by Chung et al (2000) in all five age classes of a bird-dispersed dioecious tree in southern Korea. Their focal area, however, was restricted to a very small (60 × 100 m2) plot.

Hamrick et al (1993) found that two wind-dispersed species exhibited spatial genetic structure to 30 m (Alseis blackiana) and 100 m (Platypodium elegans) on BCI, though only in the <1.0 m tall and 1–10 cm ‘sapling’ size classes; structure was not evident among the >10 cm dbh cohort. Similarly, in a recent study by Latouche-Hallé et al (2003), significant positive spatial correlation was detected for the wind-dispersed tree Dicorynia guianensis for saplings and adults from 50 to ca. 125 m and a negative association was observed for adults from 200 to 300 m, analyzed at 50 m distance intervals.

The variety of seed and pollen dispersal syndromes among tropical trees and the effectiveness of different dispersal vectors make broad-scale generalizations difficult, particularly in highly diverse tropical ecosystems. However, molecular approaches provide useful tools for assessing both seed arrival and effective dispersal in ways that traditional ecological studies cannot. Additional studies at multiple spatial and temporal scales will help improve our knowledge of the mechanisms and impacts of dispersal at both local and large scales.