While longfin smelt, Spirinchus thaleichthys, remain among one of the more common pelagic native species in the San Francisco Bay-Delta, CA, a recent significant population decline at the southern edge of its distribution has been observed (Sommer et al. 2007). This species was recently listed as Threatened under the California Endangered Species Act (CDFG 2009) although a petition for listing under the U.S. Endangered Species Act remains undetermined. Longfin smelt show a diversity of migratory patterns and anadromous and resident populations are native to the Pacific coast of North America. These fish are semelparous, typically undergo a two-year life cycle and spawn in freshwater and the low salinity zone before larval and juvenile fishes are transported downstream to nursery habitats (Hobbs et al. accepted). Juveniles move out of the San Francisco Bay into nearshore coastal waters, while others seem to reside in the Bay-Delta (Moyle 2002). Little is known about San Francisco longfin smelt spawning habitats and ecology, although adults presumably migrate back to the low salinity zone immediately prior to spawning (Rosenfield and Baxter 2007). A better-studied resident population inhabits Lake Washington, WA (Chigbu 2000; Chigbu and Sibley 1994; Chigbu et al. 1998). While longfin smelt are reported to be widespread in nearshore coastal regions and estuaries, few fish have been observed and fewer spawning populations have been identified between San Francisco Bay and Lake Washington (USFWS 1994, CDFG 2009).

Whole genomic DNA was extracted from fin tissue of longfin smelt from Suisun March, CA using the PureGene Genomic DNA Purification Kit. Six libraries enriched for one of four tetranucleotide repeat motifs [AAAC, CAGA, CATC, TAGA] were constructed, screened, and sequenced by Genetic Identification Services according to Börk et al. (2008). Sequences were analyzed with SEQUENCHER version 4.7 (Gene Code Corporation) to remove duplicate sequences. Repeat regions in sequences were identified with MREPS version 2.5 (Kolpakov et al. 2003) and primer pairs flanking these regions were created with PRIMER 3 (Rozen and Skaletsky 2000) for 200 of 227 novel sequences. For screening, one of four 5′ modifications (universal primers: T7T, T7P, M13, SP6) was added to the end of each forward primer using the method of Schuelke (2000), which incorporated a 5′ fluorophore during polymerase chain reaction (PCR). Each primer set was initially tested on eight longfin smelt samples to determine microsatellite amplification and polymorphism. Based on allelic diversity and clean scoring, nineteen primer sets were selected for amplification in a total of 12–15 individuals per collection from the San Francisco Delta, CA and Lake Washington, WA.

Polymerase chain reaction was performed with the following conditions: 20 ng DNA template, 1× GoTaq® Flexi Buffer, 1.875 mM MgCl2, 0.2 mM of each dTNP, 0.1 μM of fluorescently labeled primer and reverse primer, 30 nM tailed forward primer, and 0.4U of Taq DNA polymerase (all reagents from Promega), for a total reaction volume of 10 ul. PCR was performed using a Bio-Rad DNA engine Dyad thermal cycler under the following conditions: 95°C for 5 min; 5 cycles at 95°C for 45 s, 68°C for 5 min with 2°C decrease at each subsequent cycle, followed by 72°C for 1 min; 5 cycles at 95°C for 45 s, 58°C for 5 min with 2°C decrease at each subsequent cycle, followed by 72°C for 1 min; 30 cycles at 94°C for 45 s,°C for 2 min, followed by 72°C for 1 min; final extension at 72°C for 10 min; and held at 8°C. One microliter of fluorescently-labeled amplified product was run undiluted on an ABI 3130xl Genetic Analyzer with a LIZ600 size standard (ABI). GENEMAPPER version 4.0 (ABI) was used to analyze the electropherograms. All analyses were computed with the GENALEX software (Peakall and Smouse 2006) were estimated. This program calculated number of alleles, observed (H o) and expected (H E) heterozygosities, deviations from Hardy–Weinberg equilibrium (HWE), a pairwise F st value, and genetic distance was calculated for Principal Component Analysis (PCA) to assess differentiation between collections.

Seventeen of the nineteen primers optimized in longfin smelt were consistently scoreable (Table 1). The number of alleles per locus ranged from 5 to 16 with an average of 13.8 alleles per locus, and the observed and expected heterosygosities ranged between 0.267–0.929 and 0.331–0.926, respectively. Seven loci deviated significantly from Hardy–Weinberg expectations. A pairwise F st value of 0.091 was calculated between the two collections, and visualization of genetic distances using Principal Component Analysis supports moderate levels of differentiation between these two collection sites (Fig. 1). Genetic analysis of these collections suggested genetic isolation between locations, which are separated by over 800 miles of coast. These results are concordant with a previous allozyme study (Stanley et al. 1995), and support the distinctiveness and genetic isolation of the longfin smelt populations in these locations. These genetic markers will be useful for further evaluation of spatial and temporal genetic variation in S. thaleichthys and characterizing relationships among individuals in populations.

Table 1 Characterization of microsatellite loci: locus, motif, accession number, primer sequence, tail sequence, annealing temperature (Ta), sample size (n), number of alleles (NA), size range of observed allelic variation, H O, and H E . Significant deviations from Hardy–Weinberg expectations are in bold
Fig. 1
figure 1

Principal Component Analysis of two collections of longfin smelt (n = 15 each). Axis 1 accounts for 33.1% of the genetic covariance, while Axis 2 explains an additional 16.2% of the total variation