PRECISION ASTROMETRY WITH ADAPTIVE OPTICS

, , and

Published 2008 December 10 © 2009. The American Astronomical Society. All rights reserved.
, , Citation P. B. Cameron et al 2009 AJ 137 83 DOI 10.1088/0004-6256/137/1/83

1538-3881/137/1/83

ABSTRACT

We investigate the limits of ground-based astrometry with adaptive optics using the core of the Galactic globular cluster M5. Adaptive optics systems provide near diffraction-limit imaging with the world's largest telescopes. The substantial improvement in both resolution and signal-to-noise ratio enables high-precision astrometry from the ground. We describe the dominant systematic errors that typically limit ground-based differential astrometry, and enumerate observational considerations for mitigating their effects. After implementing these measures, we find that the dominant limitation on astrometric performance in this experiment is caused by tilt anisoplanatism. We then present an optimal estimation technique for measuring the position of one star relative to a grid of reference stars in the face of this correlated random noise source. Our methodology has the advantage of reducing the astrometric errors to ${\sim}1/\sqrt{t}$ and faster than the square root of the number of reference stars, effectively eliminating noise caused by atmospheric tilt to the point that astrometric performance is limited by centering accuracy. Using 50 reference stars, we demonstrate a single-epoch astrometric precision of ≈1 mas in 1 s, decreasing to ≲100 μas in 2 minutes of integration time at the Hale 200 inch telescope. We also show that our astrometry is accurate to ≲100 μas for observations separated by 2 months. Finally, we discuss the limits and potential of differential astrometry with current and next-generation large-aperture telescopes. At this level of accuracy, numerous astrometric applications become accessible, including planet detection, astrometric microlensing signatures, and kinematics of distant Galactic stellar populations.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The benefits of astrometry have long been clear to astronomers. Measurements of parallax and proper motion yield model-independent determinations of fundamental quantities like distance and velocity. It is not surprising that astrometry has motivated a wide variety of observational programs using many different techniques to answer fundamental questions in astrophysics. Potential applications span a wide range of physical scales including planet detection, reconstruction of the Milky Way's formation, and tests of Λ cold dark matter (CDM) cosmology (e.g., Unwin et al. 2008).

The most ubiquitous astrometric measurements have been carried out with ground-based telescopes in the seeing limit. Monet et al. (1992) conducted visible light measurements of 72 stars (V = 15–20) using the 1.55 m US Naval Observatory astrometric reflector. This program achieved a single-epoch measurement precision of ≈4 milliarcseconds (mas), and parallax accuracies ranging from 0.5 to 3 mas over ∼ 5 yr baselines. Pravdo & Shaklan (1996) performed visible light measurements of stars in the cluster NGC 2420 (V = 13–16) and achieved a single-epoch precision of ≈150 μas in 1 hr, which motivated an astrometric survey for low-mass companions to nearby stars (e.g., Pravdo et al. 2004). More recently, 200–300 μas astrometric precision has been demonstrated with the Very Large Telescope (VLT)/FORS in the visible (Lazorenko 2006; Lazorenko et al. 2007). Each of the above programs employed relatively narrow-field visible imagers (a few square arcminutes) to perform differential astrometry; however, the increasing availability of wide angle imagers has motivated studies over larger fields. Anderson et al. (2006) performed similar experiments using a 33'× 34' visible camera on the European Southern Observatory (ESO) 2.2 m telescope, which resulted in 7 mas single-epoch precision.

Ground-based interferometers provide an alternative method for performing high-precision astrometry, typically over very narrow fields relative to a single reference star. The Palomar Testbed Interferometer has used phase-referencing to achieve astrometric accuracies ≈100 μas for a 30'' binary (Lane et al. 2000) and ≈20 μas over years for binaries with separations ≲1'' (Muterspaugh et al. 2006). Due to its 40 cm apertures, this instrument is limited to targets with Ks < 6. Large aperture, ground-based interferometers equipped with adaptive optics systems, such as those at Keck (Colavita & Wizinowich 2003) and the VLT (Glindemann et al. 2000), can perform at levels similar to fainter limiting magnitudes (e.g., Boden et al. 2007).

Differential astrometric accuracies achieved in both single aperture and interferometric ground-based programs are fundamentally limited by atmospheric effects. In the seeing limit, single aperture observations suffer from image quality degradation and interferometers lose visibility fringe coherence due to atmospheric turbulence. In addition, all ground-based single aperture programs suffer from systematic effects due to differential atmospheric refraction and optical distortions.

Space-based observatories are one possible method for avoiding the effects of atmospheric turbulence. Hipparcos was the first space-based mission with primarily astrometric goals, and achieved ≲1 mas astrometry over the mission lifetime on bright targets (V ≲ 9 mag; Perryman et al. 1997). Currently, the only space-based telescope that can perform high-precision astrometry is Hubble. Both the imagers and Fine Guidance Sensor (FGS) have been characterized and well utilized for astrometry at the ≲1 mas level (e.g., Anderson & King 2000, 2003b; Benedict et al. 2003). Two complimentary future space missions are aimed at achieving levels of astrometric performance 2–3 orders of magnitude below the Hubble performance levels. GAIA will catalog roughly one billion stars to V ≈ 20 mag over the entire sky with parallax accuracies ranging from 10 to 300 μas depending on the magnitude (Perryman et al. 2001). The Space Interferometry Mission (SIM) will take a pointed approach, and enable microarcsecond astrometry on Galactic and extragalactic targets (Unwin et al. 2008).

Ground-based adaptive optics (AO) offer an alternative, more easily accessible, and cost-effective method for overcoming atmospheric turbulence over small fields (≲ arcminute). The current generation of astronomical AO systems provide diffraction-limited image quality at near-infrared (NIR) wavelengths. Achieving the telescope's diffraction limit and the resulting boost in the signal-to-noise ratio (S/N) prove to be a powerful combination for astrometry. These two effects reduce the errors in determining stellar centers, increase the number of possible reference stars at small separations, and allow techniques for mitigating systematics (e.g., use of narrowband filters to eliminate chromatic refraction; see Section 2).

The marked improvement in wavefront sensor technology and the development of laser beacons have rapidly increased the usable sky coverage of these systems (e.g., Wizinowich et al. 2006). The increase in sky coverage, operation in the NIR, gain in the S/N, and the diffraction-limited image quality make astrometry with AO amenable to numerous Galactic applications spanning a wide number of fields: detection of astrometric companions, the improved determination of the mass–luminosity relation of stars, and the formation and evolution of compact objects (Unwin et al. 2008).

Here we present an optimal estimation technique, appropriate for mitigating the astrometric errors arising in AO observations, and demonstrate its potential with multiepoch imaging of the core of the globular cluster M5 using the Hale 200 inch telescope. We are able to achieve ≲100 μas astrometric precision in 2 minutes, and have maintained this accuracy over 2 months. In Section 2, we discuss the dominant noise terms that arise in ground-based astrometry and the experimental techniques we have adopted to control them. We describe the framework of our reduction model and illustrate its salient properties with a numerical simulation in Section 4. We describe the observations of M5 and the results of applying the optimal estimation technique to the data in Sections 3 and 5. This is followed in Section 6 by a discussion of the role and potential of AO in ground-based astrometry with current and future large aperture telescopes.

2. ASTROMETRIC ERROR TERMS IN GROUND-BASED ASTROMETRY

Ground-based optical and IR imaging observations suffer from a number of errors that limit the accuracy and precision of astrometric measurements. Relative to seeing-limited observations, the diffraction-limited image quality afforded by AO modifies the relative importance of these error terms. This section describes the four largest effects, and indicates observational considerations utilized in this experiment aimed at mitigating them.

2.1. Differential Tilt Jitter

With AO, the image motion of the guide star is removed with a flat tip-tilt mirror. This stabilizes the image of the guide star with respect to the imager to high accuracy. Any residual tip-tilt error is removed in subsequent analysis by calculating only differential offsets between the target of astrometry (not necessarily the AO guide star) and the reference stars. However, the difference in the tilt component of turbulence along any two lines of sight (LOS) in the field of view (FOV) causes a correlated, stochastic change in their measured separation, known as differential atmospheric tilt jitter.

More specifically, in propagating through the atmosphere to reach the telescope aperture, light from the target star and light from a reference star at a finite angular offset traverse different columns of atmospheric turbulence that are sheared. Differential atmospheric tilt jitter arises from the decorrelation in the tilt component of the wavefront phase aberration arising from this shearing effect. This differential tilt leads to a random, achromatic, and anisotropic fluctuation in the relative displacement of the two objects. The three-term approximation to the parallel and perpendicular components of the variance arising from differential atmospheric tilt jitter, assuming Kolmogorov turbulence, is given by (Sasiela 1994)

Equation (1)

In this equation, D is the telescope diameter and θ is the angular separation of the stars. The turbulence moments μm are defined as

Equation (2)

where h is the altitude, ξ is the zenith angle, and C2n(h) is the vertical strength of atmospheric turbulence. Typical C2n(h) profiles yield σ∥,TJ ≈ 20–30 mas for a 20'' binary when observed with a 5 m aperture. Note that the variance from differential tilt is a random error, and, thus, is also ∝τTJ/t, where τTJ is the tilt jitter timescale (on the order of the wind crossing time over the aperture; see Section 5) and t is the integration time.

2.2. Distortion

The largest instrumental systematic that limits the accuracy of astrometry in any optical system is geometric distortion. These distortions can be stable—resulting from unavoidable errors in the shape or placement of optics—or dynamic—resulting from the flexure or replacement of optics.

If geometric distortions are stable, then a number of strategies can be employed to mitigate their effect. One method is to model the distortion to high accuracy; the most notable example is the calibration of the Hubble Space Telescope (HST; e.g., Anderson & King 2003a). This is particularly important for data sets obtained with multiple instruments or those that use the technique of dithering, since knowledge of the distortion is necessary to place stellar positions in a globally-correct reference frame. Alternatively, one could use a consistent optical prescription from epoch to epoch by using the same instrument and placing the field at the same location and orientation on the detector. Here we use both a distortion solution and a single, consistent dither position to achieve accurate astrometry.

Any changes in the geometric distortion must be tracked through routine, consistent calibration. The question of stability is particularly important at the Hale 200 inch telescope, since the AO system and the imaging camera (Palomar Adaptive Optics System (PALAO) and Palomar High Angular Resolution Observer (PHARO), respectively; see Section 3) are mounted at the Cassegrain focus, and PHARO undergoes a few warming/cooling cycles per month (see Section 3). The PHARO distortion solution1 by Metchev (2006) accounts for changes in the orientation of the telescope (which are relatively small for our experimental design), but the overall stability of the system is best verified with on-sky data. One of the purposes of the data presented here is to track the system stability. We find that the combination of the Hale Telescope, PALAO, and PHARO is capable of delivering ≲100 μas astrometry.

2.3. Atmospheric Refraction

Refraction by the Earth's atmosphere causes an angular deflection of light from a star, resulting in an apparent change in its position. The magnitude of this deflection depends on the wavelength and the atmospheric column depth encountered by an incoming ray. The former effect is chromatic, while the latter is achromatic. The error induced by differential chromatic refraction (DCR) has proven to be an important, and sometimes the dominant, astrometric limitation in ground-based efforts (e.g., Monet et al. 1992; Pravdo & Shaklan 1996; Anderson et al. 2006; Lazorenko 2006). These studies have shown that DCR can contribute ≈0.1–1 mas of error depending on the wavelength and strategy of the observations.

The observations presented here were conducted using a Br-γ filter at 2.166 μm with a narrow bandpass of 0.02 μm to suppress DCR. The increased S/N provided by AO allows sufficient reference stars to be detected even through such a narrow filter in a short exposure time. We reach Ks ≈ 15 mag in our 1.4 s exposures through this filter with the Hale 200 inch telescope (see Section 3). In addition, observations were acquired over a relatively narrow range of airmass (1.17–1.27) at each epoch to minimize the achromatic differential refraction.

In order to estimate the effect of atmospheric refraction on our data, we took the asterism in the core of M5 and refracted it to 37 and 32° elevation with the parallactic angles appropriate for the observations on 2007 May 28 using the slarefro function distributed with the STARLINK library (Gubler & Tytler 1998). The root mean square (rms) deviation in reference star positions between these two zenith angles was ≲250 μas and the shift in the guide star position with respect to the grid (see Section 4) was ≈10 μas. Thus, our consistent zenith angle of observations, narrowband filter, and observations in the NIR (where the refraction is more benign) make the contribution of this effect negligible for our purposes, and we make no effort to correct for it.

Performing a similar experiment using a broadband K filter with a field of ≈5000 K reference stars and a ≈3000 K target would lead to a systematic shift of ≈100 μas between zenith angles separated by 10°, which would be detectable by this experiment. Consequently, for observations where broadband filters are necessary, refraction effects must be considered and corrected.

2.4. Measurement Noise

In the case of a perfect optical system, a perfect detector, and no atmosphere, the astrometric precision is limited to one's ability to calculate stellar centers. The centering precision is determined by measurement noise, and we will use the two terms interchangeably. For a monopupil telescope, the uncertainty is

Equation (3)

(Lindegren 1978). AO allow us to achieve the diffraction limit even in the presence of the atmosphere and substantially boosts the SNR over the seeing-limited case, thereby decreasing measurement noise and improving astrometric precision.

In practice, the centering of a given stellar image is limited by spatial and temporal variations in the AO point-spread function (PSF). A great deal of time and effort has been spent determining the AO PSF and producing software packages to perform PSF fitting (e.g., Diolaiti et al. 2000; Britton 2006). However, any PSF-fitting software package is capable of calculating image positions at a ≲0.01 pixel level in a single image. For the observations considered here, this is ≲2 mas, a factor of 5–10 larger than the measurement noise in Equation (3), but this is much smaller than the tilt jitter mentioned in Section 2.1. As such, we have chosen to use simple and widely-available PSF centering software (DAOPHOT; Stetson 1987; see Section 3).

3. OBSERVATIONS AND DATA REDUCTION

We observed the globular cluster M5 on three dates spanning 2 months (see Table 1 for a summary of observations) using the Hale 200 inch telescope and the PHARO (Hayward et al. 2001) assisted by the PALAO (Troy et al. 2000). The globular cluster M5 was chosen for its relatively large distance of ≈7.5 kpc from the Sun, low velocity dispersion of ≈5 km s−1, and the availability of guide stars near the cluster core (Pryor & Meylan 1993; Harris 1996). This combination of distance and velocity yields an expected cluster dispersion of only 140 μas yr−1 or 20 μas over our 2 month observing span. We acquired 400–600 images per night. A typical image can be found in Figure 1. The guide star is a red giant branch (RGB) member of the globular cluster with V ≈ 12.6 mag (Sandquist & Bolte 2004). The cluster was imaged through the narrowband Br-γ filter (central wavelength is 2.166 μm and bandpass is 0.02 μm) using the 25'' × 25'' narrow-field channel (0farcs025 pixel−1), which oversamples the 87 mas diffraction-limited PSF. The brightest star filled the detector to 10% of the maximum well-depth in the 1.4 s exposure time, well within the linear regime of the detector.

Figure 1.

Figure 1. Top left: image of the core of the globular cluster M5 in 1.4 s through the narrowband Br-γ filter. The AO guide star is labeled as star "A," and is one of 82 detected stars in the image. The additional plots show the measured xy angular separation of each pair of stars denoted by the arrows in 600 images taken on 2007 May 28. These plots show the clear signature of anisotropic differential atmospheric tilt jitter as predicted from Equation (1). The measured (red) and predicted (green) 1σ error ellipses are overplotted. We see that temporal averaging over the 1.4 s exposure time has reduced the measured variance with respect to that predicted from the DIMM/MASS measurements and Equation (1).

Standard image High-resolution image

Table 1. Observations

Date Time Integration Time (sec) Airmass Seeinga (asecs) θ0a (asecs) μ2 (m7/3) μ4 (m13/3) μ14/3 (m15/3)  
2007-05-28 05:18:29–06:19:29 890 1.26–1.18 1.22 2.34 1.01e-5 3.82e3 2.77e6  
2007-05-29 05:58:26–06:48:30 570 1.19–1.17 1.39 2.16 1.14e-5 3.89e3 2.82e6  
2007-07-22 03:57:12–04:40:36 630 1.20–1.27 1.05 1.66 1.74e-5 6.46e3 4.71e6  

Note. aCalculated at a wavelength of 0.5 μm. These quantities scale as λ1/5 and λ6/5, respectively.

Download table as:  ASCIITypeset image

Contemporaneous measurements of the atmospheric turbulence profile were acquired with a differential image motion monitor (DIMM) and multiaperture scintillation sensor (MASS), which have been deployed as a single unit in a dome at Palomar Observatory (Thomsen et al. 2007; Kornilov et al. 2007). These turbulence profile measurements permitted an independent estimate of the magnitude of differential tilt jitter (computed using Equation 1).

We processed the raw images by subtracting dark frames and removing bad pixels from the analysis. Flat-field calibration was performed using twilight sky flats. Sky subtraction was accomplished by forming the median of the dithered frames taken outside of the cluster and subtracting this median from each exposure. The photometry and astrometry of each star were extracted using PSF fitting as implemented by the DAOPHOT package in PyRAF.2 DAOPHOT is not optimized for astrometry (see, e.g., Anderson & King 2000), and since our measurement model reduces the noise due to atmospheric turbulence, our single-epoch precision could be improved with a more careful centering technique (see Sections 4.4 and 5.2). However, our astrometric accuracy over 2 months is not limited by this choice (see Section 5.3). We used the four brightest stars in the field to derive a model PSF that is assumed to be constant over the field, and calibrated the image zeropoints using Two Micron All Sky Survey (2MASS) and find that the guide star has Ks ≈ 9.1 mag.

4. GRID ASTROMETRY FOR GROUND-BASED AO OBSERVATIONS

After controlling for distortion and atmospheric refraction, the dominant astrometric errors are caused by differential atmospheric tilt jitter and measurement noise. In this section, we present a general framework for measuring the position of a star relative to a grid of reference stars in the face of these noise sources. This framework has two key ingredients. The first is the covariance matrix (Σd), which encapsulates the relevant statistical uncertainties for astrometry with AO. The second is the weight matrix (W), which determines how the differential measurements between the target star and the reference stars are combined to calculate the target's position relative to the grid.

4.1. Measurement Model

The fundamental quantity in differential astrometry is the measured angular offset between a pair of stars. We will denote the angular distance between two stars, i and j, as $\vec{d}_{ij}$. Since $\vec{d}_{ij}$ is measured from an image, we will denote its components in the Cartesian coordinate system of the detector, simply

Equation (4)

where we have introduced the notation xijxjxi and likewise for y. The variance in the angular separation between two stars is given by

Equation (5)

where σ2∥,meas is the sum of the squares of the centering errors of each star parallel to the axis connecting the pair (and similarly for the perpendicular variance), and the remaining terms are as defined in Section 2.1.

Measurement of the offset between the target star (which we will denote with a subscript i = 0) and each of the N reference stars results in a set of N vectors, $\vec{d}_{0i}$. For simplicity, we will write these measured offsets as a single-column vector,

Equation (6)

The goal of differential astrometry is to use d to determine the position of the target star with respect to the reference grid of stars at each epoch.

There are many possible ways to construct the position of the astrometric target from a given d. Here we use the most general linear combination of the angular offsets, namely

Equation (7)

where W is the 2 × 2N weight matrix, given by

Equation (8)

We have used the notation wxy,0i to denote the weighting of the offset from the target star to star i in the y-direction, used to determine the x component of the target's position, $\vec{p}$. For example, for a standard average of the x and y measurements to calculate $\vec{p}$, we would assign all the wxx,0i = wyy,0i = 1/N and wxy,0i = wyx,0i = 0.

In principle, we are free to assign weights in any manner we please. However, we find it convenient to choose the weights such that they satisfy

Equation (9)

These constraints ensure that the components of $\vec{p}$ have physical units (e.g., pixels or arcseconds) and that its components are measured in the same coordinate system as d (presumably the detector coordinates). As a consequence, $\vec{p}$ represents the position of the target star in the sense that a proper motion of the target, $\vec{\epsilon }$, with respect to the fixed grid between two epochs will cause a change, $\vec{p} \rightarrow \vec{p} + \vec{\epsilon }$.

In order to determine if any change in $\vec{p}$ over time is meaningful, we must understand its statistical properties. Both differential tilt jitter and measurement errors are assumed to follow Gaussian statistics, so that each instance of target-reference grid offset measurements, d, is drawn from a multivariate normal probability distribution:

Equation (10)

where Σd is the covariance matrix and the bars above symbols denote using the average value of each matrix entry.

The statistics of $\vec{p}$ follow in a straightforward manner from Equation (10), given our choice in Equation (7). Since $\vec{p}$ is a linear function of d, each $\vec{p}$ is also drawn from a multivariate normal probability distribution with the covariance matrix

Equation (11)

and the uncertainties of $\vec{p}$ are described by the eigenvectors and eigenvalues of Σp. Thus, our goal of optimally determining the target's position requires calculating the covariance matrix, Σd, from data or theory, and choosing W to minimize the eigenvalues of Σp.

4.2. The Covariance Matrix

We have chosen to measure positions and offsets in the Cartesian coordinates of the detector, so the form of the covariance matrix, given our above definitions, is

Equation (12)

where we have written as $\Delta x_{ij} \equiv (x_{ij} - \bar{x}_{ij})$ to simplify the notation (likewise for y).

The total covariance matrix has contributions from centering errors and differential atmospheric tilt jitter. Since these contributions are independent, the total covariance matrix can be written as Σd = Σmeas + ΣTJ, and each term can be derived separately.

4.2.1. The Covariance Matrix for Measurement Noise

In the absence of differential tilt jitter, it is straightforward to construct the covariance matrix for measurement noise alone, Σmeas. The diagonal terms can be written as

Equation (13)

where σx,i and σx,0 are the uncertainties in determining the x-position of star i and the target star, respectively. For the off-diagonal terms 〈Δx0iΔx0j〉, we can use the fact that

Equation (14)

where we have used only algebra and the above definitions. Equation (14) is the obvious result of the fact that the measurements of the target star's coordinates are common to all differential measurements, and so its uncertainty appears in all the off-diagonal covariance terms, 〈Δx0iΔx0j〉 and 〈Δy0iΔy0j〉. However, the cross-terms involving both x and y (e.g., 〈Δx0iΔy0j〉) vanish because σx,0 and σy,0 are uncorrelated for measurement noise alone.

4.2.2. The Covariance Matrix for Differential Tilt Jitter

The covariance matrix for differential atmospheric tilt jitter between a pair of stars is diagonal when written in an orthogonal coordinate system with one axis lying along the separation axis of the binary. From Equation (1), we see that it can be written as

Equation (15)

where d and d are the angular offsets parallel and perpendicular to the axis connecting the pair of stars, respectively.

For a general field of N stars, no coordinate system exists that diagonalizes the full tilt jitter covariance matrix, ΣTJ. But we can begin computing the entries by rotating Σpair into our xy coordinates via RTΣpairR, where

Equation (16)

The result is

Equation (17)

where ϕ0i is the angle between $\vec{d}_{0i}$ and our arbitrary Cartesian system measured counterclockwise from the x-axis, and we have introduced the notation that the uncertainty parallel to $\vec{d}_{ij}$ is σ∥,ij and the uncertainty orthogonal to $\vec{d}_{ij}$ is σ⊥,ij as calculated from Equation (1). Thus, we can identify the diagonal terms

Equation (18)

and

Equation (19)

For the off-diagonal terms 〈Δx0iΔx0j〉, we notice that (as used in Equation 14)

Equation (20)

where, in the last step, we have used the fact that xij = x0ix0j and the relations in Equations (18) and (19). The quantities 〈Δy0iΔy0j〉 can be obtained by interchanging sine and cosine in Equation (20).

For the remaining off-diagonal terms 〈Δx0iΔy0j〉, we can use the fact that

Equation (21)

Rearranging gives

Equation (22)

All the terms on the right-hand side are known from Equation (19), and further investigation shows that the two terms on the left-hand side are equal. So, Equations (13), (14), (18)–(20), and (22) contain all the information required to construct the full covariance matrix, Σd.

4.3. The Optimal Weight Matrix

The optimal choice of weights in Equation (3) are those that minimize the eigenvalues in Equation (11). For a 2 × 2 symmetric matrix, the sum of the eigenvalues is the trace of the matrix, so our problem becomes one of minimizing the trace of Σp subject to the constraints in Equation (9). Specifically, we will use the method of Lagrange multipliers (Betts 1980) to find the optimal weights, W', that minimize the quadratic equation

Equation (23)

where

Equation (24)

is a 4N × 4N matrix, and

Equation (25)

is a vector of length 4N. Note that W' has identical entries as W in Equation (8); it is just written as a single vector to cast the minimization problem into a single Equation (23). We want to find the extrema of Equation (23) subject to the linear constraints in Equation (9), which can be written as

Equation (26)

where we define the 4 × 4N symmetric matrix

Equation (27)

and

Equation (28)

In this framework, the optimal weights are those that solve the system of linear equations:

Equation (29)

Here, λ are the Lagrange multipliers, which will not be used further. Equation (29) can be solved via a matrix inversion.

Note that the general constraints on the weights we have written in Equations (9) and (26) have two somewhat unintuitive features. The first is that the y measurements are sometimes used to compute the x position and vice versa. The other property is that they allow for negative weights, meaning that in some cases, certain measurements will be subtracted in calculating the position of the astrometric target, $\vec{p}$. These two facts conspire to exploit the natural correlations inherent in the data. The flexible and possibly negative weights essentially allow the reference grid to be symmetrized, thereby using the known correlations to cancel noise so as to minimize the variance in $\vec{p}$.

4.4. Numerical Simulations

As indicated in the above analysis, the single-epoch uncertainty in the location, $\vec{p}$, of the target relative to the grid of reference stars is represented by the eigenvalues and eigenvectors of the 2 × 2 matrix Σp (Equation 11). This matrix itself depends on the distribution of reference stars, the precision of centering measurements, and the degree of noise correlation due to differential tilt through the matrix Σd. In this way, the intrinsic precision of the measured value of $\vec{p}$ depends on these three factors.

To ascertain the behavior of Σp with the density of available reference stars, we performed a series of numerical simulations. In each simulation, N (2 ⩽ N ⩽ 100) stars were randomly distributed throughout a 25'' × 25'' FOV. We assumed that the target was a bright star in the middle of the field with a centering error of 0.5 mas and the reference stars were fainter, drawn from a Gaussian distribution with a mean centering error of 2 mas and a standard deviation of 1 mas (somewhat analogous to the situation for the guide star in M5; see Section 5). The full covariance matrix, Σd, was computed for each stellar configuration assuming these centering errors, the typical turbulence profile above Palomar Observatory, and a 1.4 s exposure time.

In the first simulation, Σd was contracted as in Equation (11) using standard averaging for W (wxx,0i = wyy,0i = 1/N, wxy,0i = wyx,0i = 0). For the second simulation, Σd was contracted using the optimal W as calculated by using the prescription in Section 4.3. In each case, the geometric mean of the two eigenvalues of the resulting matrix, Σp, was computed to form an estimate of the single-epoch measurement precision of $\vec{p}$. To average away random effects arising from the particular geometry of the random distribution of stars, each numerical simulation was repeated for 100 random distributions of stars for each value of N, and these were averaged to generate a mean value for the single-epoch measurement precision.

The resulting values for the single-epoch measurement precision of $\vec{p}$ are shown in Figure 2 as a function of the number of reference stars, along with the contributions of measurement noise and differential tilt jitter. In both simulations, the error due to measurement noise decreases as N−0.3. However, in the limit of an infinite number of reference stars, this error asymptotes to the target star's measurement error. The rate at which the measurement noise decreases to this value depends on the distribution of reference star measurement errors.

Figure 2.

Figure 2. Top: simulated astrometric precision as a function of the number of reference stars using standard averaging (solid line). The total astrometric precision has contributions from the measurement noise (dash-dotted) and the differential atmospheric tilt jitter (dashed line). Here, the measurement noise was taken to be 0.5 mas for the AO guide star, and the reference stars were drawn from a normal distribution with a mean of 2 mas and a standard deviation of 1 mas. The tilt jitter is that expected in a 1.4 s exposure assuming the turbulence profile measured on the night of 2007 May 28 at Palomar Observatory (see Table 1). Bottom: simulated astrometric precision as a function of the number of reference stars using optimal weighting (lines as above). By using optimal weighting based on the covariance matrix, the effect of atmospheric noise is reduced to values less than measurement noise.

Standard image High-resolution image

The important distinction between the two simulations is the contribution of tilt jitter to astrometric performance. In the simulation utilizing standard averaging, there is very little gain with increased stellar density (N−0.15), and tilt jitter dominates the error budget. However, the optimal estimation algorithm rapidly (N−0.7) eliminates the contribution of differential tilt by taking advantage of the correlations inherent in Σd and the flexibility to symmetrize the reference field through the choice of weights.

5. ANALYSIS AND RESULTS

In the analysis that follows, we will use the measurement model described in Section 4. For a given target star, we will calculate the differential offsets with respect to the grid stars to generate a value of d for each image at each epoch (Equation 6). We then use either these data or the theory in Section 4.2 to generate the full covariance matrix, Σd. From Σd, we use the prescription in Section 4.3 to calculate the optimal weights, W. These weights are used to combine the differential offsets to generate the target star's position, $\vec{p}$, in each image via Equation (7). The statistics of the positional measurements are then described by the covariance matrix, Σp, from Equation (11).

5.1. Differential Tilt Jitter

In order to test our expectation that tilt jitter dominates the astrometric error, we calculate the rms of the angular offsets for pairs of stars in the field (Figures 1 and 3). These results clearly show the characteristic signature of differential tilt. Namely, the rms separation along the axis connecting the two stars is larger than that of the perpendicular axis by a factor of $\approx \sqrt{3}$. However, the magnitude of the tilt jitter is smaller than the theoretical expectations, which suggests that some of the tilt jitter has been averaged away in the 1.4 s exposure time.

Figure 3.

Figure 3. Top: rms deviation in the distance between pairs of stars in the direction parallel to their separation axis on 2007 May 28 (filled circles). The jitter predicted (assuming no measurement noise) from the measured turbulence profiles and Equation (1) (dashed line) is far larger than the measured jitter, indicating that some tilt jitter has been averaged away in 1.4 s. The best-fit model (Equation 5), including averaged tilt jitter and measurement noise, indicates that the tilt timescale is ≈0.2 s (solid line). Bottom: as above, but in this case, the separations and predictions are for the direction perpendicular to the separation axis. The expected rms for the perpendicular direction is lower by the expected factor as seen in Equation (1). Note that not all pairs include the AO guide star.

Standard image High-resolution image

We have no direct measurement of the wind speed profile over the telescope to calculate the expected tilt jitter timescale. Instead, we fit the observed σ2ij and angular offsets using the model in Equation (5) with t = 1.4 s. The best-fit values are σmeas,ij ≈ 2 mas and tTJ ≈ 7. This implies that the characteristic timescale for tilt jitter is ≈0.2 s, resulting in a wind crossing time of 25 m s−1. Turbulence at higher altitudes contributes most to the differential atmospheric tilt jitter, and this velocity is typical of wind speeds in the upper atmosphere (Greenwood 1977). It is also clear from the figure that a number of stars have measurement noise that is much less than 2 mas; thus, this number should only be taken as characteristic of the faint stars.

5.2. Astrometric Precision

The astrometric precision achieved in a single epoch is an important diagnostic of the measurement model. On a given night for a given star, we investigate the use of both the ≈500 images and the theory in Section 4.2 to calculate Σd, leading to the optimal weights. We then apply this weight matrix to the measured offsets to compute the target's position in each image, resulting in a time series in each component of $\vec{p}$ for each epoch. The properties of each time series are best explored by computing its Allan deviation (also known as the square root of the two-sample variance). The Allan deviation is calculated by dividing a time series into chunks, averaging each segment, and computing the rms of the resulting, shorter time series. If the time series is dominated by random errors, its Allan deviation will decrease as $1/\sqrt{t_{\rm avg}}$, where tavg is the length of each chunk. It is also necessary to have sufficiently many segments so that an rms calculation is meaningful. Here, the longest timescale probed is ≈2–3 minutes for each 10–15 minute time series.

We compute the geometric mean of the Allan deviation in each dimension as a function of the averaging time for the AO guide star in Figure 4 after computing the covariance matrix from data. After 1.4 s, the guide star's positional precision is ≈600 μas. The precision subsequently improves as t−0.51±0.08 to ≈70 μas after 2 minutes, and has yet to hit a systematic floor. This suggests a precision of ≈30 μas for the full 10–15 minute data set, assuming that no systematic limit is reached in the interim.

Figure 4.

Figure 4. Allan deviation in the guide star position as a function of integration time for 2007 May 28 (solid curve), May 29 (dash-dotted), and July 22 (dashed). The astrometric precision scales as t−0.51±0.08, and the covariance matrix and optimal weights were derived from data.

Standard image High-resolution image

This level of precision is not limited to the AO guide star; similar performance is obtained on other stars in the core of M5. In Figure 5, we show the astrometric precision obtained on 2007 May 29 after 2 minutes for all detected stars as a function of their Ks magnitude. Precision below 100 μas is achieved on targets as faint as Ks ≈ 13 mag using a narrowband filter and 1.4 s individual exposures. This demonstrates the substantial S/N benefit afforded by AO.

Figure 5.

Figure 5. The astrometric precision (Allan deviation after 2 minutes) using the theoretical covariance matrix (open diamonds) and the covariance matrix from data (filled circles) as a function of Ks magnitude for all 82 detected stars on 2007 May 29. The precision in both cases is essentially constant for Ks ≲ 13 mag. However, the astrometric precision for the theoretical Σd is a factor of 2–4 times larger than when calculated from data.

Standard image High-resolution image

The astrometric precision shown in Figure 5 resulting from the theoretically-determined covariance matrix and optimal weights is ≈300 μas after 2 minutes for stars with Ks ≲ 13 mag. This level of precision is substantially better than the performance of simpler weighting schemes, but it is a factor of 2–4 worse than using the data to calculate the covariance matrix and weighting. There are several possible reasons for this reduction in precision. First, we have only used estimates of the measurement noise for each star used to calculate Σmeas. Second, the turbulence profile used to construct ΣTJ is estimated from the average C2n(h) seen by the DIMM/MASS. This unit is located 300 m from the Hale telescope and uses Polaris to estimate the turbulence profile. As a consequence, there could be important differences between the measured atmospheric turbulence and that encountered by the light from M5. Finally, we have not attempted to capture the time variability of the turbulence, having used only the average values.

In Figure 6, we investigate the improvement of the AO guide star astrometry with the number of reference stars. We drew random subsets of the available grid stars, computed Σd from the data, calculated the optimal weights, and showed the geometric mean of the eigenvalues of Σp. To average over the geometry of a particular draw, we repeated this process ten times for each value of N and averaged the results. We see that the precision rapidly decreases as N−0.60±0.03. This is slightly faster than what our simulations predict for 1.4 s of integration time. However, as noted above, our simulations are meant to approximate M5, but do not capture the true distribution of stellar measurement errors (which are difficult to decouple from tilt jitter) or any evolution in atmospheric turbulence during the observation.

Figure 6.

Figure 6. Allan deviation in 1.4 s for the AO guide star's position as a function of the number of reference stars on 2007 May 28 (solid curve), May 29 (dash-dotted), and July 22 (dashed). The astrometric precision scales as N−0.60±0.03.

Standard image High-resolution image

5.3. Astrometric Accuracy

The goal of astrometry is to measure the position of the target star over many epochs. Astrometrically interesting timescales range from hours to years. Clearly, the optical systems must be stable over these spans for astrometry with AO to be viable. There are several obstacles that could render the single-epoch precision obtained in Section 5.2 meaningless. For example, PHARO is mounted at the Cassegrain focus, which results in flexure of the instrument as the telescope tracks, and undergoes warming and cooling cycles between observing periods (typically twice per month) that could cause small changes in the powered optics. Either of these facts could alter the geometric distortion and make astrometric measurements unrepeatable. In order to probe the system stability, we have designed our experiment to be as consistent as possible, and it has spanned many removal and reinstallations of PHARO over 2 months.

In order to investigate the accuracy of the M5 measurements, we first measured and corrected the small rotational (≲0fdg04) and plate scale (≲10−5) changes between the May 29 and July 22 data and the May 28 images. We also calculate the optimal weights for a given star on all three nights, and average them to create one weighting matrix to use for each epoch. This is not strictly optimal, since each night has different turbulence conditions for example, but it ensures that the scenario that $\vec{p} \rightarrow \vec{p} + \vec{\epsilon }$. In Figure 7, we see that the measured position of the AO guide star is accurate from epoch to epoch at ≈100 μas. The error ellipses are those estimated by continuing to extrapolate the precision found in Figure 4 by $1/\sqrt{t}$ to the full 10–15 minute time series. This is an impressive level of accuracy, but unfortunately is a factor of 3 worse than our expectation. It suggests that there is some instability, likely in the distortion, over the 2 months that limit the astrometric accuracy.

Figure 7.

Figure 7. The position of the AO guide star in an arbitrary coordinate system on three dates: 2007 May 28 (solid), 2007 May 29 (dashed), and 2007 July 22 (dash-dotted; see Section 5.3). The error circles are inferred by averaging the covariance matrix measured from the data and extrapolating to the total 10–15 minute integration time as $1/\sqrt{t}$ (e.g., see Figure 4). The positions agree at the ≲100 μas level—a factor of 2–3 larger than the expected dispersion. This discrepancy indicates that some systematic errors have occurred between epochs, most likely optical distortion.

Standard image High-resolution image

The other stars in M5 show a similar level of astrometric accuracy (Figure 8) up to Ks ≈ 13 mag. This limit can be certainly pushed to be considerably fainter with increased integration time or a larger aperture. The achievement of such high levels of astrometric performance on faint targets, given the modest time investment, short integration time, and narrowband filters, illustrates the substantial S/N gain and potential for astrometry enabled by AO.

Figure 8.

Figure 8. Astrometric accuracy (geometric mean of the rms in each coordinate over the three epochs) vs. Ks magnitude. The level of accuracy is ≲100 μas, and is essentially unchanged for Ks ≲ 13 mag. However, this is a factor of 2–3 above our expectations from the achieved astrometric precision, suggesting a systematic limitation between epochs.

Standard image High-resolution image

6. DISCUSSION AND CONCLUSIONS

Here we have presented a technique for performing high-precision grid astrometry using ground-based telescopes equipped with AO systems. With this technique, the effects of distortion and atmospheric dispersion that give rise to systematic errors are mitigated through the design of the experiment. Random errors arising from differential tilt jitter and measurement noise are minimized through the use of an optimal estimation scheme that accounts for the correlated noise statistics through the covariance matrix Σd. The experimental results obtained on the Hale 200 inch telescope have demonstrated a single-epoch astrometric precision of ≲100 μas in 2 minutes and multiepoch astrometric accuracy at the same level. This level of precision is comparable to that afforded by ground-based interferometry, and is better than the precision obtained in seeing-limited programs on single apertures.

The simulation of astrometric precision afforded by the optimal weighting scheme, shown in Figure 2, illustrates that measurement noise is the dominant residual astrometric error on a 5 m telescope for stellar fields that contain more than a few reference stars. The scaling laws for differential tilt jitter (D−7/6) and measurement noise (D−2) indicate that on larger aperture telescopes, measurement noise will represent a smaller fraction of this residual error. This effect is illustrated in Figure 9, which shows the rms error between pairs of stars for a range of telescope apertures and angular separations.

Figure 9.

Figure 9. rms separation between a pair of stars vs. angular offset and aperture diameter. We have assumed the turbulence profile on 2007 May 28, a measurement error of $1/\sqrt{2}$ mas for each star for a 5 m telescope, and included the geometric mean of each component of Equation (1). Relative to Palomar (solid curve), there are substantial astrometric gains to be made for larger 10 m (dash-dotted) and 30 m (dashed) telescopes due to the reduction of both measurement noise (the y-intercept; ∝D−2) and tilt jitter (∝D−7/6), respectively. Because measurement noise falls off more quickly with D, tilt jitter becomes the dominant source of astrometric error for large aperture telescopes.

Standard image High-resolution image

The values in Figure 9 assume that tilt jitter is resolved by sufficiently short exposures. Longer exposure times will certainly reduce the differential tilt jitter by $1/\sqrt{t}$, but the measurement noise will also be decreased by this factor (for a given stellar brightness). The implication being that if tilt jitter dominates for short exposure times, it will continue to dominate longer exposures.

In situations where fewer images are available, either due to time constraints or longer exposure times per frame, it is difficult or impossible to effectively calculate the covariance matrix directly from the data. Our results show that independent measurements of the turbulence profile, for example from a DIMM/MASS unit, are sufficient to calculate Σd, and result in astrometric precision within a factor of 2–4 of the levels achieved using the data itself. Thus, the astrometric applications of turbulence sensors are twofold; they can be used to independently assess astrometric data quality and predict the AO PSF (Britton 2006).

The scaling laws presented throughout this paper indicate a substantially-improved astrometric performance on large aperture telescopes equipped with AO. We have used the measured performance on M5 with the Hale Telescope combined with these scaling laws to predict the astrometric performance of a single conjugate AO system as a function of aperture diameter and number of reference stars. The relationship can be summarized, using the results of simulation and data analysis, as

Equation (30)

This equation assumes that measurement error is dominated by photon noise (∝D−2), and the other dependences (FOV, stellar brightness distribution, turbulence profile) are identical to those for the M5 experiment.

Figure 10 shows the resulting estimates for astrometric precision as a function of aperture diameter and number of reference stars for a 2 minute exposure. These predictions demonstrate that limits to astrometric precision arising from random errors (dominated by tilt jitter) lie below 10 μas for 30 m telescopes. However, very careful characterization and control of systematic errors will be required to achieve this level of precision in an actual experiment. The extent to which systematic errors can be eliminated will distinguish the scientific goals that can be accomplished with ground-based facilities from those that require a space-based solution.

Figure 10.

Figure 10. Astrometric precision as a function of aperture size and stellar density. We have used Equation (30) with the assumptions of the Palomar turbulence profile, a 25'' × 25'' FOV, the M5 brightness distribution, and photon noise limit as described in Section 6. The astrometric precision demonstrates a very favorable scaling law with aperture diameter, and suggests that orders of magnitude improvement in precision may be available using large aperture, AO-equipped telescopes. In practice, the level of astrometric accuracy will depend on the extent to which current and future facilities can characterize and control systematic errors.

Standard image High-resolution image

Facilities: Hale (PALAO/PHARO).

We thank N. Law, M. Ireland, D. Le Mignant, A. Kraus, M. van Kerkwijk, and A. Gould for useful discussions on astrometry. We also thank Palomar Observatory for providing support for the DIMM/MASS unit used in this study. This work has been supported by NASA, and by the National Science Foundation Science and Technology Center for Adaptive Optics, managed by the University of California at Santa Cruz under cooperative agreement no. AST-9876783.

Footnotes

Please wait… references are loading.
10.1088/0004-6256/137/1/83