Articles

TRANSIT AND ECLIPSE ANALYSES OF THE EXOPLANET HD 149026b USING BLISS MAPPING

, , , , , , , , , and

Published 2012 July 18 © 2012. The American Astronomical Society. All rights reserved.
, , Citation Kevin B. Stevenson et al 2012 ApJ 754 136 DOI 10.1088/0004-637X/754/2/136

0004-637X/754/2/136

ABSTRACT

The dayside of HD 149026b is near the edge of detectability by the Spitzer Space Telescope. We report on 11 secondary-eclipse events at 3.6, 4.5, 3 × 5.8, 4 × 8.0, and 2 × 16 μm plus three primary-transit events at 8.0 μm. The eclipse depths from jointly fit models at each wavelength are 0.040% ± 0.003% at 3.6 μm, 0.034% ± 0.006% at 4.5 μm, 0.044% ± 0.010% at 5.8 μm, 0.052% ± 0.006% at 8.0 μm, and 0.085% ± 0.032% at 16 μm. Multiple observations at the longer wavelengths improved eclipse-depth signal-to-noise ratios by up to a factor of two and improved estimates of the planet-to-star radius ratio (Rp/R = 0.0518 ± 0.0006). We also identify no significant deviations from a circular orbit and, using this model, report an improved period of 2.8758916 ± 0.0000014 days. Chemical-equilibrium models find no indication of a temperature inversion in the dayside atmosphere of HD 149026b. Our best-fit model favors large amounts of CO and CO2, moderate heat redistribution (f = 0.5), and a strongly enhanced metallicity. These analyses use BiLinearly-Interpolated Subpixel Sensitivity (BLISS) mapping, a new technique to model two position-dependent systematics (intrapixel variability and pixelation) by mapping the pixel surface at high resolution. BLISS mapping outperforms previous methods in both speed and goodness of fit. We also present an orthogonalization technique for linearly correlated parameters that accelerates the convergence of Markov chains that employ the Metropolis random walk sampler. The electronic supplement contains light-curve files.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Discovered in 2005 using Doppler measurements, the Saturn-sized extrasolar planet HD 149026b orbits (in 2.876 days) a G0IV star that is larger (1.45 solar radii), and hotter (6150 ± 50 K) than most stars known to host transiting exoplanets. The planet's small radius and high average density suggest that between 50% and 90% of the planet's mass must be in its rocky or icy core (Knutson et al. 2009, hereafter K09). Invoking current theories, it is difficult to form this exoplanet by gravitational instability (Sato et al. 2005).

Shortly after detection, Fortney et al. (2006) computed models of the atmospheric temperature structure and spectra of HD 149026b. They suggested that the planet was a strong candidate for having a dayside atmospheric temperature inversion. The highly irradiated planet is hot enough to have gaseous TiO and VO molecules in the dayside atmosphere. These molecules are strong optical absorbers and had been previously shown to cause temperature inversions in model atmospheres (Hubeny et al. 2003).

Beginning in 2005, we used the photometric channels of the Spitzer Space Telescope (Werner et al. 2004) to observe HD 149026b during secondary eclipse, when the planet passes behind its parent star, to characterize the planet's dayside atmosphere. Harrington et al. (2007, hereafter H07) found an 8.0 μm eclipse depth of 0.084%+0.012− 0.009, indicating the hottest brightness temperature (2300 ± 200 K) observed at that time. This temperature matches an instantaneous re-emission model (zero albedo) from Fortney et al. (2006) that exhibits a temperature inversion (which tends to enhance the planet/star contrast at 8.0 μm), thus suggesting the presence of absorbers, such as TiO and VO gas molecules, in the atmosphere.

Charbonneau et al. (2006) observed two primary transits, when the planet passes in front of its parent star, using the Fred Lawrence Whipple Observatory telescope through the Sloan g and r filters. Winn et al. (2008) reported on five ground-based transits through Strömgren b and y filters at the Fairborn Observatory. In August of 2007, Nutzman et al. (2009) used Spitzer to monitor a transit of HD 149026b at 8.0 μm. Carter et al. (2009) used the NICMOS detector on board the Hubble Space Telescope to observe four transits of HD 149026b at 1.4 μm. Their data have the best photometric precision to date and, after combining their data with all previous transit measurements, provide improved estimates of orbital parameters, mass, and radius.

In 2008, K09 monitored the system for just over half an orbit to characterize the planet's phase variation at 8.0 μm. Their observations began slightly before the primary transit and finished slightly after the secondary eclipse. Using the final 7.2 hr of data, K09 report an eclipse depth of 0.0411% ± 0.0076%, half that of H07. As part of their paper, K09 reanalyzed the 2005 secondary-eclipse data and found eclipse depths ranging from 0.05% to 0.09%, though the lower values are preferred in most of their models. This large range of eclipse depths depends on the choice of systematic error model, fitting routines, and bad-pixel trimming methods.

HD 149026b is an interesting planet given its extremely unusual bulk abundances; the majority of the planet's mass must be in heavy elements, making the planet perhaps more akin to Uranus and Neptune than Jupiter and Saturn. In the solar system, a bulk composition that is enhanced in metals goes hand in hand with an atmospheric composition enhanced in metals (Marley et al. 2007). This suggests that HD 149026b could have an atmospheric metallicity far greater than that of most transiting exoplanets. Verifying this by measurement would let us understand the makeup of this planet and the role of atmospheric composition in determining temperature structure.

In this paper we present Spitzer Space Telescope secondary-eclipse observations of HD 149026b that resolve the disagreement in eclipse depths at 8.0 μm, characterize the planet's dayside atmosphere, and further constrain its orbital and physical parameters. We give detailed descriptions of our techniques and results because how one handles Spitzer's systematics can lead to best-fit parameters that disagree by more than 1σ, as demonstrated in Section 5.

Below, we describe the observations and data analysis, present a new method for modeling one of Spitzer's systematics, explain how we arrived at the final fits and compare the results to previously published work, discuss implications for the planetary emission spectrum and planetary composition, give improved constraints on the orbital parameters, and state our conclusions.

2. OBSERVATIONS AND DATA ANALYSIS

2.1. Observations

We observed secondary eclipses of HD 149026b at 3.6, 4.5, 5.8, and 8.0 μm with the Infrared Array Camera (IRAC; Fazio et al. 2004) and at 16 μm using the Infrared Spectrograph's (IRS; Houck et al. 2004) photometric blue peak-up array. The program also observed a primary transit at 8.0 μm. Including the four previously analyzed data sets labeled in Table 1, we present 14 observations spanning more than 3.5 years.

Table 1. Observation Information

Labela Observation Date Duration Frame Time Total Frames Spitzer Wavelength Previous
    (minutes) (s)   Pipeline (μm) Publicationsb
HD149bs41 2005 Aug 24 330 0.4 44352 S18.7.0 8.0 H07
HD149bs51 2007 Aug 4 386 14 1050 S18.18.0 16 ...
HD149bs31 2007 Aug 13 386 0.4 54080 S18.7.0 5.8 ...
HD149bp41 2007 Aug 14 478 0.4 67008 S18.7.0 8.0 N09 and C09
HD149bs52 2007 Aug 30 386 14 1050 S18.18.0 16 ...
HD149bp42 2007 Sep 12 386 0.4 54080 S18.7.0 8.0 ...
HD149bs11 2008 Mar 10 386 0.4 54080 S18.7.0 3.6 ...
HD149bs42 2008 Apr 11 386 0.4 54080 S18.7.0 8.0 ...
HD149bs21 2008 May 9 386 0.4 54080 S18.7.0 4.5 ...
HD149bp43 2008 May 11 499 0.4 70000 S18.7.0 8.0 K09
HD149bs43 2008 May 12 432 0.4 60500 S18.7.0 8.0 K09
HD149bs32 2008 Jun 16 386 0.4 54080 S18.18.0 5.8 ...
HD149bs33 2009 Mar 13 386 0.4 54080 S18.18.0 5.8 ...
HD149bs44 2009 Mar 22 386 0.4 54080 S18.18.0 8.0 ...

Notes. aHD149b designates the planet, p/s specifies primary transit or secondary eclipse, and ## identifies the wavelength and observation number. bH07: Harrington et al. (2007); N09: Nutzman et al. (2009); C09: Carter et al. (2009); and K09: Knutson et al. (2009).

Download table as:  ASCIITypeset image

2.2. POET Pipeline

Our Photometry for Orbits, Eclipses, and Transits (POET) pipeline produces systematics-corrected light curves using Spitzer-supplied basic calibrated data, fits a multitude of models with a wide range of analytic forms for systematic effects, chooses the best-fit model, and assesses the uncertainty of each free parameter. Below, we describe each of these steps in detail.

We calculate the Julian date of each image at mid-exposure using the UTCS_OBS and FRAMTIME keywords in the Spitzer-supplied headers. Following Eastman et al. (2010), we convert dates to Barycentric Julian Dates in the Coordinated Universal Time standard (BJDUTC) using the JPL Horizons system4 to interpolate Spitzer's position relative to our solar system's barycenter. Additionally, converting from UTC to the Barycentric Dynamical Time (TDB) standard addresses any discontinuities due to leap seconds. This paper reports both time standards to facilitate comparisons with previous work, which mostly does not apply the leap-second correction, and to ease the transition to the more-accurate standard.

The POET pipeline flags bad pixels (from energetic particle hits and other causes) by grouping sets of 64 frames and doing a two-iteration, 4σ rejection at each pixel location in the set. Stellar centers for photometry come from a Gaussian fit and 5× interpolated aperture photometry (H07 Supplementary Information, SI) produces the light curves. We test a broad range of aperture sizes in 0.25 pixel increments and omit frames with bad pixels in the photometry aperture. The background, subtracted before photometry, is an average of the good pixels within the specified annulus centered on the star in each frame.

2.3. Spitzer Systematics

Exoplanet characterization requires photometric stability well beyond Spitzer's design criteria (Fazio et al. 2004). Detector sensitivity models vary by channel and can have both temporal (detector ramp) and spatial (intrapixel variability) components. The main systematic effect at 3.6 and 4.5 μm is intrapixel sensitivity variations (Charbonneau et al. 2005), in which the photometry depends on the precise location of the stellar center within its pixel. We fit this systematic using the new BiLinearly-Interpolated Subpixel Sensitivity (BLISS) mapping technique described in Section 3. This technique maps the spatial sensitivity variations at high resolution.

The 5.8, 8.0, and 16 μm arrays primarily suffer from temporal variability, attributed to charge trapping (K09) in the 8.0 μm case. Weak spatial dependencies can also occur at these wavelengths (Stevenson et al. 2010), so we consider both systematics when determining our best-fit model. Typically, we omit the initial portion of each light curve from the model fit to avoid the worst of the ramp effect and to allow the telescope pointing and detector to stabilize. The clipping parameter, q, defines the number of unmodeled points from the start of a data set.

2.4. Pixelation

Pixelation is an infrequently discussed systematic error inherent to all array detectors. Sufficiently small stellar center motions between frames will not add or subtract pixels from an aperture, but these motions will cause the total flux within the aperture to vary. This means there is a (potentially large) range of stellar centers that utilizes the same set of aperture pixels, introducing a position dependence to the photometric sensitivity. We provide an illustrative example in Figure 1 and display the magnitude of this effect in Figure 2. A flux-conserving, subpixel image interpolation, combined with precise centering and applied before photometry, mitigates the pixelation effect by decreasing its range and amplitude. As demonstrated by our BLISS maps in Section 3, uninterpolated photometry exhibits strong sensitivity peaks at 1 pixel increments, while 2×- and 5×-interpolated pixels exhibit progressively weaker peaks at 0.5 and 0.2 pixel increments, respectively, in each spatial direction.

Figure 1.

Figure 1. Illustrative example of the pixelation effect, which arises when small changes in stellar centers do not add or subtract pixels within the aperture but do change the collected flux. The left panel uses a solid blue circle to depict a photometry aperture centered at (15.5, 15.5), which is on the corner of 4 pixels (defined by dashed black lines). The 12 shaded pixels have centers (green dots) that fall within the aperture and are summed to determine the stellar flux in this image. All of the incoming photons that land within the dashed blue circle (where there is a greater density of incoming photons) count toward the flux. So long as the stellar center falls within the small black circle, the photometry aperture will encompass the same green dots; thus, the specific pixels that contribute to the total flux will not change. One such example is depicted in the right panel, where we apply an offset of (0.15, 0.15) pixels. In this case, not all of the photons that fall within the dashed red circle count toward the flux. Instead, the shaded pixels count additional flux from photons that land outside the aperture (solid red circle) where the density of photons is less. The net effect is that, due to a change in centering and a non-uniform photon density, the shaded pixels in the right panel will record fewer incoming photons. The result is a position-dependent systematic called pixelation.

Standard image High-resolution image
Figure 2.

Figure 2. Projected flux from HD149bs31 integrated along the x (left) and y (right) axes. The non-uniform flux in both panels is clear evidence of pixelation. We use 5×-interpolated aperture photometry, which results in 0.2 pixel spacing between peaks. In the left panel, low-order polynomial models would fit the pixelation effect poorly at the peaks; the BLISS map (see Section 3) has no such limitation. The systematic is weakly constrained near the edges due to low sampling, as indicated by the large error bars. Whether a specific point on a pixel is a local maximum or minimum (due to pixelation) is a function of aperture size, which defines which subpixels to include in the aperture at any given point on the detector.

Standard image High-resolution image

Pixelation is most apparent with small apertures placed on underresolved point-response functions (PRFs) such as Spitzer's. It may not be apparent in other situations, such as when the aperture contains almost all of the integrated PRF, when the centroid wander is small relative to the subpixel size, and when other noise sources dominate (e.g., systematics and variable PRFs). Increasing the aperture size lessens the pixelation effect by decreasing the fraction of uncaptured light outside the aperture, but doing so may decrease the signal-to-noise ratio (S/N) by increasing the amount of background noise included in the aperture. Thus, choosing the best aperture size may introduce this position-dependent systematic. We have determined the intrapixel variability at 5.8, 8.0, and 16 μm to be a pixelation effect, previously reported at 5.8 μm by Stevenson et al. (2010) and at 8.0 μm by Anderson et al. (2011).

There are several ways to correct pixelation. First, one could shift model PRFs to match each frame's precisely determined stellar center. Dividing the stellar flux by the PRF flux in the aperture should remove the effect, but requires a highly accurate model PRF. Second, using smaller subpixels could decrease the amplitude of the pixelation effect until it is insignificant relative to the noise, but there is a limit: interpolation can only approximate the information destroyed when photons fall into the detector's finite-sized pixels. Third, if the pointing is sufficiently consistent and compact, one could choose an interpolation factor that happens to place the flat portion of the pixelation response on the stellar centers (since the peaks in Figure 2 move with different interpolation factors). Fourth, with high-precision centering, one could use a series of images taken at slightly different positions to model the position sensitivity analytically or with pixel-mapping techniques such as BLISS, but one would first have to remove any time-dependent components from the light-curve model (see Section 2.5). The accuracy of these models depends on the centering and photometric precisions, the former being ∼0.01 pixels for 0.4 s IRAC subarray exposures of bright sources (see below and Stevenson et al. 2010). We test for pixelation in all data sets using BLISS mapping, which corrects the effect when it is significant.

2.5. Light-curve Modeling

The full light-curve model is

Equation (1)

where F(x, y, t) is the measured flux centered at position (x, y) on the detector at time t, Fs is the (constant) system flux outside of secondary eclipse or primary transit, E(t) is the primary-transit or secondary-eclipse model component, R(t) is the time-dependent ramp model component, M(x, y) is the position-dependent intrapixel model component or sensitivity map, V(υ) is the visit sensitivity as a function of visit frame number υ, and P(p) is the flat-field correction at position p. Below, we discuss some of these components in more detail.

2.5.1. Eclipse and Transit Models

The uniform-source and small-planet equations from Mandel & Agol (2002) describe the secondary-eclipse and primary-transit model components, E(t), respectively. Transit light curves at 8.0 μm exhibit weak limb darkening that is not well constrained by fitting limb-darkening models to the data. We follow the method of Beaulieu et al. (2008) in deriving limb-darkening coefficients for HD 149026. Spitzer's 8.0 μm spectral response curve weights the intensities of a Kurucz ATLAS stellar atmosphere model (Castelli & Kurucz 2004; Teff = 6250 K, log (g) = 4.5 cgs, and [M/H] = 0.3), given as a function of wavelength and angle from the star's center. A least-squares minimization of the resulting curve determines the nonlinear limb-darkening coefficients (Claret 2000; a1a4 = 0.51477, −0.80525, 0.75683, −2.6168), which are then fixed for the three transit light-curve fits.

2.5.2. Ramp Models

We consider a multitude of ramp equations, R(t), all of which stem from three basic forms: exponential, logarithmic, and/or polynomial.

Equation (2)

Equation (3)

Equation (4)

Equation (5)

Equation (6)

Equation (7)

Equation (8)

Equation (9)

Equation (10)

Equation (11)

The time, t, is in units of phase (days) for secondary-eclipse (primary-transit) events. We use "+" and "−" subscripts in Equations (2)–(5) to denote the corresponding functional form. For example, Equation (2+) describes an exponentially decreasing, asymptotically constant ramp while Equation (2) describes an exponentially increasing, asymptotically constant ramp.

There is a physical interpretation applicable to the rising exponential ramps (Agol et al. 2010). Consider a population of charge traps due to an impurity in the detector's infrared material and a flux of photoelectrons through the material. The traps collect some fraction of electrons, releasing them randomly with some characteristic timescale that depends on the impurity. Bright sources saturate the traps, decreasing the fraction of captured electrons and raising the detected signal of a steady source according to the asymptotic rising exponential function in Equation (2). A double rising exponential (Equation (5)) approximates a rapidly saturating point spread function (PSF) core and slowly saturating wings (Knutson et al. 2008). It could also represent two impurities; the very short ramps of the HD149bs41 data set's visit sensitivity suggests the presence of an impurity that has a very short timescale and releases many of its electrons over the course of one cycle (∼30 minutes; see H07 Supplementary Figure 6). We tested for a common set of characteristic timescales in all of the rising exponential free parameters; however, even for the same planet in the same array, we did not achieve reasonable fits with all data sets. This may be due to inadvertent and inconsistent preflashing by the objects observed prior to our own observations. Despite its potential for providing a physical explanation for the ramps, the rising exponential does not always provide the best model according to the criteria in Section 2.6. This may be due to our fitting the final photometry and not the individual pixels' data and/or to the pointing instability producing unsteady illumination at the precision levels relevant here.

Figure 6.

Figure 6. BLISS map and pointing histogram of HD149bs11. Top: redder (bluer) colors indicate higher (lower) subpixel sensitivity. The horizontal and vertical black lines depict pixel boundaries. Bottom: colors indicate the number of points in a given bin, which, in this case, is 0.015 pixels in length and width. By recalculating the map at each step of the MCMC or minimizer, this technique substantially improves on that of B10, and beats all tested functional fits.

Standard image High-resolution image

Agol et al. (2010) advocate using a double rising exponential (Equation (5)) for 8.0 μm data due to its improved fit and smaller residuals, weaker dependence on aperture size, and less sensitivity in the clipping parameter. Similarly, Knutson et al. (2011) find that Equation (2) is sufficient for their 8.0 μm preflashed data sets, while Equation (5) is necessary for their non-preflashed data. We test all relevant ramp equations on data sets that exhibit time-dependent systematics and orthogonalize any correlated parameters that inhibit convergence (see Sections 2.5.3 and 2.7). Upon doing so, we find that we cannot corroborate the claims by Agol et al. (2010). Equation (5) should not be used for all 8.0 μm data sets because we typically find correlations between the eclipse depth and its ramp parameters that can double the latter's uncertainty relative to other models (see HD149bp42). Rather, we recommend a comprehensive examination of all relevant ramp equations before selecting the final model. See Sections 4 and 5 for discussion relevant to particular events.

2.5.3. Orthogonalization

The exponential model components have one difficulty: the Markov chain method used to assess the uncertainties does not converge to the posterior probability distribution, according to the criteria discussed in Section 2.7, even after tens of millions of iterations. The problem is a correlation between the exponential ramp parameters. Several recent analyses (e.g., K09; Stevenson et al. 2010; Campo et al. 2011) have "solved" the problem by fixing one of the exponential parameters. This is effectively profiling (slicing) rather than marginalizing (integrating) over the posterior parameter distribution, an approach that ignores correlated errors and may reduce the calculated error bar (see Press et al. 2007, Figure 15.6.3 and related text). In one case discussed below (HD149bp42), fixing parameters incorrectly decreased the calculated eclipse-depth uncertainty by 50%. There are two legitimate escapes from the problem. The first is to write a Monte Carlo sampler that explores the phase space more intelligently than the Metropolis random walk we and many others use. The second is to re-cast the equations in a form that eliminates the nonlinear correlation of the ramp parameters.

For this paper, we have expediently chosen the second approach. The version of Equation (2) presented here produces a more linear correlation between the r0 and r1 parameters than the original version in H07, whose exponent was −r(tt0). In some cases this converges quickly and the job is done. In others it still does not converge, so we run at least 105 iterations to sample the posterior distribution and then rewrite the model with a change of variables that orthogonalizes the most correlated parameters. This method does not modify the number of free parameters, nor involve any interpolations or approximations. For the selected correlated parameters, the orthogonalization shifts the origin to the center of the distribution, divides each parameter by its standard deviation to give a uniform scale in all directions, and rotates the subspace to minimize correlations (see Figure 3). A routine that prepares for principal components analysis (PCA) finds the transformation matrix from the distribution sample (we do not actually perform the PCA). We rerun the Markov chain using the new equations until it converges according to the criteria given below. Then we transform the points back to the familiar equations to assess parameter uncertainties. A simple example of a two-parameter orthogonalization of Equation (2) is

Equation (12)

where c0 and c1 are the ramp parameters in the rotated frame and θ is the rotation angle between coordinate systems. The arctangent of the slope of the best-fit line through the initial sample, projected into the r0r1 plane, determines θ. Such approximate orthogonalization of the posterior distribution has long been standard practice for improving Markov chain Monte Carlo (MCMC) performance (e.g., Hills & Smith 1992; Brooks 1998). Similar discussion can be found in Connolly et al. (1995), Pál (2008), and Cowan et al. (2009).

Figure 3.

Figure 3. Two-parameter orthogonalization example for HD149bs21 with histograms. The physical parameters (top panels) show a strong, nonlinear correlation, and asymmetric histograms; however, the orthogonalized parameters (bottom panels) are nearly uncorrelated and have symmetric histograms. Running a Markov chain method with the orthogonalized parameters reduces the convergence time.

Standard image High-resolution image

In effect, this method is the same as the first, accomplished by rotating the data and using the original sampler rather than writing a new one. This method works best for linearly correlated parameters and achieves moderate success with more exotic correlations. Converting to curvilinear coordinates or implementing manifold learning algorithms from nonlinear dimensionality reduction may offer further improvements in convergence time for the extreme cases (Lee & Verleysen 2007).

2.5.4. Flat-field (Position) Sensitivity Models

Most of the data sets presented here follow the standard time-series observing practice of keeping the object fixed to one location on the array (staring) to minimize position-dependent sensitivity effects. However, the H07 observation (HD149bs41) cycled through nine different nod positions (p = 0–8) in an attempt to use the unobserved positions in each frame to make a high-quality flat field that would correct the entire data set. This approach was unsuccessful and not repeated. Each position in the HD149bs41 data requires a flat-field correction, P(p), to account for the difference in pixel sensitivity. To eliminate correlations with the system flux (i.e., keep the correction from floating), we require the mean of all of the corrections to equal unity. We do this by freely varying p = 1–8 and equating p = 0 to the number of positions minus the sum of the other corrections.

For both 16 μm data sets, the telescope reacquires the target at one-third and two-thirds of the way into the observing runs. This action is similar to a nod because after reacquisition, the three sets of measured stellar centers are non-overlapping. We apply the same model component to these data as with HD149bs41, but only use three flat-field correction parameters (p = 0, 1, 2).

2.5.5. Visit-sensitivity Model

With HD149bs41, Spitzer completed 12 cycles through the 9 nod positions mentioned above for a total of 108 visits. Each visit has a briefer and steeper ramp compared to the overall ramp, R(t). As discussed in their SI, H07 use a 12-knot spline to model the visit-sensitivity effect, V(υ). They fix the final three knots to unity and allow the remaining nine parameters to vary. We model the visit-sensitivity ramp using

Equation (13)

which is identical in form to the model component used by K09. The only difference is that K09 use time as the independent variable while we use visit frame number, υ. In either case, the independent variable resets to zero upon moving to a new nod position.

2.6. Model Selection

For each data set, we test different photometry aperture sizes, detector ramp model components, and intrapixel sensitivity model components looking for the best combination. One must be careful in assessing which model is the "best" because χ2-like comparisons must derive from the same data set and different photometric apertures produce different data sets for this purpose. We use the standard deviation of the normalized residuals (SDNR) when comparing models of the same analytic form to different data sets. Once we have identified the best aperture size, we use the Bayesian information criterion (BIC; Schwarz 1978; Liddle 2007; Stevenson et al. 2010; Campo et al. 2011) to compare models with different numbers of free parameters:

Equation (14)

Here, epsiloni and σi are the residual and uncertainty of the ith data point, k is the total number of free parameters, and n is the number of data points used in the model fit. The Spitzer-supplied uncertainty frames are typically overestimated, so we scale σi such that the reduced χ2, χ2ν, equals unity for our best-fit model. Using the unscaled σi values in a least-squares minimizer improperly weights the data and results in a sub-optimal fit. When selecting between competing models for an observed data set, what matters is how the models compare in predicting the actually observed data, not how variable individual model fits may be among hypothetical realizations of the data. The BIC is defined so that the marginal likelihood ratio—the ratio of predictive probabilities for the observed data—is approximately e0.5ΔBIC. Note that what matters is the difference in BIC values (and thus the difference of maximum likelihood or minimum χ2 values), not the absolute values. Information criteria such as BIC may not apply when comparing fits with intrapixel maps to those with polynomial model components. See Appendix A for more discussion on the matter.

2.7. Error Estimation

We explore phase space and estimate uncertainties using an MCMC routine following the Metropolis random walk algorithm. See Campo et al. (2011) for more discussion on MCMC. The POET pipeline can test any combination of systematic model components, computing the SDNR and BIC for each fit, and can model multiple events simultaneously while sharing parameters such as the eclipse midpoint and depth. Each MCMC run begins with a least-squares minimization, a rescaling of the Spitzer-supplied uncertainties, a second least-squares minimization using the new uncertainties and, finally, at least 105 MCMC iterations to populate the posterior parameter distributions, from which we derive parameter uncertainties. We study parameter correlation plots and the posterior distribution to ensure that we have a reliable result, then publish them so that others may evaluate our work and compare to their own. We test for convergence every 105 steps, terminating only when the Gelman & Rubin (1992) diagnostic for all free parameters has dropped to within 1% of unity using all four quarters of the chain. We also examine trace and autocorrelation plots of each parameter to confirm convergence visually. We estimate the effective sample size (ESS; Kass et al. 1998) and autocorrelation time, τ, for the ith free parameter as follows:

Equation (15)

where m is the length of the MCMC chain, ρki) is the autocorrelation of lag k for the free parameter θi, and k' is the cutoff point such that ρk < 0.01. We use the longest autocorrelation time from each event to determine the number of steps between effectively independent samples for thinning each MCMC chain. The distribution of thinned samples quantifies parameter uncertainties.

3. BLISS MAPPING TECHNIQUE

3.1. Background

The change in pixel sensitivity with respect to stellar position on the detector is a well-known systematic with the Spitzer Space Telescope (Charbonneau et al. 2005; Knutson et al. 2008). This effect is particularly strong in IRAC's 3.6 and 4.5 μm channels but has also been seen at 5.8, 8.0, and 16 μm (Stevenson et al. 2010; Anderson et al. 2011). The position sensitivity in the latter channels is due to a pixelation effect (see Section 2 for a description) rather than an actual change in sensitivity over the pixel surface. In this section, we present a new technique for modeling these position-dependent systematics, called BLISS mapping.

The most common method for removing the intrapixel variability is to fit a polynomial in both spatial directions (Knutson et al. 2008). The polynomial order typically ranges from quadratic to sextic and may include cross terms. Other variations of this modeling method have applied multiple polynomials, one for each pixel quadrant, in order to find the best fit. Polynomial methods work reasonably well for data sets with small stellar position wander, resulting in a smoothly varying intrapixel sensitivity. An analysis becomes exceedingly complicated if the variation is not smooth or if strong correlations arise between parameters. These complications can increase uncertainty estimates in the best case scenario and, in the worst case, lead to incorrect results.

A new approach, pioneered by Ballard et al. (2010, hereafter B10), attempts to map the intrapixel variability on a subpixel-scale grid without assuming a specific functional form. For their particular light curve, they bin their flux and stellar positions into 20 s bins (∼145 points bin−1) before computing a sensitivity correction for each binned point. Each correction considers a set of flux values that does not include in-transit frames or frames from the current binned position. This set of flux values is Gaussian-weighted in both spatial directions relative to the position of the binned point being corrected, summed to a single value, then normalized by dividing by the summed Gaussian weighting function. This method effectively models not only the large-scale features in their data, but also a smaller-scale "corrugation" effect that a low-order polynomial cannot remove.

Moving away from a polynomial model is an excellent concept; however, this particular implementation has some drawbacks that limit its scope. For instance, ignoring the points during secondary eclipse requires that the out-of-eclipse portion of the data set be significantly longer than the in-eclipse portion, which is atypical in primary-transit and secondary-eclipse observations. Also, the weighting function computes too slowly to be used in an MCMC routine and is even slow when using a minimizer. The calculation would be even slower if the data were not binned into relatively long, 20 s time intervals. Figure 4 shows 120 s of vertical pixel positions from HD149bs11, which has a 0.4 s exposure duration versus 0.1 s for B10. The stellar center can vary significantly over a 20 s interval, indicating that positional information is lost when binning over such timescales.

Figure 4.

Figure 4. Vertical pixel positions from HD149bs11. We can track the motion of the stellar center to high precision (∼0.01 pixels) over a duration of only a few seconds. These oscillations are much faster than Spitzer's 1/2–1 hr oscillations, initially reported by Charbonneau et al. (2005). In a span of 20 s, the stellar centers can vary by >0.1 pixels. All positions are zero based.

Standard image High-resolution image

The idea of using a spline to interpolate position-dependent systematics stems from observed fine-scale sensitivity variations in some of our data sets that cannot be modeled by a low-order polynomial (see the pixelation effect in Figures 2 and 5 for examples). We initially attempted to map the pixel surface using a bicubic spline because we wanted a smoothly varying model; however, this type of interpolation is prohibitively computationally expensive. A typical secondary-eclipse observation spans a 0.3 × 0.3 pixel region. Placing knots at 0.05 pixel intervals requires 49 free parameters. Polynomial models typically require less than 10 parameters. Adequately describing the fine-scale sensitivity variations requires a large number of knots, but varying all of these knot parameters at each step of an MCMC routine leads to extremely slow convergence. Our new BLISS mapping technique circumvents the problem of slow convergence by directly computing the knot parameters, rather then allowing them to vary freely. Thus, we can use >1000 knots to map the pixel surface at high resolution (see Figure 5).

Figure 5.

Figure 5. BLISS maps illustrating the position-dependent pixelation effect. The maps use 1× (left), 2× (center), and 5× (right) interpolated, 2.5 pixel-aperture photometry. Redder (bluer) colors indicate more (less) flux within the aperture. The bin size is 0.01 pixels for all maps. The horizontal and vertical black lines depict pixel boundaries. Without subpixel interpolation, the pixelation effect is significant, but it is progressively reduced with 2×- and 5×-interpolated photometry. For time-series data such as these, one can calculate a BLISS map to correct for pixelation.

Standard image High-resolution image

3.2. Implementation

In seeking methods that compute faster than bicubic interpolation, we give up the constraint of differentiability. Nearest-neighbor interpolation (NNI) is simple, assigning each point the value of its nearest knot. Bilinear interpolation (BLI) is a straightforward calculation (see Equation (16)) that maintains continuity over the pixel surface, unlike NNI, and, given sufficiently precise centering, should be more accurate. Using BLI, the flux at a point (x, y) is

Equation (16)

This is a distance-weighted average of the flux of the four nearest knots, FIP(xi, yj), where i and j are horizontal and vertical indices for a rectangular grid of knots. This method computes faster than bicubic interpolation and may achieve comparable smoothness within the errors with less computing time simply by increasing the number of knots (see Figure 2).

We create a rectangular grid of knots that spans the range of centers in x and y. Each point in the data set associates with its nearest knot. For BLI, we compute the distances from each point to its four nearest knots, for Equation (16). If one or more knots in Equation (16) does not have any assigned points, we use NNI there instead, or the calculation would fail. This usually only occurs near the boundary of the grid of knots. We precompute the knot associations and distances prior to initiating the MCMC as they remain constant from iteration to iteration.

We do not treat the knots as MCMC jump parameters. Rather, we step all other free parameters from Equation (1), generate a new model using these new jump parameters, then divide the observed flux by the new model (FIP(x, y) = Fobs/FsE(t)R(t)V(ν)P(p)). Hypothetically, the residuals of FIP(x, y) contain only position-dependent flux variations. The flux value of a particular knot is the mean of FIP(x, y) for the points associated with that knot. We also tried median and weighted average knot values but the results did not improve and these calculations are much slower. Next, we generate the sensitivity map (M[x, y], Figure 6) by interpolating the flux from the knots to all of the observed points using BLI and/or NNI. We tested various weighted smoothing functions when generating the sensitivity maps but, again, there was no improvement in the results. Finally, M(x, y) enters Equation (1) for comparison with the observed flux to obtain an estimate of the goodness of fit and determine the MCMC acceptance probability. This process repeats for each step of the MCMC routine or minimizer.

3.3. Determining the Optimal Bin Size

The accuracy of the BLISS mapping technique depends critically on the bin size, or resolution in position space; however, there is a tradeoff between bin size and speed. Decreasing the bin size requires more knots and runs slower but may be necessary to adequately resolve sensitivity changes on the pixel surface. There is, however, a practical limit to how small the bins can be. A bin for every measurement will always produce a perfect fit, resulting in a negative number of degrees of freedom and leaving the eclipse parameters unconstrained. Bin sizes must be small enough to resolve real, small-scale variations on the pixel surface but large enough to mix in- and out-of-eclipse points. This mixture helps to minimize correlations between the eclipse parameters and the knots in the sensitivity map.

To establish the optimal bin size, we consider a range of bin sizes for both BLI and NNI using the 3.6 μm data set (HD149bs11), which has the strongest position-dependent systematic. We draw several conclusions from the results in Table 2. First, the SDNR (our measure of goodness of fit) decreases with decreasing bin size, indicating a better fit. Unfortunately, the SDNR decreases indefinitely, so it cannot constrain the minimum bin size. Second, BLI fits the data better than NNI for bin sizes greater than 0.015 pixels. The opposite is true for smaller bin sizes, which is counterintuitive because BLI should always outperform NNI, assuming no uncertainty in the position. Thus, the bin size at which NNI outperforms BLI is indicative of the centering precision for a particular data set. We estimate the precision for our analysis of HD149bs11 to be 0.009 pixels in x and 0.007 pixels in y by calculating the root mean squared (rms) frame-to-frame position difference. This agrees well with Figure 4 and is consistent with the crossover bin size of 0.015 pixels, where NNI outperforms BLI. Last, we place an upper limit on the bin size by noting that the eclipse depths become inconsistent with each other for pixel sizes ⩾0.050 using BLI and ⩾0.020 using NNI.

Table 2. BLISS Map Test—Variable Bin Size

Model Bin Size SDNR Eclipse Depth
  (Pixels)   (%)
BLI 0.100 0.0028736 0.063 ± 0.003
BLI 0.050 0.0028222 0.043 ± 0.003
BLI 0.020 0.0028076 0.040 ± 0.003
BLI 0.015 0.0028031 0.040 ± 0.003
BLI 0.010 0.0027917 0.040 ± 0.003
BLI 0.005 0.0027403 0.040 ± 0.003
BLI 0.002 0.0024796 0.039 ± 0.003
NNI 0.100 0.0029435 0.071 ± 0.003
NNI 0.050 0.0028527 0.054 ± 0.003
NNI 0.020 0.0028116 0.044 ± 0.003
NNI 0.015 0.0028024 0.041 ± 0.003
NNI 0.010 0.0027865 0.041 ± 0.003
NNI 0.005 0.0027109 0.040 ± 0.003
NNI 0.002 0.0023773 0.039 ± 0.003

Download table as:  ASCIITypeset image

We conclude that, whenever possible, BLI should be used with a bin size that is independent of the eclipse depth and has a lower SDNR than NNI. We have found cases where BLI is never better than NNI. In those instances, the position dependence is so weak that the intrapixel model component is unnecessary. A better fit with BLI, compared to NNI, is thus a good indicator that a position-dependence systematic is present in a given data set. For the test cases shown in Table 2, using BLI with a bin size between 0.015 and 0.020 pixels is recommended based on our criteria.

Precise centering is important for this method because imprecision limits the smallest meaningful bin size. Our preliminary work (see the SI of Stevenson et al. 2010) indicates that the Gaussian and least-asymmetry centering methods are better than the center of light; additional work is in preparation.

3.4. Comparing Intrapixel Models

To compare the BLISS mapping technique with other intrapixel methods, we fit six different intrapixel models to the HD149bs11 data set. These models are quadratic, cubic, and sextic polynomials (including lower-order cross terms), B10's new weighted sensitivity function (fixing σx to 0.021 and σy to 0.0079), BLI, and NNI. The eclipse depth in B10's model is slightly shallower than the other models, which are all well within 1σ of one another, but the uncertainties are essentially identical (see Table 3). The BLISS models show significant improvement in SDNR compared to the others. We cannot use the BIC to compare the BLISS model components with the polynomial model components for the reasons discussed in Appendix A.

Table 3. BLISS Map Test—Comparing to Other Intrapixel Models

Model Bin Size SDNR Eclipse Depth
      (%)
Ballard 2.4 sa 0.0028230 0.034 ± 0.003
Cubic ... 0.0028180 0.039 ± 0.003
Sextic ... 0.0028157 0.040 ± 0.003
Quadratic ... 0.0028186 0.041 ± 0.003
BLI 0.015 pixels 0.0028031 0.040 ± 0.003
NNI 0.015 pixels 0.0028024 0.041 ± 0.003

Notes. aLonger bin sizes were considered but produced worse results. Shorter bin sizes are prohibitively expensive to compute and are below the limit of detectable motion.

Download table as:  ASCIITypeset image

BLISS mapping represents a substantial improvements over polynomial model components because BLI and NNI can model real structure (such as pixelation) that cannot be modeled with low-order polynomials, they encounter fewer correlations between free parameters, and require fewer iterations to assess parameter uncertainties.

4. PRIMARY-TRANSIT FITS AND RESULTS

We present the scaling of the rms model residuals versus bin size (a test of correlation in time) in Figure 7, the best-fit transit light curves in Figure 8, a comparison between two fits in Table 4, and the full set of best-fit transit parameters in Appendix B. The electronic supplement contains light-curve files. Below, we discuss each observation to explain how we arrived at the final results.

Figure 7.

Figure 7. rms residual flux vs. bin size for three HD 149026b transits. Black vertical lines at each bin size depict 1σ uncertainties on the rms residuals (${\rm rms} / \sqrt{2N}$, where N is the number of bins). The red line shows the predicted standard error for Gaussian noise, the dotted vertical blue line indicates the ingress/egress timescale, and the dashed vertical green line indicates the transit duration timescale. Any rms residuals that are several σ above the red line would indicate correlated noise at that bin size. When considering the effects of correlations on transit depth, the bin size of interest is the transit duration and not the ingress/egress time. The shorthanded legend labels correspond to the last three characters in each event's label (e.g., p41 = HD149bp41).

Standard image High-resolution image
Figure 8.

Figure 8. Raw (left), binned (center), and systematics-corrected (right) primary-transit light curves of HD 149026b at 8.0 μm. The results are normalized to the system flux and shifted vertically for ease of comparison. The colored lines are best-fit models, the black curves omit their transit model components, and the error bars are 1σ uncertainties. The shorthanded legend labels correspond to the last three characters in each event's label (e.g., p41 = HD149bp41). The pixelation effect (see Section 2.4) is most prevalent in HD149bp43.

Standard image High-resolution image

Table 4. Individual Transit Model Fits

Label R(t) M(x,y) SDNR ΔBIC Rp/R
HD149bp41 3 BLI 0.0083449 0.0 0.0502 ± 0.0011
HD149bp41 8 BLI 0.0083444 0.4 0.0517 ± 0.0009
HD149bp41 6 BLI 0.0083444 0.4 0.0517 ± 0.0010
HD149bp42 8 BLI 0.0083565 0.0 0.0503 ± 0.0008
HD149bp42 6 BLI 0.0083564 4.0 0.0513 ± 0.0009
HD149bp42 5 BLI 0.0083562 5.6 0.0502 ± 0.0016
HD149bp43 2 BLI 0.0083681 0.0 0.0536 ± 0.0008
HD149bp43 8 BLI 0.0083682 6.5 0.0525 ± 0.0010
HD149bp43 3 BLI 0.0083685 7.0 0.0516 ± 0.0011
HD149bp43 6 BLI 0.0083682 7.2 0.0527 ± 0.0010

Download table as:  ASCIITypeset image

4.1. Three Fits at 8.0 μm

At each 0.25 pixel increment in photometry aperture size, we model the time-dependent systematic with the ramps listed in Equations (2)–(11). An aperture size of 3.5 pixels produces the lowest SDNR values for all ramps and for all transit events. We estimate the background flux using an annulus from 7 to 15 pixels centered on the star. We follow the method described in Section 3.3 when determining the optimal bin sizes of the BLISS maps.

4.1.1. HD149bp41

There are 26 consecutive frames (19,494–19,519) shifted horizontally by exactly 1 pixel; we flag these frames as bad. After clipping the first 5000 data points (∼33.3 minutes, q = 5000), many of the ramps fit the time-dependent systematic equally well, but the fits exhibit a large range of radius ratios (Rp/R = 0.0494 to 0.0517). Of the top three models shown in Table 4, Equation (3) has the lowest BIC value and favors a moderate radius ratio of 0.0502 ± 0.0011. For comparison, Nutzman et al. (2009) and Carter et al. (2009) report values of 0.05158 ± 0.00077 and 0.5188 ± 0.00085, respectively. To model the ramp, both adopt a quadratic function in ln (t) Equation (8) with t0 fixed to a time a few minutes prior to the first observation. Fixing parameters can cause an MCMC run to underestimate uncertainties of the remaining free parameters (see Section 2.5.3). Instead, we orthogonalize the correlated parameters (system flux and all three ramp parameters in Equation (3) to provide a coordinate system in which our MCMC routine can sample efficiently. There is a weak position dependence (see Figure 8) in both x- and y-directions that we model with the BLISS mapping technique using 0.030 pixel bins.

4.1.2. HD149bp42

For all ramps with reasonable fits, we find that the planet-to-star radius ratios range from 0.0444 to 0.513. The three best models appear in Table 4 with a range of uncertainties in Rp/R. The ramp parameters from Equation (5) correlate most strongly with the radius ratio, resulting in an uncertainty that is twice that of Equation (8). Fixing ramp parameters in Equation (5) erroneously improves the radius ratio uncertainty to 0.008%. Equation (8) has the lowest BIC value, so we use it to fit the full data set (q = 0). The data exhibit only a minor position dependence in the x-direction; however, the significant improvement in SDNR indicates that we should include the BLISS map during the joint model fit.

4.1.3. HD149bp43

Unlike the previous two data sets, HD149bp43 was preflashed to mitigate the ramp effect (see K09 for details). Thus, we do not need to clip a significant initial portion of the data set or use a double rising exponential. Instead, we clip only the first 1000 data points (∼6.7 minutes) and use Equation (2) to model the time-dependent systematic. As seen in Table 4, the HD149bp43 transit depth is consistently deeper than the other two data sets. The Rp/R parameter is independent of our choice of q but is dependent on the choice of ramp model components, ranging from 0.0516 to 0.0536. BIC favors the deepest transit depth, resulting in a radius ratio that is larger than other best-fit ratios by ∼4σ. For comparison, K09 use Equation (8) with a fixed t0 parameter and report a radius ratio of 0.05253 ± 0.00076. We achieve the same underestimated uncertainty using a similarly constrained t0 parameter. A relatively strong position dependence is evident in Figure 8 and is modeled with a BLISS map using 0.050 pixel bins.

4.2. Joint Fit

We perform two joint-model fits, each requiring less than 2 × 106 iterations to estimate uncertainties. The first considers only the three transits analyzed here while the second also considers the more precise NICMOS data from Carter et al. (2009) by placing priors on i and a/R. Both fits in Table 5 are consistent with previous results from Knutson et al. (2010; Rp/R = 0.0522 ± 0.0008) and Carter et al. (2009; Rp/R = 0.0519 ± 0.0008) and have improved estimates of the radius ratio. The uncertainties in the duration and ingress/egress times for the independent fit are significantly larger than those from the fit with Carter et al. (2009) priors.

Table 5. Joint Transit Model Fits

Parameters Independent Fit Carter et al. (2009) Priorsa
Rp/R 0.0514 ± 0.0006 0.0518 ± 0.0006
i (°) 87.2+1.6− 2.1 84.6 ± 0.5
a/R 6.8+0.3− 0.7 5.98 ± 0.17
Impact parameter 0.33+0.21− 0.19 0.57 ± 0.04
Transit depth (%) 0.264 ± 0.006 0.268 ± 0.006
Duration (t1t4, hr) 3.23+0.04− 0.02 3.286 ± 0.019
Ingress/egress (hr) 0.178+0.043− 0.015 0.234 ± 0.012
HD149bp41    
Midpoint (MJDUTC)b 4327.3719 ± 0.0005 4327.3720 ± 0.0005
Midpoint (MJDTDB)b 4327.3726 ± 0.0005 4327.3727 ± 0.0005
OC (minutes)c −1.0 ± 0.7 −0.9 ± 0.8
SDNR 0.0083440 0.0083440
HD149bp42    
Midpoint (MJDUTC)b 4356.1316 ± 0.0005 4356.1316 ± 0.0005
Midpoint (MJDTDB)b 4356.1323 ± 0.0005 4356.1323 ± 0.0005
OC (minutes)c 0.2 ± 0.7 0.0 ± 0.8
SDNR 0.0083550 0.0083556
HD149bp43    
Midpoint (MJDUTC)b 4597.7070 ± 0.0004 4597.7068 ± 0.0005
Midpoint (MJDTDB)b 4597.7077 ± 0.0004 4597.7075 ± 0.0005
OC (minutes)c 0.8 ± 0.7 0.6 ± 0.6
SDNR 0.0083690 0.0083691

Notes. aWe place priors on i and a/R using values from Carter et al. (2009). bMJD = BJD−2450,000. cComputed using the period and ephemeris from K09; (p = 2.8758925 ± 0.0000023 days, t0 = 2454597.70645 ± 0.00018 BJDUTC).

Download table as:  ASCIITypeset image

5. SECONDARY-ECLIPSE FITS AND RESULTS

There were 11 secondary-eclipse observations. We considered whether the eclipse duration is consistent with the more precise transit duration in these fits, which is likely if the orbit is nearly circular. In a joint fit of all secondary-eclipse events, the strong signal from HD149bs11 dominates the shared eclipse duration. The best-fit eclipse duration was 4.5 ± 3.3 minutes longer than, but still consistent with, the transit duration from Carter et al. (2009), and the mid-eclipse phases were in all but one case within 1.5σ of 0.5, together indicating circularity. Since the transit and eclipse durations are consistent, we apply priors to the eclipse duration and ingress/egress times using the values given in Table 5. Unless otherwise stated, we estimated the background flux using an annulus from 7 to 15 pixels that was centered on the star. We present the rms model residuals in Figure 9, the best-fit light curves in Figure 10, and the best-fit parameters in Appendix B. The electronic supplement contains light-curve files. Below, we discuss each observation in detail.

Figure 9.

Figure 9. Same as Figure 7 except for 11 HD 149026b eclipses. The shorthanded legend labels correspond to the last three characters in each event's label (e.g., s11 = HD149bs11).

Standard image High-resolution image
Figure 10.

Figure 10. Binned (left) and systematics-corrected (center and right) secondary-eclipse light curves of HD 149026b in five Spitzer channels. The results are normalized to the system flux and shifted vertically for ease of comparison. The colored lines are best-fit models and the error bars are 1σ uncertainties. The shorthanded legend labels correspond to the last three characters in each event's label (e.g., s11 = HD149bs11).

Standard image High-resolution image

5.1. Fit at 3.6 μm: HD149bs11

For this data set, we find that BLI outperforms NNI down to a bin size of 0.015 pixels when we exclude bins with less than four measurements. Bins with fewer data points are insufficiently sampled to compute a reliable mean flux for the knot value. The linear ramp Equation (10) fits best. The posterior parameter distributions (histograms) and parameter correlation plots (including knot values in the BLISS map) are in Figures 1113.

Figure 11.

Figure 11.

Parameter histograms for HD149bs11. We plot every 200th step in the MCMC chain to decorrelate parameter values. The BLISS map knots are similarly distributed. (The complete figure set (13 images) and a color version of this figure are available in the online journal.)

Standard image High-resolution image
Figure 12.

Figure 12.

Parameter correlations for HD149bs11. The background color depicts the absolute value of the correlation coefficient. The uncertainties produced from the MCMC method fully account for correlations between free parameters (e.g., eclipse flux ratio and system flux). We plot every 200th step in the MCMC chain to decorrelate parameter values. (The complete figure set (13 images) and a color version of this figure are available in the online journal.)

Standard image High-resolution image
Figure 13.

Figure 13. Correlation coefficients between eclipse depth and computed BLISS map knots for HD149bs11. The presence of relatively strong correlation regions (in red) indicates that computing the BLISS map at each step of an MCMC routine is necessary to assess the uncertainty on the eclipse depth correctly, as opposed to fixing the map as is done by B10. In this case, fixing the BLISS map to its post-minimizer values leads to an erroneous 13% decrease in the eclipse-depth error estimate.

Standard image High-resolution image

5.2. Fit at 4.5 μm: HD149bs21

Using the strategy described in Section 3.3, BLI achieves a better fit than NNI with bin sizes of 0.012 pixels along x and 0.006 pixels along y. Additionally, we fit the intrapixel sensitivity with three different polynomial models ranging between second and sixth order. After clipping the first 5000 points (q = 5000), the eclipse depths using various ramp and intrapixel model components are consistent (see Table 6); however, the SDNR clearly favors BLI and the BIC favors Equation (2+) to model the systematics. To minimize the convergence time in our MCMC chains, we orthogonalize the eclipse depth, system flux, and both ramp parameters (r0 and r1). All of the model fits exhibit, to various degrees, bimodal distributions in the eclipse-midpoint histograms. The lesser peak occurs at a phase of 0.497 and is likely a result of the model trying to fit the eclipse egress to the points from phases of 0.514 to 0.520, which are consistently above the secondary eclipse by 1σ–2σ (see Figure 10, center panel).

Table 6. HD149bs21—Comparing Model Fits

R(t) M(x, y) SDNR ΔBIC Ecl. Depth
        (%)
2+ BLI 0.0038800 0.0 0.034 ± 0.006
11 BLI 0.0038800 1.4 0.033 ± 0.007
3+ BLI 0.0038800 10.8 0.033 ± 0.006
2+ Quad. Poly. 0.0038961 ... 0.032 ± 0.006
2+ Cubic Poly. 0.0038954 ... 0.033 ± 0.006
2+ Sextic Poly. 0.0038941 ... 0.034 ± 0.006

Download table as:  ASCIITypeset image

5.3. Three Fits at 5.8 μm

Using the processes described below, we choose the best-fit model for each event, then perform a joint fit with a single eclipse-depth parameter.

5.3.1. HD149bs31

Initially reported by Stevenson et al. (2010), we find clear evidence of pixelation at 5.8 μm (see Figure 2). A relatively large bin size of 0.04 pixels is appropriate for HD149bs31 using BLI in combination with Equation (2+) to express the time-dependent flux variation. We orthogonalize the system flux and both ramp parameters when computing uncertainties. The eclipse-midpoint histogram peaks at a phase of 0.502 and has a broad uncertainty of 0.005. The best-fit eclipse depth is 0.044% ± 0.016%.

5.3.2. HD149bs32

The pixelation effect in HD149bs32 is weak because stellar centers fall predominantly near the middle of an interpolated subpixel (i.e., away from the blue peaks in the right panel of Figure 5). As such, an intrapixel model component is unnecessary and we model its systematics solely by Equation (2+). We orthogonalize the same parameters as with HD149bs31. The eclipse-midpoint histogram shows a strong bimodal distribution, with peak phase values of 0.496 and 0.501. The latter is favored with an eclipse depth of 0.038% ± 0.017%. This data set points to the illegitimacy of fixing one of the ramp parameters (see Section 2.5.3). The eclipse depth is correlated with the ramp parameters, so fixing one of them erroneously improves the eclipse-depth uncertainty to 0.012%.

5.3.3. HD149bs33

For the same reasons as with HD149bs32, no intrapixel model component is needed with HD149bs33. We model the declining flux using Equation (2+), orthogonalize the same three parameters as above, and find an eclipse depth of 0.047% ± 0.016%. The histogram of eclipse midpoints is clearly bimodal but favors the best-fit value of 0.500+0.003− 0.005.

5.3.4. 5.8 μm Joint Fit

To improve S/N on the eclipse depth, we share this parameter in a joint fit of all three data sets. We retain individual eclipse-midpoint times for the subsequent orbital analysis. The MCMC chain converged after 6 × 105 iterations. The combined light curve in Figure 14 illustrates the improvement when compared to the three 5.8 μm light curves in Figure 10. The best simultaneous fit favors an eclipse depth of 0.044% ± 0.010%.

Figure 14.

Figure 14. Combined and binned eclipse light curves at 5.8 and 8.0 μm. Note the improved S/N achieved with combined modeling, compared to Figure 10. The best joint-fit HD149bs32 and HD149bs41 models are plotted for comparison.

Standard image High-resolution image

5.4. Four Fits at 8.0 μm

Similarly to the fits at 5.8 μm, we fit models to each data set individually then use the best models in a 1.1 × 106 iteration joint fit that shares a common eclipse depth.

5.4.1. HD149bs41

The significant improvements in our pipeline since the original analysis by H07 warrant a new analysis of HD149bs41. We follow all of our current techniques described in Section 2 and test all of the listed ramp model components. As with H07 and K09, we find that Equation (2) best describes the overall ramp. The smaller ramps associated with each telescope movement are best described by Equation (13), according to BIC; however, we also present H07's 12-point spline for comparison (see Table 7). Each model employs a constant-flux offset at each of the nine nod positions, of which eight are free parameters as described in Section 2.

Table 7. HD149bs41—Comparing Model Fits

R(t) M(x, y) Visit Sensitivity SDNR ΔBIC Ecl. Depth
          (%)
2 ... Equation (13) 0.0084575   0 0.049
2 ... 12-pt Spline 0.0084559  59 0.049
2 9 ×Linear Equation (13) 0.0084510 126 0.050
2 9 ×Linear 12-pt Spline 0.0084494 184 0.050

Note. Bold indicates the model selected for the joint fit.

Download table as:  ASCIITypeset image

Due to the nodding motion with this particular data set, BLI and NNI are inappropriate models to use. We can see in Figure 15, which illustrates 1 of the 9 nod positions, that the pixel position is slightly different for each of the 12 visits to this position. This behavior introduces a strong time dependence in the position sensitivity correction that cannot be disentangled. Our best attempt to correct for the position sensitivity uses a linear correction in two dimensions for each of the nine nod positions (9 ×Linear). Table 7 compares the four best model combinations. Compared to K09, our flux offsets are multiplicative rather than additive (see Equation (1)) and our final model does not fix either of the ramp parameters. As a result, our eclipse-depth uncertainty is larger (0.049% ± 0.016%).

Figure 15.

Figure 15. Pointing histogram for one of nine nod positions of HD149bs41. The small, 0.01 pixel bin size clearly shows that the positions of the 12 visits have very little overlap, resulting in a time-dependent position sensitivity and making the data impossible to model accurately using a BLISS map. The small footprint size demonstrates the difficulty of making a definitive IRAC intrapixel map using all the stellar staring data in each channel. The horizontal and vertical black lines represent pixel boundaries.

Standard image High-resolution image

5.4.2. HD149bs42

We use a 0.04 pixel bin size and only consider bins with at least eight points to ignore an outlier near subpixel location (14.36, 15.24). Table 8 contains ΔBIC values and best-fit eclipse depths from our least-squares minimizer for three different values of the clipping parameter: q = 0, 5000, and 10,000. This parameter ignores the given number of data points from the beginning of the observation and is a common procedure (Knutson et al. 2011) when trying to find the best-fitting ramp. The table indicates that all but one of the eclipse depths are consistent for q = 10, 000 and that Equations (4) and (5) are consistent at all three q values. Our MCMC routine finds strong nonlinear correlations in all ramp model components except Equation (11), which exhibits linear correlations that are easily handled by orthogonalizing the system flux and both ramp parameters. We use Equation (11) with q = 10,000 in the joint model fit. The eclipse depth for the competing solution (Equation (2)) differs by less than 1σ.

Table 8. HD149bs42—Comparing Model Fits

q: 0 0 5000 5000 10,000 10,000
R(t) Ecl. Depth ΔBIC Ecl. Depth ΔBIC Ecl. Depth ΔBIC
  (%)   (%)   (%)  
2 0.149  59 0.111  1 0.068  0
3 0.030   4 0.056  2 0.068 11
4 0.065   7 0.065  9 0.063 19
5 0.065   8 0.062  9 0.061 20
6 0.132  47 0.099  6 0.067 10
7 0.096  37 0.086 15 0.065 20
8 0.087   0 0.071  0 0.068 10
9 0.062  16 0.052 17 0.049 28
11 0.197 205 0.128 31 0.064 0

Note. Bold indicates the model selected for the joint fit.

Download table as:  ASCIITypeset image

5.4.3. HD149bs43

We choose a relatively large bin size of 0.03 pixels because the intrapixel effect is minimal (position dependence is flat) and we do not want to overfit the edges of position space where there are few data points. Smaller bin sizes result in similar eclipse depths. Without BLI, we find an eclipse depth of 0.040% ± 0.008%, nearly identical to K09; however, this fit does not have the lowest BIC value (ΔBIC = 112). There is a small but clear position dependence that, once accounted for, results in a marginally deeper eclipse of 0.044% ± 0.008%. We confirm that the eclipse midpoint is noticeably earlier (by 4σ) than the expected phase value of 0.5, but do not claim a detection of eclipse timing variation because HD149bs21 measures the preceding eclipse with a much stronger S/N and is consistent with a circular orbit. Fixing the phase of mid-eclipse to 0.5 results in a marginally shallower eclipse depth (0.039% ± 0.008%) and a larger SDNR value.

5.4.4. HD149bs44

Relative to the other 8.0 μm events, HD149bs44 exhibits significantly larger SDNR and uncertainty scaling factor values (see Appendix B). The uncertainty scaling factor renormalizes the error bars such that χ2ν = 1, so a smaller scaling factor indicates a better fit. For this noisy data set, the models achieve relatively poor fits compared to other 8.0 μm data sets.

The ramp models that produce the most consistent eclipse depths in Table 9 are Equations (4) and (11) for q = 5000, 7500, and 10,000. Smaller values of q produce inconsistent eclipse depths for all ramp models; larger q values do not provide sufficient out-of-eclipse baselines. For q = 7500, most of the ramps find eclipse depths that are in agreement with the consistent values given by Equations (4) and (11). Of these ramps, Equations (2) and (11) share the lowest BIC value; however, the quadratic parameter in Equation (11) correlates strongly with the eclipse depth, resulting in a larger uncertainty (0.017 versus 0.013), so we select Equation (2) with q = 7500 for our final joint model. This data set also applies a BLISS map with 0.025 pixel bin sizes and at least 10 points bin−1.

Table 9. HD149bs44—Comparing Model Fits

q: 5000 5000 7500 7500 10,000 10,000
R(t) Ecl. Depth ΔBIC Ecl. Depth ΔBIC Ecl. Depth ΔBIC
  (%)   (%)   (%)  
2 0.072 0 0.066 0 0.085 0
3 0.075 11 0.069 11 0.085 11
4 0.073 23 0.068 22 0.069 24
5 0.072 22 0.066 22 0.085 21
6 0.074 11 0.069 11 0.082 11
7 0.089 17 0.096 10 0.086 22
9 0.073 33 0.067 22 0.081 32
11 0.073 1 0.069 0 0.069 3

Note. Bold indicates the model selected for the joint fit.

Download table as:  ASCIITypeset image

5.5. Two Fits at 16 μm

Each event consists of 1050 exposures, divided serially into three 350-image sequences at non-overlapping stellar centers on the detector. The second sequence (p = 1) is completely within the secondary eclipse and, because of the free position offset parameter, has no impact on the eclipse depth (but still helps to model the systematics). Unfortunately, this means that the eclipse depth is completely determined by the small number of points after ingress in the first sequence and before egress in the last sequence.

To avoid residual effects from potentially bright previous targets, neither of the two 16 μm observations was positioned at the center of the array. As a result, the outer radius of the sky annulus is limited to 11 and 12 pixels for HD149bs51 and HD149bs52, respectively, to avoid the flux falloff at the edges of the blue peak-up array. We apply both aperture and optimal photometry (Horne 1986; Deming et al. 2005); the former produces cleaner results for both data sets.

Figure 16 displays the best-fit eclipse depths and corresponding SDNR values versus aperture size for four competing models in each data set. The first applies no intrapixel correction, the second uses three linear components (one at each position), the third uses BLI, and the final model uses NNI. The first two models also apply a position offset to account for the differences in pixel sensitivity. This offset is redundant for BLI and NNI.

Figure 16.

Figure 16. Best-fit eclipse depths and corresponding SDNR values for HD149bs51 and HD149bs52. Both are plotted as a function of photometry aperture size for the HD149bs51 (upper panel) and HD149bs52 (lower panel) data sets. The labels refer to the type of intrapixel model component used. Here, "None" uses no model, "3*Ln" uses a linear function in x and y at each of the three positions, and "BLI" and "NNI" both use our new BLISS mapping technique with 0.02 pixel bins. In all but one case, the best aperture size is 2.0 pixels, according to SDNR. Due to the weak eclipse signal, the phase of mid-eclipse is fixed to 0.5. A typical 1σ eclipse-depth uncertainty is 0.037%.

Standard image High-resolution image

Although these models find similar (<1σ) eclipse depths at the best aperture size of 2.0 pixels, this is not the case for other aperture sizes. Models using BLI or NNI produce the most consistent eclipse depths and result in lower SDNR values (see Tables 10 and 11). We test a range of bin sizes for BLISS mapping but find that NNI consistently outperforms BLI. We take this as evidence that there are insufficient data points in most bins to model the weak position dependence accurately. As such, we consider only the last two models and select no intrapixel model for HD149bs51 and the 3  × linear model for HD149bs52 to perform the joint fit. For improved convergence in our MCMC chains, we orthogonalize the system fluxes and both ramp terms in Equations (11) and (2) for HD149bs51 and HD149bs52, respectively. Due to the weak eclipse signal, the midpoint is fixed to a phase of 0.5 for the individual fits; the joint fit achieves a sufficient S/N to share the eclipse midpoint as a free parameter. The final model fit uses 1.6 × 106 iterations and is in Appendix B.

Table 10. HD149bs51—Comparing Model Fits

R(t) M(x, y) SDNR ΔBIC Ecl. Depth
        (%)
11 NNI 0.002890 0 0.059 ± 0.038
11 BLI 0.002924 21 0.061 ± 0.035
11 ... 0.003119 169 0.068 ± 0.038
11 3 ×Linear 0.003066 175 0.060 ± 0.037

Download table as:  ASCIITypeset image

Table 11. HD149bs52—Comparing Model Fits

R(t) M(x, y) SDNR ΔBIC Ecl. Depth (%)
2 NNI 0.003166 0 0.081 ± 0.036
2 BLI 0.003211 23 0.068 ± 0.035
2 3 ×Linear 0.003378 175 0.074 ± 0.038
2 ... 0.003574 243 0.066 ± 0.034

Download table as:  ASCIITypeset image

We find that the eclipse depths for both data sets vary with the choice of eclipse midpoint. Using the midpoint from our joint model (0.5015, see Appendix B), we find that the best-fit eclipse depth differs by up to 1σ from the values listed in Tables 10 and 11, where the midpoint is fixed to 0.5. We also test joint fits without the data at p = 1 and find a comparable joint-fit eclipse depth of 0.099% ± 0.035%. The small change is likely the result of weak correlations with the ramp parameters. We include all positions in the final fit as this results in a lower SDNR and a small improvement in the eclipse-depth uncertainty.

6. ATMOSPHERE

In order to understand the atmosphere of the planet better, we compare our measured flux ratios to those generated from a model atmosphere. We simulate the atmosphere of HD 149026b using the model presented by Fortney et al. (2005, 2006, 2008). See Fortney et al. (2008) for a description of the heritage of the model, which includes solar system planets and brown dwarfs in addition to exoplanets. The chemical mixing ratios used assume chemical equilibrium, following Lodders & Fegley (2002), at both solar metallicity ("1 × solar") as well as 30 × solar, using the abundances of Lodders (2003). The opacity database is described by Freedman et al. (2008), with an update to include CO2 opacity.

We have generated chemistry/opacity grids with and without the opacity of gaseous TiO/VO. These gases, which are strong absorbers of optical flux, may be responsible for the temperature inversions diagnosed in the atmosphere of some planets (e.g., Hubeny et al. 2003; Fortney et al. 2006, 2008), but see Spiegel et al. (2009) for important caveats. The mid-infrared flux ratios observed for hot Jupiters are quite diverse, but high flux ratios (and corresponding large brightness temperatures) in the mid-infrared, together with small 3.6–4.5 μm ratios, have been found in models with temperature inversions (Fortney et al. 2006; Burrows et al. 2007, 2008; Madhusudhan & Seager 2009; Spiegel & Burrows 2010), and these models have had some success in comparisons with data (see Seager & Deming 2010 for a review). For HD 149026b, we find a 3.6–4.5 μm ratio >1, similar to HD 189733b, which indicates no temperature inversion.

We compare the measured flux ratios to three different models in Figure 17. All assume redistribution of absorbed stellar flux over the dayside of the planet only (f = 1/2; as described by Fortney & Marley 2007). We show a 1 × solar model with TiO/VO opacity (which yields an inversion) and the same model with TiO/VO opacity neglected. We also plot a similar no-inversion model at 30 × solar metallicity. This last model leads to mixing ratios ∼30 and ∼900 times larger for CO and CO2, respectively, compared to the solar metallicity case (Zahnle et al. 2009). The dramatic increases in CO and CO2 lead to the most notable spectral difference, the much deeper absorption due to the overlapping 4.5 μm band of CO and 4.2 μm band of CO2. Such a high metallicity for this atmosphere may well be realistic, as Saturn is ∼10 × solar in carbon (Flasar et al. 2005), while Uranus and Neptune are ∼30–60 × enriched.

Figure 17.

Figure 17. Atmospheric models of the dayside of HD 149026b. The red line depicts a model with a temperature inversion, f = 0.50, solar metallicity, and an effective dayside temperature, Teff, of 2067 K. The blue model has no temperature inversion, f = 0.50, solar metallicity, and Teff = 2056 K. The green model is similar to the blue model but with enhanced metallicity (30 × solar) and Teff = 2081 K. Model band-averaged ratios are shown as squares while the data points are orange diamonds. Models that lack temperature inversions and include a high atmospheric metallicity best match the observed data.

Standard image High-resolution image

Models with a temperature inversion similar to the red line shown in Figure 17 are clearly disfavored, given the 3.6–4.5 μm ratio >1 and the large flux ratios at redder wavelengths. Our best fit for the three models is the 30 × enhanced, no-inversion model. Only the 3.6 μm point is outside our 1σ error bar. Even with the strong CO/CO2 absorption, we still cannot quite match the observed 3.6–4.5 μm ratio. At face value, this would imply even larger CO and/or CO2 mixing ratios; however, this may not necessarily be the case, as some other modelers (e.g., Burrows et al. 2008) generally show a deeper absorption feature at 4.5 μm than we obtain with our models. This is likely due to differences in the temperature gradient as a function of height in the different models, as this helps to control the depth of absorption features. A steeper gradient leads to deeper absorption features.

Less-efficient redistribution of absorbed flux leads to a hotter dayside, and still would yield a satisfactory fit to the observations, albeit near the top of the 1σ error bars. Given the 8.0 μm phase curve of K09, which showed modest day/night phase variation, such a hot dayside (which would leave little energy for the night side) is not favored. We recommend that a coupled three-dimensional dynamics/radiative transfer model be run for the system, to understand if the implied dayside and nightside temperatures can be matched. Showman et al. (2009) had good success in matching the phase curve and dayside photometry of HD 189733b. These models tend to show better day–night homogenization of temperature contrasts than one would assume from the fact that our best-fit one-dimensional (1D) model assumes all absorbed flux is re-radiated on the planet's dayside. Additionally, near-infrared fluxes from the JHK band (e.g., Croll et al. 2010), where this planet is brightest, would help to understand the dayside luminosity; however, given the small planet-to-star radius ratio, this may have to wait for the James Webb Space Telescope.

Knutson et al. (2010) suggest that the absence of a temperature inversion within an exoplanet atmosphere correlates with higher levels of chromospheric activity from the host star. The lack of a temperature inversion in HD 149026b does not agree with HD 149026's relatively low activity level, but this may be due to the exoplanet's high density (Knutson et al. 2010).

The pressure–temperature profiles for the three atmospheric models are shown in Figure 18. Also plotted are the contribution functions (e.g., Chamberlain & Hunten 1987) for thermal flux in each of the five Spitzer bandpasses. Contribution functions trend toward lower pressure with enhanced metallicity for all bandpasses, but move most dramatically at 4.5 μm due to CO and CO2. At constant metallicity, a temperature inversion tends to smear the contributions over a wider pressure range, favoring lower pressures due to the hot upper atmosphere, but to a lesser extent compared to models with increasing metallicity.

Figure 18.

Figure 18. Contribution functions and atmospheric pressure–temperature profiles. Left three panels: the contribution functions in the five Spitzer bandpasses are for the three models shown in Figure 17. Emission generally comes from higher in the atmosphere for the metal-enriched model (left) and, to a lesser extent, the model that features a temperature inversion (right). Rightmost panel: the atmospheric pressure–temperature profiles are for these same models, colored to match Figure 17. The 30× solar no-inversion model is everywhere warmer than the 1× solar no-inversion model, but they have very similar Teff values, since the emission from the 30× model comes from much higher in the atmosphere.

Standard image High-resolution image

7. ORBIT

We have collected a total of 11 individual Spitzer secondary-eclipse observations with useful timing of HD 149026b over a 3.5 year baseline. These times constrain ecos ω, where e is the eccentricity and ω is the argument of periapsis, and can be used to establish eccentricity limits on the planet's orbit. We use BJDTDB given in Appendix B and correct for the eclipse-transit light time (42 s). The mean eclipse phase, using the K09 ephemeris, is 0.49997 ± 0.00028, suggesting that ecos ω = −0.00003 ± 0.00044. The data are consistent with a circular orbit (e < 0.0013). The times of secondary eclipse do not show any significant trends and do not have a period that differs significantly from the period determined from transit and radial velocity (RV) data. Such a difference would indicate apsidal motion or other secular effects (Giménez & Bastero 1995; Heyl & Gladman 2007). An MCMC ephemeris fit to our secondary-eclipse times gives a period of 2.875884 ± 0.000006 days, and a fit of all the available transit times gives a period of 2.8758922 ± 0.0000015 days. The difference between the two periods is not significant (1.4σ). If the measured period difference is due to apsidal motion, then it would indicate that $\dot{\omega }e\sin \omega = (9 \pm 7) \times 10^{-5}$ ° day−1 (Giménez & Bastero 1995), where $\dot{\omega }$ is the rate of apsidal precession. Further secondary-eclipse observations will refine the secondary-eclipse period.

We use our primary-transit and secondary-eclipse data to perform a fit, as described by Campo et al. (2011), that also incorporates other available transit data (Carter et al. 2009; Winn et al. 2008; Charbonneau et al. 2006) and RV data (Sato et al. 2005; Butler et al. 2006). When we fit an eccentric orbit to the available data (Table 12), we determine that e = 0.154 ± 0.016. Although this is a 10σ eccentricity, it is almost completely dominated by the esin ω component, leading us to believe that the eccentricity may be an overestimate (Laughlin et al. 2005). This is possible when the peaks of the RV curve, where the waveform is most sensitive to changes in esin ω, are undersampled. The eccentricity affects the symmetry of the RV curve, so when both peaks are not well sampled, the best-fit solution may misrepresent the actual eccentricity of the planet's orbit. Indeed, 16 of the 23 usable RV data points were taken at a transit phase greater than 0.5 and there are no data points between phase values of 0.1 and 0.3. The dearth of RV measurements near 0.25 signifies that only one of the two peaks is adequately constrained. To best refine the value of esin ω, we require additional RV measurements between phases 0 and 0.5, particularly near 0.25.

Table 12. Eccentric Orbital Model

Parameter Value
esin ω 0.154 ± 0.016
ecos ω −0.00037 ± 0.00044
e 0.154 ± 0.016
ω (°) 90.14 ± 0.16
P (days) 2.8758919 ± 0.0000014
T0 (MJDTDB)a 4597.70716 ± 0.00016
K (ms−1) 47.4 ± 1.1
γ (ms−1) −4.3 ± 0.6
BIC 123

Note.a MJDTDB = BJDTDB−2450,000.

Download table as:  ASCIITypeset image

In a comparison fit assuming a circular orbit (see Table 13), where the RV curve is perfectly sinusoidal and symmetric, BIC is worse than for the eccentric fit. Despite this, the undersampled RV data and the high degree of consistency of the eclipse phases with 0.5 make it unlikely that the orbit of this planet has an eccentricity greater than the maximum value of |ecos ω|. A near-perfect alignment of the system's semimajor axis with our line of sight (ω ∼ 90°) would be necessary, but the agreement between the transit and eclipse durations (see Section 5) argues against this scenario. Acknowledging that our secondary-eclipse timing measurements yield little information about esin ω, we present both solutions without judgment. Although an eccentric orbit is unlikely, it cannot be ruled out with the data currently available.

Table 13. Circular Orbital Model

Parameter Value
P (days) 2.8758916 ± 0.0000014
T0 (MJDTDB)a 4597.70713 ± 0.00016
K (ms−1) 42.6 ± 0.9
γ (ms−1) −1.6 ± 0.6
BIC 179

Note. aMJDTDB = BJDTDB−2450,000.

Download table as:  ASCIITypeset image

8. CONCLUSIONS

Over 3.5 years, Spitzer observed 3 primary transits and 11 secondary eclipses of HD 149026b in 5 infrared wavelengths. We utilize multiple observations for channels with the weakest eclipse depths to improve S/N estimates and better constrain the dayside atmospheric composition. The addition of a third transit event at 8.0 μm confirms previous results (Nutzman et al. 2009; Carter et al. 2009; K09) and offers an improved constraint on the planet-to-star radius ratio (Rp/R). A new eclipse analysis of HD149bs41 confirms the findings from K09, namely, that an eclipse depth of ∼0.05% fits best at 8.0 μm. However, we find a larger uncertainty due to correlations between the eclipse depth and ramp parameters that were not fully explored because one of the ramp parameters was previously fixed.

The atmosphere is explained well by a 1D chemical-equilibrium model. A temperature inversion is no longer favored when fitting the observed planet-to-star flux ratios. The best-fit model includes large amounts of CO and CO2, moderate heat redistribution (f = 0.5), and strongly enhanced metallicity (30 × solar). Using the times from our secondary-eclipse observations, we find no deviations from a circular orbit at the 1σ level. However, given the available RV data, we cannot completely rule out an eccentric orbit with an unlikely orbital alignment.

We present a new technique, called BLISS mapping, to model Spitzer's position-dependent systematics (intrapixel variability and pixelation). In all cases tested to date, BLISS mapping outperforms previous methods in both speed and goodness of fit. We also apply an orthogonalization technique for linearly correlated parameters that accelerates the convergence of Markov chains that employ the Metropolis random walk sampler.

We thank N. Lust, contributors to SciPy, Matplotlib, and the Python Programming Language, the free and open-source community, the NASA Astrophysics Data System, and the JPL Solar System Dynamics group for software and services. This work is based on observations made with the Spitzer Space Telescope, which is operated by the Jet Propulsion Laboratory, California Institute of Technology under a contract with NASA. Support for this work was provided by NASA through an award issued by JPL/Caltech.

APPENDIX A: MODEL COMPARISON WITH COMPLEX MODELS

In Section 3, we show that BLISS mapping improves the SDNR of models that include an intrapixel sensitivity term. Elsewhere in this paper and in our earlier work (Campo et al. 2011) we use the BIC to compare models of other systematic error components (e.g., rival parametric ramp models), but we did not use the BIC to compare intrapixel sensitivity models. In this Appendix, we discuss the approximations underlying the BIC to elucidate when it is useful, and in particular, to explain why we do not use it (or similar statistical criteria) for comparison of intrapixel sensitivity models.

The BIC provides an asymptotic approximation to quantities that may be used for Bayesian quantification of model uncertainty. The most simple use of the BIC is to approximate Bayesian explanatory model selection. A basic distinction among model selection criteria is between explanatory criteria (that seek the model that best describes the processes that produced the available data) and predictive criteria (that seek the model that will make the most accurate predictions of future data based on the available data). For example, the well-known Akaike information criterion (AIC = χ2 + 2k) is a predictive criterion. Although different from the BIC in rationale, its derivation invokes similar asymptotic approximations to those we describe here for the BIC, and we consider it to be similarly hampered for comparing large models. To illuminate the nature of the approximations underlying the BIC, we sketch its derivation here.

Consider a set of models {Mi} for observed data D. Let θi denote the parameters of model i, with ki dimensions. The (Bayesian) posterior probability for model i is proportional to the product of a prior probability for the model, and the model's marginal likelihood,

Equation (A1)

where πii) ≡ pi|Mi) is the prior probability density function (PDF) for the model's parameters and $\mathcal {L}_i(\theta _i) \equiv p(D|\theta _i, M_i)$ is the likelihood function, the probability for the data presuming the model holds with parameters θi (the likelihood function is proportional to exp [ − χ2i)/2] for our models). As its name suggests, the role of the marginal likelihood in quantifying model uncertainty is completely analogous to the role of the more familiar likelihood function in quantifying parameter uncertainty (within a particular model). Note that $\mathcal {M}_i$ is an average of the likelihood function for model i, not a maximum: in Bayesian inference, the weight of the evidence for a model is given by the typical value of the likelihood function for its parameters, not the optimum (largest) value.

For nonlinear models with more than a few dimensions, calculation of the integral in Equation (A1) is not feasible using standard quadrature methods. The development of algorithms for accurate calculation of marginal likelihoods is an active research area, and existing algorithms are typically problem-specific and computationally expensive (see Clyde et al. 2007 for a recent review targeting astronomers).

The BIC approximates $-2\ln \mathcal {M}$ for straightforward models in the limit of voluminous data, i.e., asymptotically (we drop the model index, i, when referring to a generic model, to simplify notation). In this limit, the likelihood function $\mathcal {L}(\theta)$ will be strongly peaked at the maximum likelihood estimate, $\hat{\theta }$, and it will be much more strongly concentrated than the prior; Figure 19 depicts the situation. As a first step in approximating the integral in Equation (A1), we evaluate the prior at $\hat{\theta }$ and pull it out of the integral, so

Equation (A2)

The prior PDF has dimensions of [1/θ], and we can express $\pi (\hat{\theta })$ as the inverse of a local prior scale, Δθ (for a normalized flat prior spanning a range Δθ, we have $\pi (\hat{\theta }) = 1/\Delta \theta$ exactly). We next approximate the integral of the likelihood function in Equation (A2) as the product of its height (the maximum likelihood value, $\mathcal {L}(\hat{\theta })$) and a characteristic width, δθ (describing the posterior uncertainty in θ). This gives

Equation (A3)

With suitable definitions of the prior and posterior scales, we can make this equation exact (e.g., for a 1D model with a Gaussian likelihood function of standard deviation σ, and a flat prior with range Δθ ≫ σ, as long as $\hat{\theta }$ is not near a boundary of the prior range, then setting $\delta \theta = \sigma \sqrt{2\pi }$ ≈1.06 times the full width at half-maximum makes the equation accurate). The factor multiplying the maximum likelihood is sometimes called the Ockham factor and will be ⩽1; it quantifies how much of the parameter space of the model is wasted, in the sense of including parameter values that are ruled out by the data. Note that for dimension k > 1, these scales are volumes, i.e., products of scales in each dimension.

Figure 19.

Figure 19. Ingredients for the derivation of the BIC approximation to the marginal likelihood. Curves show the likelihood function (solid blue) and prior PDF (dashed green), with characteristic widths δθ and Δθ. Points show the maximum likelihood parameter estimate, $\hat{\theta }$, and the values of the likelihood and prior at $\hat{\theta }$.

Standard image High-resolution image

Asymptotically, for a simple model of fixed dimension, we expect the uncertainty for each parameter to scale like $1/\sqrt{N}$ for sample size N. So for a model of dimension k, we expect δθ to eventually decrease proportional to Nk/2. Let Na be the sample size where the asymptotic behavior kicks in, and δθa be the typical scale of the uncertainties at that sample size. Then we expect

Equation (A4)

In the simple case of estimating linear parameters for data with additive Gaussian noise of fixed noise variance, we expect asymptotic behavior right away, so Na = 1. Now take the logarithm and group terms according to their dependence on N:

Equation (A5)

The first term may have a nontrivial N dependence; the last term is constant with respect to N. Schwarz (1978) derived a more precise expression like this, explicitly calculating the first term in the case of linear models with sampling distributions in the exponential family (which includes, e.g., normal, Poisson, and multinomial distributions). In that case the maximum log-likelihood term is expected to grow increasingly negative, roughly proportionally to N.

The BIC keeps the N-dependent terms in Equation (A5) and multiplies by −2;

Equation (A6)

It is attributed to Schwarz (and sometimes called the Schwarz criterion), but notably, Schwarz did not drop the N-independent term in his approximation to $\ln \mathcal {M}$, although he termed it a "residual" with respect to the N-dependent terms.

If the models under consideration are considered equally probable a priori, the most probable model is the one with the largest marginal likelihood. BIC-based model selection uses the BIC to approximate the logarithm of the marginal likelihood, choosing the model with the smallest value of the BIC. The derivation sketched above provides some insight regarding when we might expect this procedure to identify the highest likelihood model. There are two main considerations.

First, the BIC is an asymptotic criterion. Its accuracy requires sample sizes large enough so that the parameter uncertainties are decreasing at the O(Nk/2) rate. For complex models with many parameters, this is not simply a matter of sample sizes being "large enough." For some models—e.g., nonparametric models—the number of parameters may grow with sample size (explicitly or effectively); for others, some parameters may be sensitive to only a subset of the data. For example, the BLISS model uses a piecewise linear intrapixel sensitivity map, so a particular coefficient is determined by only a subset of the image pixels. In these cases, a BIC-like criterion may be valid, but with the kln N term replaced with a term that more accurately describes the asymptotic behavior of parameter uncertainties. Determining the form of such a term can be subtle (see Kass & Raftery 1995, Section 4.2). In these settings, it may take very large sample sizes to reach asymptotic behavior.

Second, the BIC drops a constant (in N) term from the logarithm of the marginal likelihood—Schwarz's "residual." That is, it drops a multiplicative factor from the estimated marginal likelihood. This factor depends on the prior volume, Δθ. For models with many similar degrees of freedom, like the various intrapixel sensitivity models, the prior volume is the product of the ranges of many variables. It can vary sensitively with the choice of a priori scale per parameter, and if the competing models have different types of parameters, the omitted residual terms may be very different from one model to another. The difference between the residual terms can be large when the models are large. As a result, the change in the BIC between two large models cannot be relied upon for identifying the model with the larger marginal likelihood.

Kass & Wasserman (1995, hereafter KW95) have examined the role of the residual term in the asymptotic approximation of the log marginal likelihood, arguing that for some problems there may be a reasonable argument for it to be negligible. Note that the last term in Equation (A5) will vanish if

Equation (A7)

When the asymptotic sample size Na = 1 (e.g., for linear models, like estimating the mean of a normal distribution with known variance), this requirement corresponds to having the prior range equal to the width of the likelihood for a single-sample data set. For Na > 1, the Nk/2a factor scales the δθa likelihood volume to what the single-sample volume would be if the model were asymptotic starting with N = 1. Thus KW95 dubbed a prior satisfying Equation (A7) a unit information prior, i.e., a prior that is as informative as a single datum.5 To the extent that one could consider the uncertainty scale associated with a single measurement to reflect prior uncertainty, the BIC may be an accurate approximation to $\ln \mathcal {M}$.

However, even when a unit information prior scale appears reasonable, for large models, even a small variation from this scale for the prior range of each parameter could produce a large net residual term. We thus do not consider the unit information prior argument to provide a sound justification for using the BIC to compare large models.

For these reasons, in our work we limit use of the BIC to comparing small models, or large models that are nested, so that rival models share the vast majority of parameters. For example, we rely on the BIC to compare different ramp models that share a common BLISS map model, but we do not consider the BIC to be valid for comparing, say, a model using a BLISS map to a model using B10's intrapixel variability correction, or to polynomial intrapixel models.

The residual issue, in part, reflects an inherent weakness of marginal-likelihood-based model comparisons with large models. Small changes in the per-parameter prior scale for such models can lead to large changes in marginal likelihoods, even with an accurate numerical calculation of the marginal likelihoods. This motivates developing a way to allow the scale to adapt to the data. In a Bayesian framework, this can be accomplished using a hierarchical model to implement regularization. One considers the prior range to be an adjustable parameter itself that is learned from the data. This can be done in a manner that essentially lets the data determine the effective number of free parameters (the total number of knots) required for the intrapixel map. We would then compare our best fit using BLISS mapping to those obtained using other intrapixel models and select the appropriate model component. Such calculations are beyond the scope of this paper, but would be a productive avenue for future research.

APPENDIX B

Table 14 displays the best-fit parameter values and uncertainties from our joint transit light-curve fit. Tables 15 and 16 display the best-fit parameter values and uncertainties from our joint eclipse light-curve fits. The tables also contain information to help evaluate the goodness of fit.

Table 14. Best-fit Joint Transit Light-curve Parameters

Parameter HD149bp41 HD149bp42 HD149bp43
Wavelength (μm) 8.0 8.0 8.0
Array position ($\bar{x}$, pixel) 14.93 14.96 15.14
Array position ($\bar{y}$, pixel) 15.14 14.56 14.47
Position consistencyax, pixel) 0.013 0.011 0.011
Position consistencyay, pixel) 0.012 0.013 0.013
Aperture size (pixel) 3.5 3.5 3.5
Sky annulus inner radius (pixel) 7.0 7.0 7.0
Sky annulus outer radius (pixel) 15.0 15.0 15.0
System flux Fs (μJy) 117229 ± 14 117460 ± 60 117356 ± 13
Transit midpointb (MJDUTC) 4327.3720 ± 0.0005 4356.1315 ± 0.0005 4597.7067 ± 0.0005
Transit midpointb (MJDTDB) 4327.3727 ± 0.0005 4356.1323 ± 0.0005 4597.7075 ± 0.0005
Rp/R 0.0518 ± 0.0006 0.0518 ± 0.0006 0.0518 ± 0.0006
cos i 0.095 ± 0.009 0.095 ± 0.009 0.095 ± 0.009
a/R 5.98 ± 0.17 5.98 ± 0.17 5.98 ± 0.17
Ramp equation (R(t)) 3 8 2
Ramp, r0 24 ± 4 0 18 ± 2
Ramp, r1 −0.6 ± 0.7 0 4.1 ± 1.4
Ramp, r2 0.0110 ± 0.0015 0 0
Ramp, r6 0 0.0009 ± 0.0005 0
Ramp, r7 0 −0.00048 ± 0.00011 0
Ramp, t0 0 −0.0248 ± 0.0011 0
BLISS map (M(x, y)) Yes Yes Yes
Min. number of points per bin 4 4 4
Total frames 67008 54080 70000
Rejected frames (%) 0.714840 0.377219 0.504286
Frames usedc 61520 53865 68646
Free parameters 8 5 4
AIC value 184044 184044 184044
BIC value 184216 184216 184216
SDNR 0.0083440 0.0083556 0.0083691
Uncertainty scaling factor 0.818677 0.818525 0.821463
Photon-limited S/N (%) 83.1 83.1 82.6

Notes. arms frame-to-frame position difference. bMJD = BJD−2450,000. cWe exclude frames during instrument/telescope settling, for insufficient points at a given knot, and for bad pixels in the photometry aperture.

Download table as:  ASCIITypeset image

Table 15. Best-fit Joint Eclipse Light-curve Parameters

Parameter HD149bs11 HD149bs21 HD149bs31 HD149bs32 HD149bs33
Wavelength (μm) 3.6 4.5 5.8 5.8 5.8
Array position ($\bar{x}$, pixel) 14.58 14.57 14.50 14.42 14.78
Array position ($\bar{y}$, pixel) 15.66 14.99 14.37 14.17 14.67
Position consistencyax, pixel) 0.009 0.009 0.020 0.020 0.015
Position consistencyay, pixel) 0.007 0.004 0.018 0.013 0.017
Aperture size (pixel) 3.75 2.75 2.75 2.75 2.75
Sky annulus inner radius (pixel) 7.0 7.0 7.0 7.0 7.0
Sky annulus outer radius (pixel) 15.0 15.0 15.0 15.0 15.0
System fluxb Fs (μJy) 528456 ± 10 334037 ± 40 216010 ± 70 214102 ± 40 212621 ± 80
Eclipse depth (%) 0.040 ± 0.003 0.034 ± 0.006 0.044 ± 0.010 0.044 ± 0.010 0.044 ± 0.010
Brightness temperature (K) 2000 ± 60 1650 ± 120 1600 ± 200 1600 ± 200 1600 ± 200
Eclipse midpointc (orbits) 0.5003 ± 0.0004 0.4994 ± 0.0007 0.502 ± 0.004 0.501+0.002− 0.005 0.500+0.003− 0.005
Eclipse midpointd (MJDUTC) 4535.8756 ± 0.0010 4596.2669 ± 0.0019 4325.941 ± 0.013 4633.658 ± 0.010 4903.989 ± 0.012
Eclipse midpointd (MJDTDB) 4535.8764 ± 0.0010 4596.2676 ± 0.0019 4325.942 ± 0.013 4633.659 ± 0.010 4903.990 ± 0.012
Eclipse duration (t4–1, hr) 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02
Ingress/egress time (t2–1, hr) 0.230 ± 0.011 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012
Ramp equation (R(t)) 10 2+ 2+ 2+ 2+
Ramp, r0 0 29 ± 9 30 ± 6 42 ± 5 26 ± 5
Ramp, r1 0 7 ± 4 8 ± 3 14 ± 2 6.4 ± 2.0
Ramp, r2 −0.0038 ± 0.0008 0 0 0 0
BLISS map (M(x, y)) Yes Yes Yes No No
Min. number of points per bin 4 5 5 ... ...
Total frames 54080 54080 54080 54080 54080
Rejected frames (%) 0.308802 0.160873 0.268121 0.277367 0.312500
Frames usede 50769 48769 50919 50930 53911
Free parameters 6 7 7 4 4
AIC value 50775 48776 155775 155775 155775
BIC value 50828 48837 155924 155924 155924
SDNR 0.00280365 0.00388070 0.0120115 0.0119551 0.0119188
Uncertainty scaling factor 0.401020 0.489317 0.914694 1.02508 1.02476
Photon-limited S/N (%) 93.9 89.9 73.6 73.7 74.3

Notes. arms frame-to-frame position difference. bWe multiply the HD149bs32 and HD149bs33 measured system fluxes by 0.968 to correct for an IRAC flux conversion issue in the S18.18 pipeline. cBased on the period and ephemeris time given by K09. dMJD = BJD−2450,000. eWe exclude frames during instrument/telescope settling, for insufficient points at a given knot, and for bad pixels in the photometry aperture.

Download table as:  ASCIITypeset image

Table 16. Best-fit Joint Eclipse Light-curve Parameters

Parameter HD149bs41 HD149bs42 HD149bs43 HD149bs44 HD149bs51 HD149bs52
Wavelength (μm) 8.0 8.0 8.0 8.0 16 16
Array position ($\bar{x}$, pixel) 15.06 14.40 15.04 14.64 26.07 23.95
Array position ($\bar{y}$, pixel) 14.45 15.09 14.13 15.08 22.16 20.65
Position consistencyax, pixel) 0.091 0.013 0.011 0.012 0.016 0.018
Position consistencyay, pixel) 0.077 0.011 0.012 0.011 0.021 0.018
Aperture size (pixel) 4.0 3.5 3.75 3.5 2.0 2.0
Sky annulus inner radius (pixel) 7.0 7.0 7.0 7.0 6.0 6.0
Sky annulus outer radius (pixel) 15.0 15.0 15.0 15.0 11.0 12.0
System fluxb Fs (μJy) 117190 ± 100 116960 ± 10 117818 ± 6 118365 ± 70 18618 ± 6 18805 ± 11
Eclipse depth (%) 0.052 ± 0.006 0.052 ± 0.006 0.052 ± 0.006 0.052 ± 0.006 0.085 ± 0.032 0.085 ± 0.032
Brightness temperature (K) 1650 ± 110 1650 ± 110 1650 ± 110 1650 ± 110 1800 ± 600 1800 ± 600
Eclipse midpointc (phase) 0.5002 ± 0.0007 0.5010 ± 0.0009 0.4961 ± 0.0012 0.4988 ± 0.0006 0.5013 ± 0.0014 0.5013 ± 0.0014
Eclipse midpointd (MJDUTC) 3606.962 ± 0.002 4567.513 ± 0.003 4599.133 ± 0.003 4912.613 ± 0.002 4317.311 ± 0.004 4343.194 ± 0.004
Eclipse midpointd (MJDTDB) 3606.963 ± 0.002 4567.513 ± 0.003 4599.134 ± 0.003 4912.614 ± 0.002 4317.311 ± 0.004 4343.194 ± 0.004
Eclipse duration (t4–1, hr) 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02 3.29 ± 0.02
Ingress/egress time (t2–1, hr) 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012 0.234 ± 0.012
Ramp equation (R(t)) 2 11 10 2 11 2
Ramp, r0 15.4 ± 1.2 0 0 14 ± 8 0 62 ± 10
Ramp, r1 3.1 ± 0.5 0 0 0 ± 4 0 24 ± 4
Ramp, r2 0 0.083 ± 0.002 −0.0014 ± 0.0016 0 0.413 ± 0.012 0
Ramp, r3 0 −0.67 ± 0.11 0 0 −5.1 ± 0.2 0
BLISS map (M(x, y)) No Yes Yes Yes No No
Min. number of points per bin  ⋅⋅⋅ 8 4 10  ⋅⋅⋅  ⋅⋅⋅
Total frames 44352 54080 60500 54080 1050 1050
Rejected frames (%) 0.47574 0.432692 0.634711 0.488166 0.285714 0.571429
Frames usede 44041 48801 60100 46274 1047 1044
Free parameters 15 4 3 4 10 12
AIC value 194243 194243 194243 194245 2113 2113
BIC value 194518 194518 194518 194540 2237 2237
SDNR 0.00845740 0.00833381 0.00837245 0.00847674 0.00311603 0.00337752
Uncertainty scaling factor 0.816657 0.813220 0.819215 0.981709 0.374578 0.402919
Photon-limited S/N (%) 82.3 83.1 82.4 81.6 27.2 24.9

Notes. arms frame-to-frame position difference. bWe multiply the HD149bs44 measured system flux by 0.973 to correct for an IRAC flux conversion issue in the S18.18 pipeline. cBased on the period and ephemeris time given by K09. dMJD = BJD−2450,000. eWe exclude frames during instrument/telescope settling, for insufficient points at a given knot, and for bad pixels in the photometry aperture.

Download table as:  ASCIITypeset images: 1 2

Footnotes

  • The KW95 derivation is of course more careful than that sketched here. They account for correlations between parameters, replacing Equation (A7) with a relationship between Hessian matrices of the prior and likelihood.

Please wait… references are loading.
10.1088/0004-637X/754/2/136