Deriving Physical Properties from Broadband Photometry with Prospector: Description of the Model and a Demonstration of its Accuracy Using 129 Galaxies in the Local Universe

Joel Leja; Benjamin D. Johnson; Charlie Conroy; Pieter G. van Dokkum; Nell Byler

doi:10.3847/1538-4357/aa5ffe

1. Introduction

One of the primary goals of galaxy evolution studies is to understand how galaxies assembled their stars over time. The most straightforward way to accomplish this is to measure star formation rates (SFRs) and stellar masses as a function of redshift for representative samples of the galaxy population. This has now been done by many studies out to high redshift, both for the stellar mass function (e.g., Marchesini et al. 2009; Davidzon et al. 2013; Ilbert et al. 2013; Moustakas et al. 2013; Muzzin et al. 2013; Tomczak et al. 2014) and for the SFR–mass relationship (e.g., Brinchmann et al. 2004; Daddi et al. 2007; Noeske et al. 2007; Peng et al. 2010; Karim et al. 2011; Whitaker et al. 2012, 2014; Speagle et al. 2014; Ilbert et al. 2015).

The star-forming sequence and stellar mass function are derivative-integral pairs, and as a consistency check, the stellar mass growth rates as a function of mass and redshift implied by each can be compared. This approach minimizes systematics by using SFRs and masses derived from the same data sets, and in contrast to techniques which integrate over mass, is largely insensitive to the mass completeness limits (Leja et al. 2015; Tomczak et al. 2016). This technique finds broad consistency at $z\lt 1$ (Bell et al. 2007; Weinmann et al. 2012): however, at $z=1-3$ , the observed SFRs and stellar masses are systematically incompatible with one another by a factor of ∼2 (Leja et al. 2015; Contini et al. 2016; Tomczak et al. 2016). The most straightforward explanation is that these systematics arise from the models used to convert observed fluxes into stellar masses and/or SFRs.

Compilations of galaxy SFRs and masses in the literature tell a more complex story. Comparisons of observed SFR densities to the evolution of the stellar mass density show systematic differences of a factor of 1.5–2 (Madau & Dickinson 2014; Yu & Wang 2016). Empirical abundance matching models can appear able to reconcile these two measurements (e.g., Behroozi et al. 2013; Moster et al. 2013). These models contain a greater degree of complexity, which may be important in reconciling SFRs and stellar mass densities, but some of these additional model ingredients are yet to be thoroughly tested. Part of the challenge associated with interpreting literature compilations is that systematic uncertainties vary across techniques, and so combining literature results likely washes away substantial systematic offsets.

The apparent inconsistencies in observational quantities make it difficult for simulations of galaxy evolution to simultaneously satisfy observed SFR and mass constraints. Hydrodynamical simulations (e.g., Genel et al. 2014; Torrey et al. 2014; Furlong et al. 2015), semi-analytical models (e.g., Mitchell et al. 2014; Henriques et al. 2015), and pure analytic models (e.g., Lilly et al. 2013; Dekel & Burkert 2014) are all systematically offset from observations of masses, SFRs, or both at the level of 0.2–0.5 dex. This is a particularly interesting comparison because simulated SFRs and stellar masses are self-consistent by definition.

This systematic factor of two uncertainty in basic galaxy properties has direct implications for our understanding of how galaxies assemble their stars. Massive galaxies are thought to double in stellar mass from z = 2 to the present epoch (e.g., van Dokkum et al. 2010; Patel et al. 2013b). It is not possible to assess the uncertainty in this result without accurate models, nor in the evolution (or lack thereof) of the massive end of the mass function (e.g., Moustakas et al. 2013). Milky Way-mass galaxies are thought to grow by a factor of ∼5 since z = 2, and M31-mass galaxies by ∼3, with most of the growth occurring between z = 1 and z = 2 (Patel et al. 2013a; van Dokkum et al. 2013; Papovich et al. 2015). Systematic adjustment of stellar masses and SFRs at $z=1-3$ could change these results significantly.

Furthermore, in order to convert dynamical measurements into the dark matter fractions within galaxies, it is critical to know the stellar masses to within a factor of two (e.g., Cappellari et al. 2012). A systematic factor of two uncertainty in stellar mass results in no constraint for the dark matter fraction within the average quiescent (van de Sande et al. 2015) or star-forming (Wuyts et al. 2016) galaxy at $z=1-3$ .

In order to advance the field, basic stellar mass and SFR estimates must be improved. The dominant source of uncertainty in stellar populations fitting is no longer instrumental noise, sample size, or sample selection, but instead the degeneracies and limitations in synthesizing and fitting complex stellar populations to spectral energy distributions (SEDs) (Conroy et al. 2009; Wuyts et al. 2009; Behroozi et al. 2010; Walcher et al. 2011; Mobasher et al. 2015; Santini et al. 2015).

These parameter degeneracies arise because galaxy SEDs are the result of many complex physical variables, and it is challenging to uniquely determine these variables in individual galaxies with only broadband photometry. The age–dust–metallicity degeneracy is a well-known example of this (Bell & de Jong 2001), and uncertainty in these parameters can result in systematic changes to both stellar mass and SFR estimates. Another example is the shape of the dust attenuation curve, which is thought to change from galaxy to galaxy based on both dust geometry and composition (Witt & Gordon 2000). Since the effect of extinction is strongest at blue wavelengths, there is a strong degeneracy between the shape of the attenuation curve and the total column density of dust, with most studies requiring spectra or infrared (IR) photometry to separate the two (Johnson et al. 2007; Wild et al. 2011; Reddy et al. 2015). These parameter degeneracies complicate comparisons between different codes, because the outputs are strongly dependent on the priors in each SED fitting routine. The prior of a model parameter is the probability distribution which expresses the belief about the distribution of a parameter before the observations are fit: priors can either be explicitly applied in the likelihood calculation, or implicitly applied in the chosen parameterization of the variable, e.g., by using a decaying τ model to fit star formation histories (SFHs). Complicating the situation further, the priors for each model are often not clearly defined or explored.

There are also substantial uncertainties in the underlying stellar populations synthesis models. One challenge is the difficulty in sourcing accurate spectra for short-lived phases of stellar evolution and nonsolar stellar metallicities. For example, the contribution of thermally pulsing asymptotic giant branch stars (TP-AGB) may or may not dominate the near-IR emission of galaxies of intermediate age (Maraston 2005; Marigo et al. 2008; Conroy & Gunn 2010). This uncertainty alone can result in systematic differences of a factor of two in both stellar mass and age estimates (Bruzual 2007), though recent analyses imply that the data favor a relatively low TP-AGB contribution (e.g., Kriek et al. 2010).

Galaxy SED-fitting routines thus far have largely used simple models to derive stellar masses, with fixed or discrete dust attenuation curves, limited or fixed stellar metallicities, simple parameterized SFHs, and simple chi-squared minimization routines (Bolzonella et al. 2000; Brammer et al. 2008; Kriek et al. 2009). Yet, observed galaxies show significant variation in stellar metallicity (Tremonti et al. 2004) and attenuation curves (Reddy et al. 2015; Salmon et al. 2016), and neglecting this variation can bias the parameters determined from SED fitting (Mitchell et al. 2013). SFH recovery is critical to accurately recover stellar masses (Lee et al. 2009; Maraston et al. 2010; Pforr et al. 2012), and simple exponentially declining SFHs fail to reproduce the mass function of galaxies at earlier epochs by several orders of magnitude (Wuyts et al. 2011a). They also fit simulated SFHs poorly, leaving significant biases in stellar mass estimates (Simha et al. 2014). Simple chi-squared fitting routines cannot properly estimate the uncertainties when there are substantial degeneracies, as is the case when fitting broadband photometry of galaxies. Progress has been made in fitting SEDs with nonparametric SFHs (e.g., STECMAP (Ocvirk et al. 2006), VESPA (Tojeiro et al. 2007), LITTLE THINGS (Zhang et al. 2012), CSI (Kelson 2014; Dressler et al. 2016)); however, this practice is not yet widespread, as nonparametric SFHs require either very high signal-to-noise (S/N) data or a fitting algorithm which can handle significant degeneracies.

It has also been shown that the SFRs derived from fitting the ultraviolet (UV)–near-infrared (NIR) SED are often biased low for highly star-forming galaxies (Wuyts et al. 2011b), largely because they cannot account for star formation obscured by dust. SFRs are instead often derived separately, with simple "recipes" to convert observed UV, mid-infrared (MIR), or emission line fluxes into SFRs (e.g., Kennicutt 1998). This approach also suffers from known biases: the conversion from emission line fluxes to SFRs is affected by the ionization state of the gas, extra dust attenuation toward H ii regions, and stellar metallicity. Inferring total IR luminosities with only MIR broadband photometry can be strongly affected by galaxy-to-galaxy variability in polycyclic aromatic hydrocarbon (PAH) emission in the MIR (Draine et al. 2007) and by an active galactic nucleus (AGN) contribution (Kirkpatrick et al. 2015). The interpretation of L_IR itself is complicated by the contribution of evolved stars to the IR luminosity (Cortese et al. 2008; Hayward et al. 2014; Utomo et al. 2014). UV luminosities are sensitive to stellar metallicity and recent SFH, and very sensitive to the amount of dust attenuation, which can be estimated from the UV slope β but with serious limitations (Viaene et al. 2016).

Many of the issues in estimating SFRs with "recipes" can be avoided by instead performing a full-SED fit, which consolidates all available information regarding the physical condition of the stars, dust, and gas. Parameter degeneracies can be marginalized over with Bayesian inference techniques and physically motivated priors. Some codes have begun to take advantage of Bayesian approaches in fitting SEDs: CIGALE (Burgarella et al. 2005; Noll et al. 2009), MAGPHYS (da Cunha et al. 2008), GALMC (Acquaviva et al. 2011), BAYESED (Han & Han 2014), and BEAGLE (Chevallard & Charlot 2016). Yet due to the complexity in choosing fit parameters, priors, and stellar population synthesis techniques, SED fitting codes with multiple, degenerate parameters often produce different solutions for key physical parameters like stellar mass and SFRs (Santini et al. 2015). This motivates a thorough investigation into the accuracy of the posteriors in SFRs, stellar masses, and other key physical parameters determined from broadband photometry.

In this paper, we create and test the Prospector-α model within the Prospector inference framework (B. D. Johnson et al. 2017, in preparation) using the Flexible Stellar Populations Synthesis (FSPS) stellar populations code (Conroy et al. 2009). The Prospector-α model includes many of the advances mentioned above in a single, self-consistent framework. The model is fit to the broadband photometry from the Brown et al. (2014) spectral atlas, which is unique in that it has optical spectra which are aperture-matched to the photometry. These data allow us to probe the accuracy of our model fits by comparing posterior probability functions (i.e., probability distributions for parameters of interest after the data are taken into account) for diagnostic spectral features to the observed features from the aperture-matched spectroscopy. This posterior check is a strong test of the output SFRs, SFH, dust attenuation model, and stellar metallicities derived from fitting the Prospector-α model to broadband photometry.

In Section 2, the galaxy sample is described, along with the available broadband photometric and spectroscopic observations. In Section 3.1, the Prospector-α model is introduced, and the Prospector sampling procedure is described. In Section 4, a comparison between the Prospector-α model posteriors and the observed spectroscopic quantities is shown, including the Hα and Hβ luminosities, Balmer decrements, D_n4000, Hδ absorption, and stellar metallicities. In Section 5, the implications of the posterior checks are discussed, and past and future changes to the Prospector-α model are discussion in Section 6. The conclusion is presented in Section 7. A WMAP9 cosmology (Hinshaw et al. 2013) and a Chabrier (2003) initial mass function (IMF) are adopted for all relevant calculations.

2. Data

2.1. Broadband Photometry

We use broadband photometry and spectra from the Brown et al. (2014) spectral atlas of 129 local galaxies. There are 26 photometric bands available from a range of surveys and missions, including Swift/UVOT, Galaxy Evolution Explorer (GALEX), the Sloan Digital Sky Survey (SDSS), the Two Micron All Sky Survey (2MASS), Spitzer, and the Wide-field Infrared Space Explorer (WISE), giving wavelength coverage from the far-UV (FUV) to the MIR.

Figure 1 shows color cutouts of the sample from the online data catalog provided by Brown et al. (2014), along with the SFRs and stellar masses relative to the star-forming sequence. The sample covers a wide range of morphological types and stellar masses. The sample selection of the Brown et al. (2014) catalog is based on the availability of the aperture-matched spectra, and is not volume-complete.

Considerable care was taken with the absolute calibration of the photometry: the atlas includes corrections for foreground dust extinction, SDSS photometric calibration, scattered light in IRAC imaging, and errors in the pre-launch WISE W4 filter curve (Brown et al. 2014). The photometry is aperture-matched across all filters.

Swift/UVOT U and V bands and Spitzer/IRS Blue and Red Peak Up imaging channels are not included in the fits. These spectral regions are well-covered by SDSS optical imaging and Spitzer/MIPS and WISE imaging, respectively.

For 26 of these galaxies, we add Herschel PACS and SPIRE imaging from the KINGFISH survey (Kennicutt et al. 2011), spanning 70–500 μm. Galaxy emission at these wavelengths is dominated by thermal dust emission, which in the Prospector-α model, is self-consistently modeled with the stellar emission. The effect of the Herschel photometry on the model posteriors is explored in Appendix C.

2.2. Spectra

We utilize optical spectra from multiple ground-based telescopes, MIR Spitzer spectra (Houck et al. 2004), and Akari spectra (Onaka et al. 2007), all provided in the Brown et al. (2014) spectral atlas. The optical spectra largely come from Moustakas & Kennicutt (2006) and Moustakas et al. (2010), with some additional spectra from Kennicutt (1992) and Gavazzi et al. (2004). The resolution of the optical spectra is $R\sim 650$ and the wavelength coverage spans 3650 to 6900 Å.

The assembly of these spectra is described in detail in Brown et al. (2014) and in references therein. In brief, the optical spectra use drift-scan techniques, allowing them to be aperture-matched to the photometry. The spatial matching between the optical spectroscopy and the photometry is a key feature of the atlas: these two data products describe the same stellar populations without the uncertainty typically introduced by aperture corrections. The spectra have also been flux-normalized to be consistent with the photometry. These features allow direct comparison of the posteriors from fitting the photometry to the observed spectral features.

We measure the strength of key diagnostic emission and absorption features from the spectra, and compare these to expectations from fits to the photometry. For each galaxy, we measure the following features: Hδ, [O iii] 4959, [O iii] 5007, Hβ, [N ii] 6549, [N ii] 6583, Hα, and D_n4000. To ensure uniform quality, these spectral quantities are only measured when the spectra in question have a minimum resolution of R = 400 at Hα wavelengths. This removes IRAS 08572 + 3915, NGC 4860, and IRAS 17208-0014 from the spectral analysis.

2.2.1. Measuring Optical Emission Line Fluxes

In order to extract emission line fluxes, we perform simultaneous fits to the [O iii] 4959, [O iii] 5007, Hβ, [N ii] 6549, [N ii] 6583, and Hα features. The simultaneous fits allow overlapping lines to be properly separated. Each emission and absorption feature is modeled with a Gaussian, with a variable amplitude and width. The center of each line is fixed relative to one another, but the overall redshift of the model is allowed to vary around the fiducial redshift provided by the Brown et al. (2014) catalog. For doublet emission lines ([O iii] and [N ii]), the relative amplitudes of the emission lines are fixed to their fundamental ratios, and the line widths are also fixed to be equal.

The best-fit Prospector-α spectrum (see Section 4.2) is adopted as the stellar continuum. A simple approach to measuring emission line strength from spectra in the case of mixed absorption and emission is to separately measure the underlying absorption in the model stellar continuum, and add that to the observed emission line strength. However, the observed spectra often show gas emission resolved separately from stellar absorption, especially when the width of the stellar absorption line profile is dominated by age-dependent stellar broadening. In these cases, the line emission fills the center of the line, but the wings of the absorption are not filled with emission. To account for this complex line profile, the underlying stellar continuum is subtracted from the observed spectra, and the emission line flux is measured from the residuals.

Before subtracting the stellar continuum model, it must first be smoothed to the appropriate resolution and normalized to the observed spectrum. The observed spectral resolution is controlled by instrumental broadening, atmospheric effects, and the intrinsic line of sight velocity dispersion. The appropriate smoothing resolution is determined by performing a first-pass fit to the observed spectrum with the Gaussian model for absorption and emission lines. The two emission lines with the highest equivalent width are selected, and the average of their line widths is used to smooth the model spectrum. This implicitly assumes the gas and stars have similar velocity dispersions. The normalization is done by fitting two separate linear components for the continuum: one between 4700 Å $\lt \,\lambda \,\lt \,5100$ Å, and one between 6350 Å $\lt \,\lambda \,\lt \,6680$ Å. This is done for both the model spectrum from the best-fit to the photometry and the observed spectrum, and the ratio between the linear components is used to renormalize the model spectrum separately in each wavelength range. Since the model is fit to the photometry, and the observed photometry and observed spectra have consistent normalizations by design (Brown et al. 2014), this normalization factor is typically <5%.

The Brown et al. (2014) spectral atlas does not include error estimates for the spectral fluxes. To estimate errors, the smoothed, normalized stellar continuum model is subtracted from the observed spectra, with known emission lines masked. A Gaussian is fit to the histogram of the residuals. The error on the spectra fluxes is taken to be the width of the Gaussian. This error is typically between 3%–15% of the flux. This provides a conservative estimate of the true instrumental noise, as it also includes inevitable mismatches between the model spectrum and the observed spectrum.

To derive errors on the line fluxes, a bootstrap analysis is performed: for each pixel, the flux is perturbed with a random value drawn from a Gaussian centered at zero, with a width equal to the errors derived above, then the line fluxes are re-fit. This is performed 100 times for each galaxy, and the final line fluxes are taken to be the 50th percentile of this distribution, with the +/− 1σ errors taken to be the 84th and 16th percentiles, respectively. For Hα and Hβ, an additional error due to uncertainty in the absorption correction is added, equal to the 1σ variance in the underlying model stellar absorption as sampled from the model posteriors.

2.2.2. D_n4000

D_n4000 is measured using the Balogh et al. (1999) definition: the ratio of the average flux density ${{\rm{F}}}_{\nu }$ in the narrow wavelength bands 3850–3950 Å and 4000–4100 Å. This is narrower and thus less sensitive to reddening effects than the original Bruzual (1983); Hamilton (1985) definition. This definition is used to measure D_n4000 in both the observed and the model spectra, and both are convolved to a common resolution of 200 km s⁻¹ before measurement. This is done to remove any potential systematics from different spectral resolutions between the model and the observed spectrum.

2.2.3. Hδ

Hδ is measured separately from the other line features. This is done for two reasons: (1) the Hδ emission and absorption are often of comparable size, resulting in a complex line morphology with a weak net signal (often poorly fit by a Gaussian), and (2) because nearby CN features affect simple linear continuum fits, which introduces a dependence on the detailed elemental abundance ratios (Prochaska et al. 2007). The relatively low S/N of the optical spectra also make the measurement of the intrinsically weak Hδ feature challenging.

To measure Hδ, fixed wavelength indices are defined over which to measure the continuum and the strength of the Hδ feature (e.g., Worthey et al. 1994; Worthey & Ottaviani 1997; Nelan et al. 2005). Defining these indices is challenging, since the intrinsic Hδ absorption line profile is sensitive to the age of the composite stellar populations (Prochaska et al. 2007),. For example, K-dwarf stars have Hδ full-widths at half-maximum (FWHMs) of ∼1–2 Å, while A stars have FWHMs of ∼20 Å, with the wings of the absorption profile extending ∼50 Å from the center (Prochaska et al. 2007).

In order to accurately measure the strength of the Hδ feature for a variety of line profile shapes, we thus adopt a wide and narrow set of indices. The narrow set of indices measures Hδ as the average flux between 4095 Å < λ < 4114 Å, the blue continuum as the average flux between 4072 Å < λ < 4095 Å, and the red continuum as the average flux between 4114 Å < λ < 4130 Å. The broad set of indices measures Hδ as the average flux between 4082 Å < λ < 4124 Å, the blue continuum as the average flux between 4042 Å < λ < 4082 Å, and the red continuum as the average flux between 4124 Å < λ < 4151 Å.

In accordance with the age dependence of the line profile, the wide index is used to measure the strength of the Hδ feature for galaxies with a half-mass assembly time of $\lt 8\,\,\mathrm{Gyr}$ , while the narrow index is adopted for galaxies with a half-mass assembly time of $\gt 8\,\,\mathrm{Gyr}$ . The half-mass assembly time is measured directly from the model posteriors for each galaxy. This classification scheme qualitatively correlates the width of the Hδ feature in the observed spectra. The same index to measure the Hδ strength is used in both the model and the observations, and the model measurement is performed at the same 200 km s⁻¹ resolution. In the observed spectra, this method of measuring line fluxes inherently measures the sum of Hδ absorption and emission. Errors are derived with the same bootstrapping method described in Section 2.2.1.

3. Definition of the Physical Model and Inference of Model Posteriors from SEDs

The process of generating and fitting a galaxy model to an observed SED is described here. Section 3.1 describes the relevant fixed and free parameters in our physical model for galaxies, Prospector-α, and the associated priors for each parameter. Section 3.2 describes the methodology used to fit this model to the observed galaxy photometry.

3.1. Main Features of the `Prospector-`α Model

Here the salient features of our physical model for galaxies are described, including all relevant free and fixed parameters, and the priors for each free parameter. A simple illustration of the effect of each model parameter on the SED is shown in Figures 2 and 3. The physical origin of each parameter is described, and the change in the SED from varying each free parameter while holding the others fixed is shown. The example SED in these figures is a relatively quiescent galaxy, similar to the Milky Way (specific SFR (sSFR) $\sim 1\times {10}^{-11}$ yr⁻¹): we note that the effect of some parameters (e.g., birth-cloud dust, stellar metallicity) is highly dependent on the sSFR.

**Figure 2.** Six of the 13 free parameters in the `Prospector-`α model. For each parameter, we show from left to right: a panel showing the physical significance of the given parameter, a model SED with three different values of the given parameter, and the relative change in the SED for these three different values. The first panel shows the stellar mass function for stellar mass, and the full history of star formation for the five SFH bins. We note that the sixth SFH bin is not a free parameter, as it is inferred from the values of the other five bins (see Section 3.1.1 for further details).
Download figure:
Standard image High-resolution image

**Figure 3.** Seven of the 13 free parameters in the `Prospector-`α model. For each parameter, we show from left to right: a panel showing the physical significance of the given parameter, a model SED with three different values of the given parameter, and the relative change in the SED for three different values of the given parameter. For the stellar metallicity, the first panel shows the triangular weighting scheme used in the metallicity distribution function. For the dust absorption parameters, the first panel shows the optical depth for a single component of the dust model. For the dust emission parameters, the first panel shows either the distribution of starlight intensity incident on the dust or the grain size distribution for carbon dust.
Download figure:
Standard image High-resolution image

For stellar population synthesis, the Flexible Stellar Population Synthesis (FSPS) package⁴ is used (Conroy et al. 2009). The python-fsps ⁵ package is used to communicate with FSPS in a Python interface. All of the parameters and choices described below are implemented in the FSPS source code. We use the default SPS parameters in FSPS,⁶ except the MILES stellar library is adopted.

3.1.1. Stellar Mass and SFH

Stellar mass, defined as the mass of existing stars and stellar remnants, is a free parameter in the Prospector-α model. Stellar emission powers all sources of radiation in our model (gas, dust, and stars); thus, the stellar mass sets the overall normalization of the SED. Since the normalization constant is defined in mass instead of luminosity, it often has covariances with the SFH parameters. A flat prior is adopted on the logarithm of the stellar mass, 5 < log(M/M_⊙) < 14, and a Chabrier (2003) IMF is used.

A nonparametric SFH prescription is adopted in the Prospector-α model. Using a nonparametric SFH has the great advantage of avoiding the poorly quantified systematics introduced by using a parameterized SFH (Conroy 2013): for example, exponentially declining τ models inherently couple the early-time to late-time SFH (Simha et al. 2014). These systematics are particularly challenging when determining proper errors on the derived SFH.

In the literature, nonparametric SFHs have primarily been used when performing full-spectrum fitting to high-resolution, high S/N data, e.g., STECMAP (Ocvirk et al. 2006) or VESPA (Tojeiro et al. 2007), or both low-resolution spectra and photometry (Kelson et al. 2014; Dressler et al. 2016). This is because, when performing classical minimization routines, non-parametric SFHs are very flexible and have too many degenerate parameters, leading to poorly behaved and noisy best-fit solutions. Bayesian statistics are more suitable for this problem, particularly when coupled with a Monte Carlo Markov chain (MCMC) sampler which can fully explore the posterior probability for poorly constrained parameters. Using a non-parametric SFH with Bayesian statistics will result in clean, accurate errors on SFH parameters compared to previous approaches, at the expense of longer computational time.

Fundamentally, the selection of the number, size, and location of SFH bins is a tradeoff between the required computational time and maximizing the amount of SFH information extracted from the data. This is challenging for a model which is fit to a large sample of galaxies with diverse SFHs, as the recoverability of the SFH given a set of data is highly dependent on the SFH of the galaxy itself (e.g., Tojeiro et al. 2007). To inform our selection of the SFH bins, we use a procedure which is described in detail in B. D. Johnson et al. (2017, in preparation). Here, it is briefly summarized.

The goal is to estimate the fractional uncertainty on the SFH bins given a set of data (i.e., broadband filters) and the associated noise parameters. The fractional uncertainty on each bin is used to select the bin spacing and size. This is done with the Cramer–Rao bound, which is the matrix inverse of the Fisher information matrix. The Fisher information matrix measures the fractional curvature of the likelihood function near the location of maximum likelihood: small curvature implies a broad posterior, and thus more uncertainty in the parameters. In practice, the SFH recoverability is estimated assuming only a single metallicity and no dust. These simplifying assumptions mean that this procedure will specify a lower limit on the uncertainties in the bin sizes.

As a simple estimate, two exponentially declining SFHs are used: a star-forming galaxy with τ = 1 Gyr observed after t = 1 Gyr, and a quiescent galaxy with τ = 1 Gyr observed after 10 Gyr. This procedure suggests that the six time bins shown in the bottom six panels of Figure 2 recover all SFH information for the chosen model galaxies with the Brown et al. (2014) photometry. The spectrum of each SFH bin is shown separately in Figure 4. The model SFR is constant within each bin.

**Figure 4.** The SED for each of the six bins of the non-parametric SFH are shown here. Each spectrum has 1 solar mass of stars formed at t = 0. The youngest bin is the only bin which produces nebular emission lines.
Download figure:
Standard image High-resolution image

The most straightforward way to fit these bins to the data would be to fit directly for the mass in each bin. In practice, the fractional mass in each bin (i.e., the "shape" of the SFH) is used instead, and the mass normalization, M, is fit as a separate parameter. This separation helps MCMC convergence by restricting the SFH parameter space to a smaller, well-defined volume, and also provides cleaner posteriors. We fit for the weight in each bin, f_n, such that

$\begin{eqnarray}&&{m}_{n}=\displaystyle \frac{{t}_{n}{f}_{n}}{{\displaystyle \sum }_{N}{t}_{n}{f}_{n}},\end{eqnarray} \tag{ 1 }$

where m_n is the fractional mass in each bin, N is the number of bins, and t_n is the amount of time in each bin in years. The meaning of f_n is more clear in the following equation:

$\begin{eqnarray}&&{\mathrm{SFR}}_{n}={\rm{M}}\displaystyle \frac{{m}_{n}}{{t}_{n}}={\rm{M}}\displaystyle \frac{{f}_{n}}{{\displaystyle \sum }_{N}{t}_{n}{f}_{n}},\end{eqnarray} \tag{ 2 }$

where M is the total mass and SFR_n is the SFR in a bin. The weights f_n are thus proportional to the sSFR (and the SFR) within each time bin. This maps directly into the prediction of physical quantities such as the age of the stellar populations and the strength of spectral absorption and emission features. We require that ${f}_{n}\geqslant 0$ , i.e., no bin can have a negative amount of stars formed.

In order to ensure a unique mapping between the weights f_n and the shape of the resulting SFH, the weights are constrained during the fitting process such that

$\begin{eqnarray}&&\displaystyle \sum _{n}\ {f}_{n}=1.\end{eqnarray} \tag{ 3 }$

In practice, given the sampling algorithm, applying this constraint directly would be highly inefficient. This is because in each sampling step, the sampler chooses a new set of parameters without taking constraints into account, then tests whether the parameters are within the constraints: for this constraint, the probability of randomly chosen parameters which fulfill this constraint is vanishingly small.

To alleviate this inefficiency, we instead allow $N-1$ bin fractions to be free model parameters with the constraint that

$\begin{eqnarray}&&\displaystyle \sum _{n-1}\ {f}_{n}\leqslant 1.\end{eqnarray} \tag{ 4 }$

The Nth bin fraction is then determined by

$\begin{eqnarray}&&{f}_{N}=1-\displaystyle \sum _{1}^{N-1}{f}_{n}.\end{eqnarray} \tag{ 5 }$

These constraints means that, when the data have little to no constraining power over the SFH, the fractional variables f_n will follow a flat Dirichlet distribution. We have verified numerically that this puts a Dirichlet prior on both the N-1 explicit fractional variables and the Nth implicit fractional variable. The marginal probability distribution for a single f_n, which corresponds to the effective Dirichlet prior on each f_n, is

$\begin{eqnarray}&&\mathrm{PDF}={(1-x)}^{N-2}\displaystyle \frac{{\rm{\Gamma }}(N)}{{\rm{\Gamma }}(N-1)}.\end{eqnarray} \tag{ 6 }$

which is a Beta distribution. So for the SFR in a single bin with N = 6 bins, the prior probability distribution is $p{({f}_{n})\propto (1-{f}_{n})}^{4}$ with $0\leqslant {f}_{n}\leqslant 1$ .

The upshot of the above parameterization for the SFH is that there is a roughly Gaussian-shaped prior on the logarithm of the sSFR in each time bin, i.e., on log(SFR _bin/M_tot). The center of this prior is at log(1/t_univ) ∼ −10.1 yr⁻¹, which is a constant SFR. The FWHM of the Gaussian prior is roughly 1.5 dex. Thus, in the absence of strong evidence from the data (where "strong" is determined by the FWHM of the prior), Prospector-α will assign a constant SFR(t) to a galaxy.

A clear advantage of the nonparametric SFH formulation over libraries of SFHs from simulations is that the prior on sSFR(t) can be explicitly written down as has been done here. The SFH posterior for an individual galaxy is a combination of the prior and the constraining power of the data. By knowing the prior, it is straightforward to discern how much of the posterior comes directly from the data, and how much of it comes from assumptions that are built-in to every SFH formulation.

3.1.2. Stellar Metallicity

Stellar metallicity affects the optical-to-NIR flux ratio (see Figure 3), which is important in determining ages, dust attenuation, and masses (Bell & de Jong 2001; Mitchell et al. 2013). The stellar metallicity also helps to set the normalization and shape of the SED blueward of the Lyman limit, which is used to generate emission line fluxes.

When fitting galaxy SEDs, it is important to interpolate between metallicities rather than using a discrete set of metallicities, else the resulting stellar mass estimates will be biased (Mitchell et al. 2013). We refine this approach further and use a metallicity distribution function rather than interpolated metallicities. Simple interpolation between stellar spectra of different metallicities will introduce non-physical features in the interpolated spectra, because stellar SEDs have a nonlinear response to changes in metallicity. The metallicity distribution function avoids interpolation, producing more physically motivated SEDs.

Stellar metallicity does not vary with age in Prospector-α, and there is no age–metallicity relationship implemented. This is in contrast to expectations in galaxies, where an age–metallicity relationship is expected due to the build-up of heavy elements with the passing of cosmic time. In testing, we have found that implementing an age–metallicity relationship can impart a unique shape to the UV-NIR SED which cannot be replicated by a fixed stellar metallicity, though this depends on both the assumed SFH and metal enrichment history. For example, quiescent galaxies are less sensitive to an age–metallicity relationship, as their SFR timescales are shorter. The complexity of implementation combined with the poor constraining power of broadband photometry prevent inclusion of an age–metallicity relationship in Prospector-α. Ideally, implementation would be guided by performing a detailed comparison of stellar metallicities recovered by Prospector-α to robust spectroscopic metallicities from a sample of galaxies with a wide range in sSFR. This is a topic for future work.

The range in metallicity in our model is determined by the coverage of the Padova isochrones (Marigo & Girardi 2007; Marigo et al. 2008) in FSPS, which extend from $-2.00\lt \mathrm{log}({\rm{Z}}/{{\rm{Z}}}_{\odot })\lt 0.20$ . The metallicity distribution function is implemented by summing simple stellar populations at discrete metallicity values, using a triangular weighting scheme. This is illustrated in Figure 3. The metallicity is specified in units of log(Z/Z ${}_{\odot }$ ), where log(Z/Z ${}_{\odot }$ ) defines the center of the triangular weighting scheme. A flat prior over −2.0 < log( ${\rm{Z}}/{{\rm{Z}}}_{\odot })\lt 0.2$ is used in the Prospector-α model.

3.1.3. Dust Attenuation

We use the two-component Charlot & Fall (2000) dust attenuation model, which postulates separate birth-cloud and diffuse dust screens. The birth-cloud dust simulates the embedding of young stars in molecular clouds and H ii regions, the net effect of which is extra attenuation toward young stars. We allow the normalization of the birth-cloud and diffuse dust components to vary separately in the fits, along with a power-law index describing the shape of the attenuation curve for the diffuse dust component. The variance of the SED with these parameters is shown in Figure 3.

It is difficult to achieve tight posteriors on dust attenuation parameters from UV-NIR broadband photometry alone, due to the well-known age–dust degeneracy (Papovich et al. 2001) (see Section 3.1.6 for further discussion). This degeneracy can be broken either by directly measuring the dust emission with MIR or FIR broadband filters (Burgarella et al. 2005), or with excellent photometric coverage of the age-sensitive D_n4000 break (MacArthur et al. 2004). In the Brown et al. (2014) sample, the MIR is sampled via WISE and Spitzer photometry, and for a limited subsample, the FIR is measured with Herschel photometry.

The birth-cloud component ( ${\hat{\tau }}_{\lambda ,1}$ ) in our model has an attenuation curve that scales with wavelength as ${\lambda }^{-1}$ , and attenuates stellar emission only from stars formed in the last t = 10 Myr, the typical timescale for the disruption of a molecular cloud (Blitz & Shu 1980). It also attenuates nebular emission. It has the following functional form:

$\begin{eqnarray}&&{\hat{\tau }}_{\lambda ,1}={\hat{\tau }}_{1}{(\lambda /5500\mathring{\rm A} )}^{-1.0}.\end{eqnarray} \tag{ 7 }$

The diffuse component ( ${\hat{\tau }}_{\lambda ,2}$ ) has a variable attenuation curve, and attenuates all stellar and nebular emission from the galaxy. For the wavelength scaling, we use the following prescription from Noll et al. (2009):

$\begin{eqnarray}&&{\hat{\tau }}_{\lambda ,2}=\displaystyle \frac{{\hat{\tau }}_{2}}{4.05}[{k}^{\prime }(\lambda )+D(\lambda )]{\left(\displaystyle \frac{\lambda }{{\lambda }_{V}}\right)}^{n}.\end{eqnarray} \tag{ 8 }$

The free parameters in this equation are ${\hat{\tau }}_{\lambda ,2}$ , which controls the normalization of the diffuse dust, and n the diffuse dust attenuation index. $k^{\prime} (\lambda$ ) is the (fixed) Calzetti et al. (2000) attenuation curve. D(λ) is a Lorentzian-like Drude profile describing the UV dust bump. We tie the strength of the UV dust absorption bump to the best-fit diffuse dust attenuation index, following the results of Kriek & Conroy (2013). We adopt a flat prior over $0\lt {\hat{\tau }}_{\lambda ,2}\lt 4.0$ , a flat prior over $0\lt {\hat{\tau }}_{\lambda ,1}\lt 4.0$ , and a flat prior over $-2.2\lt {\text{}}n\lt 0.4$ . The upper limit on ${\text{}}n$ is chosen to disallow a flat attenuation curve, which would cause ${\hat{\tau }}_{\lambda ,2}$ to be nearly fully degenerate with the normalization of the SED.

The similar effects of ${\hat{\tau }}_{\lambda ,1}$ and ${\hat{\tau }}_{\lambda ,2}$ on the stellar SED means that they are often degenerate (though this is SFH-dependent). It is important to distinguish between these parameters to properly predict emission line equivalent widths. Previous work suggests that the total optical depth toward nebular emission lines is ∼twice that of the stellar component (Calzetti et al. 1994; Price et al. 2014), though the exact value varies with SFR and galaxy inclination (Wild et al. 2011). Translating this result into the Charlot & Fall (2000) model implies that ${\hat{\tau }}_{\lambda ,1}\sim {\hat{\tau }}_{\lambda ,2}$ , as ${\hat{\tau }}_{\lambda ,1}$ applies to the entire galaxy, whereas star-forming regions only experience attenuation from ${\hat{\tau }}_{\lambda ,2}$ . We adopt a joint prior on the ratio of the two: $0\lt ({\hat{\tau }}_{\lambda ,1}/{\hat{\tau }}_{\lambda ,2})\lt 2.0$ , which allows some reasonable variation around the fiducial results in the literature.

3.1.4. Dust Emission

To model the dust emission from galaxies, we assume energy balance, where all starlight attenuated by the dust is re-emitted in the IR (da Cunha et al. 2008). This assumption means that the MIR and FIR photometry are additional constraints both on the total amount of dust attenuation, and on the dust-free stellar SED. Simultaneous modeling of the dust and stellar emission is key to accurate SFRs, particularly for galaxies with lower sSFRs where dust-heating via old stellar populations becomes more important (e.g., Utomo et al. 2014). While a useful tool, energy balance cannot be used to determine L_IR from the UV-MIR SED alone without making some assumptions about the shape of the IR SED (see Appendix C for further discussion).

We use the Draine & Li (2007) dust emission templates to describe the shape of the IR SED. This model is based on the silicate-graphite-PAH model of interstellar dust (Mathis et al. 1977; Draine & Lee 1984). The size distribution of carbonaceous and silicate grains is calibrated to be consistent with the wavelength-dependent extinction in the Milky Way. We note that in our model, the shape of the dust emission is modeled independently from the shape of the dust attenuation curve. This is a reasonable implementation because the shape of the attenuation (not extinction) curve is largely set by the geometry and column depth of the dust, rather than the composition of the dust (Chevallard et al. 2013). In the future, it may be useful to tie the strength of the 2175 Å extinction bump to the strength of the PAH emission, as it has long been suggested that these features both arise from the same dust grains (Stecher & Donn 1965; Draine 1989).

The Draine & Li (2007) model has three free parameters controlling the shape of the IR SED: U_min, ${\gamma }_{{\rm{e}}}$ , and Q_PAH. The effect of these three parameters on the resulting SED is demonstrated in Figure 3.

U_min and ${\gamma }_{{\rm{e}}}$ together control the shape and location of the thermal dust emission bump in the IR SED. Physically, they describe the fraction of dust mass exposed to the distribution of starlight intensities U, which is parameterized in the Draine & Li (2007) model by a combination of a delta function and a power-law distribution over U_min $\lt \,U\lt {U}_{\max }$ :

$\begin{eqnarray}\displaystyle \frac{{{dM}}_{\mathrm{dust}}}{{dU}} & = & (1-{\gamma }_{{\rm{e}}}){M}_{\mathrm{dust}}\delta (U-{U}_{\min })\\ & & +{\gamma }_{{\rm{e}}}{M}_{\mathrm{dust}}\displaystyle \frac{(\alpha -1)}{({U}_{\min }^{1-\alpha }-{U}_{\max }^{1-\alpha })}{U}^{-\alpha },\end{eqnarray} \tag{ 9 }$

with ${{dM}}_{\mathrm{dust}}$ is the mass of dust heated by starlight intensities between U and U + dU, ${M}_{\mathrm{dust}}$ is the total dust mass, and α is a power-law index. Draine & Li (2007) finds that MIR photometry of galaxies in the SINGS survey (Kennicutt et al. 2003) is well-reproduced with $\alpha =2$ and ${U}_{\max }={10}^{6}$ , which we adopt here. ${M}_{\mathrm{dust}}$ is determined by the normalization of the SED.

Specifically, U_min represents the minimum starlight intensity to which the dust mass is exposed, and ${\gamma }_{{\rm{e}}}$ represents the fraction of dust mass which is exposed to this minimum starlight intensity. The minimum starlight intensity roughly represents the intensity experienced by dust in the general diffuse interstellar medium (ISM). The inclusion of this delta function at the minimum starlight intensity is the major difference between the Draine & Li (2007) and Dale & Helou (2002) models for ${{dM}}_{\mathrm{dust}}/{dU}$ .

The final free parameter, Q_PAH, describes the fraction of total dust mass that is in PAHs. This parameter is particularly important because a substantial fraction or a majority of the MIR emission comes from strong PAH emission features.

The PAH ionized fractions and absorption cross-sections in Draine & Li (2007) are updated from the Li & Draine (2001) model by using observations of SINGS galaxies with 5–38 μm Spitzer IRS spectra (Smith et al. 2007). Draine & Li (2007) provide IR spectral templates for 0.1 < Q_PAH < 4.58. We note that with the excellent MIR coverage of the Brown et al. (2014) spectral atlas, Q_PAH can be very well-determined photometrically (see Appendix A for further details).

We adopt a flat prior of $0.1\,\lt$ U_min $\lt \,15$ and $0.0\,\lt$ ${\gamma }_{{\rm{e}}}$ $\lt \,0.15$ . For Q_PAH, we extrapolate the Spitzer IRS spectra out past Q_PAH = 4.58, and adopt a flat prior over $0.1\,\lt$ Q_PAH $\lt \,7.0$ . The extrapolation is linear in both flux and Q_PAH. In brief, the adopted priors are consistent with both the SINGS sample (Draine et al. 2007) and the Brown et al. (2014) galaxies with Herschel photometry, and adopting more permissive priors would bias the L_IR, SFR, and dust attenuation in galaxies without Herschel photometry. We discuss the origin and effects of these priors further in Appendix C.

3.1.5. Nebular Emission

We generate nebular continuum and line emission using the CLOUDY (Ferland et al. 1998, 2013) implementation within FSPS. This is described in detail by Byler et al. (2016), and summarized briefly here.

The CLOUDY implementation in FSPS is a grid in the following parameters: (1) simple stellar population (SSP) age, (2) SSP and gas-phase metallicity, and (3) the ionization parameter, U, which is the ratio of ionizing photons to the total hydrogen density. Before the SSPs within FSPS are combined into composite stellar populations, the number of ionizing photons in each SSP is calculated and removed from the SSP SEDs. The energy removed from the SSPs is then rerouted into line luminosities and nebular continua. This is only done for SSPs with ages $\lt 10\,\,\mathrm{Myr}$ , as the ionizing contribution from older SSPs is very small. This also implicitly assumes an ionizing photon escape fraction of zero.

Within the Prospector-α model, the gas-phase metallicity is set equal to the model stellar metallicity, and the ionization parameter U is fixed to 0.01 in all fits. We note that the ionization parameter has very little effect on the Balmer emission line luminosities (e.g., Hα or Hβ), though it has a strong effect on the luminosity of forbidden lines ([O ii], [O iii], etc.). In principle, it could be fit as well, though broadband photometry alone provides very little constraining power. For the Brown et al. (2014) sample, the contribution to the broadband photometry from emission lines is relatively low (≲5%) due to the relatively low sSFRs, so U is fixed for simplicity. In the future, it may be useful to allow U to vary, perhaps with a joint prior between sSFR and U, as suggested by recent work which finds a strong relationship between the two (e.g., Dickey et al. 2016). This would be particularly relevant for galaxies with high sSFR, where the nebular contribution to the broadband flux can become significant (for typical star-forming galaxies at z ∼ 1–2, it is 5%–10% or greater in certain rest-frame optical bands; Pacifici et al. 2015).

In the Prospector-α model, after production, emission lines fluxes experience attenuation from both the birth-cloud and diffuse dust component (see Section 3.1.3). The emission line equivalent widths are unaffected by attenuation from the diffuse dust component, as it attenuates the stellar emission equally. However, the emission line equivalent widths are sensitive to the birth-cloud attenuation, since it only attenuates emission from young stars and nebular emission. Both nebular continuum and line emission are added to the full SED model, and their effects are included in the broadband photometry.

3.1.6. Example Model Posteriors

Here we describe the typical posteriors for an application of the Prospector-α model to the Brown et al. (2014) photometry. Figure 5 shows a "corner"⁷ plot for NGC 0628, a mildly star-forming galaxy chosen to illustrate several important parameter degeneracies.

$\pm 1\sigma $ — **Figure 5.** A "corner" plot, where the posterior probability distribution functions are shown for each model parameter along the diagonal panels, and the correlations between model parameter posteriors are shown along the columns. Above each probability distribution, the median of the parameter posterior is printed, along with the $\pm 1\sigma$ error bars. In the upper right panels, the posterior probability functions for the half-mass time, the SFR, and the sSFR are also shown. The object shown is NGC 0628, a mildly star-forming galaxy. This object was chosen to best illustrate important parameter degeneracies, including the dust–metallicity-–degeneracy (dust2-logzsol-sfr_fraction1, respectively), and the amount of dust and the shape of the attenuation curve (dust2 and dust_index).
Download figure:
Standard image High-resolution image

The panels along the diagonal show the posterior probability functions for each model parameter. The median of each posterior is printed above the diagonals, with the ±1σ credible intervals also indicated. Most model parameters have Gaussian-like posteriors, and the exceptions in the SFH and dust parameters are a result of explicit or implicit priors in the Prospector-α model. The fractional SFH parameters largely have $p{({f}_{n})\propto (1-{f}_{n})}^{4}$ , as a result of the implicit prior set by the SFH implementation described in Section 3.1.1. This is not true for the youngest SFH time bin, where the evidence from the photometry is strong enough to reshape the posterior into a Gaussian. The birth-cloud dust parameter posterior, dust1, is truncated where dust1 = dust2, as a result of the dust priors described in Section 3.1.3. Finally, the dust_index parameter, which controls the shape of the dust attenuation curve, is against the prior limit described in Section 3.1.3.

The panels in each column show the correlation between the posteriors for each model parameter. These illustrate several degeneracies which are important in understanding the limited precision for certain parameters, including the dust–metallicity–SFR degeneracy (dust2-logzsol-sfr_fraction1, respectively), and the amount of dust and the shape of the attenuation curve (dust2 and dust_index). The parameters controlling the shape of the dust emission (duste_gamma, duste_qpah, and duste_umin) are also mildly degenerate with one another, due to the limited sampling of the IR SED. These degeneracies can be accurately explored by Prospector due to the on-the-fly model generation and the MCMC sampling.

3.2. Fitting Procedure

In this section we describe the algorithms used to generate and fit model SEDs to the observed photometry.

3.2.1. Inference Framework

We construct the Prospector-α galaxy model within Prospector.⁸ Prospector is a Python package which serves as a framework to bring together a galaxy model with a noise model in a posterior probability function. That probability function is then passed to algorithms which either maximize or sample the posterior probability function. Prospector has a highly flexible, modular design: input observations can be photometric, spectroscopic, or both; the sampling/minimizing procedure can be changed to suit the problem at hand; and the likelihood function, physical model, and priors are cleanly separated from one another.

We adopt a simple chi-squared noise model, i.e., we model the photometric noise with Gaussian independent errors of known dispersion. We note that Prospector has a number of more sophisticated noise models available, including estimation of covariant noise between sets of photometric filters, and estimating an overall jitter term from the available photometric data.

3.2.2. Minimization and Sampling

We use a two-step process to find the posterior probability distribution function for our model. The first step is a simple minimization scheme intended to find the best-fit model parameters. The second step is an MCMC sampler, which is initialized in the region of the best-fit parameters from the previous step.

The minimization step uses the Powell minimization scheme from the scipy Python package. We take advantage of multiprocessing to perform this process. For N available processors, N sets of initial parameter values are generated randomly within the constraints of the priors, and then each processor performs an individual Powell minimization scheme for a specified number of iterations, or to a specified likelihood tolerance, whichever is reached first. The best-fitting model parameters from the Powell routine are used as the initial conditions for the next step.

We use the emcee package (Foreman-Mackey et al. 2013) for the sampling step. emcee is a Python-based MCMC sampler based on an affine-invariant algorithm. In brief, emcee uses an ensemble of walkers to explore probability space. Each walker will attempt a specified number of steps in parameter space. A step is attempted by first proposing a new position, and then calculating the likelihood of the new position based on the data. The walker has a chance to move to the new position based on the ratio of the likelihood of its current position to that of the new position. After a specified number of steps, the algorithm is completed and the position and likelihood of each walker as a function of step number is written out. These positions and likelihoods are used to generate the posterior probability distribution function for each model parameter.

To initialize emcee, we generate walker positions in a Gaussian ball around the best-fitting model parameters from the Powell minimization phase. The dispersion of the Gaussian ball is customized for each parameter. The dispersions are chosen to represent the typical uncertainty in each parameter. If the first minimization process returns a set of best-fit parameters close to the global maximum, then convergence is swift and the probability space around the global maximum will be well-sampled. If the initial conditions are far from the global maximum, the walkers will attempt to reach and explore the global maximum before the process is completed. In some cases the process completes before the walkers have reached an equilibrium around the global maximum (before they have burned in), and the resulting parameter probability distributions will not be useful. We visually inspect plots of the walker location in each parameter as a function of iteration step for each galaxy to ensure they are burned in. If the MCMC process is not burned in, the fit is repeated with more iterations until equilibrium is achieved. Future versions of Prospector will have quantitative criteria to assess convergence.

It is critical that we use an MCMC algorithm to explore our model posterior, as a standard grid approach with 13 parameters and sufficient grid spacing would take up a prohibitive amount of memory. One way to limit the amount of grid space to explore is to use generating functions to sample reasonable combinations of parameters, and create static template libraries from this sampling of parameter space. However, this method necessarily results in opaque priors. On-the-fly stellar populations generation with FSPS combined with MCMC sampling allows Prospector to explore model parameter space free from the complex priors from pre-generated template libraries, at the expense of increased computational time.

4. Evaluating the Quality of the Model Fits

In this section we discuss the results of fitting the photometry from the Brown et al. (2014) sample with the Prospector-α model. Section 4.1 describes the residuals from the fits to the photometry. Section 4.2 describes overall trends in the residuals between the best-fit model spectra and the observed spectra; the spectra are not included in the fits, so this is an excellent test of the Prospector-α model. The remaining sections quantify the predictive value of specific features or combinations of features in the observed spectra from fits to the photometry: Hα and Hβ emission (Section 4.3), Balmer decrements (Section 4.5), D_n4000 (Section 4.4), Hδ absorption (Section 4.6), and stellar metallicity (Section 4.7). The spectral quantities presented from the Prospector-α model are calculated from the median of the model posterior, and the error bars are the 16th and 84th percentiles of the model posteriors.

4.1. Photometric Residuals

In Figures 6 and 7 we show example model fits to the photometry of a star-forming and a quiescent galaxy, respectively, along with the photometric residuals, the best-fit and marginalized SFHs, and a comparison between the best-fit model spectrum and the observed spectra.

**Figure 6.** Broadband photometry of a star-forming galaxy (top panel), NGC 6090, along with the best-fit photometry and spectrum of the `Prospector-`α model. The 16th and 84th percentiles of the model spectrum are shaded in gray. The `Prospector-`α model is fit to the broadband photometry only. The 16th, 50th, and 84th percentiles of the SFH are in the inset panel. The photometric residuals are in the lower attached panel. Comparisons between the best-fit model spectrum and the optical, *Akari*, and *Spitzer* IRS spectra are shown in the lower three panels. For the optical and *Akari* spectra, the models are smoothed to the resolution of the data; for the *Spitzer* IRS spectra, the data are smoothed to the resolution of the models. Disagreement in the emission line fluxes at [O ii 3727], [O iii 4959], and [O iii 5007] is expected, as the ionization parameter is fixed in the `Prospector-`α model (see Section 3.1.5).
Download figure:
Standard image High-resolution image

**Figure 7.** Broadband photometry of a quiescent galaxy (top panel), NGC 4125, along with the best-fit photometry and spectrum of the `Prospector-`α model. The 16th and 84th percentiles of the model spectrum are shaded in gray. The `Prospector-`α model is fit to the broadband photometry only. The 16th, 50th, and 84th percentiles of the SFH are in the inset panel. The photometric residuals are in the lower attached panel. Comparisons between the best-fit model spectrum and the optical and *Spitzer* IRS spectra are shown in the lower two panels. For the optical spectra, the models are smoothed to the resolution of the data; for the *Spitzer* IRS spectra, the data are smoothed to the resolution of the models.
Download figure:
Standard image High-resolution image

We show the photometric and spectral residuals for the entire catalog in Figure 8. The residuals from star-forming and quiescent galaxies are separated, where star-forming is defined by an sSFR cut of 10⁻¹¹ yr⁻¹.

**Figure 8.** Residuals between the observations and model predictions as a function of rest-frame wavelength. Star-forming galaxies (defined as having sSFR(`Prospector-`α) $\gt {10}^{-11}$ yr⁻¹) are shown in blue, while quiescent galaxies are shown in red. The residuals for individual galaxies are shown as light lines, while the medians over star-forming and quiescent galaxies are shown separately as heavy lines. The panels show, from top to bottom, the residuals for broadband photometry, the optical spectra, the *Akari* spectra, and the *Spitzer* IRS spectra. The panels on the right-hand side show histograms of the ${\chi }^{2}$ and RMS values. Strong emission lines are masked when calculating the optical RMS. Systematic features are clearly visible: for example, large residuals at 3.3 μm in the *Akari* spectra suggest that the Draine & Li (2007) model overpredicts the 3.3 μm PAH emission line strength.
Download figure:
Standard image High-resolution image

The photometric residuals are lowest at well-studied optical and NIR wavelengths, and increase in the UV and MIR bands. A significant minority of the galaxies show IR colors and emission line ratios indicative of AGN activity (Brown et al. 2014). This may contribute to the residuals at MIR wavelengths by adding significant amounts of non-stellar emission. The larger MIR residuals also reflect the well-known difficulty of modeling diffuse dust, PAHs, and circumstellar disks heated by AGB stars, star formation, and AGN (Mentuch et al. 2009; Lange et al. 2016).

4.2. Spectral Residuals

Here we describe the residuals between the observed spectra and the model spectra. The model spectra here are generated from the posteriors of the fit to the broadband photometry; the observed spectra are not fit.

The spectral residuals in the optical are flat and have RMS deviations at the <10% level for the majority of both quiescent and star-forming galaxies. The strongest residuals are in emission lines, particularly the [O ii] and [O iii] lines. This is due to their sensitivity to the ionization parameter (Kewley et al. 2013). The ionization parameter is fixed in the Prospector-α model. This has a negligible effect on the fit for most galaxies, as forbidden lines generally contribute a small fraction of the flux in the broadband photometry, with the exception of highly star-forming galaxies (Pacifici et al. 2015).

The Akari and Spitzer IRS spectral residuals are also relatively flat with wavelength. There is substantial variation in the absorption feature at 10 μm, which is likely related to AGN activity (e.g., Nenkova et al. 2008a, 2008b. The dominant residuals come from PAH emission features. This is not unexpected, as there is significant galaxy-to-galaxy variation in the strength of PAH features (Smith et al. 2007; O'Dowd et al. 2009). Specifically, the strength of many of the PAH emission features at $6\lt \lambda \lt 20$ μm show substantial variation, while the 3.3 μm feature is strongly overpredicted. As discussed in Section 3.1.4, the Draine & Li (2007) dust emission model is calibrated to Spitzer IRS 5–38 μm spectra of SINGS galaxies, and so the 3.3 μm feature was not in the calibration range of the Draine & Li (2007) model. The systematic differences in the remaining PAH features may be related to sample selection differences between the Brown et al. (2014) spectral catalog and the SINGS survey (Kennicutt et al. 2003). The Brown et al. (2014) selection is based on the availability of the drift-scan spectroscopy and is thus highly heterogeneous, with significant diversity in stellar masses and sSFRs (see Figure 1), whereas the SINGS survey targets "normal" (∼L*) star-forming galaxies (Kennicutt et al. 2003). The correlation between the strength of the PAH features and the variation in Q_PAH in the Prospector-α fits is discussed further in Section 5.2.2.

There are also substantial residuals in the absorption feature at 10 μm for star-forming galaxies. This is likely related to AGN activity (e.g., Nenkova et al. 2008a); see the discussion in Section 6 for more details.

4.3. Balmer Emission Lines

The upper panels of Figure 9 show the observed Hα and Hβ luminosities versus the model Hα and Hβ luminosities, as predicted by the Prospector-α model. The points are color-coded by their position on the Baldwin–Phillips–Telervich (BPT) diagram, which is a diagnostic for their primary excitation mechanism (Baldwin et al. 1981). Only galaxies with observed Hα and Hβ S/N > 5 are shown. We note that these cuts preferentially select galaxies with higher sSFRs: the average Prospector-α sSFR for the entire sample is 3 × 10⁻¹⁰ Gyr⁻¹, with a range of 3 × 10⁻¹⁴ yr⁻¹ to 6 × 10⁻⁹ yr⁻¹, while after the S/N cuts, the average sSFR is 6 × 10⁻¹⁰ yr⁻¹, with a range of 6 × 10⁻¹¹ yr⁻¹ to 6 × 10⁻⁹ yr⁻¹.

**Figure 9.** Predicted Hα and Hβ from fitting the `Prospector-`α model to the broadband photometry, compared to observed values from the spectrum. The increase in scatter from Hα to Hβ may be related to the intrinsically weaker Hβ emission being more affected by uncertainty in the underlying stellar absorption. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown. Galaxies are color-coded by their BPT classification. The equivalent widths are in Å, while the luminosities are in ${L}_{\odot }$ .
Download figure:
Standard image High-resolution image

There is excellent agreement between the models and observations, with a small offset of ∼0.13 (0.07) dex and ∼ 0.18 (0.2) dex scatter in Hα (Hβ) across a wide range of galaxy types and stellar masses (6.3 < log M/M ${}_{\odot }\lt 11.4$ ). The good agreement between predicted and observed Hα suggests that galaxy SFRs do not strongly vary over 100 Myr timescales.

Figure 9 also shows the same comparison in equivalent width (EW). The EW and luminosity comparisons roughly measure the ability of the fitter to recover sSFR and SFR, respectively. The EWs show a similar amount of scatter and offset to the luminosity comparison, suggesting that sSFR and SFR are equally well-recovered.

Hβ is recovered with a similar accuracy to Hα, with slightly more scatter. One possible source of this extra scatter is errors in the model dust attenuation curve, which would have a larger effect on Hβ than Hα since it is at bluer wavelengths. Errors in the absorption corrections are another possibility: since Hβ line emission is intrinsically weaker than Hα, the absorption correction makes up a larger percentage of the obsered flux.

The excellent agreement between observed Hα and Hα from Prospector-α fits to the broadband photometry is a key result of the paper, and we spend much of the remainder of the manuscript discussing it. In Section 5.1, we show that the primary determinants of the Hα luminosity from a galaxy are the SFR over the last 10 Myr, the dust attenuation, and the stellar metallicity. We discuss the correlation between the Hα and Hβ residuals in Section 5.1.3, and show that this means the dominant source of error in predicting the Balmer lines is not dust reddening, but rather the overall normalization of the Balmer lines. We also use the spread between the predictions and the observations to show that the width of the model Hα posteriors is accurate to within ∼20%. In Appendix B, we demonstrate that an SED-fitting code of similar complexity, MAGPHYS, shows considerably more scatter in model–observational Hα predictions.

4.4. D_n4000

Figure 10 shows the D_n4000 measured from optical spectroscopy versus the D_n4000 predicted from fits to the photometry. D_n4000 is a classic spectral indicator of both the stellar metallicity and stellar age of a galaxy (Bruzual 1983; Hamilton 1985; Balogh et al. 1999). The Prospector-α model recovers D_n4000 from the broadband photometry alone across a wide range of stellar ages and metallicities.

The Prospector-α model fits slightly underpredict the age and/or metallicities of galaxies with larger D_n4000. This is likely related to the maximum galaxy age in the Prospector-α model, which, due to the spacing of the SFH bins, is ∼10 Gyr (representing the average stellar age when all SFR occurs in the oldest SFH bin).

Appendix A explores the ability of the Prospector-α model to recover D_n4000 in mock tests. D_n4000 is recovered with no bias and very little scatter in the mock tests. This is not unexpected, as the SFH priors in the mock tests are the same as the SFH priors in the Prospector-α model; however, this is likely not true when fitting real galaxies. Since the SFH at ≳1 Gyr recovered from broadband photometry alone can be very prior-dependent, it may be useful to adjust the priors on the SFH parameters in future analyses to minimize the residuals in plots such as Figure 10.

4.5. Balmer Decrements

The Balmer decrement is the observed ratio of Hα/Hβ emission line fluxes. For case B recombination, the intrinsic ratio of these fluxes is set to 2.86 by atomic physics. Any deviation from this ratio can be converted directly into the reddening along lines of sight to H ii regions (e.g., Osterbrock 1989; Calzetti et al. 2000; Brinchmann et al. 2004).

In Figure 11, we show the observed Balmer decrements compared to the Balmer decrements predicted from fitting the Prospector-α physical model to the photometry. The units of the plot are the logarithm of the amount of reddening between Hα and Hβ. The S/N and EW cuts are identical to the Hα and Hβ comparisons. The model Balmer decrements are calculated directly from the posteriors for the model dust parameters; all three dust parameters ( ${\hat{\tau }}_{\lambda ,1},$ ${\hat{\tau }}_{\lambda ,2}$ , and n) are used to calculate the reddening toward H ii regions. The dust reddening toward AGN is systematically overpredicted, which is expected; Prospector-α fits the excess MIR emission caused by the AGN-heated dust by adding an excess of dust to the stellar model.

**Figure 11.** Predicted reddening toward H ii regions from fitting the `Prospector-`α model to the broadband photometry compared to the observed reddening. Reddening is calculated from the observed Balmer decrement for each galaxy, and converted into the difference in dust attenuation toward the birth-cloud regions between Hα wavelengths and Hβ wavelengths. Classically, accurate nebular dust attenuation measurements have required expensive spectroscopic observations; this comparison suggests they may be recovered accurately from the photometry instead. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt \,5$ are shown.
Download figure:
Standard image High-resolution image

Overall, we find excellent agreement between the spectroscopic reddening measurement and the reddening inferred by the photometric model. This is a particularly encouraging result, because the effect of the dust parameters on the observed photometry is often highly degenerate (see Figure 3). The sample size of studies that rely on expensive spectroscopic Balmer decrements for reddening measurements can be greatly expanded by using full SED fits to estimate the reddening from simple broadband photometry (e.g., Gallazzi et al. 2005; Shivaei et al. 2016a).

4.6. Hδ Absorption

Hδ is a hydrogen Balmer line that will appear in either emission or absorption, depending on the mix of stellar populations in the observed galaxy. Hδ absorption is a well-known proxy for stellar age, as it measures the temperature of the main-sequence turnoff (Worthey & Ottaviani 1997; Kauffmann et al. 2003). To a lesser extent, it is also affected by stellar metallicity (Worthey & Ottaviani 1997).

In Figure 12, we show the observed Hδ EW versus the Prospector-α Hδ EW marginalized over the posterior of the fit parameters., Both an absorption-only model and an absorption+emission model are compared. We select galaxies where the Hδ feature has an S/N > 5. The Hδ absorption is stronger than the emission for all points shown.

**Figure 12.** Hδ EWs measured from the observed spectra compared to predictions from `Prospector-`α fits to the broadband photometry. The Hδ feature is, in most galaxies, a mixture of absorption and emission, both of which are modeled by `Prospector`. The left panel compares to only the model absorption, while the right panel compares to the modeled emission+absorption. Here, higher values of EW indicate more absorption. The absorption-only models show good correlation, though they are systematically stronger than the observed values. Adding the expected nebular emission removes this offset at the expense of added scatter. Galaxies are color-coded by their BPT classification. Only galaxies with S/N Hδ absorption $\gt 5$ are shown. The equivalent widths are in Å.
Download figure:
Standard image High-resolution image

Figure 12 shows reasonable agreement between the absorption-only model and observed Hδ strength (left panel), with a small scatter of 0.87 in EW. There is a systematic offset such that the absorption-only model produces stronger absorption than is observed. When adding in the expected nebular emission (right panel), this bias is reduced and instead a small offset is produced in the other direction, such that the feature is systematically weaker than observed. This overestimation of the strength of Hδ nebular emission is consistent with the mild overestimation of Hα and Hβ nebular emission in Figure 9. The enhanced scatter when using the absorption+emission model is not unexpected, given the size of the model error bars. The large model error bars come from the model marginalizing over both emission and absorption in a regime where both are nearly of equal strength, and the fact that uncertainties in the dust attenuation model affect Hδ more strongly than Hα or Hβ, since it is at a bluer wavelength.

4.7. Stellar Metallicity

We compare photometric stellar metallicities to spectroscopic metallicities from the ATLAS-3D survey (Cappellari et al. 2011), presented in McDermid et al. (2015). ATLAS-3D simultaneously estimates metallicities, ages, and alpha abundances using Schiavon (2007) simple stellar populations matched to observed measurements of the Lick indices (Worthey et al. 1994) for Hβ, Fe5015, and Mg b. The overlap between the Brown et al. (2014) sample contains only quiescent galaxies. The ATLAS-3D metallicities are measured within the effective radius, R_e, whereas the Brown et al. (2014) spectroscopic aperture varies substantially from galaxy to galaxy (see Figure 1), though it is generally larger than R_e. This may introduce systematics on the order of ∼0.1 dex into this comparison, as early-type galaxies on average have negative stellar metallicity gradients with radius (e.g., Zheng et al. 2017 measure a gradient of −0.09 ± 0.01 dex/R_e in nearby ellipticals). The ATLAS-3D metallicities are compared with Prospector-α metallicities in Figure 13.

**Figure 13.** Comparing the photometric stellar metallicities to the ATLAS-3D stellar metallicities. ATLAS- 3D metallicities are derived from spectra using Lick indices. Lick indices are then translated into metallicities by comparison with simple stellar population models. `Prospector` metallicities come from fits to the broadband photometry. The agreement between the two measurements is excellent. We note that only quiescent galaxies are in the overlap with the ATLAS-3D survey.
Download figure:
Standard image High-resolution image

There is an excellent one-to-one correlation between the photometric and spectroscopic metallicities, with little scatter or offset. The scatter is consistent with the error bars on the ATLAS-3D metallicities and the Prospector-α metallicities. This agreement is encouraging, given the very different methodology used to estimate stellar metallicities. It would be useful in the future to also compare to spectroscopic metallicities from star-forming galaxies, as the photometric recovery of stellar metallicities may be a strong function of sSFR.

5. Interpretation of the Results

5.1. A Critical Look at the Balmer Emission Line Luminosities

In this section we critically examine the factors determining the Balmer line luminosities within the Prospector-α model, and show that the observed scatter is consistent with predictions from the model to within 20%–30% for Hα and Hβ.

Specifically, in Section 5.1.1, we examine in turn each of the factors in the Prospector-α model affecting the Balmer line luminosity: SFR, stellar metallicity, and dust attenuation. The agreement between the observed and predicted lines validates the accuracy of these model parameters. It is also demonstrated that neglecting the effect of stellar metallicity in the Hα luminosity results in a substantial increase of the scatter between observations and model predictions. In Section 5.1.2, we investigate how Hα can be determined from the broadband photometry, and argue that Hα is best predicted from the broadband photometry by full UV-IR SED fits. Finally, in Section 5.1.3, we explore the origins of the scatter between the model and observed Hα and Hβ emission, showing that it is (a) not dominated by errors in the reddening curve, and (b) the scatter is consistent with the width of the posteriors in emission line luminosity from the Prospector-α model, which is ultimately determined by the size of the errors in the broadband photometry.

5.1.1. Which Model Parameters Determine Balmer Line Luminosity?

Balmer emission lines are produced when hydrogen recombines after being ionized by high-energy photons. The conversion from number of ionizing photons to the intrinsic Hα luminosity is effectively a constant. The primary stellar source of ionizing photons is young, massive stars that live for a very short period of time (<10 Myr; e.g., Kennicutt 1998; Conroy 2013 and references therein). Testing model predictions of the Hα luminosity is thus, in large part, a test of the recent SFR.

In addition to SFR, the dust attenuation and the stellar metallicity affect the emission line luminosity. In the Prospector-α galaxy model, the dust attenuation of the line emission is determined by a combination of three parameters: the optical depth of the diffuse dust, the optical depth of the birth-cloud dust, and the shape of the diffuse dust attenuation curve. The birth-cloud dust and diffuse dust are largely degenerate in fits to the broadband photometry (see Figure 3): the ratio of the two is constrained in the model priors to mitigate uncertainty.

Stellar metallicity affects the Hα and Hβ luminosities by changing the production rate of ionizing photons at a fixed SFR: at lower metallicities, there is an increased number of ionizing photons, resulting in a higher Hα luminosity. To illustrate the effect of stellar metallicity on the Hα luminosities, we use the Kennicutt (1998) conversion between recent SFR and predicted Hα luminosity:

$\begin{eqnarray}&&\mathrm{SFR}({M}_{\odot }\,{\mathrm{yr}}^{-1})=\displaystyle \frac{L({{\rm{H}}}_{\alpha })}{1.26\times {10}^{41}\ \mathrm{erg}\ {{\rm{s}}}^{-1}}.\end{eqnarray} \tag{ 10 }$

This is derived using a Salpeter (1955) IMF and assumes a fixed, solar metallicity. We correct our SFRs from a Chabrier to a Salpeter scale by multiplying by a factor of 1.7, which accounts for the additional low-mass stars in a Salpeter IMF.

We plug the SFR inferred from the photometric fit into Equation 10 to give another estimate of Hα luminosity (in addition to the CLOUDY estimates), and attenuate the resulting Hα luminosities with the Prospector-α model dust attenuation.

In the left panel of Figure 14, we compare the analytical, fixed-metallicity Hα luminosities from Equation 10 to the observations. There is an additional scatter of 0.05 dex over the same comparison with the CLOUDY Hα predictions (Figure 9).

**Figure 14.** Observed Hα luminosity in units of ${L}_{\odot }$ compared to the Hα luminosity calculated from Equation 10, and attenuated by the model dust properties (left panel). The scatter is 0.23 dex, compared to the scatter of 0.18 dex from the `CLOUDY` Hα predictions which incorporate model stellar metallicities. The right panel shows the ratio between `CLOUDY` vs. Kennicutt Hα luminosities. The residuals are nearly a monotonic function of stellar metallicity. This demonstrates that incorporating the stellar metallicity improves the prediction of Hα luminosities. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown.
Download figure:
Standard image High-resolution image

The difference between the two estimates is because the CLOUDY predictions integrate the ionizing flux directly from the stellar model, thus including the effect of a variable stellar metallicity on the number of ionizing photons. We demonstrate this directly by comparing the ratio of the analytical Hα luminosities to the CLOUDY predictions in the right panel of Figure 14, as a function of model stellar metallicity. The ratio of the two is a nearly monotonic function of stellar metallicity.

Because the gas-phase and stellar metallicities are set equal in the Prospector-α model (Section 3.1.5), the right panel of Figure 14 includes the effect of changing the gas-phase abundance within CLOUDY. However, this effect is negligible compared to the effect of stellar metallicity.

The increased predictive power of the CLOUDY Hα predictions, compared to the Hα predictions from Equation 10, is indirect evidence that the Prospector-α metallicities for star-forming galaxies derived from the broadband photometry are accurate.

We have shown here that neglecting the effect of stellar metallicities can dominate the scatter between model and observed Hα. We thus caution against interpreting observational differences between Hα SFRs and other SFR indicators as variation in SFRs over a short timescale, without first taking stellar metallicity into account.

5.1.2. How is Hα Luminosity Predicted from the Broadband Photometry?

In this section we demonstrate how the model Hα luminosity is sensitive to the shape and normalization of the observed SED, and argue that full-SED fitting is the best way to predict Hα from the photometry.

Typical mock SEDs at three different Hα luminosities are generated by drawing model parameters that affect Hα luminosity (dust, metallicity, and SFH) randomly within their priors, while other model parameters are held fixed. The typical SEDs at each Hα luminosity are shown in the left panel of Figure 15.

The primary signature of increasing Hα luminosity in the galaxy SED is an increase in the total IR and UV luminosities, L_IR + L_UV. This is because L_UV and L_IR are directly correlated with the recent SFR (Kennicutt 1998).

Measuring L_IR + L_UV alone does not uniquely determine the Hα luminosity. The right panel of Figure 15 shows individual model SEDs created at a fixed L_IR + L_UV. The SEDs are color-coded by the resulting observed Hα luminosity, which spans a range of ∼0.7 dex. This variation in Hα luminosity comes from variation in the dust attenuation at Hα wavelengths, which is visible in the right panel of Figure 15 as a gradual increase in L_IR/L_UV with decreasing Hα luminosity.

The upshot of Figure 15 is that accurate predictions of the Hα luminosity require: (1) estimates of the energy budget available for Hα photon production (L_UV + L_IR), and (2) the dust attenuation toward nebular regions at Hα wavelengths.

With only UV-MIR broadband photometry, it is possible to obtain a crude estimate of the total energy budget available for Hα photon production from measurements of L_UV and the UV slope β by using the L_UV/L_IR (known as "IRX")–β relationship⁹ (Meurer et al. 1999; Hao et al. 2011). This method will also give the nebular dust attenuation, if one assumes a shape for the dust attenuation curve and a conversion from stellar to nebular attenuation. However, the IRX–β relationship is sensitive to variations in the intrinsic UV slope from variation in the recent SFH (Kong et al. 2004; Boquien et al. 2012); furthermore, we show in Section 5.2.1 that the dust attenuation curve shows strong galaxy-to-galaxy variation.

Hao et al. (2011) perform the above conversion, from observed UV slope and luminosities to the dust-free UV luminosity, finding an RMS scatter of 0.23 dex between the observed and predicted Hα. However, they use the spectroscopic Balmer decrement to correct the Hα luminosity, rather than attempting to estimate ${A}_{{\rm{H}}\alpha }$ directly. They find that the scatter between ${A}_{{\rm{H}}\alpha }$ and A_FUV without FIR fluxes to be on the order of ∼0.6 magnitudes or ∼0.25 dex (see their Figure 15): this means that using UV-MIR photometric data only would increase the scatter further.

We conclude that the full UV-IR SED modeling approach that we present in this work is the most robust way to estimate Hα luminosities from the broadband photometry, as it requires no assumptions about the intrinsic stellar populations or the shape of the attenuation curve. We note that having direct L_IR measurements from Herschel reduces the uncertainty in both approaches, though it may be more helpful for the IRX–β approach, as L_IR recovery from UV-MIR imaging alone after applying the appropriate priors on the shape of the IR SED is excellent (see Appendix C).

5.1.3. Model Error Budget in the Emission Line Predictions

In this section we investigate the scatter between the predicted and observed Balmer emission line luminosities in Figure 9, and to what extent this scatter is consistent with the uncertainty on the model predictions.

We first compare the residuals in Hα to the residuals in Hβ in Figure 16. Since the intrinsic flux ratio between the Balmer lines is nearly invariant, the correlations between emission line residuals are informative. Correlated residuals are due to errors in normalization of the Balmer lines: intrinsic emission line strength (errors in SFR or stellar metallicity), normalization of the dust attenuation, or errors in the underlying continuum absorption. Uncorrelated errors are due to errors in the reddening curve, which is a function of ${\hat{\tau }}_{\lambda ,1}$ , ${\hat{\tau }}_{\lambda ,2}$ , and n in the Prospector-α model.

**Figure 16.** Correlation in the residuals between Hα and Hβ luminosities. Since the ratio of Hα and Hβ photons is fixed by atomic physics, the only parameter which has a different effect on Hα and Hβ residuals is the reddening curve. The strong correlation in the Hα and Hβ residuals shows the scatter comes from errors in the production rate of Balmer photons (from errors in the SFR, metallicity, or other effects), which dominate over errors in the reddening curve. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown.
Download figure:
Standard image High-resolution image

Figure 16 makes it clear that the dominant source of scatter is from the normalization of the Balmer lines, and errors in the reddening curve are a secondary contribution. This can also be calculated directly from the observed scatter in the Balmer decrements in Figure 11: reddening errors add a scatter of ∼ 0.08 dex to the Hα measurements, compared to an overall scatter of 0.18 dex.

We also investigate whether the observed scatter in emission line luminosities is consistent with predictions from the Prospector-α model. Each fit to the photometry results in some posterior probability distribution for the emission line fluxes. The observational errors on emission line fluxes are much smaller than the typical width of this posterior probability function.¹⁰ Thus, we can use the observations to test the model posteriors.

In Figure 17, we show the sum of the model posteriors for the Hα luminosity. The posteriors are stacked such that the median of each posterior is located at the origin, and the global offset is removed (the offset is shown in the corner of Figure 9). We also show the location of the observations for each of these fits, again stacked relative to the median of each posterior. If the model posteriors accurately describe all relevant emission line production and attenuation processes in these galaxies, then the two histograms would match each other to within Poisson errors.

**Figure 17.** Extent to which the model posterior probability functions accurately capture the observed distribution of Hα and Hβ. Red histograms show the location of the observed Hα and Hβ luminosities relative to the center of the model posterior, with the offset global between the observations and model removed. Blue histograms show the stacked model posterior probability functions. If the adopted `Prospector-`α model perfectly described emission line production and attenuation, these two histograms would overlap within Poisson errors. The `Prospector-`α error bars describe the distribution of true emission line fluxes with good accuracy: the Balmer line distributions are correct to within 20%–30%, . Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown.
Download figure:
Standard image High-resolution image

The agreement between the two histograms is good, suggesting that the posterior probability distributions are accurate. This means that the scatter in Figure 16 is largely consistent with the errors. We quantify this statement by calculating the fraction of observations that fall within the model 1σ range (i.e., the 16th and 84th percentiles of the posterior) and 2σ range (the 2.5th and 97.5th percentiles of the posterior). These are shown in the upper-left corner of Figure 17. This test shows that the model error bars are accurate to within 20%–30% for Hα and Hβ.

5.2. Dust Properties

Here we explore correlations in the model dust properties. We show a strong trend in the relationship between dust optical depth and the shape of the attenuation curve, and compare to expectations from simulations in Section 5.2.1. In Section 5.2.2, we demonstrate the recovery of the PAH mass fraction from the photometric fits, and explore trends between the stellar mass and the PAH mass fraction.

5.2.1. Relationship Between the Dust Attenuation Curve and the Dust Optical Depth

A relationship between the dust attenuation optical depth and the shape of the attenuation curve in galaxies has been predicted both analytically (Bruzual et al. 1988) and numerically, with radiative transfer codes (Witt et al. 1992; Witt & Gordon 2000; Gordon et al. 2001; Chevallard et al. 2013). The wavelength-dependent scattering properties of dust and the mixed star-dust geometry both contribute to this effect. In brief, at low optical depths, red light is scattered more isotropically and escapes the galaxy, while blue light is more forward-scattered and is subjected to more absorption. This results in a net steepening of the attenuation curve. At high optical depths, in a mixed star-dust geometry, the observed radiation primarily comes from stars at an optical depth less than unity. Redder photons travel more physical distance than bluer photons before absorption or scattering; the net effect is that larger optical depths result in a grayer attenuation curve.

In Figure 18, we plot the optical depth of the diffuse dust versus the slope of the attenuation curve at 5500 Å (d logτ/dλ), compared to a compilation of dusty radiative transfer models from Chevallard et al. (2013). The qualitative agreement is remarkable: as the observed dust optical depth increases, the observed attenuation curves become grayer. There is clear evidence for a systematic relationship between the dust optical depth and the shape of the attenuation curve, although the models predict a somewhat grayer attenuation curve than is observed at large optical depths. We caution that there is a correlation between the optical depth of the dust and the attenuation curve for individual galaxies which could partially mimic this relationship; however, it is not large enough to explain the observed sample trend.

The dust attenuation curve is often a fixed parameter when fitting the photometric SEDs of galaxies, even for complex SED fitters with many free parameters. Yet there is substantial evidence that the attenuation curve varies: both theoretically, as discussed above, and observationally, with serious differences between the Milky Way (Cardelli et al. 1989), Small Magellanic Cloud (SMC; Prevot et al. 1984; Bouchet et al. 1985), and Large Magellanic Cloud (LMC; Fitzpatrick 1986), as well as in high redshift galaxies (Buat et al. 2012; Reddy et al. 2015; Salmon et al. 2016).

The assumption of a fixed attenuation curve can have profound effects on the results of SED fits. For example, assuming a fixed Calzetti et al. (2000) attenuation curve will underpredict the 1500 Å luminosity dust correction by ∼2 at low optical depths, and overpredict the dust corrections by a factor of 2–5 at high optical depths (Salmon et al. 2016). Scatter in the relationship between L_IR/L_UV and UV slope could be reduced substantially by taking into account the variation in the slope of the attenuation curve (Buat et al. 2012). Assuming the shape of the attenuation curve also affects the bulk properties of the galaxy population, Marchesini et al. (2009) show that the low-mass slope of the stellar mass function steepens and the normalization decreases by 20%–50%.

Given that the shape of the dust attenuation curve is well-recovered in photometric mock tests (Appendix A), the excellent accuracy in which the model fits predict the reddening from the Balmer decrements (Figure 11), and the biases introduced by a fixed attenuation curve in SED fits, it is wise to use a variable dust law when fitting galaxy SEDs.

5.2.2. The PAH Mass Fraction

In this section we investigate the PAH mass fraction, parameterized in the Draine & Li (2007) dust emission model as Q_PAH. Many studies use broadband photometry in the MIR to estimate total IR luminosities, either by using fixed IR templates (e.g., Daddi et al. 2007; Wuyts et al. 2008; Buat et al. 2009; Whitaker et al. 2014) or by fitting IR models to a limited wavelength range (e.g., Chang et al. 2015). The PAH mass fraction is of particular interest because PAH emission can dominate the integrated flux at MIR wavelengths. Thus, the natural variation of the PAH mass fraction in galaxies has serious consequences for estimating SFRs from photometry: unless Q_PAH can be constrained with available photometry, systematic errors in SFR estimates will be introduced by assuming a fixed IR template.

In Figure 19, we investigate the recovery of Q_PAH from photometric fits to the full SED. We show the Spitzer IRS spectra stacked in bins of median Q_PAH, which come from the posteriors of the fits to the broadband photometry. Only galaxies with log(M) $\gt \,10$ are included, to minimize confounding trends in stellar mass. The bin sizes are chosen so that the number of galaxies is the same for each bin (N = 20). Individual spectra are normalized such that the total flux observed in the Spitzer IRS spectrum is equal to unity, prior to stacking: this means that each galaxy contributes an equal amount of information to the stacks. The data are smoothed for presentation purposes. The following PAH features dominate the total PAH luminosity, and are marked with dashed lines: the 6.2 μm (∼10% of L_PAH), 7.7 μm (∼30% of L_PAH), 8.3 μm + 8.6 μm (∼15% of L_PAH), 11.3 μm (∼15% of L_PAH), 12.6 μm (∼10% of L_PAH, but blended with [Ne ii]), and 17 μm (∼10% of L_PAH) (Smith et al. 2007). Strong nebular emission lines are masked in gray bands.

**Figure 19.** Observed *Spitzer* IRS spectra, stacked in bins of median Q_PAH. Q_PAH is derived from the posteriors of the fits to the broadband photometry, which is a completely independent measurement from the spectra. Prominent PAH emission features are marked with dashed lines. Strong nebular emission lines are masked in gray bands. The strong correlation between the photometric Q_PAH and the observed luminosity in PAH features is an excellent validation of the photometric fits. Only galaxies with log(M) $\gt 10$ are included, to minimize confounding trends in stellar mass.
Download figure:
Standard image High-resolution image

It is clear from Figure 19 that the EW of the PAH features increases as the Q_PAH from the photometric fits increases. This is an excellent validation of the photometric Q_PAH values derived from fitting the full SED. This result implies that PAH emission can be cleanly distinguished from other sources of MIR emission (e.g., circumstellar dust around AGB stars, or AGN; Lange et al. 2016) with full-SED modeling.

Measuring total IR luminosity in PAH features for individual galaxies, rather than stacks, would provide additional compelling spectral evidence for the photometric Q_PAH values. However, this measurement is challenging: it is difficult to define the continuum in the MIR region, resulting in biased EWs by factors of ∼2–3 (Smith et al. 2007). Furthermore, individual PAH EWs are sensitive to the intensity of the radiation field (particularly the presence of an AGN) and the galaxy metallicity (Smith et al. 2007; O'Dowd et al. 2009), which makes the interpretation of PAH EWs complex. A full validation of the IR SED models would involve a simultaneous fit to the observed Spitzer IRS, Akari spectra, and the photometry, with the Draine & Li (2007) models and a model for the IR emission of the AGN. We defer this more careful treatment to a later investigation.

Figure 20 shows the relationship between stellar mass and Q_PAH. There is a clear link with stellar mass, such that Q_PAH is very low below a stellar mass of log(M/M ${}_{\odot }$ ) ∼ 8.8. Assuming a tight relationship between gas-phase metallicity and stellar mass, this is consistent with previous observations which show the PAH mass fraction is very low for galaxies below a gas-phase metallicity of log(O/H) + 12 ∼ 8.1, with considerable natural variation above this threshold (Roche et al. 1991; Engelbracht et al. 2005; Hunt et al. 2005; Draine et al. 2007). Full-SED modeling of the broadband photometry reproduces these trends in the PAH mass fraction. We note that the MIR is particularly well-sampled for these galaxies, with Spitzer/IRAC at 3–8 μm, WISE at 3–22 μm, and Spitzer/MIPS at 24 μm, and that Q_PAH is very well-recovered in mock tests; see Appendix A.

It has been demonstrated in this section that Q_PAH, and trends in Q_PAH with stellar mass, can be reproduced with a combination of MIR photometry and full-SED modeling. This, along with encouraging mock test results in Appendix A, suggests that Q_PAH is accurately recovered by fits to the photometry. It is likely that full-SED modeling with MIR photometry can tighten the 8 μm/L_IR relationship (Elbaz et al. 2010, 2011), and thus refine SFR estimates, by including constraints from optical–IR energy conservation. It has recently been shown at z = 2 that L_IR can be accurately estimated with the FSPS stellar populations combined with the Draine & Li (2007) dust emission model (Shivaei et al. 2016a). This is particularly pertinent in light of recent results from Shivaei et al. (2016b), which demonstrate that systematic trends in PAH emission with mass bias global SFR estimates by ∼30% at $z\sim 2$ . In future work, we will explore the use of Prospector to estimate accurate L_IR, 8 μm/L_IR, and SFRs of $z\gt 1$ galaxies.

6. Past and Future Changes in the `Prospector-`α Model

Here, we include a short list of model SFH choices that were found to be impractical in the adopted fitting framework. This list may be useful to those building models within the Prospector interface, or other such interfaces, in future work. We also briefly discuss some of the planned additions to the Prospector-α model. The Prospector parameter files for the Prospector-α model used to create the results in this paper are the "brownseds_np" files at https://github.com/jrleja/threedhst_bsfh/, Github commit hash 84d32c0ece57b11d0c8424b44be160eb5a735537.

6.1. Unsuccessful SFH Parameterizations

Fitting complex galaxy models to broadband photometry is a poorly constrained problem, and as a result, the model posteriors often have substantial degeneracies. This lack of constraints, combined with the realities of an imperfect sampler and a finite amount of computing time, mean that the output posteriors can be sensitive to the parameterization of the model SFH. Listed here are a number of SFHs that we attempted to incorporate into the model, which, while accurately describing physical states which are known to occur in galaxies, prove difficult or impractical to implement.

1.
Two separate galaxy SEDs, with distinct declining-τ SFH models and dust attenuations, which are then summed to create the composite SED. This model was intended to mimic the spatially distinct bulge-and-disk system regularly seen in resolved images of galaxies. Each model component would have a unique SFH and dust attenuation model, to simulate e.g., a star-forming disk and a quiescent bulge. However, mock tests showed that this model had too many multimodalities for the emcee sampler to sample efficiently in a reasonable amount of time. With only 5% errors on the photometry, the sampler would find solutions widely separated in parameter space with significantly different masses and SFRs than the input values, and also with better ${\chi }^{2}$ values.
2.
A delayed τ SFH with a truncation, followed by a linear ramp, as described in Simha et al. (2014). This parameterization works very well in describing the observed photometry. However, the SFH posteriors in this parameterization are highly multimodal, and often have multi-dimensional flared, curving degeneracies. In order to achieve accurate posteriors on masses and SFRs using this parameterization, the sampler must properly explore multiple widely separated solutions with large degeneracies. This meant that while the best-fit results were often reasonable, the error bars were substantially underestimated. These truncated error bars also affected the accurate recovery of metallicity and dust parameters, which are highly sensitive to the SFH posteriors due to the age–dust–metallicity degeneracy.
3.
A single burst component in the SFH, with two parameters describing the time and strength of the burst. Without data that can constrain changes in SFH on shorter timescales, such as spectra, bursts in the SFH would often create unnecessary multimodalities and would decouple SFR(10 Myr) from SFR(100 Myr) in ways that are not uniquely constrained by the broadband photometry. It is more efficient to use the adopted nonparametric SFH and tailor the time bins to the resolving power of the data, as described in Section 3.1.1 and B. D. Johnson et al. (2017, in preparation).

6.2. Future Improvements

Here we briefly describe ways in which the Prospector-α model will be expanded upon in future work.

1.
Galaxies with strong AGN contributions are known to have issues in the current Prospector-α model; in particular, red MIR colors indicative of obscured AGNs are currently instead fit with very strong 100-300 Myr components, as the emission from circumstellar dust around AGB stars can mimic the MIR colors of AGNs (Alatalo et al. 2016). The Nenkova et al. (2008a, 2008b) AGN models implemented in the FSPS source code will be tested and implemented in future versions of the Prospector-α model.
2.
The SFH at ≳1 Gyr is poorly constrained from the broadband photometry (see the mock tests in Appendix A). When the data do not put strong constraints on a parameter, the posteriors are prior-dominated. The current priors on the SFH parameters have a Gaussian shape on the logarithm of the sSFR in each bin, log(SFR_n/M_total), as shown in Section 3.1.1. The center of the prior is at log(1/t_univ) ∼ −10.1 yr⁻¹ and the FWHM is roughly 1.5 dex. However, it is known from observations of distant galaxies that the SFRs were much higher in the past (e.g., Madau & Dickinson 2014), so this prior which places a preference on constant SFHs does not agree with observations. The effects of the SFH prior on the model posteriors will be explored in future work.

7. Conclusion

In this paper, we present the Prospector-α model for fitting the SEDs of galaxies, based on the FSPS stellar populations code and built in the Prospector inference framework. The Prospector-α model includes a six-component nonparametric SFH, nebular emission, a variable attenuation curve, and dust attenuation and re-radiation. It uses the on-the-fly model generation and MCMC implementation within the Prospector framework to explore the model posteriors in a thorough, unbiased manner. We demonstrate the power of this framework by fitting the Brown et al. (2014) broadband photometry, and comparing the model predictions to aperture-matched optical spectroscopy. Prospector-α predicts observed Hα luminosities from fits to the photometry with a scatter of 0.18 dex and an offset of ∼0.1 dex, a strong verification of the model SFRs, dust attenuation, and stellar metallicities, across a wide range of galaxy types and stellar masses. We also find good agreement in SFH indicators (D_n4000 and Hδ absorption), direct tests of the reddening curve (Balmer decrements), stellar metallicities, and PAH mass fractions.

The accurate prediction of Hα luminosities is a key component of these results, and it is explored in some detail. The Hα luminosities are sensitive to the recent SFR, the stellar metallicities, and the dust attenuation in the Prospector-α model. It is demonstrated that including stellar metallicities is key to achieving tight scatter in Hα comparisons. It is shown that the remaining scatter of 0.18 dex is consistent with the width of the model posteriors to within 20%–30% for Hα and Hβ, and that it is not dominated by errors in the reddening curve. The low scatter in this comparison implies that SFRs do not vary strongly over 100 Myr timescales.

The accuracy of recovered trends in the dust attenuation and re-emission model is also explored. In Section 5.2.1, it is shown that the Prospector-α model, using flat priors, naturally recovers the relationship between the dust optical depth and the shape of the attenuation curve predicted in dust radiative transfer models. In Section 5.2.2, it is demonstrated using stacked Spitzer IRS spectra that the PAH mass fractions are well constrained by the full-SED fits.

We test the extent to which the model posteriors describe the spread in true galaxy properties by performing mock tests (Appendix A). It is demonstrated that the Prospector-α model produces accurate posteriors with well-estimated error bars, even for model parameters which are poorly constrained by the data. The Prospector-α model performs better at predicting Hα luminosities than SED-fitting codes of similar complexity (Appendix B), largely due to different choices in the dust and SFH implementation. It is also shown that the FIR Herschel fluxes can be predicted from fits to the UV to MIR photometry alone after applying reasonable physical priors to the shape of the IR SED (Appendix C).

Broadband galaxy SEDs are composed of a complex mix of stellar populations, gas, and dust, but models which contain this complexity are often only somewhat constrained by the data. Thus, the parameters derived from these SEDs can be highly sensitive to the model parameterization. Model posterior checks for SED fitters are the key ingredient necessary to move the field forward toward precise, unbiased estimates of stellar masses and SFRs across a variety of redshifts. Here we have performed these tests for stellar metallicities, SFRs, SFHs, dust reddening, and PAH emission, and found excellent agreement with the data. However, important components of the Prospector-α model remain untested, in particular the stellar masses and the behavior of the model at higher redshifts. These aspects will be explored in future work.

The demonstrated flexibility and accuracy of the Prospector framework strongly motivates a fresh look at basic properties of the galaxy population, such as the stellar mass function and star-forming sequence as a function of redshift. Furthermore, Prospector allows broadband photometry to describe galaxy properties to a level of detail that was previously only reserved for spectra. The stellar mass–metallicity relationship, systematic variations in SFH with stellar mass and SFR, variations in the PAH fraction with stellar mass and metallicity, and variations in the total dust attenuation and attenuation curve with galaxy properties can all now be accurately explored with broadband photometry alone. This tremendously increases the sample sizes available to these types of studies. Going even further, the simultaneous fitting of broadband photometry and spectra can put tight constraints on quantities that would otherwise be highly degenerate, such as the timescale of SFH variations and the ratio between stellar and nebular attenuation. We plan to re-examine SFR–mass relations and the evolution of the stellar mass function to see if this framework is capable of producing self-consistent SFRs and masses over most of cosmic time.

We thank Marijn Franx, Eric Bell, Frank van den Bosch, and Nikhil Padmanabhan for valuable discussion and comments, and Michael Brown for the excellent spectrophotometric catalog upon which this work is based. C.C. acknowledges support from NASA grant NNX13AI46G, NSF grant AST-1313280, and the Packard Foundation. P.V.D. acknowledges support from NASA grants NNX13A146G and HST-GO-12177. This work used the Extreme Science and Engineering Discovery Environment (XSEDE, Towns et al. 2014), which is supported by National Science Foundation grant number ACI-1053575. This time was granted under XSEDE allocations TG-AST140054 and TG-AST150015. Some of the computations in this paper were run on the Odyssey cluster supported by the FAS Division of Science, Research Computing Group at Harvard University.

Appendix A: Mock Tests

In this section we generate and fit 200 mock galaxies with the Prospector-α model, with the goal of investigating which model parameters can be accurately constrained with the available Brown et al. (2014) photometry.

To generate mock galaxy SEDs, free parameters from the Prospector-α model are selected randomly within the model priors. The model priors are described in detail in Section 3.1. In order to produce galaxies with realistic amounts of dust, we allow the diffuse dust optical depth to only vary within $0.0\lt {\tau }_{\mathrm{diffuse}}\lt 0.5$ , and, for simplicity, we also constrain mass to be $10\,\lt$ log(M/M ${}_{\odot }$ ) $\lt \,11$ . These changes are made during mock generation, but are not reflected in any adjustment of the priors during model fitting.

Photometric noise is simulated by perturbing the mock photometric fluxes. For each photometric band, the flux is adjusted by sampling from a Gaussian with a mean of zero and a width equal to 5% of the flux value. The statistical properties of this noise are reported accurately to the sampler.

Figure 21 examines the ability of the fitter to recover the free model parameters for the mock galaxies. The stellar mass, 0–100 Myr SFH, diffuse dust content, stellar metallicity, and dust emission parameters are all well-constrained by the available photometry. The birth-cloud dust, however, is not uniquely determined from the mock photometry: this parameter has a subtle effect on the photometry even in star-forming galaxies (see Figure 3), and for quiescent galaxies with no recent star formation, will be completely unconstrained. The diffuse dust index, which controls the shape of the attenuation curve, is recovered with considerable scatter, though with no significant bias.

The SFH beyond 100 Myr is recovered modestly well: notably, the ability to accurately recover the SFH becomes increasingly poor with lookback time. This is consistent with the fact that older stellar populations have increasingly similar SED shapes, and also tend to be out-shone by younger generations of stars. These results are also consistent with mock tests of another six-component nonparametric SFH implementation in Zhang et al. (2012). Notably, fractional SFH parameters $\gtrsim 0.4$ have poor recovery rates. This is in part due to the implicit Dirichlet prior from the fractional parameterization, which favors a constant SFH over a dominant contribution from a single SFH bin: see Section 3.1.1 for an extensive discussion of this. This prior does not affect mass or SFR estimates significantly, as demonstrated by the mock tests shown here, but the SFH priors would be very relevant in any future studies attempting to recover the SFH at ≳1 Gyr from broadband photometry.

Some model parameters will not be uniquely determined from the photometry alone, or may only be recoverable for particular SFHs. As long as these parameters are assigned broad posteriors which accurately describe the constraining power of the data, then the fitter is functioning properly. Figure 22 examines the distribution of true model parameters relative to the model posteriors. Similar to Figure 17, we show the sum of the model posteriors for each model fit parameter. The posteriors are first shifted such that the median of each posterior is located at the origin. We also show a histogram of the truth values for each of these parameters. If the model posteriors are accurate, then the two histograms will match each other.

**Figure 22.** Extent to which the model posterior probability functions accurately capture the true distribution of mock parameters. Red histograms show the location of the true mock galaxy parameters relative to the center of the model posterior. Blue histograms show the stacked model posterior probability functions. Error bars capture the model fit parameters to within 10% for all fit parameters except for the stellar metallicity and the dust attenuation curve, where the error bars are incorrect by 15% and 25%, respectively. This is likely due to large-scale parameter multimodalities arising from the well-known dust–metallicity degeneracy (see Section 3.1.6.
Download figure:
Standard image High-resolution image

To quantify this test, we calculate the fraction of truths that fall within the posterior 1σ range (i.e., the 16th and 84th percentiles of the posterior) and 2σ range (the 2.5th and 97.5th percentiles of the posterior), shown in the upper-left corner of the panels in Figure 22. The posteriors are accurate to within ∼10% for all model parameters except the stellar metallicity and the shape of the attenuation curve. The posteriors for stellar metallicity and the diffuse dust index often exhibit multimodality, which is challenging for the emcee sampling algorithm to sample properly. The recovery of these parameters would likely be improved by using nested sampling algorithms such as MultiNest (Feroz & Hobson 2008).

In addition to recovering the free model parameters, it is also interesting to evaluate the recovery of parameters derived from the SFH, such as SFR, sSFR, and the half-mass assembly time. The recovery rate for these derived parameters is shown in Figure 23, and a posterior test analogous to Figure 22 is shown in Figure 24.

**Figure 23.** Recovery rate for select physical properties of the mock galaxies. The x-axes show the true physical properties of the mock galaxies, while the y-axes show the marginalized outputs from the `Prospector` fits to the mock photometry. SFRs and sSFRs are well-recovered, while half-mass assembly times are more uncertain.
Download figure:
Standard image High-resolution image

**Figure 24.** Extent to which the model posterior probability functions accurately capture the true distribution of mock parameters. Red histograms show the location of the true mock galaxy parameters relative to the center of the model posterior. Blue histograms show the stacked model posterior probability functions. Since the fits to the mock photometry accurately describe the mock parameters, these two histograms overlap within Poisson errors. Error bars describe the distribution of mock values to within 10% for SFR, sSFR, and half-mass time.
Download figure:
Standard image High-resolution image

SFR and sSFR are well-recovered by the mock tests, with the recovery rate increasing with increasing SFR/sSFR. This makes sense: the fraction of young stars is recovered with increasing fidelity as the young stars dominate more and more of the light. The half-mass time is poorly constrained from the available photometry, consistent with the modest recovery of the SFH parameters in Figure 21. The posteriors for all three of these parameters are well-behaved, and the error bars are accurate to within 10%.

Finally, in Figure 25, we demonstrate the capacity of the fitter to recover key spectral parameters (Hα, and Hβ nebular emission fluxes, D_n4000, and the Balmer decrement) from fits to the photometry. The overall prediction of these spectroscopic features is of excellent quality. With 5% photometric errors, the scatter in the Hα and Hβ nebular emission lines is similar to the observed scatter in the Hα and Hβ nebular emission (0.18 dex, Figure 9); both scatters are consistent with the scatter predicted from the width of the model posteriors. The observational S/N > 5 cut is relevant to this comparison, as the ability of the model to predict Hα nebular emission increases with increasing sSFR. The average sSFR of the mock galaxies is lower than the average sSFR of the Brown et al. (2014) sample (see Section 4.3), so the measured scatter in mock nebular emission flux predictions is an upper limit to the scatter in a truly analogous sample. The recovery of D_n4000 is slightly improved relative to the observations, with a decrease in scatter by 0.05. This is expected, as D_n4000 is sensitive to the SFH. The SFH parameters are drawn directly from the SFH priors in the mock tests, whereas this is not guaranteed for observed galaxies. The Balmer decrement, parameterized by the magnitude of the differential extinction between Hα and Hβ wavelengths, is better recovered in the mock tests, with a scatter of 0.02 dex compared to 0.08 dex in the observations. This is likely because the dust geometry in real galaxies is considerably more complex than our model approximation.

Appendix B: Comparison to `MAGPHYS`

In this section, the results of running the full-SED fitting code MAGPHYS (da Cunha et al. 2008) on the Brown et al. (2014) galaxy catalog are described. MAGPHYS is chosen as it is similar to Prospector-α in a number of ways: it has a similar number of free parameters, it simultaneously models the dust and stellar emission from galaxies, and it has a Bayesian implementation. In this section we compare basic physical parameters derived from the MAGPHYS fits to those from Prospector-α, contrast the ability of Prospector-α and MAGPHYS to predict Hα fluxes using the Kennicutt (1998) prescription, and show how both model SFRs and dust attenuations contribute to the difference in Hα predictions between MAGPHYS and Prospector-α. We note that the MAGPHYS results we show in this section are very similar to the MAGPHYS fits published in Brown et al. (2014).

In brief, MAGPHYS fits Bruzual & Charlot (2003) stellar populations models to the observed SEDs of galaxies. The MAGPHYS stellar templates are generated with exponentially declining SFHs with random bursts superimposed, and a variable stellar metallicity. MAGPHYS uses a two-component Charlot & Fall (2000) dust attenuation model, with both the birth-cloud and diffuse dust components as free parameters. It employs energy balance to re-emit the energy absorbed by the dust as thermal radiation in the IR. The shape of the IR SED is controlled by four free parameters in the MAGPHYS model, described in detail by da Cunha et al. (2008). A Chabrier (2003) IMF is the default in MAGPHYS, which is the same as used in Prospector-α.

Particularly relevant physical differences between the two codes include the SFH implementation (nonparametric SFHs in Prospector-α, compared to parametric SFHs with random bursts superimposed in MAGPHYS), the dust attenuation curve (a free parameter in Prospector-α, and a fixed parameter in MAGPHYS), the parameterization of the SED from dust emission, the inclusion of nebular emission lines in the model broadband photometry (CLOUDY models in Prospector-α, but not modeled in MAGPHYS). The model priors also have substantial differences: for example, MAGPHYS has a template library of generated SFHs, which imposes some fixed prior on the SFH fits, while Prospector generates stellar populations on-the-fly and can freely explore parameter space, at the cost of increased runtime.

Figure 26 compares important physical parameters derived from the SED fits between MAGPHYS and Prospector-α. We plot the marginalized MAGPHYS parameters with errors where they are available (mass, SFR); otherwise, we plot the best-fit values. On average, MAGPHYS predicts that galaxies are considerably less massive (0.25 dex), with slightly higher SFRs (0.1 dex). Despite these galaxies having excellent photometric wavelength coverage and high S/N photometry, there is a biweight scatter of a factor of 2.5 in derived SFRs between MAGPHYS and Prospector-α. There is little agreement in the photometric metallicities between the two codes. On average, the optical depth of the diffuse dust components agree reasonably well, though with a substantial amount of scatter. Finally, MAGPHYS assigns considerably more dust attenuation toward H ii regions, attenuating the flux from young stars by an extra a factor of ${e}^{1.11}$ = 3.1, and producing on average 0.15 dex of additional reddening between Hα and Hβ wavelengths.

To compare the recovery of Hα, we calculate the Kennicutt Hα luminosity using Equation 10. We adopt the average SFR over the past 10 Myr from each model as the SFR. We apply the model dust attenuation to the resulting Hα luminosity, and compare the attenuated model Hα luminosity to the observations. The results are shown in Figure 27. Overall, MAGPHYS systematically underpredicts the Hα fluxes by 0.31 dex, with a scatter of 0.54 dex.

**Figure 27.** Hα luminosities in ${L}_{\odot }$ measured from the aperture-matched spectra compared to model Hα luminosities predicted from fits to the broadband photometry. The model Hα luminosity is calculated from the model SFR using Equation 10, and attenuated by the model dust properties. The left panel shows Prospector-α, the right panel shows `MAGPHYS`. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown. The left panel is a reproduction of the left panel in Figure 14 for the convenience of the reader.
Download figure:
Standard image High-resolution image

There are systematic trends in the MAGPHYS comparison, including a "parallel" set of galaxies offset from the 1:1 relationship by ∼1 dex. We examine the source of systematics in Figure 28 by looking at the inputs to the Hα luminosity calculation from each code. The majority of the scatter between the two models comes from the difference in model SFRs over the last 10 Myr. The offset between the predictions comes from the much larger dust attenuation toward H ii regions predicted by the MAGPHYS fit. The MAGPHYS reddening model also systematically overpredicts the observed Balmer decrements (not shown), confirming thatMAGPHYS predicts too much dust attenuation in these galaxies.

**Figure 28.** Inputs to the empirical Hα calculation from both `MAGPHYS` and `Prospector-`α. There is considerable scatter in the SFRs over 10 Myr timescales, 0.2 dex more than the scatter in SFRs over 100 Myr timescales (Figure 26). This may be due to the random bursts superimposed on the `MAGPHYS` SFHs. There is both scatter and a large offset in the dust attenuation at Hα wavelengths, such that `MAGPHYS` predicts much more dust attenuation. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown.
Download figure:
Standard image High-resolution image

**Figure 28.** Inputs to the empirical Hα calculation from both `MAGPHYS` and `Prospector-`α. There is considerable scatter in the SFRs over 10 Myr timescales, 0.2 dex more than the scatter in SFRs over 100 Myr timescales (Figure 26). This may be due to the random bursts superimposed on the `MAGPHYS` SFHs. There is both scatter and a large offset in the dust attenuation at Hα wavelengths, such that `MAGPHYS` predicts much more dust attenuation. Galaxies are color-coded by their BPT classification. Only galaxies with S/N (Hα, Hβ) $\gt 5$ are shown.
Download figure:
Standard image High-resolution image

The difference in model dust attenuations is likely due to the lack of a limit on the ratio of birth-cloud dust to diffuse dust in MAGPHYS. The birth-cloud dust has a very small effect on the SED and is thus poorly constrained by photometry alone (see Figure 3). The dominant effect of the birth-cloud dust on the Hα attenuation in the MAGPHYS model can be inferred by comparing the difference between the total optical depth and the diffuse optical depth in Figure 26.

The origin of the difference in model SFRs is likely the different SFH prescriptions. MAGPHYS imposes a set of random bursts on top of a smoothly varying SFH, so that SFR(10 Myr) can be significantly different than SFR(100 Myr). In contrast, Prospector-α enforces a constant SFH over the previous 100 Myr. While both SFH recipes reproduce the observed SED to similar levels of accuracy (as measured by the ${\chi }^{2}$ in the fit to the photometry), they predict substantial differences in SFR indicators which probe different timescales. This is relevant when trying to predict the Hα fluxes (∼5 Myr timescales) with photometric UV fluxes (∼tens of Myr) and IR fluxes (∼hundreds of Myr). This conclusion is in agreement with Smith & Hayward (2015), which found that the best-fit MAGPHYS SFH is a poor approximation of simulation SFHs. The agreement in Smith & Hayward (2015) improves dramatically when the public version of MAGPHYS is modified to marginalize over all of the library SFHs, which mitigates the effect of the random bursts.

We conclude by suggesting that the best way to resolve the substantial difference in predicted physical properties between SED fitters is with further posterior checks against spectroscopic features. This will lead to an across-the-board improvement in the recipes and prescriptions in SED models, and consequently, in the accuracy of their predicted physical properties.

Appendix C: Effect of the Herschel Photometry

In the Prospector-α model, dust emission is tied to the UV-NIR SED by assuming energy balance: all energy attenuated by dust is re-radiated at IR wavelengths. In principle, this means L_IR can be constrained from fitting the UV-NIR SED alone. This is a crucial point, as accurate determination of L_IR is necessary for measuring accurate total SFRs and dust attenuations.

We investigate the extent to which L_IR can be recovered from the UV-MIR SED. First, more permissive priors on the dust emission parameters are adopted. This prevents L_IR from being constrained solely from a combination of the MIR fluxes and a fixed shape to the IR SED; instead, it must be determined from energy balance in the UV-NIR SED and the MIR fluxes. The full parameter space of the Draine & Li (2007) dust emission model in FSPS is used: flat priors over $0.1\,\lt$ U_min $\lt \,25$ , $0.0\,\lt$ ${\gamma }_{{\rm{e}}}$ $\lt \,1.0$ , and $0.1\,\lt$ Q_PAH $\lt \,10.0$ . Then, the 26 galaxies that are covered by Herschel PACS and SPIRE imaging are fit with the Prospector-α model. We fit them in three different ways:

1.
Including the FIR photometry, with wide IR priors.
2.
Masking the FIR photometry, with wide IR priors.
3.
Masking the FIR photometry, with constrained IR priors (described in Section 3.1.4).

In Figure 29, the posteriors for the dust emission parameters from each of the above fits are stacked. The fits to the Herschel fluxes give a tight constraint on the dust emission parameters. Fits without the Herschel fluxes and with broad priors have broader posteriors on the dust emission parameters, though ${\gamma }_{{\rm{e}}}$ and Q_PAH are constrained to some extent by the MIR photometry. Fits without the Herschel fluxes but with tight priors have posteriors that more closely resemble the posteriors of the Herschel fit.

**Figure 29.** Stacked posteriors for the dust emission parameters for all of the galaxies with FIR photometry. The left panel shows the posteriors for Q_PAH, the middle panel for U_min, and the right panel for ${\gamma }_{{\rm{e}}}$ . Posteriors from fits including the *Herschel* photometry are shown in red. These are fit with flat, wide priors on the dust emission parameters, described in the text in Appendix C. Posteriors from fits without the *Herschel* photometry with the same wide priors are shown in orange, while fits without the *Herschel* photometry but with the narrower priors described in Section 3.1.4 are shown in blue. Narrower priors are chosen based on the range of IR parameters from the fits to the *Herschel* fluxes, and are consistent with the best-fit parameters from Draine et al. (2007). When the fits without *Herschel* photometry are supplied with tighter priors, the posteriors more closely resemble the fits that include the *Herschel* photometry.
Download figure:
Standard image High-resolution image

The implications of these different dust emission posteriors are explored in Figure 30. This figure shows the median χ in each band when the FIR photometry is not fit, for the two different sets of priors. The error floor on each photometric band is 5%, so a χ of 10 represents a difference of 50%. The tight priors reproduce the FIR fluxes with less offset than the broad priors, but the UV-MIR SED is almost equally well reproduced with either.

The other three panels compare the L_IR, SFR, and the optical depth of the dust attenuation at 5500 Å between the fits with FIR photometry, and those without. When fitting without the FIR photometry and with broad priors, the L_IR and SFR are biased low by ∼0.15 dex, and the dust attenuation by ∼0.15 in optical depth. When fitting with tighter priors, this offset largely disappears. This has fascinating implications. The first is that the UV-MIR SED alone cannot uniquely determine L_IR. This is clear from the upper-left panel of Figure 30, where the UV-MIR is almost equally well fit regardless of the FIR priors, but the implied FIR fluxes have a large variance. It is also visible in the upper-right panel, where the fitter assigns large error bars on L_IR when the FIR fluxes are not fit, and very small L_IR error bars when they are fit. This is likely related to the dust energy problem (Bianchi et al. 2000; Popescu et al. 2000; Misiriotis et al. 2001; Baes et al. 2010; Holwerda et al. 2012; Mosenkov et al. 2016), where FIR fluxes can be underpredicted by factors of 3–4 when fitting only the UV-NIR emission.

The second implication is that, given that L_IR cannot be uniquely determine without FIR fluxes, it is critical to have tight priors on the shape of the FIR emission; otherwise, the large uncertainty on the range of possible FIR shapes is translated into a large uncertainty and, in this case, a bias in L_IR, SFR, and dust attenuation. This may be particularly relevant when performing full-SED modeling in the case of having limited MIR and no FIR photometry, as is often true at higher redshifts.

Given these results, we have adopted the tighter priors on the dust emission parameters in our fiducial model. These tight priors are also better descriptors of the fits to the SINGS sample in Draine et al. (2007). One potential caveat is that the Herschel photometry in the Brown et al. (2014) sample comes from overlap with the KINGFISH survey (Kennicutt et al. 2011), and that the galaxies in this overlap may not be representative of the whole Brown et al. (2014) sample.