A COMPREHENSIVE ANALYSIS OF UNCERTAINTIES AFFECTING THE STELLAR MASS–HALO MASS RELATION FOR 0 < z < 4

Peter S. Behroozi; Charlie Conroy; Risa H. Wechsler

doi:10.1088/0004-637X/717/1/379

1. INTRODUCTION

A variety of physical processes are thought to be responsible for the observed distribution of galaxy properties, and distinguishing among them is one of the principal goals of modern galaxy formation theory. Among the relevant mechanisms are those responsible for galaxy growth, such as star formation and galaxy mergers, as well as those responsible for regulating growth, including energetic feedback by supernovae, active galactic nuclei (AGNs), cosmic ray pressure, and long gas cooling times.

A fruitful approach for separating the influence of different mechanisms is to constrain the redshift-dependent relation between physical characteristics of galaxies, such as stellar mass, and the mass of their dark matter halos. This is possible because it is expected that many of these physical processes depend primarily on the mass of a galaxy's dark matter halo. By connecting galaxies to their parent halos, one is able to more clearly identify and constrain the physical processes responsible for galaxy growth.

The galaxy stellar mass–halo mass (SM–HM) relation has additional utility because many properties of both galaxies and halos are tightly correlated with halo mass. In addition, the SM–HM relation provides a mechanism for connecting predictions for the halo mass function and the mass-dependent spatial clustering of halos to the abundances and clustering of galaxies. If a model for galaxy evolution is able to reproduce the intrinsic galaxy mass–halo mass relation in the correct cosmological model, then such a model will match both the observed stellar mass function and the stellar mass-dependent clustering of galaxies. Simultaneously matching these two observational quantities and their evolution has been difficult with either hydrodynamic simulations or semi-analytic models of galaxy formation (e.g., Weinberg et al. 2004; Li et al. 2007).

There are several ways to constrain the galaxy mass–halo mass relation. The first type of approach attempts to directly measure the mass of galactic halos. Techniques include weak lensing (e.g., Guzik & Seljak 2002; Sheldon et al. 2004; Mandelbaum et al. 2006) and the use of satellite galaxy or stellar velocities as tracers of the halo potential well (e.g., Ashman et al. 1993; Zaritsky & White 1994; Prada et al. 2003; van den Bosch et al. 2004; Conroy et al. 2007). While such methods are a relatively direct probe of halo mass, they are limited in dynamic range; current observations probe halo masses from roughly 10¹²–10¹⁴ M_☉ at the present epoch, and a smaller range at higher redshift. A second approach is to identify groups and clusters of galaxies either through optical or X-ray selected cluster catalogs, and then directly measure their galaxy content (e.g., Lin & Mohr 2004; Hansen et al. 2009; Yang et al. 2007). This is limited to relatively massive halos (although for optically identified groups it could extend to lower masses as new surveys probe dimmer galaxies in large enough volumes), and it also requires accurate knowledge of the mass–observable relation (Yang et al. 2007; Hansen et al. 2009).

An alternative approach is to assume that the properties of the halo population are known, for example, from cosmological simulations, and then find a functional form relating galaxies to halos which achieves agreement with a variety of observations. This approach is less direct but can be applied over a much larger dynamic range. Halo occupation (e.g., Berlind & Weinberg 2002; Bullock et al. 2002; Cooray & Sheth 2002; Tinker et al. 2005; Zheng et al. 2007) and conditional luminosity function (CLF) modelings (e.g., Yang et al. 2003; Cooray 2006) fall into this category.

In the past decade, a number of studies have found that this latter approach can be greatly simplified using a technique called abundance matching. In its most basic form, the technique assigns the most massive (or the most luminous) galaxies to the most massive halos monotonically. The technique thus requires as input only the observed abundance of galaxies as a function of mass, namely the galaxy stellar mass function (alternatively the galaxy luminosity function) and the abundance of dark matter halos as a function of mass, namely the halo mass function. This technique has been shown to accurately reproduce a variety of observational results including various measures of the redshift- and scale-dependent spatial clustering of galaxies (Colín et al. 1999; Kravtsov & Klypin 1999; Neyrinck et al. 2004; Kravtsov et al. 2004; Vale & Ostriker 2004, 2006; Tasitsiomi et al. 2004; Conroy et al. 2006; Shankar et al. 2006; Berrier et al. 2006; Marín et al. 2008; Guo et al. 2010; Moster et al. 2010). In the context of this technique, not only central halos but also subhalos (halos contained within the virial radii of larger halos) are included in the matching process, meaning that satellite galaxies can be accounted for without any additional parameters.

Applications of the abundance matching technique have typically focused on using the default modeling assumptions to derive statistical information about the galaxy–halo connection. Uncertainties in the derived galaxy mass–halo mass relation have received little systematic attention in the context of this technique (though see Moster et al. 2010, for a recent treatment of the attendant uncertainties). An accurate assessment of the uncertainties is necessary to make strong statements regarding the underlying physical processes responsible for the derived galaxy–halo relation. In the present work, we undertake an exhaustive exploration of the uncertainties relevant in constructing the galaxy SM–HM relation from the abundance matching technique. We consider a range of uncertainties related to the observational stellar mass function, the theoretical halo mass function, and the underlying technique of abundance matching. The resulting galaxy SM–HM relation and associated uncertainties will provide a benchmark against which galaxy evolution models may be fruitfully tested.

This paper is divided into several sections. In Section 2, we detail known sources of uncertainty which may affect our results. Our methodology for modeling the effects of uncertainties is discussed in Section 3, providing simple conversions where possible to allow for different modeling choices. We present our resulting estimates of the galaxy SM–HM relation for z < 1 in Section 4 and describe the contribution of each of the uncertainties to the overall error budget. Estimates for the evolution of the relation out to z ∼ 4, for which the uncertainties are significantly less well-understood, are presented in Section 5. Finally, we discuss the implications of this work in Section 6, and summarize our conclusions in Section 7.

Stellar masses throughout are quoted assuming a Chabrier (2003) initial mass function (IMF), the stellar population synthesis (SPS) models of Bruzual & Charlot (2003), and the age and dust models in Blanton & Roweis (2007). We consider multiple cosmologies in this paper, but the main results assume a WMAP5+SN+BAO concordance ΛCDM cosmology (Komatsu et al. 2009) with Ω_M = 0.27, Ω_Λ = 1 − Ω_M, h = 0.7, σ₈ = 0.8, and n_s = 0.96.

2. UNCERTAINTIES AFFECTING THE SM–HM RELATION

Uncertainties in the abundance matching technique for assigning galaxies to dark matter halos can be conceptually separated into three classes. The first is uncertainty in the abundance of galaxies as a function of stellar mass. This class includes both uncertainties in counting galaxies due to shot noise and sample variance, as well as uncertainties in the stellar mass estimates themselves. The second class concerns the dark matter halos and includes uncertainties in cosmological parameters, the impact of baryon condensation, and substructure. Finally, there are uncertainties in the process of matching galaxies to halos arising primarily from the intrinsic scatter between galaxy stellar mass and halo mass. Each of these sources of uncertainty are described in detail below. The detailed modeling of these uncertainties is described in Section 3.

2.1. Uncertainties in the Stellar Mass Function

Galaxy stellar masses are not measured directly, but are instead inferred from photometry and/or spectra. In particular, as the observed stellar light is a function of many physical processes (e.g., stellar evolution, star formation history (SFH) and metal-enrichment history, and wavelength-dependent dust attenuation), stellar masses are estimated via complicated models to find the best fit to galaxy observations in a very large parameter space. Assumptions and simplifications in these models, along with the fact that the best fit may be only one of a number of likely possibilities, mean that there can be substantial uncertainties in calculated stellar masses.

Different types of observations can yield different uncertainties in these calculations. Spectroscopic surveys generally recover more spectral features per galaxy and more accurate redshifts than photometric surveys. However, spectroscopy requires substantially more telescope time than photometry. Therefore, spectroscopic samples tend to be limited both in area and depth, which translates into limitations in both volume and the minimum stellar mass probed. An additional problem for spectroscopic surveys, such as the Sloan Digital Sky Survey (SDSS; York et al. 2000), is that the spectra probe only the central regions of galaxies (the SDSS spectra gather on average 1/3 of the total flux from galaxies at z = 0.1). For galaxies containing both a bulge and a disk, or galaxies with radial gradients, the spectra will therefore not provide a fair sample of the entire galaxy (e.g., Kewley et al. 2005); furthermore, this bias will be a function of redshift. For these reasons, especially for robust comparisons of evolution, we confine this paper to photometric-based stellar masses; however, samples with spectroscopic redshifts are used where available.

2.1.1. Principal Uncertainties in Stellar Mass Functions

In this paper, we analyze seven main sources of uncertainties in stellar mass functions applicable to photometric surveys, listed below in rough order of importance.

1.
Choice of stellar IMF. The luminosity of stars scales as their mass to a large power (dlnL/dlnM ∼ 3.5), while the IMF, defined as the number of stars formed per unit mass, scales approximately as ξ ∝ M^−2.3 (Salpeter 1955). Thus, the light from a galaxy is dominated by the most massive stars, while the total stellar mass is dominated by the lowest mass stars. This fact, which has been known for decades (e.g., Tinsley & Gunn 1976), implies that the assumed form of the IMF at M ≲ 0.5 M_☉ will have no effect on the integrated light from galaxies, but will have a large effect on the total stellar mass. Nonetheless, constraints on the total dynamical mass of spheroidal systems provide a valuable independent check on the form of the IMF at low masses. Cappellari et al. (2006) find, for example, that the IMF proposed by Chabrier (2003) for the solar neighborhood is consistent with dynamical constraints on masses of nearby ellipticals.We do not marginalize over IMFs in our error calculations for the reason that it is relatively simple to convert from our choice of IMF (Chabrier 2003) to other IMFs. For our purposes, the IMF becomes a serious source of systematic uncertainty only if the IMF varies with galaxy properties or if it evolves. It should be noted, however, that the IMF can also introduce more complicated systematic effects associated with the inferred star formation rate (SFR), which may in turn impact stellar masses in non-trivial ways (e.g., Conroy et al. 2010).
2.
Choice of SPS model. SPS modeling efforts have grown substantially in sophistication in the past decade (e.g., Leitherer et al. 1999; Bruzual & Charlot 2003; Le Borgne et al. 2004; Maraston 2005); yet, significant uncertainties remain (e.g., Charlot et al. 1996; Charlot 1996; Yi 2003; Lee et al. 2007; Conroy et al. 2009). Treatments of convection vary, leading to different main sequence turn off times for intermediate mass stars. Advanced stages of stellar evolution—including blue stragglers, thermally pulsating AGB stars (TP-AGB), and horizontal branch stars—are poorly understood, both observationally and theoretically. The theoretical spectral libraries contain known deficiencies, especially for M giants, where the effective temperatures are low and where hydrodynamic effects become important. Empirical stellar libraries to some extent circumvent these issues, although they are plagued by incomplete coverage in the Hertzsprung–Russell diagram and difficulties associated with deriving stellar parameters. See Conroy et al. (2009) and Percival & Salaris (2009) for recent reviews.Different SPS models treat these issues differently, which can result in large systematic differences in the derived stellar mass. For instance, the model of Maraston (2005) compared to Bruzual & Charlot (2003) has systematic differences of 0.1 dex in stellar mass (Salimbeni et al. 2009; Pérez-González et al. 2008). However, Salimbeni et al. (2009) reports that the model in Bruzual (2007; with a revised treatment of TP-AGB stars) yields systematic differences in stellar mass relative to Bruzual & Charlot (2003), which ranges from 0.05 dex for 10¹¹ M_☉ galaxies to 0.3 dex for 10^9.5 M_☉ galaxies. Conroy et al. (2009) show that uncertainty in the luminosity of the TP-AGB phase can shift stellar masses by as much as ±0.2 dex.
3.
Parameterization of SFHs. In order to estimate stellar masses, model libraries are constructed with a large range in SFHs, dust attenuation (see below), and, often but not always, metallicity. The adopted functional form of the SFH is another source of systematic uncertainty, as typically very simple functional forms are assumed. Several authors have investigated various aspects of this problem. When attempting to model observed photometry, Pérez-González et al. (2008) found that a single stellar population model (in particular, star formation proportional to e^−t/τ, which is a commonly used parameterization) systematically underpredicts stellar mass by 0.18 dex compared to a double stellar population model (exponentially decaying star formation followed by a later starburst). The particular parameterization of SFHs may also lead to systematic differences as a function of stellar mass. Lee et al. (2009) analyzed a sample of mock Lyman-break galaxies at z ∼ 4–5 and found that simple SFHs produced best-fit stellar masses that were under- or overestimated by ∼±50% depending on the rest-frame galaxy color. This bias was attributed to the chaotic SFHs of the mock galaxies.
4.
Choice of dust attenuation model. Because dust reddens starlight, it is difficult to separate the effects of dust from stellar population effects, especially when fitting optical photometry. The effects of dust are also known to change depending on galaxy inclination (e.g., Driver et al. 2007). Hence, the choice of dust attenuation law has a non-trivial effect on the inferred stellar population ages and, consequently, SFHs derived from photometric and even spectroscopic surveys (Panter et al. 2007). In terms of the effect on stellar masses, Pérez-González et al. (2008) compared the dust models of Calzetti et al. (2000) and Charlot & Fall (2000), finding a systematic difference of 0.10 dex. Panter et al. (2007) found a similar difference between Calzetti et al. and models based on extinction curves from the Small and Large Magellanic Clouds. The effects of varying dust attenuation models have also been explored recently by Marchesini et al. (2009) and Muzzin et al. (2009), with similar results. We have used the stellar population fitting procedure described in Conroy et al. (2009) to compare the Calzetti et al. dust attenuation law to the dust model used in the kcorrect package (Blanton & Roweis 2007). We find a median offset of 0.02 dex but also a systematic trend such that two galaxies whose stellar masses are estimated with Calzetti dust attenuation and are separated by 1.0 dex will have kcorrect masses separated by (on average) only 0.92 dex.
5.
Statistical errors in individual stellar mass estimates. Stellar mass estimates for each galaxy are subject to statistical errors due to uncertainty in photometry as well as uncertainty in the SPS parameters for a given set of model assumptions. We treat this here as a random statistical error. While it may seem that random scatter in individual stellar masses should on average have no systematic effect, it in fact introduces a systematic error analogous to the Eddington bias (Eddington 1940) observed in luminosity functions. As the stellar mass function drops off steeply beyond a certain characteristic stellar mass, there are many more low stellar mass galaxies that can be upscattered than there are high stellar mass galaxies that can be downscattered by errors in stellar mass estimates. This asymmetric scatter implies that the drop-off in number density at high masses becomes shallower in the presence of scatter. We discuss this effect in detail in Section 3.1.2 (also see the Appendix in Cattaneo et al. 2008).
6.
Sample variance. Surveys of finite regions of the universe are susceptible to large-scale fluctuations in the number density of galaxies. This is no longer a dominant source of uncertainty for the volumes probed at low redshift by the SDSS, but it is an important consideration for higher-redshift surveys, which cover much smaller comoving volumes. Most authors who consider sample variance attempt to minimize it by averaging over several fields (e.g., Pérez-González et al. 2008; Marchesini et al. 2009). Very few surveys at z>0 attempt to estimate the magnitude of the error except by computing the field-to-field variance, which is often an underestimate when insufficient volume is probed (Crocce et al. 2010). We detail a more accurate method based on simulations to model the error arising from sample variance in Section 3.2.4.
7.
Redshift errors. Photometric redshift errors blur the distinction between GSMFs at different redshifts. While a galaxy may be scattered either up or down in redshift space, volume-limited survey light cones will contain larger numbers of galaxies at higher redshifts, meaning that the GSMF as reported at lower redshifts will be artificially inflated. Moreover, as galaxies at earlier times have lower stellar masses, surveys will tend to report artificially larger faint-end slopes in the GSMF. However, as these errors are well known, it is easy to correct for their effects on the stellar mass function, as has been done for the data in Pérez-González et al. (2008; see the Appendix of Pérez-González et al. 2005 for details on this process).

For completeness, we remark that galaxy–galaxy lensing will also result in systematic errors in the GSMF at high redshifts because galaxy magnification will result in higher observed luminosities. However, from ray-tracing studies of the Millennium simulation (Hilbert et al. 2007), the expected scatter in galaxy stellar masses from lensing is minimal (e.g., 0.04 dex at z = 1) compared to the other sources of scatter above (e.g., 0.25 dex from different model choices). For that reason, we do not model galaxy–galaxy lensing effects in this paper.

2.1.2. Additional Systematics at z>1

Recently, it has become clear that current estimates of the evolution in the cosmic SFR density are not consistent with estimates of the evolution of the stellar mass density at z>1 (Nagamine et al. 2006b; Hopkins & Beacom 2006b; Pérez-González et al. 2008; Wilkins et al. 2008a). The origin of this discrepancy is currently a matter of debate. One solution involves allowing for an evolving IMF with redshift (Davé 2008; Wilkins et al. 2008a). While such a solution is controversial, a number of independent lines of evidence suggest that the IMF was different at high redshift (Lucatello et al. 2005; Tumlinson 2007a, 2007b; van Dokkum 2008). Reddy & Steidel (2009) offer a more mundane explanation for the discrepancy. They appeal to luminosity-dependent reddening corrections in the ultraviolet luminosity functions at high redshift and demonstrate that the purported discrepancy then largely vanishes.

In contrast to results at z>1, there does seem to be an accord that for z < 1 both the integrated SFR and the total stellar mass are in good agreement if one assumes (as we have) a Chabrier (2003) IMF (see Wilkins et al. 2008b; Pérez-González et al. 2008; Hopkins & Beacom 2006a; Nagamine et al. 2006a; Conroy & Wechsler 2009).

Because of the discrepancy between reported SFRs and stellar masses in the literature, it is clear that estimates of uncertainties in GSMFs and SFRs at z>1 tend to underestimate the true uncertainties; for this reason, we separately analyze results for z < 1 in Section 4 and z>1 in Section 5 of this paper.

2.2. Uncertainties in the Halo Mass Function

Dark matter halo properties over the mass range 10¹⁰–10¹⁵ M_☉ have been extensively analyzed in simulations (e.g., Jenkins et al. 2001; Warren et al. 2006; Tinker et al. 2008), and the overall cosmology has been constrained by probes such as the Wilkinson Microwave Anisotropy Probe (WMAP; Spergel et al. 2003; Komatsu et al. 2009). As such, uncertainties in the halo mass function have on the whole much less impact than uncertainties in the stellar mass function. We present our primary results for a fixed cosmology (WMAP5), but we also calculate the impact of uncertain cosmological parameters on our error bars. We do not marginalize over the mass function uncertainties for a given cosmology, as the relevant uncertainties are constrained at the 5% level (when baryonic effects are neglected, see below; Tinker et al. 2008). Additionally, in Appendix A, a simple method is described to convert our results to a different cosmology using an arbitrary mass function. For completeness, we mention the three most significant uncertainties here.

1.
Cosmological model. The SM–HM relation has dependence on cosmological parameters due to the resulting differences in halo number densities. We investigate this both by calculating the relation for two specific cosmological modes (WMAP1 and WMAP5 parameters) and then by calculating the uncertainties in the relation over the full range of cosmologies allowed by the WMAP5 data. We find that in all cases these uncertainties are small compared to the uncertainties inherent in stellar mass modeling (Section 2.1.1), although they are larger than the statistical errors for typical halo masses at low redshift.
2.
Uncertainties in substructure identification. Different simulations have different methods of identifying and assigning masses to substructure. Our matching methods make use only of the subhalo mass at the epoch of accretion (M_acc) as this results in a better match to clustering and pair-count results (Conroy et al. 2006; Berrier et al. 2006), so we are largely immune to the problem of different methods for calculating subhalo masses. Of greater concern is the ability to reliably follow subhalos in simulations as they are tidally stripped. Two related issues apply here. The first is that it is not clear how to account for subhalos which fall below the resolution limit of the simulation. The second is that the formation of galaxies will dramatically increase the binding energy of the central regions of subhalos, potentially making them more resilient to tidal disruption. Hydrodynamic simulations suggest that this latter effect is small except for subhalos that orbit near the centers of the most massive clusters (Weinberg et al. 2008). However, while these details are important for accurately predicting the clustering strength on small scales (≲1 Mpc), they are not a substantial source of uncertainty for the global halo mass–stellar mass relation because satellites are always sub-dominant (≲20%) by number. We discuss the analytic method we use to model the satellite contribution to the halo mass function in Section 3.2.2.
3.
Baryonic physics. Recent work by Stanek et al. (2009) suggests that gas physics can affect halo masses relative to dark-matter-only simulations by −16% to +17%, leading to number density shifts of up to 30% in the halo mass function at 10¹⁴ M_☉. Without evidence for a clear bias in one direction or the other—the models of gas physics still remain too uncertain—we do not apply a correction for this effect in our mass functions. Uncertainties of this magnitude are larger than the statistical errors in individual stellar masses at low redshift, but are still small in comparison to systematic errors in calculating stellar masses.

For completeness, we note that the effects of sample variance on halo mass functions estimated from simulations are small. Current simulations readily probe volumes of 1000 (h⁻¹ Mpc)³ (Tinker et al. 2008), and so the effects of sample variance on the halo mass function are dwarfed by the effects of sample variance on the stellar mass function; we therefore do not analyze them separately in this paper.

We also remark on the issue of mass definitions. Although abundance matching implies matching the most massive galaxies to the most massive halos, there is little consensus on which halo mass definition to use, with popular choices being M_vir (mass within the virial radius), M₂₀₀ (mass within a sphere with mean density 200ρ_crit), and M_fof (mass determined by a friends-of-friends particle linking algorithm). We choose M_vir for this paper and note that the largest effect of choosing another mass algorithm will be a purely definitional shift in halo masses. We expect that scatter between any two of these mass definitions is degenerate with and smaller than the amount of scatter in stellar masses at fixed halo mass (the latter effect is discussed in Section 2.3).

2.3. Uncertainties in Abundance Matching

Finally, there are two primary uncertainties concerning the abundance matching technique itself:

1.
Nonzero scatter in assigning galaxies to halos. While host halo mass is strongly correlated with stellar mass, the correlation is not perfect. At a given halo mass, the halo merger history, angular momentum properties, and cooling and feedback processes can induce scatter between halo mass and galaxy stellar mass. This is expected to result in scatter in the stellar mass of ∼0.1–0.2 dex at a given halo mass; see Section 3.3.1 for discussion. The scatter between halo mass and stellar mass will have systematic effects on the mean relation for reasons analogous to those mentioned for statistical error in stellar mass measurements. At the high-mass end where both the halo and stellar mass functions are exponential, scatter in stellar mass at fixed halo mass (or vice versa) will alter the average relation because there are more low-mass galaxies that are upscattered than high-mass galaxies that are downscattered.
2.
Uncertainty in assigning galaxies to satellite halos. It is not clear that the SM–HM relation should be the same for satellite and central galaxies. Once a halo is accreted onto a larger halo, it starts to lose halo mass because of dynamical effects such as tidal stripping. While stripping of the halo appears to be a relatively dramatic process (e.g., Kravtsov et al. 2004), the stripping of the stellar component probably does not occur unless the satellite passes very near to the central object because the stellar component is much more tightly bound than the halo. It is clear from the observed color–density relation (Dressler 1980; Postman & Geller 1984; Hansen et al. 2009) that star formation in satellite galaxies must eventually cease with respect to galaxies in the field. It is less clear how quickly star formation ceases, and whether or not there is a burst of star formation upon accretion. All of these issues can potentially alter the relation between halo and stellar mass for satellites (although the modeling results of Wang et al. 2006 suggest that the halo–satellite relation is indistinguishable from the overall galaxy–halo relation).

3. METHODOLOGY

Our primary goal is to provide a robust estimate of the SM–HM relation over a significant fraction of cosmic time via the abundance matching technique. We aim to construct this relation by taking into account all of the relevant sources of uncertainty. This section describes in detail a number of aspects of our methodology, including our approach for incorporating uncertainties in the stellar mass function (Section 3.1), a summary of the adopted halo mass functions and associated uncertainties (Section 3.2), the uncertainties associated with abundance matching (Section 3.3), our choice of functional form for the SM–HM relation, including a discussion of why certain functions should be preferred over others (Section 3.4), and the Markov Chain Monte Carlo (MCMC) parameter estimation technique (Section 3.5). For readers interested in the general outline of our process but not the details, we conclude with a brief summary of our methodology (Section 3.6).

3.1. Modeling Stellar Mass Function Uncertainties

As discussed in Section 2, there are several classes of uncertainties affecting the way the stellar mass function is used in the abundance matching process. In this section, we discuss systematic shifts in stellar mass estimates and the effects of statistical errors on the stellar mass function.

3.1.1. Modeling Systematic Shifts in Stellar Mass Estimates

Most studies on the GSMF report Schechter function fits as well as individual data points; many also provide statistical errors. However, even when systematic errors are reported (either in Schechter parameters or at individual data points), the systematic error estimates are of limited value unless one is also able to model shifts in the GSMF caused by such errors.

Fortunately, based on the discussion in Section 2.1.1, there seem to be two main classes of systematic errors causing shifts in the GSMF.

1.
Over/underestimation of all stellar masses by a constant factor μ. This appears to cover the majority of errors, including most differences in SPS modeling, dust attenuation assumptions, and stellar population age models.
2.
Over/underestimation of stellar masses by a factor which depends linearly on the logarithm of the stellar mass (i.e., depends on a power of the stellar mass). This covers the majority of the remaining discrepancies between different SPS models and different stellar age models.

Both forms of error are modeled with the equation

$\begin{equation} \log _{10}\left(\frac{M_{\ast,\mathrm{meas}}}{M_{\ast,\mathrm{true}}}\right) = \mu + \kappa \,\log _{10}\left(\frac{M_{\ast,\mathrm{true}}}{M_0}\right). \end{equation} \tag{ 1 }$

Without loss of generality, we may take M₀ = 10^11.3 M_☉ (the fixed point of the variation between the Bruzual 2007 and Bruzual & Charlot 2003 models found by Salimbeni et al. 2009), allowing the prior on M₀ to be absorbed into the prior on μ.

For the prior on μ, we consider four contributing sources of uncertainty. We adopt estimates of the uncertainty from the SPS model (≈0.1 dex), the dust model (≈0.1 dex), and assumptions about the SFH (≈0.2 dex) from Pérez-González et al. (2008) as detailed in Section 2.1.1. Additionally, we have the variation in κ log₁₀(M₀) (at most 0.1 dex, as |κ| ≲ 0.15—see below). Assuming that these are statistically independent, they combine to give a total uncertainty of 0.25 dex, which is consistent with the accepted range for systematic uncertainties in stellar mass (Pérez-González et al. 2008; Kannappan & Gawiser 2007; van der Wel et al. 2006; Marchesini et al. 2009). For lack of adequate information (i.e., different models) to infer a more complicated distribution, we assume that μ has a Gaussian prior. As more studies of the overall systematic shift, μ, become available, our assumptions for the prior on μ and the probability distribution will likely need corrections. We remark, however, that our results can easily be converted to a different assumption for μ, as μ simply imparts a uniform shift in the intrinsic stellar masses relative to the observed stellar masses.

For the prior on κ, the result of Salimbeni et al. (2009) would suggest |κ| ≲ 0.15. As mentioned in Section 2.1.1, we found that |κ| ≈ 0.08 between the Blanton & Roweis (2007) and Calzetti et al. (2000) models for dust attenuation. Li & White (2009) finds |κ| ≲ 0.10 between Blanton & Roweis (2007) and Bell et al. (2003) stellar masses. Without a large number of other comparisons, it is difficult to robustly determine the prior distribution for κ; however, motivated by the results just mentioned, we assume that the prior on κ is a Gaussian of width 0.10 centered at 0.0.

We remark that some authors have considered much more complicated parameterizations of the systematic error. For example, Li & White (2009) consider a four-parameter hyperbolic tangent fit to differences in the GSMF caused by different SPS models, as well as a five-parameter quartic fit. However, we do not consider higher-order models for systematic errors for several reasons. First, given that second- and higher-order corrections will result only in very small corrections to the stellar masses in comparison to the zeroth-order correction (μ ≈ 0.25 dex), the corrections will not substantially affect the systematic error bars. Second, we do not know of any studies which would allow us to construct priors on the higher-order corrections. Finally, with higher-order models, there is the serious danger of over-fitting—that is, with very loose priors on systematic errors, the best-fit parameters for the systematic errors will be influenced by bumps and wiggles in the stellar mass function due to statistical and sample variance errors. Hence, the interpretive value of the systematic errors becomes increasingly dubious with each additional parameter.

3.1.2. Modeling Statistical Errors in Individual Stellar Mass Measurements

In addition to the systematic effects discussed in the previous section, measurement of stellar masses is subject to statistical errors. Even for a fixed set of assumptions about the dust model, SPS model, and parameterization of SFHs, stellar masses will carry uncertainties because the mapping between observables and stellar masses is not one-to-one. This additional source of uncertainty has unique effects on the GSMF. Observers will see a GSMF (ϕ_meas) which is the true or "intrinsic" GSMF (ϕ_true) convolved with the probability distribution function of the measurement scatter. For instance, if the scatter is uniform across stellar masses and has the shape of a certain probability distribution P, we have

$\begin{equation} \phi _\mathrm{meas}(M) = \int _{-\infty }^\infty \phi _\mathrm{true}(10^y) \, P(y - \log _{10}(M)) dy, \end{equation} \tag{ 2 }$

where y is the integration variable, in units of log₁₀ mass. As derived in Appendix B, the approximate effect of the convolution is

$\begin{equation} \log _{10} \left(\frac{\phi _\mathrm{meas}(M)}{\phi _\mathrm{true}(M)}\right) \approx \frac{\sigma ^2}{2}\ln (10)\,\left(\frac{d\log \phi _\mathrm{true}(M)}{d\log M}\right)^2, \end{equation} \tag{ 3 }$

where σ is the standard deviation of P. That is to say, the effect of the convolution strongly depends on the logarithmic slope of ϕ_true. Where the slope is small (i.e., for low-mass galaxies), there is almost no effect. Above 10¹¹ M_☉, where the GSMF becomes exponential, there can be a dramatic effect, with the result that ϕ_true is more than an order of magnitude less than ϕ_meas because it becomes far more likely that stellar mass calculation errors produce a galaxy of very high perceived stellar mass than it is for there to be such a galaxy in reality (see, for example, Cattaneo et al. 2008).

For the observed z ∼ 0 GSMF, we take the probability distribution P to be log-normal with 1σ width 0.07 dex from the analysis of the photometry of low-redshift luminous red galaxies (LRGs; Conroy et al. 2009). Kauffmann et al. (2003) found similar results regarding the width of P. This function only accounts for the statistical uncertainties mentioned above and does not include additional systematic uncertainties. In light of Equation (3), we use LRGs to estimate P because LRGs occupy the high stellar mass regime where measurement errors are most likely to affect the shape of the observed GSMF. However, the single most important attribute of the distribution P is its width; the main results do not change substantially if an alternate distribution with non-Gaussian tails beyond the 1σ limits of P is used.

For higher redshifts, we scale the width of the probability distribution to account for the fact that mass estimates become less certain at higher redshift (e.g., Conroy et al. 2009; Kajisawa et al. 2009):

$\begin{equation} P(\Delta \log _{10} M_\ast, z) = \frac{\sigma _0}{\sigma (z)} P_0\left(\frac{\sigma _0}{\sigma (z)} \Delta \log _{10} M_\ast \right), \end{equation} \tag{ 4 }$

where P₀ is the probability distribution at z = 0 (as discussed above), σ₀ is the standard deviation of P₀, and σ(z) gives the evolution of the standard deviation as a function of redshift.

Conroy et al. (2009) did not give a functional form for σ(z), but they calculate for a handful of massive galaxies that σ(z = 2) is ≈0.18 dex, as compared to σ(z = 0)≈ 0.07 dex. Kajisawa et al. (2009) performed a similar calculation (albeit with a different SPS model) of the distribution in several redshift bins; their results show gradual evolution for σ(z) out to z = 3.5 for high stellar mass galaxies consistent with a linear fit:

$\begin{equation} \sigma (z) = \sigma _0 + \sigma _z z. \end{equation} \tag{ 5 }$

The results of Kajisawa et al. (2009) suggest that σ_z = 0.03–0.06 dex for LRGs. As this is consistent with the value of σ_z = 0.05 dex which would correspond to Conroy et al. (2009), we adopt the linear scaling of Equation (5) with a Gaussian prior of σ_z = 0.05 ± 0.015 dex.

Note that the effect of this statistical error on the stellar mass function is minimal below 10¹¹ M_☉, and therefore does not affect the SM–HM relation for halos below ∼10¹³ M_☉, as discussed in Section 4.2. While this scatter does have an effect on the shape of the stellar mass function for high-mass galaxies, the qualitative predictions we make from this analysis are generic to all types of random scatter.

3.2. Halo Mass Functions

The halo mass function specifies the abundance of halos as a function of mass and redshift. A number of analytic models and simulation-based fitting functions have been presented for computing mass functions given an input cosmology (e.g., Press & Schechter 1974; Jenkins et al. 2001; Warren et al. 2006; Tinker et al. 2008). For most of our results, we will adopt the universal mass function of Tinker et al. (2008), as described below. Analytic mass functions are preferable as they (1) allow mass functions to be computed for a range of cosmologies and (2) do not suffer significantly from sample variance uncertainties, because the analytic relations are typically calibrated with very large or multiple N-body simulations.

For some purposes, it will be useful to also consider full halo merger trees derived directly from N-body simulations that have sufficient resolution to follow halo substructures. The simulations used herein will be described below, in addition to our methods for modeling uncertainties in the underlying mass function, including cosmology uncertainties, sample variance in the galaxy surveys, and our models for satellite treatment.

3.2.1. Simulations

For the principal simulation in this study (L80G), we used a pure dark matter N-body simulation based on Adaptive Refinement Tree (ART) code (Kravtsov et al. 1997; Kravtsov & Klypin 1999). The simulation assumed flat, concordance ΛCDM (Ω_M = 0.3, Ω_Λ = 0.7, h = 0.7, and σ₈ = 0.9) and included 512³ particles in a cubic box with periodic boundary conditions and comoving side length 80 h⁻¹ Mpc. These parameters correspond to a particle mass resolution of ≈3.2 × 10⁸ h⁻¹ M_☉. For this simulation, the ART code begins with a spatial grid size of 512³; it refines the grid up to 8 times in locally dense regions, leading to an adaptive distance resolution of ≈1.2 h⁻¹ kpc (comoving units) in the densest parts and ≈0.31 h⁻¹ Mpc in the sparsest parts of the simulation.

In this simulation, halos and subhalos were identified using a variant of the Bound Density Maxima algorithm (Klypin et al. 1999). Halo centers are located at peaks in the density field smoothed over a 24 particle SPH kernel (for a minimum resolvable halo mass of 7.7 × 10⁹ h⁻¹ M_☉). Nearby particles are classified as bound or unbound in an iterative process; once all the locally bound particles have been found, halo parameters such as the virial mass M_vir and maximum circular velocity V_max may be calculated. (See Kravtsov et al. 2004 for complete details on the algorithm.) The simulation is complete down to V_max ≈ 100 km s⁻¹, corresponding to a galaxy stellar mass of 10^8.75 M_☉ at z = 0.

The ability of L80G to track satellites with high mass and force resolution gives it several uses. Merger trees from L80G inform our prescription for converting analytical-central-only halo mass functions to mass functions which include satellite halos (see Section 3.2.2). Additionally, the merger trees allow for evaluation of different models of satellite stellar evolution with full consistency (see Section 3.3.2). Finally, the knowledge of which satellite halos are associated with which central halos allows for estimates of the total stellar mass (in the central and all satellite galaxies)–halo mass relation (see Section 4.3.6).

We also make use of a secondary simulation from the Large Suite of Dark Matter Simulations (LasDamas Project, http://lss.phy.vanderbilt.edu/lasdamas/) in our sample variance calculations. The L80G simulation is too small for use in calculating the sample variance between multiple independent mock surveys, but the larger size of the LasDamas simulation (420 h⁻¹ Mpc, 1400³ particles) makes it ideal for this purpose. However, the LasDamas simulation has poorer mass resolution (a minimum particle size of 1.9 × 10⁹ M_☉) and force resolution (8 h⁻¹ kpc), making it unable to resolve subhalos (particularly after accretion) as well as L80G. The LasDamas simulation assumes a flat, ΛCDM cosmology (Ω_M = 0.25, Ω_Λ = 0.75, h = 0.7, and σ₈ = 0.8) which is very close to the WMAP5 best-fit cosmology (Komatsu et al. 2009). Collisionless gravitational evolution was provided by the gadget-2 code (Springel 2005). Halos are identified using friends of friends with a linking length of 0.164. The subfind algorithm Springel (2005) is used to identify substructure.

As mentioned, the primary use of the LasDamas simulation is in sampling the halo mass functions in mock surveys to model the effects of sample variance on high-redshift pencil-beam galaxy surveys. The mock surveys are constructed so as to mimic the observations in Pérez-González et al. (2008). In each mock survey, three pencil-beam light cones (matching the angular sizes of the three fields in Pérez-González et al. 2008) with random orientations are sampled from a random origin in the simulation volume out to z = 1.3. Thus, by comparing the halo mass functions in individual mock surveys to the mass function of the ensemble, the effects of sample variance may be calculated with full consideration of the correlations between halo counts at different masses.

3.2.2. Analytic Mass Functions

The analytic mass functions of Tinker et al. (2008) are used to calculate the abundance of halos in several cosmological models. We calculate mass functions defined by M_vir, using the overdensity specified by Bryan & Norman (1998).⁵ This results in an overdensity (compared to the mean background density) Δ_vir which ranges from 337 at z = 0 to 203 at z = 1 and smoothly approaches 180 at very high redshifts. Following Tinker et al. (2008), we use spline interpolation to calculate mass functions for overdensities between the discrete intervals presented in their paper.

The mass functions in Tinker et al. (2008) only include central halos. We model the small (≈20% at z = 0) correction to the mass function introduced by subhalos to first order only, as the overall uncertainty in the central halo mass function is already of order 5% (Tinker et al. 2008). In particular, we calculate satellite (mass at accretion) and central mass functions in our simulation (L80G) and fit Schechter functions to both, excluding halos below our completeness limit (10^10.3 M_☉). Then, we plot the difference between the Schechter parameters (the difference in characteristic mass, Δlog₁₀M_*; the difference in characteristic density, Δlog₁₀ϕ₀; and the difference in faint-end slopes, Δα) as a function of scale factor (a). This gives the satellite mass function (ϕ_s) as a function of the central mass function (ϕ_c), which allows us to use this (first-order) correction for central mass functions of different cosmologies:

$\begin{equation} \phi _s(M) = 10^{\Delta \log _{10} \phi _0} \, \left(\frac{M}{M_0 \cdot 10^{\Delta \log _{10} M_0}}\right)^{-\Delta \alpha } \, \phi _c(M / 10^{\Delta \log _{10} M_0}). \end{equation} \tag{ 6 }$

From our simulation, we find fits as shown in Figure 1:⁶

$\begin{eqnarray} \Delta \log _{10} \phi _0(a) &=& {-}0.736 - 0.213 \, a, \nonumber \\ \Delta \log _{10} M_0(a) &=& 0.134 - 0.306 \, a,\\ \Delta \alpha (a) &=& {-}0.306 + 1.08 \, a - 0.570 \, a^2. \nonumber \end{eqnarray} \tag{ 7 }$

The mass function used here may be easily replaced by an arbitrary mass function, as detailed in Appendix A.

**Figure 1.** Differences between the fitted Schechter-function parameters for the satellite halo mass (at accretion) function and the central halo mass function as a function of scale factor; e.g., Δlog₁₀ϕ₀ corresponds to log₁₀(ϕ_0,sats/ϕ_0,centrals). The black lines are calculated from a simulation using WMAP1 cosmology (L80G), and the red lines represent the fits to the simulation results in Equation (7).
Download figure:
Standard image High-resolution image

3.2.3. Modeling Uncertainties in Cosmological Parameters

Our fiducial results are calculated assuming WMAP5 cosmological parameters. In order to model uncertainties in cosmological parameters, we have sampled an additional 100 sets of cosmological parameters from the WMAP5+BAO+SN MCMC chains (from the models in Komatsu et al. 2009) and generated mass functions for each one according to the method in the previous section. Hence, to determine the variance in the derived SM–HM relation caused by cosmology uncertainties, we recalculate the relation for each sampled mass function according to the method described in Appendix A.

3.2.4. Estimating Sample Variance Effects for the Stellar Mass Function

Large-scale modes in the matter power spectrum imply that finite surveys will obtain a biased estimate of the number densities of galaxies and halos as compared to the full universe. That is to say, matching observed GSMFs measured from a finite survey to the halo mass function estimated from a much larger volume will introduce systematic errors into the resulting SM–HM relation. These errors cannot be corrected unless one has knowledge of the halo mass function for the specific survey in question, which is in general not possible.

However, we can still calculate the uncertainties introduced by the limited sample size. While we cannot determine the true halo mass function for the survey, we can calculate the probability distribution of halo mass functions for identically shaped surveys via sampling light cones from simulations. If we rematch galaxy abundances from the observed GSMF to the abundances of halos in each of the sampled light cones, then the uncertainty introduced by sample variance is exactly captured in the variance of the resulting SM–HM relations.

In detail, we create our distribution of halo mass functions by sampling one thousand mock surveys from the LasDamas simulation (see Section 3.2.1) corresponding to the exact survey parameters used in Pérez-González et al. (2008). We fit Schechter functions to the halo mass functions of each mock survey (over all redshifts), and we calculate the change in Schechter parameters (Δlog₁₀ϕ₀, Δlog₁₀M₀, and Δα) as compared to a Schechter fit to the ensemble average of the mass functions. Using the distribution of the changes in Schechter parameters, we may mimic to first order the expected distribution of halo mass functions for any cosmology. In particular, we use an equation exactly analogous to Equation (6) to convert the mass function for the full universe (ϕ_full) and the distribution of Δlog₁₀ϕ₀, Δlog₁₀M₀, and Δα into a distribution of possible survey mass functions (ϕ_obs):

$\begin{eqnarray} \phi _{\rm obs}(M) &=& 10^{\Delta \log _{10} \phi _0} \, \left(\frac{M}{M_0 \cdot 10^{\Delta \log _{10} M_0}}\right)^{-\Delta \alpha } \nonumber\\ &&\times \phi _\mathrm{full}(M / 10^{\Delta \log _{10} M_0}). \end{eqnarray} \tag{ 8 }$

Hence, to obtain the variance in the SM–HM relation caused by finite survey size, we recalculate the relation for each one of the survey mass functions thus computed according to the method described in Appendix A.

3.3. Uncertainties in Abundance Matching

3.3.1. Scatter in Stellar Mass at Fixed Halo Mass

An important uncertainty in the abundance matching procedure is introduced by intrinsic scatter in stellar mass at a given halo mass. Suppose that M_*(M_h) is the average (true) galaxy stellar mass as a function of host halo mass. For a perfect monotonic correlation between stellar mass and halo mass, i.e., without scatter between stellar and halo mass, it is straightforward to relate the true or "intrinsic" stellar mass function (ϕ_true) to the halo mass function (ϕ_h) via

$\begin{equation} \frac{dN}{d\log _{10} M_\ast } = \frac{dN}{d\log _{10} M_h} \, \frac{d\log M_h}{d\log M_\ast }, \end{equation} \tag{ 9 }$

where N is the number density of galaxies, so that

$\begin{equation} \phi _\mathrm{true}(M_\ast (M_h)) = \phi _h(M_h) \, \left(\frac{d\log M_\ast (M_h)}{d\log M_h}\right)^{-1}. \end{equation} \tag{ 10 }$

Intuitively, as the halos of mass M_h get assigned stellar masses of M_*(M_h), the number density of galaxies with mass M_*(M_h) will be proportional to the number density of halos with mass M_h. The above equations are simply a mathematical representation of the traditional abundance matching technique.

Equation (10) remains useful in the presence of scatter. If we know the expected scatter about the mean stellar mass, say in the form of a probability density function P_s(Δlog₁₀M_*|M_h), then we may still relate ϕ_true to ϕ_h via an integral similar to a convolution:

$\begin{eqnarray} \phi _\mathrm{true}(x) & =& \int _{0}^\infty \phi _h(M_h(M_\ast)) \, \frac{d \log M_h(M_\ast)}{d\log M_\ast } \nonumber \\ && {\times}\, P_s\left(\log _{10} \frac{x}{M_\ast } \bigg| M_h(M_\ast)\right) d\log _{10} M_\ast, \end{eqnarray} \tag{ 11 }$

where M_h(M_*) is the inverse function of M_*(M_h).

This similarity to a convolution is no coincidence—mathematically, it is analogous to how we model random statistical errors in stellar mass measurements in Section 3.1.2. Namely, if we define ϕ_direct to equal the right-hand side of Equation (10),

$\begin{equation} \phi _\mathrm{direct}(M_\ast) \equiv \phi _h(M_h(M_\ast)) \, \frac{d \log M_h}{d\log M_\ast }, \end{equation} \tag{ 12 }$

and if we assume a probability density distribution independent of halo mass (i.e., scatter in stellar mass at fixed halo mass is independent of halo mass), then ϕ_true is exactly related to ϕ_direct by a convolution:

$\begin{equation} \phi _\mathrm{true}(M_\ast) = \int _{-\infty }^\infty \phi _\mathrm{direct}(10^y) \, P_s(y - \log _{10} M_\ast) dy, \end{equation} \tag{ 13 }$

which is mathematically identical to Equation (2) in Section 3.1.2.

Then, if one calculates ϕ_direct from ϕ_true, one may find M_h(M_*) via direct abundance matching. Namely, integrating Equation (12), we have

$\begin{equation} \int _{M_h(M_\ast)}^\infty \phi _h(M)d\log _{10} M = \int _{M_\ast }^\infty \phi _\mathrm{direct}(M_\ast)d\log _{10} M_\ast. \end{equation} \tag{ 14 }$

Equivalently, letting $\Phi _h(M_h) \equiv \int _{M_h}^\infty \phi _h(M) d\log _{10} M$ be the cumulative halo mass function, and letting $\Phi _\mathrm{direct}(M_\ast) \equiv \int _{M_\ast }^\infty \phi _\mathrm{direct}(M_\ast) d\log _{10} M_\ast$ be the cumulative "direct" stellar mass function, we have

$\begin{equation} M_h(M_\ast) = \Phi _h^{-1}(\Phi _\mathrm{direct}(M_\ast)), \end{equation} \tag{ 15 }$

and one may similarly find M_*(M_h) by inverting this relation.

Our approach in all equations except for Equation (13) allows a halo mass-dependent scatter in the stellar mass, but to date the data appears to be consistent with a constant scatter value. For example, using the kinematics of satellite galaxies, More et al. (2009) find that the scatter in galaxy luminosity at a given halo mass is 0.16 ± 0.04 dex, independent of halo mass. Using a catalog of galaxy groups, Yang et al. (2009b) find a value of 0.17 dex for the scatter in the stellar mass at a given halo mass, also independent of halo mass. Here, we thus assume a fixed value for the scatter in stellar mass at fixed halo mass, ξ, to specify the standard deviation of P_s(Δlog₁₀M_*). As the Yang et al. (2009b) value is consistent with the More et al. (2009) value, we set the prior using the More et al. (2009) value and error bounds on ξ. We assume a Gaussian prior on the probability distribution for ξ, and we assume that the scatter itself is log-normal.

3.3.2. The Treatment of Satellites

When a galaxy is accreted into a larger system, it will likely be stripped of dark matter much more rapidly than stellar mass because the stars are much more tightly bound than the halo. It has been demonstrated that various galaxy clustering properties compare favorably to samples of halos where satellite halos—i.e., subhalos—are selected according to their halo mass at the epoch of accretion, M_acc, rather than their current mass (e.g., Nagai & Kravtsov 2005; Conroy et al. 2006; Vale & Ostriker 2006; Berrier et al. 2006). These results support the idea that satellite systems lose dark matter more rapidly than stellar mass.

As commonly implemented (e.g., Conroy et al. 2006), the abundance matching technique matches the stellar mass function at a particular epoch to the halo mass function at the same epoch, using M_acc rather than the present mass for subhalos. As M_acc remains fixed as long as the satellite is resolvable, the standard technique implies that the satellite galaxy's stellar mass will continue to evolve in the same way as for centrals of that halo mass. Therefore, a subtle implication of the standard technique is that satellites may continue to grow in stellar mass, even though M_acc remains the same. A different model for satellite stellar evolution (e.g., in which stellar mass which does not evolve after accretion) would therefore involve different choices in the satellite matching process.

The fiducial results presented here use the standard model where satellites are assigned stellar masses based on the current stellar mass function and their accretion-epoch masses. However, we also present results for comparison in which satellite masses are assigned utilizing the stellar mass function at the epoch of accretion, corresponding to a situation in which satellite stellar masses do not change after the epoch of accretion. In order to maintain self-consistency for the latter method, we use full merger trees (from L80G, the simulation described in Section 3.2.1) to keep track of satellites and to assure that, e.g., mergers between satellites before they reach the central halo preserve stellar mass.

Finally, we note that any specific halo-finding algorithm may introduce artifacts in the halo mass function in terms of when a satellite halo is considered absorbed/destroyed. This can have a small effect on satellite clustering as well as number density counts. Wetzel & White (2010) suggest an approach that avoids some of the problems associated with resolving satellites after accretion. Namely, they suggest a model where satellites remain in orbit for a duration that is a function of the satellite mass, the host mass, and the Hubble time, after which time they dissolve or merge with the central object. Although we have not modeled this explicitly, our satellite counts are consistent with their recommended cutoff—they suggest considering a satellite halo absorbed when its present mass is less than 0.03 times its infall mass; in our simulation, only 0.1% of all satellites fall below this threshold.

3.4. Functional Forms for the SM–HM Relation

In order to determine the probability distribution of our underlying model parameters, we must first define an allowed parameter space for the SM–HM relation. Ideally, one would like a simple, accurate, physically intuitive, and orthogonal parameterization; in practice, we seek the best compromise with these four goals in mind. We consider one of the most popular methods for choosing a functional form (indirect parameterization via the stellar mass function) before discussing the method we use in this paper (parameterization via deconvolution of the stellar mass function).

3.4.1. Parameterizing the Stellar Mass Function

In abundance matching, knowledge of the halo mass function and the stellar mass function uniquely determines the SM–HM relation. Hence, parameterizing the stellar mass function yields an indirect parameterization for the SM–HM relation as well. Numerous papers (e.g., Cole et al. 2001; Bell et al. 2003; Panter et al. 2004; Pérez-González et al. 2008) have found that the GSMF is well approximated by a Schechter function:

$\begin{equation} \phi (M_\ast,z) = \phi ^\star (z)\left(\frac{M_\ast }{M(z)}\right)^{-\alpha (z)}\exp \left(-\frac{M_\ast }{M(z)}\right), \end{equation} \tag{ 16 }$

where the Schechter parameters ϕ^⋆(z), M(z), and α(z) evolve as functions of the redshift z. In many previous works on abundance matching (e.g., Conroy et al. 2009), it is the Schechter function for the stellar mass function that sets the form of the SM–HM relation.

More recently, however, several authors have noted that the GSMF cannot be matched by a single Schechter function for z < 0.2 to within statistical errors (e.g., Li & White 2009; Baldry et al. 2008), in part because of an upturn in the slope of the GSMF for galaxies below 10⁹ M_☉ in stellar mass. It is possible that a conspiracy of systematic errors causes the observed deviations, but there is no fundamental reason to expect the intrinsic GSMF to be fit exactly by a Schechter function (see discussion in Appendix C). In any case, our full parameterization–either the stellar mass function or the error parameterization—must be able to capture all the subtleties of the observed stellar mass function. Hence, we are inclined to adopt a more flexible model than the Schechter function of Equation (16). Other authors, wrestling with the same problem, have chosen to adopt multiple Schechter functions, including the eleven-parameter triple piecewise Schechter-function fit used by Li & White (2009). While accurate, these models often add complication without increasing intuition.

3.4.2. Deconvolving the Observed Stellar Mass Function

Rather than attempting to parameterize the stellar mass function, we could use abundance matching directly to derive the SM–HM relation for the maximal-likelihood stellar mass function, and then find a fit which can parameterize the uncertainties in the shape of the relation. This process is complicated by the various errors which we must take into account. Recall from Equations (2) and (13) that

$\begin{equation} \phi _\mathrm{meas}(M_\ast) = \phi _\mathrm{direct}(M_\ast) \circ P_s(\Delta \log _{10} M_\ast) \circ P(\Delta \log _{10} M_\ast), \end{equation} \tag{ 17 }$

where "○" denotes the convolution operation, P_s is the probability distribution for the scatter in stellar mass at fixed halo mass, and P is the probability distribution for errors in observed stellar mass at fixed true stellar mass. However, if we obtain ϕ_direct by deconvolution of the observed stellar mass function ϕ_meas, we may use direct abundance matching (Equation (15)) to determine the maximum likelihood form of M_h(M_*).

Figure 2 shows the result of calculating M_h(M_*) at z ∼ 0.1 via deconvolution and direct matching of the stellar mass function as described in the previous section. We choose the maximum-likelihood value for the distribution function P_s (namely, 0.16 dex log-normal scatter), and we use WMAP5 cosmology for the halo mass function ϕ_h in the derivation.

While deconvolution plus direct abundance matching gives an unbiased calculation of the relation, there are several problems which prevent it from being used directly to calculate uncertainties.

1.
Deconvolution will tend to amplify statistical variations in the stellar mass function—that is, shallow bumps in the GSMF will be interpreted as convolutions of a sharper feature.
2.
Deconvolution will give different results depending on the boundary conditions imposed on the stellar mass function (i.e., how the GSMF is extrapolated beyond the reported data points), the effects of which may be seen at the edges of the deconvolution in Figure 2.
3.
Deconvolution becomes substantially more problematic when the convolution function varies over the redshift range, as it does for our higher-redshift data (z>0.2).
4.
Deconvolution cannot extract the relation at a single redshift; instead, it will only return the relation averaged over the redshift range of galaxies in the reported GSMF.

For these reasons, we choose to find a fitting formula instead. In the discussion that follows, we fit M_h(M_*) (the halo mass for which the average stellar mass is M_*) rather than the more intuitive M_*(M_h) (the average stellar mass at a halo mass M_h) primarily for reasons of computational efficiency. From Equations (12) and (17), the calculation of what observers would see (ϕ_meas) for a trial SM–HM relation requires many evaluations of M_h(M_*) and no evaluations of M_*(M_h). If we had instead parameterized M_*(M_h), and then inverted as necessary in the calculation of ϕ_meas, our calculations would have taken an order of magnitude more computer time.

3.4.3. Fitting the Deconvolved Relation

It is well known from comparing the GSMF (or the luminosity function) to the halo mass function that high-mass (M_* ≳ 10^10.5 M_☉) galaxies have a significantly different SM–HM scaling than low-mass galaxies, which is usually attributed to different feedback mechanisms dominating in high-mass versus low-mass galaxies. The transition point between low-mass and high-mass galaxies—seen as a turnover in the plot of M_h(M_*) around M_* = 10^10.6 M_☉ in Figure 2—defines a characteristic stellar mass (M_*,0) and an associated characteristic halo mass (M₁). Hence, we consider functional forms which respect this general structure of a low stellar mass regime and a high stellar mass regime with a characteristic transition point:

$\begin{eqnarray*} \log _{10}(M_h(M_\ast)) &= &\log _{10}(M_1) \; \hbox{[characteristic halo mass]}\\ &+&\!\! f_{\rm low}(M_\ast / M_{\ast,0})\; \hbox{[low-mass functional form]}\\ & +&\!\! f_{\rm high}(M_\ast / M_{\ast,0}) \; \hbox{[high-mass functional form],} \end{eqnarray*}$

where f_low and f_high are dimensionless functions dominating below and above M_*,0, respectively.

For low-mass galaxies (M_* < 10^10.5 M_☉), we find the SM–HM relation to be consistent with a power law:

$\begin{eqnarray} \frac{M_h(M_\ast)}{M_1} & \approx & \left(\frac{M_\ast }{M_{\ast,0}}\right)^\beta,\hbox{ or} \nonumber \\ \log (M_h(M_\ast)) & \approx & \log (M_1) + \beta \log \left(\frac{M_\ast }{M_{\ast,0}}\right). \end{eqnarray} \tag{ 18 }$

For high-mass galaxies, we find the SM–HM relation to be inconsistent with a power law. In particular, the logarithmic slope of M_h(M_*) changes with M_*, with dlog M_h/dlog M_* always increasing as M_* increases. This may seem like a small detail; after all, by eye, it appears that a power law could be a reasonable fit for high-mass galaxies in Figure 2. In addition, because previous authors (e.g., Moster et al. 2010; Yang et al. 2009a) have used power laws, it may not seem necessary to use a different functional form.

In order to explore this issue, we tried a general double power-law functional form for M_h(M_*) which parameterized a superset of the fits used in Moster et al. (2010) and (Yang et al. 2009a; in particular, the same form as in Equation (C2) in Appendix C). We found that this approach had two major problems common to any such power-law form.

1.
As the logarithmic slope of M_h(M_*) increases with increasing M_*, the best-fit power law for high-mass galaxies will depend on the upper limit of M_* in the available data for the GSMF. Thus, the best-fit power law will depend on the number density limit of the observational survey used rather than on any fundamental physics. Moreover, for studies such as this one which consider redshift evolution, the different number densities probed at different redshifts result in a completely artificial "evolution" of the best-fit power law.
2.
The best-fit power-law will not depend on the highest-mass galaxies alone; instead, it will be something of an average over all the high-mass galaxies. Because the logarithmic slope is increasing with M_*, this means that the best-fit power law for M_h(M_*) will increasingly underestimate the true M_h(M_*) at high M_*. Namely, the fit will underestimate the halo mass corresponding to a given stellar mass, and therefore (as lower-mass halos have higher number densities) will result in stellar mass functions systematically biased above observational values. However, a systematic bias in our functional form will influence the best-fit values of the systematic error parameters. The systematic bias caused by assuming a power law form turns out to be most degenerate with the scatter in stellar mass at fixed halo mass (ξ). As a result, for the MCMC chains which assumed a double power law form for M_h(M_*), the posterior distribution of ξ was 0.09 ± 0.02 dex, which just barely lies within 2σ of the constraints from More et al. (2009).

These problems are not as significant if one only considers the stellar mass function at a single redshift, or if one does not allow for the systematic errors which change the overall shape of the stellar mass function (κ, ξ, and σ(z)). However, we find that the issues listed above exclude the use of a power law for our purposes. Instead, we find that M_h(M_*) asymptotes to a sub-exponential function for high M_*: namely, a function which climbs more rapidly than any power-law function, but less rapidly than any exponential function. We find that high–mass galaxies (M_*>10^10.5 M_☉) are well fit by the relation

$\begin{eqnarray} M_h(M_\ast) & \stackrel{\sim }{\propto }& 10^{(\frac{M_\ast }{M_{\ast,0}})^\delta }, \quad\hbox{ or} \nonumber \\ \log _{10}(M_h(M_\ast)) & \rightarrow & \log _{10}(M_1) + \left(\frac{M_\ast }{M_{\ast,0}}\right)^\delta, \end{eqnarray} \tag{ 19 }$

where δ sets how rapidly the function climbs; δ → 0 would correspond to a power law, and δ = 1 would correspond to a pure exponential. Typical values of δ at z = 0 range from 0.5to0.6. It is not obvious what physical meaning can be directly inferred from the choice of a sub-exponential function—after all, the stellar mass of a galaxy is a complicated integral over the merger and evolution history of the galaxy—but it could suggest that the physics driving the M_h(M_*) relation at high mass is not scale-free.

Although this form now matches the asymptotic behavior for the highest and lowest stellar mass galaxies, one additional parameter is necessary to match the functional form of the deconvolution. That is to say, galaxies in between the extremes in stellar mass will lie in a transition region, as they may have been substantially affected by multiple feedback mechanisms. The width of this transition region will depend on many things, e.g., how long galaxies take to gain stellar mass, how much of the stellar mass present came from quiescent star formation as opposed to mergers, and the degree of interaction between multiple feedback mechanisms. Hence, instead of having M_h(M_*) become suddenly sub-exponential for galaxies larger than M_*,0, we allow for a slow "turn-on" of the more rapid growth. The behavior of M_h(M_*) is best fit by modifying the previous equation to

$\begin{equation} \log _{10}(M_h(M_\ast)) \rightarrow \log _{10}(M_1) + \frac{\big(\frac{M_\ast }{M_{\ast,0}}\big)^\delta }{1+ \big(\frac{M_{\ast }}{M_\ast,0}\big)^{-\gamma }}. \end{equation} \tag{ 20 }$

The denominator, 1 + (M_*/M_*,0)^−γ, is large for M_* < M_*,0, and it falls to unity for M_*>M_*,0 at a rate controlled by γ. A larger value of γ implies a more rapid transition between the power-law and sub-exponential behavior (typical values for (γ) at z = 0 are 1.3–1.7). As the non-constant piece of M_h(M_*) in Equation (20) is $\frac{1}{2}$ for M_* = M_*,0, we add a final factor of $-\frac{1}{2}$ to compensate so that M_h(M_*,0) = M₁.

To summarize, our resulting best-fit functional form has five parameters:

$\begin{eqnarray} \log _{10}(M_h(M_\ast)) &=& \log _{10}(M_1) + \beta \,\log _{10}\left(\frac{M_\ast }{M_{\ast,0}}\right)\nonumber\\ && + \frac{\big(\frac{M_\ast }{M_{\ast,0}}\big)^\delta }{1 + \big(\frac{M_{\ast }}{M_{\ast,0}}\big)^{-\gamma }} - \frac{1}{2}, \end{eqnarray} \tag{ 21 }$

where M₁ is a characteristic halo mass, M_*,0 is a characteristic stellar mass, β is the faint-end slope, and δ and γ control the massive-end slope. The best fit using this functional form is shown in Figure 2, and it achieves excellent agreement over the entire range of stellar masses.

Deconvolving the GSMF at higher redshifts does not suggest that anything more than linear evolution in the parameters is necessary, at least out to z = 1. While the characteristic mass of the GSMF and the characteristic mass of the halo mass function certainly evolve, the change in the shapes of the two functions is relatively slight. As we wish for the functional form to have a natural extension to higher redshifts, we parameterize the evolution in terms of the scale factor (a):

$\begin{eqnarray} \log _{10}(M_1(a)) & = & M_{1,0} + M_{1,a} \, (a-1), \nonumber \\ \log _{10}(M_{\ast,0}(a)) & = & M_{\ast,0,0} + M_{\ast,0,a} \, (a-1), \nonumber \\ \beta (a) & = & \beta _0 + \beta _a \, (a-1),\\ \delta (a) & = & \delta _0 + \delta _a \, (a-1), \nonumber \\ \gamma (a) & = & \gamma _0 + \gamma _a \, (a-1), \nonumber \end{eqnarray} \tag{ 22 }$

where a = 1 is the scale factor today.

3.5. Calculating Model Likelihoods

We make use of an MCMC method to generate a probability distribution in our complete parameter space of stellar mass function parameters (M_1,0, M_1,a, M_*,0,0, M_*,0,a, β₀, β_a, δ₀, δ_a, γ₀, γ_a), systematic modeling errors (κ, μ, σ_z), and the scatter in stellar mass at fixed halo mass (ξ). A brief summary of each of these parameters appears in Table 1 along with a reference to the section in which it was first described. Using this full model, we may calculate the stellar mass functions expected to be seen by observers (ϕ_expect) for a large number of points in parameter space, and compare them to observed GSMFs (Li & White 2009; Pérez-González et al. 2008). Note that, as the observational data always cover a range of redshifts, we must mimic this in our calculation of ϕ_expect:

$\begin{equation} \phi _\mathrm{expect} = \frac{\int _{z_1}^{z_2} \phi _\mathrm{fit}(z) dV_C(z)}{\int _{z_1}^{z_2} dV_C(z)}, \end{equation} \tag{ 23 }$

where dV_C(z) is the comoving volume element per unit solid angle as a function of redshift. Then, we can write the likelihood as $\mathcal {L} = \exp (-\chi ^2/2)$ , where

$\begin{equation} \chi ^2 = \int \left[\frac{\log _{10}[\phi _\mathrm{expect}(M_\ast) / \phi _\mathrm{meas}(M_\ast)]}{\sigma _\mathrm{obs}(M_\ast)}\right]^2 d\log _{10}(M_\ast), \end{equation} \tag{ 24 }$

and where σ_obs(M_*) is the reported statistical error in ϕ_meas as a function of stellar mass.

Table 1. Summary of Model Parameters

Symbol	Description	Prior^a	Section
M_h(M_*)	The halo mass for which the average stellar mass is M_*	N/A	3.4.3

M₁	Characteristic halo mass	Flat (Log)	3.4.3
M_*,0	Characteristic stellar mass	Flat (Log)	3.4.3
β	Faint-end power law (M_h ∼ M^β_*)	Flat (Linear)	3.4.3
δ	Massive-end sub-exponential (log₁₀(M_h) ∼ M^δ_*)	Flat (Linear)	3.4.3
γ	Transition width between faint- and massive-end relations	Flat (Linear)	3.4.3
(x)₀	Value of the variable (x) at the present epoch, where (x) is one of (M₁, M_*,0, β, δ, γ)	(See above)	3.4.3
(x)_a	Evolution of the variable (x) with scale factor	(Same as for (x)₀)	3.4.3

μ	Systematic offset in M_* calculations	G(0, 0.25) (Log)	3.1.1
κ	Systematic mass-dependent offset in M_* calculations	G(0, 0.10) (Linear)	3.1.1
σ_z	Redshift scaling of statistical errors in M_* calculations	G(0.05, 0.015) (Log)	3.1.2
ξ	Scatter in M_* at fixed M_h	G(0.16, 0.04) (Log)	3.3.1

Note. ^aSee Equations (1), (5), and (21)–(23). G(x, s) denotes a Gaussian prior centered at x with standard deviation s, in either linear or logarithmic units. "Flat" denotes a uniform prior in either linear or logarithmic units.

Download table as: ASCII Typeset image

Note that, as defined above, the equation for χ² contains the assumption that there is only one independent observation point for the GSMF per decade in stellar mass (from the weight of dlog₁₀(M)). We may tune this assumption introducing another parameter n—the number of non-correlated observations per decade in stellar mass—which would change the likelihood function to $\mathcal {L} = \exp (-n\chi ^2/2)$ . Here, we assume that each of the data points reported by Li & White (2009) and Pérez-González et al. (2008) are independent, such that n = 10 for the former paper and n = 5 for the latter paper.

The MCMC chains each contain 2²² ≈ 4 × 10⁶ points. We verify convergence according to the algorithm in Dunkley et al. (2005); in all cases, the ratio of the sample mean variance to the distribution variance (the "convergence ratio") is below 0.005.

3.6. Methodology Summary

Our procedure to calculate the SM–HM relation, taking into account all mentioned uncertainties, may be summarized in seven steps.

1.
We select a trial point in the parameter space of SM–HM relations as well as a trial point in our parameter space of systematics (μ, κ, σ_z, ξ). A complete list of parameters and descriptions is given in Table 1.
2.
The trial SM–HM relation gives a one-to-one mapping between halo masses and stellar masses, giving a direct conversion from the halo mass function to a trial GSMF (corresponding to ϕ_direct in Section 3.3.1).
3.
This trial GSMF is convolved with the probability distributions for scatter in stellar mass at fixed halo mass (controlled by ξ; see Section 3.3.1) and for scatter in observer-determined stellar mass at fixed true stellar mass (partially controlled by σ_z; see Section 3.1.2).
4.
The resulting GSMF is shifted by a uniform offset in stellar masses (controlled by μ) to account for uniform systematic differences between our adopted stellar masses and the true underlying masses. Also, its shape is stretched or compressed to account for stellar mass dependent offsets between our masses and the true underlying masses (controlled by κ, see Section 3.1.1).
5.
We repeat steps 2–4 for all redshifts in the range covered by the observed data set. We may then calculate the expected GSMF in each redshift bin for which observers have reported data. The likelihood of the expected GSMFs given the measured GSMFs is then used to determine the next step in the MCMC chain.
6.
To account for sample variance in the observed stellar mass functions above z ∼ 0.2, we recalculate each SM–HM relation in the chain for an alternate halo mass function taken from a randomly sampled mock survey (see Section 3.2.4) and refit our functional form to the redshift evolution of the relation. Similarly, for the results which include cosmology uncertainty, we recalculate each SM–HM relation for an alternate halo mass function randomly selected from the MCMC chain used to determine the WMAP5 cosmology uncertainties.
7.
We repeat steps 1–6 to build a joint probability distribution for the SM–HM relation and the systematics parameter space. The steps are repeated until the joint probability distribution has converged to the underlying posterior distribution.

4. RESULTS FOR 0 < z < 1

We now present the results of this approach to determine the SM–HM relation and related quantities. In Section 4.1, we compare GSMFs generated from our best fits to observed data and comment on the effects of systematic observational biases. We present our best-fitting results for the SM–HM relation with full error bars in Section 4.2. We evaluate the relative importance of each of the contributing types of error in Section 4.3 and summarize the most relevant contributions in Section 4.3.7. Finally, our derived SM–HM relation is compared to other published results in Section 4.4.

4.1. GSMFs

To demonstrate that our functional form for M_h(M_*) is capable of reproducing observed GSMFs, we show a comparison between our best-fit models and the observed data in Figures 3 and 4 at several redshifts. For our best-fit models, both ϕ_true (the true or "intrinsic" stellar mass function) and ϕ_meas (the GSMF that observers would measure) are shown. Recall that ϕ_meas incorporates the effects of the systematic observational biases, namely, the overall shift in stellar mass calculations, μ; the linearly mass-dependent shift, κ; and the statistical errors in stellar mass calculations for individual galaxies, σ(z). The fact that the best values of the systematic parameters (μ, κ, ξ, σ_z) are very close to the centers of their prior distributions provides confirmation that the functional form for the SM–HM relation outlined in Section 3.4.3 does not bias our best-fit results.

**Figure 3.** Comparison of the best fit ϕ_true (the true or "intrinsic" GSMF) in our model to the resulting ϕ_meas (what an observer would report for the GSMF, which includes the effects of the systematic biases μ, κ, and σ) at z = 0. Since the best-fit values of μ and κ are very close to zero, the difference between ϕ_meas and ϕ_true almost exclusively comes from the uncertainty in measuring stellar masses (σ).
Download figure:
Standard image High-resolution image

**Figure 4.** Comparison of the best fit ϕ_true (the true or "intrinsic" GSMF) to the resulting ϕ_meas (as in Figure 3), for z = 0.5 and z = 1.15. Statistical errors in individual stellar masses have a larger effect at higher redshift, resulting in a steeper intrinsic bright end than measured.
Download figure:
Standard image High-resolution image

As our best-fit values for μ and κ are close to zero (see Table 2), the difference between ϕ_true and ϕ_meas is almost exclusively due to the scatter σ(z) in calculated stellar masses. The difference between ϕ_true and ϕ_meas only becomes evident for galaxies above 10¹¹ M_☉, where the falling slope of the GSMF becomes severe enough for the scatter σ(z) to significantly raise number counts in the observed GSMF. At z ∼ 0, the systematic effect of σ(z) puts the intrinsic GSMF well below the small statistical error bars.

At higher redshifts, although the effect of σ(z) is larger, current surveys at z>0.2 do not yet cover sufficient volume to constrain the shape of the GSMF well at the massive end. Nonetheless, for future wide-field surveys at z>0.2, correction to the GSMF for scatter in calculated stellar masses will be an important consideration.

4.2. The Best-fit SM–HM Relations

We plot the average stellar mass as a function of halo mass for z = 0–1 in Figure 5 to show the evolution of the SM–HM relation. Note that as the stellar mass at a given halo mass has a log-normal scatter (see Section 2.3), we use geometric averages for stellar masses rather than linear ones. To highlight the effects of halo mass on star formation efficiency, we also present the SM–HM relation in terms of the average stellar mass fraction (stellar mass/halo mass) for z = 0–1 as a function of halo mass in the same figure. We focus on this quantity for the remainder of the paper. The best-fit parameters for the function M_h(M_*) are given in Table 2, and the numerical values for the stellar mass fractions are listed in Appendix D.

Table 2. Best Fits for the Redshift Evolution of M_h(M_*)

Parameter	Free (μ, κ)	μ = κ = 0	Free (μ, κ)
	0 < z < 1	0 < z < 1	0.8 < z < 4
M_*,0,0	10.72^+0.22_−0.29	10.72^+0.02_−0.12	11.09^+0.54_−0.31
M_*,0,a	0.55^+0.18_−0.79	0.59^+0.15_−0.85	0.56^+0.89_−0.44
$M_{\ast,0,a^2}$	N/A	N/A	6.99^+2.69_−3.51
M_1,0	12.35^+0.07_−0.16	12.35^+0.02_−0.15	12.27^+0.59_−0.27
M_1,a	0.28^+0.19_−0.97	0.30^+0.14_−1.02	−0.84^+0.87_−0.58
β₀	0.44^+0.04_−0.06	0.43^+0.01_−0.05	0.65^+0.26_−0.20
β_a	0.18^+0.08_−0.34	0.18^+0.06_−0.34	0.31^+0.38_−0.47
δ₀	0.57^+0.15_−0.06	0.56^+0.14_−0.05	0.56^+1.33_−0.29
δ_a	0.17^+0.42_−0.41	0.18^+0.41_−0.42	−0.12^+0.76_−0.50
γ₀	1.56^+0.12_−0.38	1.54^+0.03_−0.40	1.12^+7.47_−0.36
γ_a	2.51^+0.15_−1.83	2.52^+0.03_−1.89	−0.53^+7.87_−2.50

μ	0.00^+0.24_−0.25	N/A	0.00^+0.25_−0.25
κ	0.02^+0.11_−0.07	N/A	0.00^+0.14_−0.04
ξ	0.15^+0.04_−0.02	0.15^+0.04_−0.01	0.16^+0.07_−0.01
σ_z	0.05^+0.02_−0.01	0.05^+0.02_−0.01	0.05^+0.02_−0.01

Note. See Table 1 and Equations (1), (5), (21)–(23), and (25).

Download table as: ASCII Typeset image

The stellar mass fractions for central galaxies consistently show a maximum for halo masses near 10¹² M_☉. While the location of this maximum evolves with time, it clearly illustrates that star formation efficiency must fall off for both higher- and lower-mass halos. The slopes of the SM–HM relation above and below this characteristic halo mass are indicative of at least two processes limiting star formation efficiency, although mergers complicate direct analysis for high-mass halos. At the low-mass end, the SM–HM relation scales as M_* ∼ M^2.3_h at z = 0 and as M_* ∼ M^2.9_h at z = 1. However, given the lack of information about low stellar mass galaxies at z>0.5, the statistical significance of this evolution is weak; no evolution in the low-mass slope of the relation is consistent within our 1σ errors. Several studies (most recently, Baldry et al. 2008; Drory et al. 2009) have reported that the GSMF has an upturn in slope for very low stellar masses, particularly below 10^8.5 M_☉; this would imply that our best fits may overestimate the scaling relation for galaxies below 10^8.5 M_☉. At the high-mass end, our best-fitting function results in a progressively shallower relation for the growth of stellar mass with halo mass, so that no single power law can describe the scaling. However, for halos close to 10¹⁴ M_☉, the best-fit relation scales locally as M_* ∼ M^0.28_h at z = 0 and M_* ∼ M^0.34_h at z = 1, in accord with previous studies (see Section 4.4). The results for high-mass halos are also consistent with no evolution in the slope of the SM–HM relation.

Figure 6 shows the stellar mass fraction for 0 < z < 1 excluding the effects of systematic shifts in stellar mass calculations (i.e., assuming μ = κ = 0). Under the assumption that systematic errors in stellar mass calculations result in similar biases in stellar masses at z = 0 as they do at higher redshifts, this allows us to consider the evolution in normalization of the SM–HM relation. Low-mass halos (below 10¹² M_☉) display clearly higher stellar mass fractions at late redshifts than they do at early redshifts. By contrast, the evolution in the stellar mass fractions for high-mass halos (above 10^13.5 M_☉) is not statistically significant, and it is constrained to be substantially less than for low-mass halos. In the time since z = 1, this means that the SFRs for high-mass halos typically fall relative to their dark matter accretion rates, whereas the opposite is true for low-mass halos (Conroy & Wechsler 2009). The best-fitting parameters for the SM–HM relation assuming μ = κ = 0 appear in Table 2, and the data points in Figure 6 appear in Appendix D.

4.3. Impact of Uncertainties

4.3.1. Systematic Shifts in Stellar Mass Calculations

By far the largest contributor to the error budget of the SM–HM relation is the systematic error parameter μ. As the effect of μ is to multiply all stellar masses by a constant factor, and as the width of the error bars in Figure 5 corresponds almost exactly to the prior on μ, we may conclude that reducing the error on the systematic shifts in stellar mass calculations would represent the single largest improvement in our understanding of the shape of the SM–HM relation. Figure 6 shows the substantially smaller error bars that result if systematic errors (μ and κ) in the stellar mass calculations are neglected.

4.3.2. Scatter in Stellar Mass at Fixed Halo Mass

The effect of ignoring scatter in stellar mass at fixed halo mass (i.e., setting ξ = 0) is shown at two redshifts in Figure 7. We find that the change is insignificant below halo masses of 10¹² M_☉, and is within statistical error bars below 10¹³ M_☉ for z = 1. This is a result of the fact that the slope of the stellar mass function below 10^10.5 M_☉ in stellar mass (corresponding to 10¹² M_☉ in halo mass) is not steep enough for scatter to have significant impact (also see Tasitsiomi et al. 2004). Because ξ>0 results in high stellar mass galaxies being assigned to lower-mass halos than they would be otherwise (due to the higher number density of lower-mass halos), the effect is that higher-mass halos contain fewer stars on average than they would for ξ = 0. The effect of setting ξ = 0 exceeds systematic error bars only for the very highest mass halos, above 10^14.5 M_☉.

We note that our posterior distribution constrains ξ to be less than 0.22 dex at the 98% confidence level. Higher values for ξ would result in GSMFs inconsistent with the steep falloff of the Li & White (2009) GSMF (also see discussion in Guo et al. 2010).

4.3.3. Statistical Errors in Stellar Mass Calculations

The significance of including or excluding random statistical errors in stellar mass calculations, σ(z), is also shown in Figure 7. The effect of this type of scatter on the SM–HM relation is mathematically identical to the effect of scatter in stellar mass at fixed halo mass. As σ(z = 0) (∼0.07 dex) is much smaller than the expected value of ξ (∼0.16 dex), the convolution of the two effects is only marginally different from including ξ alone at z = 0; this results in only a minor effect on the SM–HM relation. The effect becomes more pronounced at z = 1 for the reason that σ(z = 1) (∼0.12 dex) becomes more comparable to ξ; and so, including the effects of statistical errors in stellar mass becomes as important as modeling scatter in stellar mass at fixed halo mass.

4.3.4. Cosmology Uncertainties

In Figure 8, we show a comparison of best fits for the stellar mass fraction using abundance matching with three different halo mass functions: analytic prescriptions for WMAP5 and WMAP1 (see Section 3.2.2), as well as the mass function taken directly from the L80G simulation (see Section 3.2.1). The difference between the L80G simulation and the analytic WMAP1 mass function is slight, as the L80G simulation uses WMAP1 initial conditions (h = 0.7, Ω_m = 0.3, Ω_Λ = 0.7, σ₈ = 0.9, n_s = 1); the difference is consistent with sample variance for the relatively small (80 h⁻¹ Mpc) size of the simulation. The difference between SM–HM relations using WMAP1 and WMAP5 cosmologies is within the systematic errors at all masses. When systematic errors are neglected, the two cosmologies yield SM–HM relations that are noticeably different only at low halo masses (M < 10¹² M_☉).

Figure 9 shows the results of including uncertainties in the WMAP5 cosmological parameters. As described in Section 3.1, this is done using halo mass functions calculated with parameters resampled from the cosmological parameter chains provided by the WMAP team. Only at z ∼ 0 are the changes in error bars significant enough to justify mention. Here, the uncertainty in cosmology begins to exceed other sources of statistical error for halos below 10¹² M_☉ due to the small errors on the GSMF at the stellar masses associated with such halos (Li & White 2009). However, the cosmology uncertainties are still well within the systematic error bars.

4.3.5. Sample Variance

Because of the large volume of the SDSS, sample variance contributes insignificantly to the error budget for the SM–HM relation below z = 0.2. Above that redshift, the comparatively limited survey volume of Pérez-González et al. (2008) results in sample variance becoming an important contributor to the statistical error for halos below 10¹² M_☉ (Poisson noise dominates for larger halos). If the effects of sample variance were ignored, the statistical error spreads for our derived SM–HM relations at z = 1 would shrink from 0.12 dex to 0.09 dex for 10¹¹ M_☉ halos, and from 0.05 dex to 0.04 dex for 10^12.25 M_☉ halos. As with other types of errors, these considerations are well below the limits of the systematic error bars.

We caution that our error bars including sample variance at z>0 have a very specific meaning. Namely, they include the standard deviation in our fitting form which might be expected if the survey in Pérez-González et al. (2008) had been conducted on alternate patches of the sky. Sample variance at redshifts z>0 impacts only the linear evolution of the SM–HM relations we derive, as the large volume probed by the SDSS constrains the SM–HM relation very well at z ∼ 0. Because our fit is matched to the ensemble of reported data between 0 < z < 1, it is less vulnerable to the effects of sample variance in individual redshift bins. Instead, it is affected most by overall shifts in the number densities reported for the entire high-redshift survey. While this means that our fit gives a more robust SM–HM relation at all redshifts, some caution must be used when comparing our relation to results derived from the GSMF in a single redshift bin (e.g., 0.2 < z < 0.4). These will have much larger uncertainties due to sample variance than SM–HM relations derived (like ours) from GSMFs along a light cone probing a large redshift range.

To demonstrate the credibility of our approach for calculating the appropriate error bars including sample variance, we repeated our analysis of the SM–HM relation using GSMFs from Drory et al. (2009; which appeared as we were completing this work) instead of Pérez-González et al. (2008) for 0.2 < z < 1 and retaining the GSMF from Li & White (2009) for z < 0.2. Although the COSMOS survey in Drory et al. (2009) covers a much larger area (∼9× the area in Pérez-González et al. 2008), the fact that it is a single field means that the expected sample variance is only slightly smaller than for the combined fields in Pérez-González et al. (2008). As might be expected, the SM–HM relation at z = 0.1 using the Drory et al. (2009) GSMF is identical to the result in Figure 6 because of the strong constraining power of the SDSS data sample. At z = 1, the SM–HM relation generated by using the Drory et al. (2009) GSMF is within our quoted statistical and sample variance errors, as shown in Figure 12. This may not be surprising unless one considers that clustering results suggest an overdensity at the 2σ–3σ level in the COSMOS redshift bin z = 1 (Meneux et al. 2009). However, because our method fits the Drory et al. (2009) GSMFs across the entire redshift range, the excess at z = 1 is partially offset by an underdensity at z = 0.5. This demonstrates the robustness of our fitting method to the effects of sample variance except on the scale of the entire survey.

4.3.6. Satellite Treatment

Finally, we consider the changes in both the stellar mass fraction and the total stellar mass fraction (total stellar mass in central galaxy and all satellites/total halo mass) induced by different satellite evolution models (see Section 3.3.2 for details on the two models). As shown in Figure 10, fixing satellite stellar mass at the redshift of accretion (lines labeled as SMF_acc) has virtually no effect on either fraction as compared to allowing satellite stellar mass to evolve the same way as centrals with the same mass (labeled as SMF_now). Because comparison of different satellite evolution models requires tracking satellites through merger trees, Figure 10 shows results only for satellites in the L80G simulation.

**Figure 10.** Comparison between stellar mass fractions and total stellar mass fractions (labeled as Total M_*/*M_h*) derived by assuming different matching epochs for satellite galaxies. The L80G simulation was used here in order to follow the accretion histories of the subhalos. The relations terminate at high masses where the halo statistics become unreliable due to finite-volume effects.
Download figure:
Standard image High-resolution image

The treatment of satellites may have a somewhat larger impact on the total stellar mass fraction, including the stellar mass of both central and satellite galaxies within a halo. This is shown for both models in Figure 10. Because of the steep falloff in stellar mass for low-mass galaxies, the total stellar mass fraction has only minimal contribution from satellites for low-mass halos and deviates significantly from the stellar mass fraction for centrals only at halo masses M_h>10^12.5 M_☉. At cluster-scale masses (M_h ∼ 10¹⁴ M_☉), accreted satellites have on average a higher ratio of stars to dark matter than the central galaxy, and the total stellar mass fraction can be many times the central stellar mass fraction. However, the impact of the two models for satellite treatment on this ratio is small. Profiles of satellite galaxies in clusters should be able to distinguish better between such models.

4.3.7. Summary of Most Important Uncertainties

Systematic stellar mass offsets resulting from modeling choices result in the single largest source of uncertainties (∼0.25 dex at all redshifts). The contribution from all other sources of error is much smaller, ranging from 0.02 to 0.12 dex at z = 0 and from 0.07 to 0.16 dex at z = 1. On the other hand, this statement is only true when all contributing sources of scatter in stellar masses are considered. Models that do not account for scatter in stellar mass at fixed halo mass will overpredict stellar masses in 10^14.25 M_☉ halos by 0.13–0.19 dex, depending on the redshift. Models that do not account for scatter in calculated stellar mass at fixed true stellar mass will overpredict stellar masses in 10^14.25 M_☉ halos by 0.12 dex at z = 1. Hence, it is important to take both these effects into account when considering the SM–HM connection either at high masses or at high redshifts.

4.4. Comparison with Other Work

A comparison of our results with several results in the literature at z ∼ 0.1 is shown in Figure 11. Such comparison is not always straightforward, as other papers have often made different assumptions for the cosmological model, the definition of halo mass, or the measurement of stellar mass. In addition, some papers report the average stellar mass at a given halo mass (as we do), and others report the average halo mass at a given stellar mass. Given the scatter in stellar mass at fixed halo mass, the averaging method can affect the resulting stellar mass fractions, particularly for group- and cluster-scale halo masses. To facilitate comparison with both approaches, we plot our main results (labeled as 〈M_*/M_h|M_h〉) along with results for which the stellar mass fractions have been averaged at a given stellar mass (labeled as 〈M_*/M_h|M_*〉). In the comparisons below, we have not adjusted the assumptions used to derive stellar masses, because such adjustments can be complex and difficult to apply using simple conversions. Additionally, we have only corrected for differences in the underlying cosmology for those papers using a variant of abundance matching method (Moster et al. 2010; Guo et al. 2010; Wang & Jing 2010; Conroy et al. 2009) using the process described in Appendix A, as alternate methods require corrections which are much more complicated. We have, however, adjusted the IMF of all quoted stellar masses to that of Chabrier (2003), and we have converted all quoted halo masses to virial masses as defined in Section 3.2.2.

The closest comparison with our work, using a very similar method, is the result from Moster et al. (2010). This result is in excellent agreement with ours at the high-mass end and is within our systematic errors for all masses considered. However, their less flexible choice of functional form and their use of a different stellar mass function (estimated from spectroscopy using the results of Panter et al. 2007) result in a different value for the halo mass M_peak with peak stellar mass fraction and a shallower scaling of stellar mass with halo mass at the low-mass end. Their error estimates only account for statistical variations in galaxy number counts, and they do not include sample variance or variations in modeling assumptions. Guo et al. (2010) use a similar approach to Moster et al. (2010), using stellar masses from Li & White (2009), but they do not account for scatter in stellar mass at fixed halo mass. Consequently, their results match ours for 10¹² M_☉ and less-massive halos, but overpredict the stellar mass for larger halos.

Wang & Jing (2010) use a parameterization for the SM–HM relation for both satellites and centrals, and they attempt to simultaneously fit both the stellar mass function and the clustering constraints, including the effects of scatter in stellar mass at fixed halo mass. At z ∼ 0.1, their data source matches ours (Li & White 2009), but their approach finds a best-fit scatter in stellar mass at fixed halo mass of ξ = 0.2 dex, essentially the highest value allowed by the stellar mass function (Guo et al. 2010). As this is higher than our best-fit value for ξ, their SM–HM relation falls below ours for high-mass galaxies. Possibly because of the limited flexibility of their fitting form (they use only a four-parameter double power law), their SM–HM relation is in excess of ours for halo masses near 10¹² M_☉.

Zheng et al. (2007) used the galaxy clustering for luminosity-selected samples in the SDSS to constrain the halo occupation distribution (HOD). This gives a direct constraint on the r-band luminosity of central galaxies as a function of halo mass. Stellar masses for this sample were determined using the g − r color and the r-band luminosity as given by the Bell et al. (2003) relation, and a WMAP1 cosmology was assumed. This method allows for scatter in the luminosity at fixed halo mass to be constrained as a parameter in the model; results for this scatter are consistent with More et al. (2009), although they are less well constrained. According to Li & White (2009), stellar masses for the Bell et al. (2003) relation are systematically larger than those calculated using Blanton & Roweis (2007) by 0.1–0.3 dex. However, as Ω_m in WMAP1 is larger than in WMAP5, halo masses in WMAP1 will be higher at a given number density than in WMAP5, somewhat compensating for the higher stellar masses.

We next compare to constraints from direct measurements of halo masses from dynamics or gravitational lensing. Mandelbaum et al. (2006) have used weak lensing to measure the galaxy–mass correlation function for SDSS galaxies and derive a mean halo mass as a function of stellar mass. Mandelbaum et al. (2006) assume a WMAP1 cosmology and uses spectroscopic stellar masses, calculated per Kauffmann et al. (2003). A. Klypin et al. (2010, in preparation) have derived the mean halo mass as a function of stellar mass using satellite dynamics of SDSS galaxies (also see Prada et al. 2003; van den Bosch et al. 2004; Conroy et al. 2007). Their results are generally within our systematic errors but lower than others at the lowest masses and with a somewhat different shape. This may be due to selection effects, as their work uses only isolated galaxies, which may have somewhat lower average stellar masses. Gavazzi et al. (2007) use a set of strong lenses from the SLACS survey along with a model for simultaneously fitting the stellar and dark matter components of the stacked lens profiles. This result, at one mass scale, is a bit higher than our error range but within 1.5σ. The selection effects relevant to strong lenses are beyond the scope of this paper; however, within the effective radius, the stellar mass can easily contribute more to the lensing effect than the dark matter. Thus, at any given halo mass, the halos with less-massive galaxies are much less likely to be strong lenses, resulting in a bias toward higher stellar mass fractions in strong lenses as compared to halos selected at random.

At the high-mass end, one can directly identify clusters and groups corresponding to dark matter halos, and measure the stellar masses of their central galaxies. Yang et al. (2009a) use a group catalog matched to halos to determine halo masses (via an iteratively computed group luminosity–mass relation). Stellar masses in this work are determined using the Bell et al. (2003) relation between g−r color and M/L; a WMAP3 cosmology was assumed. Their results agree very well with ours for low-mass halos, but they begin to differ at higher masses. This may be partially due to scatter between their calculated halo masses (based on total stellar mass in the groups) and the true halo masses, resulting in additional scatter in their stellar masses at fixed halo mass. It could also be due to differences in stellar modeling; their results remain at all times within our systematic errors. We also compare to direct measurements of massive clusters by Hansen et al. (2009) and Lin & Mohr (2004). In order to convert luminosities to stellar masses, we assume M/L_i0.25 = 3.3 M_☉/L_☉,i0.25 and M/L_K = 0.83 M_☉/L_☉,K based on the population synthesis code of Conroy et al. (2009). These measurements are both somewhat higher than our results for massive clusters; the 1σ error estimates overlap. The discrepancies may be due to issues with cluster selection and with modeling scatter in the mass–observable relation; in each case the cluster mass is an average mass for the given observable (X-ray luminosity or cluster richness), and can result in a bias if central galaxies are correlated with this observable. More detailed modeling of the scatter and correlations will be required to determine whether this can account for the offsets.

A comparison of our results to others at z ∼ 1 is shown in Figure 12. As may be expected, it is much harder to directly measure the SM–HM relation at higher redshifts, resulting in relatively fewer published results with which we may compare. We first note that we have compared the impact of two independent measurements of the GSMF from different surveys. As discussed in Section 4.3.5, because we simultaneously fit our model with linear evolution to the GSMF at redshifts 0 < z < 1, our results are less sensitive to sample variance. In contrast to the conclusion of Drory et al. (2009), which fit their results to specific redshift bins, Figure 12 shows that the Drory et al. (2009) and Pérez-González et al. (2008) results are in agreement within statistical errors at z ∼ 1 when fitting the full redshift range. The results in Moster et al. (2010) are also very similar to ours at z ∼ 1. However, Moster et al. (2010) do not give fits for the SM–HM relation which include the effects of scatter in stellar mass at fixed halo mass except at z ∼ 0 and do not in general include scatter in measured stellar mass with respect to the true stellar mass. Hence, we include for comparison an SM–HM relation derived using our analysis but excluding both of these effects. The remaining deviance most likely stems from sample variance due to the much smaller survey volume on which their fit is based.

At z ∼ 1, Wang & Jing (2010) make use of clustering data and stellar mass functions from the VVDS survey (Meneux et al. 2008; Pozzetti et al. 2007) at z ∼ 0.8. At the same time, the high-mass and low-mass slopes of their power-law relation are not refit for the higher redshift, leading to similar deviations from our results as at z ∼ 0.1. The results in Zheng et al. (2007) lie slightly below our results at z ∼ 1. This is partially due to their use of a WMAP1 cosmology. However, it is difficult to say exactly how much, as their stellar masses at z ∼ 1 are a hybrid of K_s-derived masses from Bundy et al. (2006) and color-based masses derived in a manner analogous to Bell et al. (2003). Nonetheless, they remain well within our systematic error bars.

We also include a comparison to the z = 1 SM–HM relation presented in Conroy & Wechsler (2009), who also use an abundance matching technique to assign galaxies to halos. Unique to the work of Conroy & Wechsler (2009) is their attempt to jointly fit both the redshift-dependent stellar mass functions and the redshift-dependent SFR–stellar mass relations. In their model, halo growth is tracked through time using results derived from halo merger trees, allowing galaxies to be identified across epochs. The evolution of halos in conjunction with standard abundance matching provides model predictions for SFRs. The SM–HM relation from Conroy & Wechsler (2009) lies slightly above our best-fit relation, and it is within statistical error bars, except at the very highest halo masses. This is due to the different assumptions made about the GSMF in the two works and also to the absence of corrections for the scatter in stellar mass at fixed halo mass in Conroy & Wechsler (2009).

Finally, the strong lensing survey of Gavazzi et al. (2007) has been extended by Lagattuta et al. (2009) out to z ∼ 0.9 using lenses observed in the CASTLES program, as well as in COSMOS and in the EGS. While the same caveats about selection effects apply as for lower redshifts, Lagattuta et al. (2009) find that the evolution in the stellar mass fraction for M_h ∼ 10^13.5 M_☉ halos is within 0–0.3 dex greater at z ∼ 0.9 as compared to z ∼ 0, consistent with our limits of 0–0.15 dex for the allowed evolution over that redshift interval.

5. RESULTS BEYOND z = 1

5.1. Methodology and Data Limitations

As discussed in Section 2.1.2, published results for the GSMF beyond z = 1 suffer from the important caveat that integrated SFRs are inconsistent with GSMFs when both sets of observations are taken at face value with a constant IMF. Nonetheless, one may use similar methodology as in Section 3 to derive SM–HM relations at higher redshifts under the assumption that the observed stellar mass functions are correct. Here, we assume the GSMFs of Marchesini et al. (2009), which cover a redshift range of 1.3 < z < 4.

As there is no guarantee that the evolution of the SM–HM relation at high redshifts will have the same form as its evolution at low redshifts, we re-examine the assumptions affecting our evolution parameterization in Equation (23). As with all current high-redshift data, the results in Marchesini et al. (2009) are limited to luminous (massive) galaxies, so little information about the value of β (the faint-end slope of the galaxy–halo mass relation) is available. Hence, we continue to assume a linear functional form for its evolution; as the value of β evolves linearly with scale factor (a) in our fit, this means that it is largely constrained to be consistent with the evolution at lower redshifts (1 < z < 2). Naturally, if the evolution of β at high redshifts is significantly different than that for 1 < z < 2, then our error bars for β may underestimate the full uncertainties in the parameter.

Additionally, the systematics affecting high-mass galaxies at high redshifts are much more severe than for z < 1. Not only are the errors in stellar mass calculations significant (due to larger photometry errors, limited templates, etc.), but also any miscalibration in correcting photometric redshift errors will result in low-redshift galaxies masquerading as very bright high-redshift galaxies. These combined uncertainties result in poor constraints on high-mass galaxies. For that reason, we do not attempt to assume a more complicated functional form for the evolution of δ and γ, which means that, as before, their rates of evolution are largely constrained to be consistent with lower redshifts.

However, we find that individual fits at each redshift do suggest a possible evolution for the characteristic stellar mass which is nonlinear in the scale factor. Hence, we expand the form of the evolution of M_*,0 to include a quadratic dependence on scale factor:

$\begin{equation} M_{\ast,0}(a) = M_{\ast,0,0} + M_{\ast,0,a}(a - 1) + M_{\ast,0,a^2}(a-0.5)^2, \end{equation} \tag{ 25 }$

where (a − 0.5)² is used instead of (a − 1)² to minimize the degeneracy between M_*,0,a and $M_{\ast,0,a^2}$ . We parameterize the evolution of all other parameters as in Equation (23). All other methodology remains the same as for lower redshifts, as outlined in Section 3.6.

5.2. Results

To maintain some overlap with the z < 1 results, we evaluate the likelihood function for each SM–HM relation against GSMFs for 0.8 < z < 4. This data range includes two redshift bins (0.8–1.0, 1.0–1.3) from Pérez-González et al. (2008) and three redshift bins (1.3–2, 2–3, 3–4) from Marchesini et al. (2009). Including more redshift bins from Pérez-González et al. (2008) would improve the continuity of the fits to the low-redshift results; however, doing so would require a more complicated redshift parameterization than what we have assumed. The evolution of the best-fit stellar mass fraction for 0 < z < 4 is shown in Figure 13. All data points for Figure 13 are listed in Appendix D.

**Figure 13.** Evolution of the derived stellar mass fractions for central galaxies, from z = 4 to the present. The best-fit relations are shown only over the mass range where constraining data are available. At higher redshifts, certainty about the shape of the curves drops precipitously owing to a lack of constraining data beyond the knee of the stellar mass function. Combined systematic and statistical error bars are shown for three redshift bins only.
Download figure:
Standard image High-resolution image

As may be expected, uncertainties at high redshifts are substantially larger than at lower redshifts. The contribution of systematic errors in stellar masses to the error budget (0.25 dex) is still important, but it is no longer the only dominant factor. Statistical errors due to the comparatively small number of galaxy observations at high redshifts can contribute an equal uncertainty (up to 0.25 dex) to the derived SM–HM relation. The statistical errors are large not only for massive galaxies with low number counts, but also for halos below 10¹² M_☉, where magnitude limits on surveys make observations of the corresponding galaxies difficult.

The contribution from other sources of uncertainties (e.g., sample variance, cosmology uncertainties) is substantially smaller than the current statistical errors. The effects of sample variance on uncertainties at high redshift are less than for low redshifts because the volume probed in the high-redshift sample is 5 times larger than for the low-redshift sample (≈5 × 10⁶ Mpc³ versus 10⁶ Mpc³, respectively). While cosmology uncertainties are somewhat larger at high redshifts, their contribution to the overall levels of uncertainty are again much smaller than the statistical errors, as was true even by z = 1 for the low-redshift sample.

The statistical uncertainties at high redshifts mean that it is difficult to draw strong conclusions about the evolution of the SM–HM relation. However, the indication is that the mass corresponding to the peak efficiency for star formation evolves slowly, and is roughly a factor of 5 larger at z = 4. At all redshifts, the integrated star formation peaks at ∼10%–20% of the universal baryon fraction; the current data indicates that this value may start high at very high redshifts, shrink as halos grow faster than they form stars, and then start growing again after z = 2. However, with current uncertainties these results are tenuous. The single most effective way to reduce current uncertainties on both the SM–HM relation at individual redshifts and on its evolution is to conduct more high-redshift galaxy surveys, both to probe fainter galaxies to determine the shape of the GSMF and to get better statistics at the high-mass end.

6. DISCUSSION AND IMPLICATIONS

At z ∼ 0, the majority of published results are in accord within our full systematic error bars, regardless of the technique used. All reported results appear to be consistent with the principles necessary for abundance matching over a wide range of halo masses (10¹¹–10¹⁵ M_☉)—that each dark matter halo and subhalo above the masses we have considered host a galaxy with a reasonably tight relationship between their masses, and that average SM–HM relation increases monotonically with halo mass.

Because of the available statistics of halo and GSMFs, especially at z = 0, the technique of abundance matching offers the tightest constraints on the SM–HM relation currently available, and it is in agreement with results from a broad variety of additional techniques. Under the assumption that systematic errors in stellar mass calculations do not change substantially with redshift, abundance matching offers tight constraints on the evolution of the SM–HM relation from z = 1 to the present. These in turn will serve as important new tests for star formation prescriptions and recipes in both hydrodynamic simulations and semi-analytic models, as they will apply on the level of individual halos instead of on the simulated volume as a whole.

At the same time, abundance matching offers these constraints with a minimal number of parameters. The HOD technique requires modeling P(N|M_h), the probability distribution of the number of galaxies per halo as a function of halo mass, in several different luminosity bins. In the model proposed in Zheng et al. (2007), this results in 45 fitted parameters just to model the occupation at z = 0 (five parameters for nine luminosity bins). CLF modeling requires parameterizing a form for ϕ(L, M_h), the number density of galaxies as a function of luminosity and host halo mass, which results in approximately a dozen parameters to model occupation at z = 0 (Cooray 2006). Because of the additional constraints imposed by assuming that each halo hosts a galaxy, our approach uses fewer parameters. Abundance matching, as discussed in this paper, results in a model with only six independent parameters (five to empirically fit the derived SM–HM relation, and one to model the scatter in observed stellar masses at fixed halo mass) to describe the population of galaxies in halos.

The abundance matching approach to the SM–HM relation requires a few parameters in comparison to other methods because of the fairly small scatter (≈0.16 dex) between stellar mass and halo mass at high masses (the scatter has a negligible impact on the average SM–HM relation at lower masses), and the requirement that satellite galaxies live in satellite halos (subhalos). It may well be that a more complicated model must be adopted for satellites to quantitatively match the small-scale clustering observations (e.g., Wang et al. 2006). However, such changes will affect the clustering much more than the derived SM–HM relation, as suggested by the minimal changes in Figures 7 and 10 for mass scales (≲10^12.5 M_☉) where satellites are a non-negligible fraction of the total halo population.

The largest uncertainties in the SM–HM relation at z < 1 come from assumptions in converting galaxy luminosities into stellar masses, which amount to uncertainties on the order of 0.25 dex in the normalization of the relation. However, the systematic biases introduced by the combined sources of scatter between calculated stellar masses and halo masses can rise to equivalent significance for halos above 10^14.5 M_☉. Because the GSMF is monotonically decreasing, results which do not account for all sources of scatter in stellar mass will overpredict the average stellar mass in halos by 0.17–0.25 dex for these massive halos.

Using abundance matching to find confidence intervals for the SM–HM relation is an even more involved process, as each of the ways in which the systematics might vary must also be taken into account. While future work on constraining stellar masses will be the most valuable in terms of reducing uncertainties for the lowest redshift data, wider and deeper surveys and some resolution to the discrepancy between high-redshift cosmic star formation density and stellar mass functions must occur in order to improve constraints on the relation at high redshifts.

As mentioned in the Introduction, abundance matching may be used equally well to assign galaxy luminosities and colors to halos. In this case, the galaxy luminosity–halo mass relation may be derived using identical methodology to that presented in Section 3, with the exception that the systematics μ and κ may be neglected, leaving only σ(z) (effectively, the redshift scaling of photometry errors) and ξ (effectively, the scatter in luminosity at fixed halo mass). As these systematics are much better constrained than their stellar mass counterparts (simply as luminosities may be measured directly), this approach can yield very powerful constraints on the normalization as well as the evolution of the luminosity–mass relation. The higher accuracy possible compared to the stellar mass–mass relation will generally not remove uncertainties in comparing to galaxy formation models. Galaxy formation codes which calculate luminosities must include modeling for all the effects in Section 2.1.1, meaning that constraints on the underlying physics are subject to the same uncertainties. However, tighter constraints on the luminosity–mass relation will be nonetheless helpful for applications which are concerned with cosmological constraints from large luminosity-selected surveys.

7. CONCLUSIONS

We have performed an extensive exploration of the uncertainties relevant to determining the relationship between dark matter halos and galaxy stellar masses from the halo abundance matching technique. Errors related to the observed stellar mass function, the theoretical halo mass function, and the underlying technique of abundance matching are all considered. We focus on the mean stellar mass to halo mass ratio for central galaxies as a function of halo mass, and present results for this relationship at the present epoch and extending to z ∼ 4. We account separately for statistical errors and for systematic errors resulting from uncertainties in stellar mass estimation, and also investigate the relative contribution of various sources of error including cosmological uncertainty and the scatter between stellar mass and halo mass. An analytic model has been developed which can be used to constrain this connection in the absence of high-resolution simulations.

Our primary conclusions are as follows.

1.
The peak integrated star formation efficiency occurs at a halo mass near 10¹² M_☉, with a relatively low fraction—20% at z = 0—of baryons currently locked up in stars. This peak value declines to z ∼ 2 but remains between 10% and 20% for all redshifts between z=0 and 4. This implies that 30%–40% of baryons were converted into stars over the lifetime of a galaxy with current halo mass of 10¹² M_☉.
2.
At low masses, the SM–HM relation at z = 0 scales as M_* ∼ M^2.3_h. At high masses, around 10¹⁴ M_☉, stellar mass scales as M_* ∼ M^0.29_h. However, the high-mass scaling may not be a power law, as our model indicates that this slope decreases with increasing halo mass.
3.
Within statistical uncertainties, the stellar mass content of halos has increased by 0.3–0.45 dex for halos with mass less than 10¹² M_☉ since z ∼ 1. Systematic biases in stellar mass calculations between different redshifts could broaden the uncertainties in this number, but the conclusion that significant evolution has occurred for low-mass halos would remain robust. For halos with mass greater than 10¹³ M_☉, our best-fit results indicate more growth in halo masses than in stellar masses since z ∼ 1, but are consistent with no evolution in the stellar mass fractions over this time.
4.
Systematic, uniform offsets in the GSMF and its evolution are the dominant uncertainty in the SM–HM relation at low redshift. Statistical errors in the estimation of individual stellar masses impact the high-mass end of the GSMF, and at higher redshifts may result in an observed GSMF that deviates from a Schechter function.
5.
Current uncertainties in the underlying cosmological model are sub-dominant to the systematic errors, but are larger than other sources of statistical error for halos below M_h ∼ 10¹² M_☉ for low redshifts (z < 0.2).
6.
Given current constraints from other methods, uncertainty in the value of scatter between stellar mass and halo mass is important in the mean relation for masses above M_h ∼ 10^12.5 M_☉, although it is sub-dominant to systematic errors for all masses below M_h ∼ 10^14.5 M_☉.
7.
Other uncertainties in the galaxy–halo assignment, including different assumptions about the treatment of satellite galaxies, are sub-dominant when considering the mean relation for central galaxies.
8.
At higher redshifts (1 < z < 4), systematic uncertainties remain important, but statistical uncertainties reach equal significance. The shape of the relation is fairly unconstrained at z>2, where improved statistics and constraints on the GSMF below M_* are needed.

We have presented a best-fit galaxy SM–HM relation including an estimate of the total statistical and systematic errors using available data from z = 0–4, although caution should be used at redshifts higher than z ∼ 1. We also present an algorithm to generalize this relation for an arbitrary cosmological model or halo mass function. The fact that assignment errors are sub-dominant and scatter can be well constrained by other means gives increased confidence in using the simple abundance matching approach to constrain this relation. These results provide a powerful constraint on models of galaxy formation and evolution.

P.S.B. and R.H.W. received support from the U.S. Department of Energy under contract number DE-AC02-76SF00515 and from a Terman Fellowship at Stanford University. C.C. is supported by the Porter Ogden Jacobus Fellowship at Princeton University. We thank Michael Blanton, Niv Drory, Raphael Gavazzi, Qi Guo, Sarah Hansen, Anatoly Klypin, Cheng Li, Yen-Ting Lin, Pablo Pérez-González, Danilo Marchesini, Benjamin Moster, Lan Wang, Xiaohu Yang, Zheng Zheng, as well as their co-authors for the use of electronic versions of their data. We appreciate many helpful discussions and comments from Ivan Baldry, Michael Busha, Simon Driver, Niv Drory, Anatoly Klypin, Ari Maller, Danilo Marchesini, Phil Marshall, Pablo Pérez-González, Paolo Salucci, Jeremy Tinker, Frank van den Bosch, the Santa Cruz Galaxy Workshop, and the anonymous referee for this paper. The ART simulation (L80G) used here was run by Anatoly Klypin, and we thank him for allowing us access to these data. We are grateful to Michael Busha for providing the halo catalogs we used to estimate sample variance errors. These simulations were produced by the LasDamas project (http://lss.phy.vanderbilt.edu/lasdamas/); we thank the LasDamas collaboration for providing us with this data.

APPENDIX A: CONVERTING RESULTS TO OTHER HALO MASS FUNCTIONS

From Equation (14), it is possible to simply convert from our halo mass function ϕ_h to any halo mass function of choice (ϕ_h,r). In particular, the function M_h(M_*) is defined by the fact that the number density of halos with mass above M_h(M_*) must match the number density of galaxies with stellar mass above M_* (with the appropriate deconvolution steps applied). Recall from Equation (14) that

$\begin{equation} \int _{M_{h}(M_\ast)}^\infty \phi _{h}(M)d\log _{10} M = \int _{M_\ast }^\infty \phi _\mathrm{direct}(M_\ast)d\log _{10} M_\ast. \end{equation} \tag{ A1 }$

Naturally, the correct mass–stellar mass relation for the alternate halo mass function ϕ_h,r (which we will label as M_h,r(M_*)) must satisfy this same equation, with the result that

$\begin{equation} \int _{M_h(M_\ast)}^\infty \phi _{h}(M)d\log _{10} M = \int _{M_{h,r}(M_\ast)}^\infty \phi _{h,r}(M)d\log _{10} M. \end{equation} \tag{ A2 }$

To make the calculation even more explicit, let Φ_h(M) = ∫^∞_Mϕ_hdlog₁₀M be our cumulative halo mass function, and let Φ_h,r(M) be the corresponding cumulative halo mass function for ϕ_h,r. Then, we find

$\begin{equation} M_{h,r}(M_\ast) = \Phi _{h,r}^{-1}(\Phi _{h}(M_{h}(M_\ast))). \end{equation} \tag{ A3 }$

Mass functions from different cosmologies than those assumed in this paper will also require converting stellar masses if their choices of h differ from the WMAP5 best-fit value.

APPENDIX B: EFFECTS OF SCATTER ON THE STELLAR MASS FUNCTION

This section is intended to provide basic intuition for the effects of both ξ and σ(z), which may be modeled as convolutions. The classic example in this case is convolution of the GSMF with a log-normal distribution of some width ω. While the convolution (even of a Schechter function) with a Gaussian has no known analytical solution, we may approximate the result by considering the case where the logarithmic slope of the GSMF changes very little over the width of the Gaussian. Then, locally, the stellar mass function is proportional to a power function, say ϕ(M_*) ∝ (M_*)^α. Then, if we let x = log₁₀M_* (so that ϕ(10^x) ∝ 10^αx), finding the convolution is equivalent to calculating the following integral:

$\begin{eqnarray} \phi _\mathrm{conv}(10^x) &\propto & \int _{-\infty }^\infty \frac{10^{\alpha b}}{\sqrt{2\pi \omega ^2}} \exp \left(-\frac{(x-b)^2}{2\omega ^2}\right)db \nonumber \\ & = & 10^{\alpha x} \, 10^{\frac{1}{2}\alpha ^2 \omega ^2 \ln (10)}. \end{eqnarray} \tag{ B1 }$

That is to say, the stellar mass function is shifted upward by approximately 1.15(αω)² dex. Hence, for parts of the stellar mass function with shallow slopes, the shift is completely insignificant, as it is proportional to α². However, it matters much more in the steeper part of the stellar mass function to the point that for galaxies of mass 10¹² M_☉ the observed stellar mass function can be several orders of magnitude above the intrinsic GSMF.

APPENDIX C: A SAMPLE CALCULATION OF THE FUNCTIONAL FORM OF THE STELLAR MASS FUNCTION

Galaxy formation models typically assume at least two feedback mechanisms to limit star formation for low-mass galaxies and for high-mass galaxies. Thus, one of the simplest fiducial SFR as a function of halo mass (M_h) would assume a double power-law form:

$\begin{equation} \mbox{SFR}(M_h) \propto \left(\frac{M_h}{M_0}\right)^a + \left(\frac{M_h}{M_0}\right)^b. \end{equation} \tag{ C1 }$

We might expect the total stellar mass as a function of halo mass to take a similar form, except perhaps with a wider region of transition between galaxies whose histories are predominantly low-mass, and those with histories that are predominantly high-mass, for the reason that some galaxies' accretion histories may have caused them to be affected comparably by both feedback mechanisms.

Hence, assuming that the relation between halo mass and stellar mass follows a double power-law form, we adopt a simple functional form to convert from the stellar mass of a galaxy to the halo mass:

$\begin{equation} M_h(M_\ast) = M_1 \, \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right]^\gamma. \end{equation} \tag{ C2 }$

Here, β may be thought of as the faint-end slope, δ as the massive-end slope (although β and δ are functionally interchangeable), and γ as the transition width (larger γ means a slower transition between the massive and faint-end slopes).

We first calculate $\frac{d\log (M_h)}{d\log (M_\ast)}$ :

$\begin{eqnarray} \log (M_h) & = & \log (M_1) + \gamma \log \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right],\nonumber\\ \end{eqnarray} \tag{ C3 }$

$\begin{eqnarray} \frac{d\log (M_h)}{d\log (M_\ast)} & = & \frac{d\log (M_h)}{dM_\ast } \, \frac{dM_\ast }{d\log (M_\ast)}\nonumber\\ & = & M_\ast \, \ln (10) \, \frac{d\log (M_h)}{dM_\ast }\nonumber\\ & = & \frac{\beta \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \delta \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }}{\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta / \gamma }}\nonumber\\ & = & \beta + (\delta -\beta)\,\left(1+ \left(\frac{M_\ast }{M_{\ast,0}}\right)^\frac{\beta -\delta }{\gamma }\right)^{-1}. \end{eqnarray} \tag{ C4 }$

This justifies the earlier intuition that the functional form for M_h(M_*) transitions between slopes of β and δ with a width that increases with γ. Note that $\frac{d\log (M_h)}{d\log (M_\ast)}$ is always of order one, as the stellar mass is always assumed to increase with the halo mass and vice versa (namely, β>0 and δ>0).

Next, we approach $\frac{dN}{d\log {M_h}}$ . From analytical results, we expect a Schechter function for the halo mass function, namely

$\begin{equation} \frac{dN}{d\log (M_h)} = \phi _0 \, \ln (10) \, \left(\frac{M_h}{M_0}\right)^{1-\alpha }\, \exp \left(-\frac{M_h}{M_0}\right). \end{equation} \tag{ C5 }$

Substituting in the equation for M_h(M_*), we have

$\begin{eqnarray} \frac{dN}{d\log (M_h)} & = & \phi _0 \, \ln (10) \, \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right]^{\gamma (1-\alpha)}\nonumber \\ &&\hspace{-10ex}\times \left(\frac{M_h}{M_0}\right)^{1-\alpha }\, \exp \left(-\frac{M_1}{M_0}\, \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right]^{\gamma } \right).\nonumber\\ \end{eqnarray} \tag{ C6 }$

Already evident is the generic result that there will be separate faint-end and massive-end slopes in the stellar mass function, and that the falloff is not generically specified by an exponential. We may make one simplification in this model, namely, to note that M_h(M_*,0) corresponds to the halo mass at which the slope of M_h(M_*) begins to transit from β to δ. We expect this transition to correspond to the transition between supernova feedback and AGN feedback in semi-analytic models; namely, for a halo mass that is too large to be affected much by supernova feedback, but that is yet too small to host a large AGN. This implies that M_h(M_*,0) is expected to be around 10¹² M_☉ or less, meaning that M_h/M₀ is small until stellar masses well beyond M_*,0, meaning that we may neglect the faint-end slope of the M_h(M_*) relation in the exponential portion of the stellar mass function:

$\begin{eqnarray} \frac{dN}{d\log (M_h)} & = & \phi _0 \, \ln (10) \, \left(\frac{M_1}{M_0}\right)^{1-\alpha }\, \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma } \right.\nonumber\\ &&\left. + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right]^{\gamma (1-\alpha)} \exp \left(-\frac{M_1}{M_0}\, \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta } \right).\nonumber\\ \end{eqnarray} \tag{ C7 }$

Hence, we may combine these two equations to obtain the expression for the stellar mass function:

$\begin{eqnarray} \frac{dN}{d\log (M_\ast)} & = & \phi _0 \, \ln (10) \, \left(\frac{M_1}{M_0}\right)^{1-\alpha }\, \left[\left(\frac{M_\ast }{M_{\ast,0}}\right)^{\beta /\gamma }\right.\nonumber\\ &&\left. + \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta /\gamma }\right]^{\gamma (1-\alpha)}\exp \left(-\frac{M_1}{M_0}\, \left(\frac{M_\ast }{M_{\ast,0}}\right)^{\delta } \right) \nonumber \\ && \times \left[ \beta + (\delta -\beta)\,\left(1+ \left(\frac{M_\ast }{M_{\ast,0}}\right)^\frac{\beta -\delta }{\gamma }\right)^{-1}\right].\nonumber\\ \end{eqnarray} \tag{ C8 }$

While this seems complicated, it may be intuitively deconstructed as

$\begin{eqnarray} \frac{dN}{d\log (M_\ast)} & = & [\mathrm{constant}] \, [\hbox{double power law}] \nonumber \\ && \times [\hbox{exponential drop-off}] \, \mathcal {O}(1). \end{eqnarray} \tag{ C9 }$

As mentioned previously, this functional form is equivalent to ϕ_direct. To convert to the true stellar mass function ϕ_true or the observed stellar mass function ϕ_meas, it must be convolved with the scatter in stellar mass at fixed halo mass and (for ϕ_meas) the scatter in calculated stellar mass at fixed true stellar mass. As such, it should be clear that—while the final form may be Schechter-like—there is certainly much more flexibility in the final shape of the GSMF than a Schechter function alone would allow, as evidenced by the five parameters required to fully specify Equation (C8).

APPENDIX D: DATA TABLES

We reproduce here listings of the data points in Figures 5, 6, and 13 in Tables 3–5, respectively. See Sections 4.2 and 5.2 for details on the data points in each table.

Table 3. Stellar Mass Fractions for 0 < z < 1 Including Full Systematics

log₁₀(M_h)	z = 0.1	z = 0.5	z = 1.0
	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)
11.00	−2.30^+0.26_−0.23
11.25	−1.96^+0.25_−0.23	−2.11^+0.22_−0.26
11.50	−1.67^+0.24_−0.24	−1.84^+0.22_−0.26	−2.01^+0.25_−0.24
11.75	−1.53^+0.23_−0.24	−1.70^+0.24_−0.24	−1.85^+0.26_−0.23
12.00	−1.54^+0.24_−0.24	−1.68^+0.26_−0.23	−1.77^+0.26_−0.23
12.25	−1.62^+0.24_−0.24	−1.72^+0.26_−0.23	−1.77^+0.25_−0.23
12.50	−1.74^+0.24_−0.24	−1.81^+0.25_−0.23	−1.81^+0.24_−0.24
12.75	−1.87^+0.23_−0.25	−1.92^+0.25_−0.23	−1.89^+0.23_−0.26
13.00	−2.02^+0.23_−0.25	−2.05^+0.24_−0.24	−1.99^+0.22_−0.27
13.25	−2.18^+0.22_−0.26	−2.19^+0.24_−0.25	−2.11^+0.21_−0.28
13.50	−2.35^+0.22_−0.26	−2.34^+0.23_−0.25	−2.25^+0.21_−0.29
13.75	−2.52^+0.22_−0.27	−2.51^+0.23_−0.26	−2.39^+0.21_−0.30
14.00	−2.70^+0.21_−0.28	−2.68^+0.23_−0.26	−2.55^+0.22_−0.30
14.25	−2.88^+0.21_−0.28	−2.86^+0.23_−0.26
14.50	−3.07^+0.20_−0.29	−3.04^+0.23_−0.27
14.75	−3.26^+0.20_−0.30
15.00	−3.45^+0.20_−0.30

Notes. Halo masses are in units of M_☉. Constraints are quoted over the mass range probed by the observed GSMF.

Download table as: ASCII Typeset image

Table 4. Stellar Mass Fractions without Systematic Errors (μ = κ = 0)

log₁₀(M_h)	z = 0.1	z = 0.5	z = 1.0
	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)
11.00	−2.30^+0.03_−0.02
11.25	−1.96^+0.04_−0.01	−2.10^+0.04_−0.08
11.50	−1.67^+0.03_−0.01	−1.83^+0.05_−0.06	−2.02^+0.10_−0.06
11.75	−1.53^+0.01_−0.01	−1.71^+0.06_−0.03	−1.85^+0.10_−0.04
12.00	−1.54^+0.01_−0.02	−1.68^+0.06_−0.02	−1.78^+0.09_−0.03
12.25	−1.62^+0.00_−0.02	−1.72^+0.06_−0.01	−1.77^+0.08_−0.03
12.50	−1.74^+0.01_−0.02	−1.81^+0.05_−0.02	−1.81^+0.06_−0.04
12.75	−1.87^+0.01_−0.03	−1.92^+0.05_−0.02	−1.88^+0.04_−0.06
13.00	−2.02^+0.01_−0.03	−2.05^+0.04_−0.03	−1.98^+0.03_−0.08
13.25	−2.18^+0.02_−0.04	−2.19^+0.04_−0.04	−2.10^+0.04_−0.10
13.50	−2.35^+0.02_−0.05	−2.34^+0.04_−0.05	−2.24^+0.04_−0.13
13.75	−2.52^+0.03_−0.06	−2.51^+0.05_−0.06	−2.38^+0.05_−0.15
14.00	−2.70^+0.03_−0.07	−2.68^+0.05_−0.07	−2.54^+0.06_−0.17
14.25	−2.88^+0.04_−0.08	−2.85^+0.06_−0.08
14.50	−3.07^+0.04_−0.09	−3.04^+0.06_−0.10
14.75	−3.25^+0.05_−0.10
15.00	−3.45^+0.05_−0.11

Note. Halo masses are in units of M_☉.

Download table as: ASCII Typeset image

Table 5. Stellar Mass Fractions for 0.8 < z < 4 Including Full Systematics

log₁₀(M_h)	z = 1.0	z = 2.0	z = 3.0	z = 4.0
	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)	log₁₀(M_/M_h*)
11.50	−2.01^+0.25_−0.24
11.75	−1.85^+0.26_−0.23
12.00	−1.77^+0.26_−0.23	−1.89^+0.22_−0.27	−1.89^+0.25_−0.27
12.25	−1.77^+0.25_−0.23	−1.76^+0.24_−0.25	−1.67^+0.21_−0.29	−1.58^+0.22_−0.32
12.50	−1.81^+0.24_−0.24	−1.78^+0.23_−0.26	−1.63^+0.21_−0.30	−1.50^+0.20_−0.35
12.75	−1.89^+0.23_−0.26	−1.87^+0.20_−0.30	−1.71^+0.19_−0.35	−1.56^+0.19_−0.40
13.00	−1.99^+0.22_−0.27	−2.00^+0.17_−0.35	−1.83^+0.17_−0.39	−1.68^+0.19_−0.44
13.25	−2.11^+0.21_−0.28	−2.14^+0.15_−0.39	−1.97^+0.16_−0.44	−1.82^+0.19_−0.49
13.50	−2.25^+0.21_−0.29	−2.30^+0.15_−0.42	−2.13^+0.17_−0.47	−1.98^+0.19_−0.52
13.75	−2.39^+0.21_−0.30	−2.47^+0.17_−0.45	−2.30^+0.19_−0.49
14.00	−2.55^+0.22_−0.30	−2.64^+0.20_−0.47

Note. Halo masses are in units of M_☉.

Download table as: ASCII Typeset image

A COMPREHENSIVE ANALYSIS OF UNCERTAINTIES AFFECTING THE STELLAR MASS–HALO MASS RELATION FOR 0 < z < 4

Article metrics

Permissions

Share this article

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION

2. UNCERTAINTIES AFFECTING THE SM–HM RELATION

2.1. Uncertainties in the Stellar Mass Function

2.1.1. Principal Uncertainties in Stellar Mass Functions

2.1.2. Additional Systematics at z>1

2.2. Uncertainties in the Halo Mass Function

2.3. Uncertainties in Abundance Matching

3. METHODOLOGY

3.1. Modeling Stellar Mass Function Uncertainties

3.1.1. Modeling Systematic Shifts in Stellar Mass Estimates

3.1.2. Modeling Statistical Errors in Individual Stellar Mass Measurements

3.2. Halo Mass Functions

3.2.1. Simulations

3.2.2. Analytic Mass Functions

3.2.3. Modeling Uncertainties in Cosmological Parameters

3.2.4. Estimating Sample Variance Effects for the Stellar Mass Function

3.3. Uncertainties in Abundance Matching

3.3.1. Scatter in Stellar Mass at Fixed Halo Mass

3.3.2. The Treatment of Satellites

3.4. Functional Forms for the SM–HM Relation

3.4.1. Parameterizing the Stellar Mass Function

3.4.2. Deconvolving the Observed Stellar Mass Function

3.4.3. Fitting the Deconvolved Relation

3.5. Calculating Model Likelihoods

3.6. Methodology Summary

4. RESULTS FOR 0 < z < 1

4.1. GSMFs

4.2. The Best-fit SM–HM Relations

4.3. Impact of Uncertainties

4.3.1. Systematic Shifts in Stellar Mass Calculations

4.3.2. Scatter in Stellar Mass at Fixed Halo Mass

4.3.3. Statistical Errors in Stellar Mass Calculations

4.3.4. Cosmology Uncertainties

4.3.5. Sample Variance

4.3.6. Satellite Treatment

4.3.7. Summary of Most Important Uncertainties

4.4. Comparison with Other Work

5. RESULTS BEYOND z = 1

5.1. Methodology and Data Limitations

5.2. Results

6. DISCUSSION AND IMPLICATIONS

7. CONCLUSIONS

APPENDIX A: CONVERTING RESULTS TO OTHER HALO MASS FUNCTIONS

APPENDIX B: EFFECTS OF SCATTER ON THE STELLAR MASS FUNCTION

APPENDIX C: A SAMPLE CALCULATION OF THE FUNCTIONAL FORM OF THE STELLAR MASS FUNCTION

APPENDIX D: DATA TABLES

Footnotes