THE SPATIAL STRUCTURE OF MONO-ABUNDANCE SUB-POPULATIONS OF THE MILKY WAY DISK

Jo Bovy; Hans-Walter Rix; Chao Liu; David W. Hogg; Timothy C. Beers; Young Sun Lee

doi:10.1088/0004-637X/753/2/148

1. INTRODUCTION

The formation of galactic disks is a long-standing problem in galaxy formation. In numerical simulations, disks form through gas dissipation (Sandage et al. 1970; Larson 1976), and the formation of the outer regions of the disk happens on longer timescales than the inner disk (Gott & Thuan 1976; Larson 1976; Katz & Gunn 1991). The disks that form have exponential density profiles (Lake & Carlberg 1988), possibly due to the detailed conservation of angular momentum of an initially spherical cloud in solid-body rotation (Fall & Efstathiou 1980; Gunn 1982). Yet, direct observational evidence for this picture is scant (e.g., Dalcanton et al. 1997; Somerville et al. 2008; Wang et al. 2011), and forming realistic disks within the ΛCDM paradigm remains challenging (e.g., Abadi et al. 2003; Scannapieco et al. 2009; Guedes et al. 2011).

Central to the question as to how galactic disks form and evolve is the existence of "thick" disk components. First discovered in external galaxies (Tsikoudi 1979; Burstein 1979; van der Kruit & Searle 1981), thick-disk components represent excess light or stars beyond the canonical thin disk's exponential vertical profile. The Milky Way's thick-disk component (Yoshii 1982; Gilmore & Reid 1983; Reid & Majewski 1993; Majewski 1993; Jurić et al. 2008) provides us with a detailed look at this common galactic component. Generally, thick-disk components are found to be old (Bensby et al. 2005; Yoachim & Dalcanton 2008), kinematically hot (Chiba & Beers 2000; Soubiran et al. 2003; Gilmore et al. 2002; Yoachim & Dalcanton 2005), and metal-poor (compared to the thin-disk components), as well as enhanced in α-elements (Fuhrmann 1998; Prochaska et al. 2000; Tautvaišienė et al. 2001; Bensby et al. 2003, 2005; Feltzing et al. 2003; Mishenina et al. 2004; Reddy et al. 2006; Haywood 2008). Density decompositions of the stellar disk into thinner and thicker components of external galaxies and the Milky Way have found that thicker-disk components have larger scale heights and longer scale lengths than their corresponding thin disk (Robin et al. 1996; Buser et al. 1999; Chen et al. 2001; Ojha 2001; Neeser et al. 2002; Larsen & Humphreys 2003; Yoachim & Dalcanton 2006; Pohlen et al. 2007; Jurić et al. 2008). However, these decompositions are purely geometric, and do not take kinematics or abundance information into account when assigning thinner- or thicker-disk membership.

A number of qualitatively very different models have been proposed for the formation of thick-disk components. External mechanisms, such as the direct accretion of stars from a disrupted satellite galaxy (Abadi et al. 2003) or the heating of a pre-existing thin disk through a minor merger (Quinn et al. 1993; Wyse et al. 2006; Kazantzidis et al. 2008; Villalobos & Helmi 2008; Moster et al. 2010), can explain many of the observed properties of thick-disk components. Thick-disk components can also be formed internally through star formation following a gas-rich merger (e.g., Brook et al. 2004) or by quiescent internal dynamical evolution (Schönrich & Binney 2009a, 2009b; Loebman et al. 2011).

The idea that the thick-disk component could arise in good part through internal evolution is an intriguing possibility, as it explains a range of other observations. Significant redistribution of angular momentum without radially heating the disk ("radial migration") happens naturally if spiral structure is transient (Sellwood & Binney 2002; Roškar et al. 2008, 2011), and has also been shown to occur through bar–spiral structure interactions (Minchev & Famaey 2010; Minchev et al. 2011). It can also be induced by an orbiting satellite (Quillen et al. 2009; Bird et al. 2012). The transient nature of spiral structure is favored both theoretically (e.g., Sellwood & Carlberg 1984; Carlberg & Sellwood 1985; Sellwood & Lin 1989) and observationally from surveys of the solar neighborhood (e.g., Dehnen 1998; De Simone et al. 2004; Bovy et al. 2009b; Bovy & Hogg 2010; Sellwood 2011) and of external galaxies (e.g., Meidt et al. 2009; Foyle et al. 2011). Radial migration naturally explains the flatness and spread in the age–metallicity relation in the solar neighborhood, as the large-scale changes in the guiding radii of stars tend to flatten radial-abundance gradients. A thicker-disk component arises through radial migration when stars from the inner Galaxy migrate outward, where the gravitational attraction toward the mid-plane is smaller, such that they reach larger heights above the plane. However, to date, radial-migration models have essentially only been confronted with data at the solar radius, and observational tests to discriminate formation scenarios for thicker-disk components have not been conclusive (e.g., Dierickx et al. 2010).

Because radial migration is effectively a diffusion process, it complicates, if not erases, the link between the present-day chemo-orbital distribution and the orbital characteristics and abundance distribution at the time of a given star's birth. Without detailed modeling of the episodes of transient spiral structure (or the equivalent in other radial-migration scenarios), reconstructing the radial and azimuthal actions is also problematic. However, the vertical action is an adiabatic invariant during the slow change in the vertical potential that ensues from this migration. The overall (mass-weighted) radial structure of the disk is left relatively unchanged as radial migration proceeds (Sellwood & Binney 2002; Minchev et al. 2011)—essentially because any redistribution of the surface-mass density would provide energy to heat the disk, and to avoid this heating the overall surface-mass density profile needs to be conserved—but if different (age or abundance) components of the disk have a different initial structure, radial migration will work to bring the spatial distributions of different populations closer to the mean.

In this paper we implement for the first time an alternative approach to globally "dissecting" the Milky Way's stellar disk: we study the overall (vertical and radial) spatial structure of large samples of stars selected to be sub-populations in the elemental-abundance space spanned by metallicity [Fe/H] and α-abundance [α/Fe],⁷ as it is becoming increasingly clear that a characterization of the thicker-disk components based only on stellar abundances is superior to kinematic definitions (Navarro et al. 2011; Lee et al. 2011b). The [α/Fe] ratio in particular is a crucial parameter, as it can be used as a relative age indicator (Wyse & Gilmore 1988). At early times, the low-metallicity interstellar medium is enriched by Type II supernovae (SNeII). After about 2–3 Gyr, Type Ia SNe occur (e.g., Maoz et al. 2011), and the stellar yields shift toward Fe, leading to a decreasing [α/Fe] with increasing age. Therefore, populations of stars with enhanced [α/Fe] ratios are chemically older than those with [α/Fe] closer to the solar ratio. By using the SDSS/SEGUE G-dwarf sample, we observe stars globally across the Milky Way, constraining their vertical distributions from 300 pc to 4 kpc from the mid-plane, and their radial densities from Galactocentric radii ranging from 5 to 12 kpc. We show that the scale length of the α-enhanced—and thus probably oldest—population is much shorter than that of the chemically more evolved stars with solar [α/Fe]. This is opposite to previous disk decompositions into thicker and thinner components that make use of geometric information alone (e.g., Jurić et al. 2008, and see above). Also, we do not detect any discontinuity in the vertical scale height as a function of [α/Fe] that might be expected if the thick-disk component was formed through a singular external or internal event, but instead observe a continuous increase in scale height with [α/Fe]. This casts doubt on how sensible or useful it is to think of distinct thin- and thick-disk components in the Milky Way.

The outline of this paper is as follows. In Section 2 we present the details of our data sample. Our density-fit methodology, accounting for the various aspects of the SEGUE selection function, is given in Section 3. We give the results of the density fits to the various abundance-selected samples in Section 4, and discuss these results in terms of disk formation and evolution models in Section 5. We summarize the main conclusions of the paper in Section 6. The appendices describe our model for the SEGUE selection function, some details of our fitting methodology, and detailed comparisons between our fits and the data. Modeling the spectroscopic SEGUE selection function is central to our analysis. It is described in Appendix A to aid the readability of the paper, as its implementation requires a detailed and hence extensive description that may not be of interest to all readers. Throughout this paper, we assume that the Sun's displacement from the mid-plane is 25 pc toward the north Galactic pole (Chen et al. 2001; Jurić et al. 2008), and that the Sun is located at 8 kpc from the Galactic center (e.g., Bovy et al. 2009a).

2. DATA

The Sloan Digital Sky Survey (SDSS; York et al. 2000) has obtained u, g, r, i, and z CCD imaging of ≈10⁴ deg² of the northern and southern Galactic sky (Gunn et al. 1998, 2006; Stoughton et al. 2002). All the data processing, including astrometry (Pier et al. 2003), source identification, deblending, and photometry (Lupton et al. 2001), calibration (Fukugita et al. 1996; Hogg et al. 2001; Smith et al. 2002; Ivezić et al. 2004; Padmanabhan et al. 2008), and spectroscopic fiber placement (Blanton et al. 2003) are performed with automated SDSS software. The SDSS spectroscopic survey uses two fiber-fed spectrographs that have 320 fibers each.

The Sloan Extension for Galactic Understanding and Exploration (SEGUE; Yanny et al. 2009) is a low-resolution (R ≈ 2000) spectroscopic sub-survey of the SDSS focused on Galactic science. We select a sample of G-type dwarfs from the SDSS/SEGUE Data Release 7 (DR7; Abazajian et al. 2009). G-type dwarfs are the most luminous tracers whose main-sequence lifetime is larger than the expected disk age at basically all metallicities. G-type stars are selected from the full DR7 SEGUE sample using a simple color–magnitude cut that corresponds to the SEGUE G-star target type: 0.48 ⩽g − r ⩽ 0.55 and r < 20.2. All magnitudes here and in what follows are absorption corrected and dereddened, respectively, using the reddening maps of Schlegel et al. (1998); as we only use lines of sight with relatively small extinction and we do not use the SDSS u band, using the improved reddening maps of Schlafly & Finkbeiner (2011) leads to insignificant differences for the purpose of our analysis. We further limit the spectroscopic sample to those lines of sight with E(B − V) < 0.3, to minimize effects due to uncertainty in extinction, to objects having spectra with signal-to-noise ratio (S/N) >15, and to objects with valid metallicities, heliocentric line-of-sight velocities, and proper motions (even though the latter two are not used in the analysis). All of the selected objects have valid values for their stellar atmospheric parameters as determined by the SEGUE Stellar Parameter Pipeline (Lee et al. 2008a, 2008b, 2011a; Allende Prieto et al. 2008; Smolinski et al. 2011). Typical uncertainties in these parameters are 0.2 dex for the spectroscopic metallicity [Fe/H], 0.1 dex for [α/Fe], 0.25 dex for the surface gravity log g, and 180 K for the effective temperature (Schlesinger et al. 2010; Smolinski et al. 2011). In what follows we are primarily interested in the relative rankings of stars based on [α/Fe] and [Fe/H], such that random uncertainties are all that matter. Note that our signal-to-noise ratio cut of S/N >15 is more inclusive than recommended by Lee et al. (2011a) (who recommend S/N >20), but this does not increase the uncertainties in [α/Fe] by much. For dwarfs with [Fe/H] >−2, there is no significant correlation between the [Fe/H] and [α/Fe] estimates. We use this sample of G-type stars to determine the SEGUE G-star selection function in Appendix A below.

We select G-type dwarfs by selecting stars with log g >4.2, to eliminate giant stars. We perform no other cuts (e.g., other color cuts or distance cuts) beyond these basic cuts in order to preserve a relatively simple spatial selection function. This sample contains about 28,000 stars, 23,767 of which lie within the well-populated bins in the ([Fe/H], [α/Fe]) plane that we analyze below. Distances to individual stars are obtained from the Ivezić et al. (2008) photometric color–metallicity absolute magnitude relation (their Equation (A7)) applied to the g − r color, rather than the g − i color, using

$\begin{equation} r-i = \frac{(g-r-0.12)}{2.34}\,, \end{equation} \tag{ 1 }$

and employing the spectroscopic metallicity. These distances are about 10% larger than the distances obtained from the An et al. (2009) stellar isochrones, with little to no color or metallicity dependence over the color and metallicity ranges considered here (see Figure 1 and further discussion in Section 5). Individual distance uncertainties are typically ≲10%, and thus do not greatly smooth the underlying Galactic density, whose scales are much larger than this (for an illustration of this see Jurić et al. 2008, where much larger distance uncertainties of around 20% were shown to influence the inferred scale heights by less than 5%).

**Figure 1.** Comparison between distance moduli derived from the Ivezić et al. (2008) photometric-distance relation (their Equation (A7)), and those derived from the An et al. (2009) theoretical isochrones.
Download figure:
Standard image High-resolution image

The distribution of the G-dwarf sample in the elemental-abundance space, made up of [Fe/H] and [α/Fe], is shown in Figure 2. This distribution is characterized by two modes, one a metal-poor, α-enhanced population that must represent the oldest part of the Galactic disk, and another that is metal-rich and has a solar [α/Fe] ratio. The two boxes delineated by dashed lines constitute our broad separation of these two populations, which we will refer to as α-old and α-young, respectively:

$\begin{eqnarray} &&\hspace{-2.0pc}\alpha \hbox{-}\mathrm{old\ sample}{:}\qquad \nonumber \\ &&\hspace{-2.5pc}\quad\quad {-}1.5 < [\mathrm{Fe/H}]< -0.25, \quad 0.25 < [\alpha \mathrm{/Fe}]< 0.50\,, \end{eqnarray} \tag{ 2 }$

$\begin{eqnarray} &&\hspace{-2.7pc}\alpha \hbox{-}\mathrm{young\ sample}{:}\qquad \nonumber \\ &&\hspace{-3pc}\quad\quad {-}0.3 < [\mathrm{Fe/H}]< 0.25, \quad 0.00 < [\alpha \mathrm{/Fe}]< 0.25\,. \end{eqnarray} \tag{ 3 }$

**Figure 2.** Distribution of the spectroscopic sample of G dwarfs in elemental-abundance space. The density is linear, and the contours contain 68%, 95%, and 99% of the distribution. Outliers beyond 99% are individually shown. Our cuts to select α-old (top, left) and α-young (bottom, right) samples are shown as dashed boxes. The dotted lines indicate the median [Fe/H] for the α-old sample (rounded to the nearest 0.05 dex), used to split the α-old sample in [Fe/H], and for [α/Fe] < 0.25 the dotted box indicates the metal-poor α-young sample, used in Sections 4.1 and 4.2.
Download figure:
Standard image High-resolution image

The spatial distributions of the α-old and α-young G-dwarf samples are shown in Figure 3, without accounting for the selection function. It is clear from this figure that the bright limit of the G-dwarf sample (r > 14.5, see below) is such that the effective minimum distance is approximately 600 pc. This means that for the thinner-disk components, stars within one scale height are not sampled by SDSS/SEGUE (this is also apparent in Figure 2, where most stars have sub-solar metallicities). However, because the thinner components also contain the most stars (see below and Bovy et al. 2012a), there are still sufficient stars above one scale height of these components such that the G-dwarf data set contains a large number of them.

Understanding and modeling the SEGUE selection function, i.e., the relation between the stars with successfully determined spectral parameters that enter our sample and their photometric or volume-complete parent population, is central to any analysis that involves the spatial structure of spectroscopically selected samples. It has not been worked out previously, and while in principle straightforward, it requires attention to a number of details. We describe our model for the SEGUE selection function in Appendix A. The SEGUE G-star sample was obtained by uniformly sampling the dereddened color–magnitude boxes with color range 0.48 ⩽g − r ⩽ 0.55 and a "bright" (14.5 ⩽r ⩽ 17.8) and "faint" (17.8 ⩽r ⩽ 20.2) apparent magnitude range along a set of ≈150 lines of sight. Due to our S/N cut, this uniform sampling is truncated at a brighter magnitude, where the cutoff is different for each line of sight. We determine the cutoff for each SEGUE plug-plate (which we refer to simply as "plates" in what follows) as the faintest star in the color–magnitude box, and model the r-dependence of the selection function using a hyperbolic-tangent step around the cutoff. We obtain the overall selection fraction for each line of sight by comparing the size of the spectroscopic sample to that of the photometric sample in the targeted color–magnitude box for each individual line of sight. This model is described in more detail in Appendix A.

3. DENSITY-FITTING METHODOLOGY

3.1. Generalities

Fitting the spatial-density profiles of various G-dwarf sub-samples must account for the fact that the observed star counts do not reflect the underlying stellar distribution, but are strongly shaped by (1) the strongly position-dependent selection fraction of stars with spectra (see Figure 11), (2) the need to use photometric distances that in turn depend on the color and metallicity distribution of the sample (as the magnitude-limited SEGUE sample corresponds to a color- and metallicity-dependent distance-limited sample), and (3) the pencil-beam nature of the SEGUE survey. To properly take all of these effects into account, we need to use forward modeling: in what follows we fit stellar-density models to the data by generating the expected observed distribution of stars in the spectroscopic sample, based on our model for the SEGUE selection function and the photometric-distance relation; this predicted distribution is then compared to the observed star counts. We show below how this can be expressed as a maximum likelihood problem. This general density-fitting methodology applies to any spectroscopic survey, with minor modifications, and needs to be applied to obtain selection-corrected distributions from spectroscopically selected stellar samples. In particular, this methodology needs to be applied to constrain the structural parameters of abundance-selected samples in the Milky Way.

As the photometric distance estimates depend on the g − r color, metallicity [Fe/H], and apparent r-band magnitude, and because the selection function is a function of position, r, and g − r, we need to model the observed density of stars in color–magnitude–metallicity–position space, λ(l, b, d, r, g − r, [Fe/H]). This density of stars can be written as

$\begin{eqnarray} &&\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}])= \rho (r,g-r,[\mathrm{Fe/H}]|R,Z,\phi)\nonumber\\ &&\qquad\times \nu _*(R,Z,\phi) \times |J(R,Z,\phi;l,b,d)|\nonumber\\ &&\qquad \times S(\mathrm{plate},r,g-r).\quad \end{eqnarray} \tag{ 4 }$

Here, (R, Z, ϕ) are Galactocentric cylindrical coordinates corresponding to rectangular coordinates (X, Y, Z), which can be calculated from (l, b, d). The factor ρ(r, g − r, [Fe/H]|R, Z, ϕ) is the number density in magnitude–color–metallicity space as a function of position (see further discussion in Appendix B). The |J(R, z; l, b, d)| is a Jacobian term because of the (X, Y, Z) → (l, b, d) coordinate transformation; the crucial factor S(plate, r, g − r) is the selection function as given in Equation (A2). Finally, ν_*(R, Z, ϕ) is the underlying spatial number density of the sample; we stress that this is a density as a function of rectangular coordinates (X, Y, Z) that we evaluate through (R, Z, ϕ), i.e., its dimension is 1/(spatial unit)³. In what follows we will assume that our models for this density (e.g., exponentials in the vertical and radial directions) are characterized by a set of parameters denoted as θ and that the density is axisymmetric, such that ν_* ≡ ν_*(R, Z|θ).

The likelihood of a given model for the density ν_*(R, z|θ) is given by that of a Poisson process with rate parameter λ,

$\begin{eqnarray} \ln \mathcal {L} &=& \sum _i [ \ln \lambda (\lbrace l,b,d,r,g-r,[\mathrm{Fe/H}]\rbrace _i|\theta) ]\nonumber\\ && -\int {d}l \,{d}b\, {d}d\, {d}r\, {d}(g-r) \,{d}[\mathrm{Fe/H}]\nonumber\\ &&\times \lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta), \end{eqnarray} \tag{ 5 }$

where the integral is over the domain surveyed and i indexes the observed objects. Because the Jacobian, the selection function, and the density in magnitude–color–metallicity space only enter λ multiplicatively (Equation (4)), their contribution to the first term (ln λ) in Equation (5) is a constant that does not depend on the density parameters. Thus, up to a term that does not depend on θ, the log likelihood is equivalent to

$\begin{eqnarray} \ln \mathcal {L} &=& \sum _i [ \ln \nu _*(R,z|\lbrace l,b,d\rbrace _i,\theta) ]-\int {d}l \,{d}b\, {d}d\, {d}r\, {d}(g-r) \,\nonumber\\ &&\times {d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta) \,. \end{eqnarray} \tag{ 6 }$

Note that the Jacobian, the density in the magnitude–color–metallicity space, and the selection function only enter through the second term, and do not need to be evaluated on a star-by-star basis. The second term in Equation (6)—the normalization integral—can be written as (assuming that the density does not depend on (l, b) over the area of a plate, although this can easily be relaxed)

$\begin{eqnarray} &&\int {d}l\, {d}b\,{d}d\, {d}r\, {d}(g-r) \,{d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta) \nonumber\\ &&= A_p\,\sum _{\mathrm{plates}\ p} \int {d}(g-r)\,{d}[\mathrm{Fe/H}]\,{d}r \, S(p,r,g-r) \nonumber\\ && \times\int dd\, \rho (r,g-r,[\mathrm{Fe/H}]|R,Z,\phi)\,d^2\,\nu _*(R,z|l,b,d,\theta)\,, \end{eqnarray} \tag{ 7 }$

where A_p is the area of a SEGUE plate (approximately 7 deg²).

In the following, we analytically marginalize over the amplitude of the rate λ with a logarithmically flat prior. In that case, the log likelihood becomes

$\begin{eqnarray} \ln \mathcal {L} &=& \sum _i [ \ln \nu _*(R,z|\lbrace l,b,d\rbrace _i,\theta) -\ln \int {d}l \,{d}b\, {d}d\, {d}r\, \nonumber\\ &&\times{d} (g-r) \,{d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta)]. \quad\ \ \ \end{eqnarray} \tag{ 8 }$

Note that the normalization integral is now moved inside of the logarithm.

In Appendix B, we discuss how we include the magnitude–color–metallicity factor ρ(r, g − r, [Fe/H]|R, Z, ϕ) in the likelihood.

3.2. Stellar Number-density Models

We fit number-density models for the various abundance sub-populations, consisting of a disk with an exponential profile in both the vertical and radial directions, plus a constant density

$\begin{eqnarray} \nu _*(R,Z) &=& N(R_0)\bigg[\frac{1}{2\,h_z}\, \exp \left(-\frac{R-R_0}{h_R}\right)\,\nonumber\\ &&\times \exp \left(-\frac{|Z|}{h_z}\right) + \frac{\beta _c}{24}\bigg]\,, \end{eqnarray} \tag{ 9 }$

where N(R₀) is the vertically integrated number density at R₀. We refer to this model below as a single-exponential disk fit, as in all cases the data imply β_c ≪ 1. We also fit combinations of exponential disks as

$\begin{eqnarray} \nu _* (R,Z) &=& N(R_0)\,\left[\frac{1-\beta _2}{2\,h_z}\,\exp \left(-\frac{R-R_0}{h_R}\right)\,\exp \left(-\frac{|Z|}{h_z}\right)\right. \nonumber\\ &&\left.+\, \frac{\beta _2}{2\,h_{z,2}}\,\exp \left(-\frac{R-R_0}{h_{R,2}}\right)\,\exp \left(-\frac{|Z|}{h_{z,2}}\right)\right]\,. \end{eqnarray} \tag{ 10 }$

In particular, in the Z-direction, this is analogous to traditional density fits based on photometric data, which require (at least) two exponential components. We do not fit for the overall normalization, N(R₀), as we are interested primarily in the shape of the stellar-density profile.

To determine the best-fit parameters and their uncertainties we use Powell's method for minimization (Press et al. 2007), and then MCMC-sample the posterior distribution function, obtained by multiplying the likelihood in Equation (8) with flat logarithmic priors for the scale parameters (h_z, h_R, h_{z, 2}, h_{R, 2}) and flat priors on the contamination-fraction parameters (β_c, β₂), using an ensemble MCMC sampler (Goodman & Weare 2010; Foreman-Mackey et al. 2012).

3.3. Tests on Mock Data

In Appendix D, we discuss tests of the fitting methodology on mock data sets made up of single-exponential disk components observed using the SEGUE sampling. These tests show that we can recover the input density structure to within the MCMC-determined uncertainties over the range of inferred scale heights, scale lengths, and sample sizes found below.

4. DENSITY STRUCTURE

First, we briefly discuss the result of fitting the broad bins in abundance as defined in Equation (2), in order to explore the broad trends in spatial structure with elemental abundance. In Section 4.3, we then split the sample finely in elemental-abundance space and map the structure of mono-abundance populations.

4.1. The α-old Stars

For the α-old sample, the fit results for single-exponential profiles in R and Z, and for a combination of two exponential profiles for both R and Z, are given in Table 1. The model with two exponentials in both R and Z is preferred, but the parameters of the dominant double-exponential disk are similar for both fits. That is, even when we give the model the additional freedom of two vertical scale heights, the data lead us to employ only a single-exponential scale height. There is no evidence for a thinner component in the α-old abundance range. We see that the α-old sample is dominated by a population of stars with a scale height of 686 ± 11 pc, and a short scale length of 2.01 ± 0.05 kpc (consistent with the rough estimate of 2 kpc based on a handful of stars by Bensby et al. 2011 and the indirect dynamical estimate of 2.2 ± 0.35 kpc of Carollo et al. 2010).

Table 1. Results for the α-old G-dwarf Sample (−1.5 < [Fe/H] <−0.25, 0.25 < [α/Fe] <0.50)

	h_z	h_R	h_{z, 2}	h_{R, 2}	β₂	β_c
	(pc)	(kpc)	(pc)	(kpc)
All plates	701 ± 5	2.06 ± 0.03	...	...	...	0.0000 ± 0.0009
Bright plates	769 ± 14	1.79 ± 0.05	...	...	...	0.004 ± 0.009
Faint plates	714 ± 11	2.25 ± 0.05	...	...	...	0.001 ± 0.001
b < 0°	694 ± 9	2.02 ± 0.05	...	...	...	0.0000 ± 0.0010
b > 0°	699 ± 8	2.10 ± 0.04	...	...	...	0.000 ± 0.001
\|b\| > 45°	696 ± 6	2.23 ± 0.06	...	...	...	0.0000 ± 0.0009
\|b\| < 45°	640 ± 10	2.05 ± 0.04	...	...	...	0.002 ± 0.002
All plates	686 ± 11	2.01 ± 0.05	933 ± 49	3.0 ± 0.4	0.04 ± 0.03	...
Bright plates	764 ± 20	1.78 ± 0.04	3126 ± 271	>64	0.01 ± 0.02	...
Faint plates	688 ± 40	2.2 ± 0.1	1311 ± 189	>3.0 (5 ± 1)	0.03 ± 0.04	...
b < 0°	671 ± 22	1.97 ± 0.08	993 ± 169	3.7 ± 0.4	0.05 ± 0.05	...
b > 0°	687 ± 11	2.06 ± 0.07	886⁺³⁵⁰_{− 708}	3 ± 1	0.04 ± 0.04	...
\|b\| > 45°	692 ± 11	2.2 ± 0.1	800 ± 88	4.3 ± 0.4	0.01 ± 0.07	...
\|b\| < 45°	639 ± 17	2.03 ± 0.07	1142 ± 99	>5	0.01 ± 0.02	...
[Fe/H] < −0.7	856 ± 20	2.06 ± 0.08	865 ± 108	2.1 ± 0.3	0.07 ± 0.08	...
[Fe/H] > −0.7	583 ± 16	1.97 ± 0.08	873 ± 62	4.0 ± 0.5	0.03 ± 0.04	...
0.25 ⩽ [α/Fe] < 0.35	627 ± 18	2.23 ± 0.10	802 ± 104	3.5 ± 0.3	0.03 ± 0.06	...
0.35 ⩽ [α/Fe] < 0.5	765 ± 15	1.89 ± 0.04	826 ± 45	2.0 ± 0.1	0.03 ± 0.06	...

Notes. Lower limits are at 99% posterior confidence. Lower limits are given when the best-fit value is larger than 4.5 kpc. The best-fit value is not given if it is larger than 6 kpc.

Download table as: ASCII Typeset image

We have split the α-old sample into more metal-poor and more metal-rich sub-samples by cutting the sample at [Fe/H] = −0.7. This is close to the median [Fe/H] of the α-old sample. The metal-poor sub-sample may be identified with the metal-weak thick-disk (MWTD) population discussed by Carollo et al. (2010), which they argue covers the metallicity range −1.8 ⩽ [Fe/H] ⩽ −0.8. The resulting fits for the spatial structure of these sub-samples are given in Table 1. The inferred scale lengths for these sub-samples are equal to within the uncertainties. However, the scale height of the more metal-poor sample is 856 ± 20 pc while that of the more metal-rich sample is 583 ± 16 pc. The radial scale length of the MWTD determined from the indirect dynamical analysis of Carollo et al. (2010) is roughly 2 kpc, while the scale height is 1.36 ± 0.13 kpc.

We have also split the α-old sample into two bins in [α/Fe] by splitting the sample at [α/Fe] = 0.35. The best-fit density profiles, given at the bottom of Table 1, again have similar scale lengths, around 2 kpc, and different scale heights. The stars that are most enhanced in α-elements have the largest scale height (765 ± 15 pc) and the shortest scale length (1.89 ± 0.04 kpc), while the less α-enhanced stars have a smaller scale height (627 ± 18 pc) and longer scale length (2.23 ± 0.1 kpc). As the latter dominate the full α-old sample, their scale height is very similar to that inferred for the full sample. We explore the dependence of the disk parameters on [Fe/H] and [α/Fe] in more detail in Section 4.3 below.

4.2. The α-young Sample

The results for single-exponential disk fits and double-exponential disk fits for the α-young sample are given in Table 2. The double-exponential disk fit model is formally preferred, but the parameters of the dominant double-exponential disk are again similar for both fits. We see that the α-young sample is dominated by a population of stars with a low scale height of 256 ± 4 pc and a long scale length of 3.6 ± 0.2 kpc.

Table 2. Results for the α-young G-dwarf Sample (−0.3 < [Fe/H] <0.25, 0.00 < [α/Fe] <0.25)

	h_z	h_R	h_{z, 2}	h_{R, 2}	β₂	β_c
	(pc)	(kpc)	(pc)	(kpc)
All plates	270 ± 3	3.8 ± 0.2	...	...	...	0.0005 ± 0.0010
Bright plates	267 ± 3	3.6 ± 0.2	...	...	...	0.0009 ± 0.0003
Faint plates	329 ± 14	>3.8 (5.1 ± 1.0)	...	...	...	0.0010 ± 0.0003
b < 0°	264 ± 4	3.6 ± 0.2	...	...	...	0.0008 ± 0.0009
b > 0°	271 ± 4	3.80 ± 0.10	...	...	...	0.000 ± 0.001
\|b\| > 45°	270 ± 5	4.2 ± 0.8	...	...	...	0.0004 ± 0.0008
\|b\| < 45°	264 ± 3	4.0 ± 0.2	...	...	...	0.0006 ± 0.0007
All plates	256 ± 4	3.6 ± 0.2	664 ± 132	>5	0.012 ± 0.004	...
Bright plates	260 ± 5	3.5 ± 0.3	491 ± 83	>2	0.02 ± 0.02	...
Faint plates	268 ± 23	>3.8 (5.0 ± 0.8)	910 ± 152	>2.9 (6 ± 2)	0.014 ± 0.008	...
b < 0°	242 ± 8	3.2 ± 0.2	639 ± 81	>5	0.017 ± 0.010	...
b > 0°	263 ± 6	3.7 ± 0.2	834 ± 70	>4	0.004 ± 0.002	...
\|b\| > 45°	249 ± 6	3.8 ± 0.8	631 ± 142	>6	0.015 ± 0.005	...
\|b\| < 45°	252 ± 5	3.9 ± 0.3	656 ± 65	>5	0.012 ± 0.005	...
−1.5 < [Fe/H] < −0.6^a	689 ± 25	>37	1431⁺⁷⁰⁴_{− 1916}	1.1^+0.6_{− 1.0}	0.03 ± 0.07	...
−0.6 < [Fe/H] < −0.3^a	360 ± 9	>16	946 ± 92	>14	0.018 ± 0.009	...
0.00 < [α/Fe] < 0.15	239 ± 4	4.3 ± 0.2	647 ± 53	>7	0.010 ± 0.003	...
0.15 ⩽ [α/Fe] < 0.25	348 ± 13	2.3 ± 0.2	959 ± 335	>2.0 (5 ± 2)	0.018 ± 0.009	...

Notes. Lower limits are at 99% posterior confidence. Lower limits are given when the best-fit value is larger than 4.5 kpc. The best-fit value is not given if it is larger than 6 kpc. ^aThese samples have the same [α/Fe] range as the nominal α-young sample.

Download table as: ASCII Typeset image

The second double-exponential disk in the best-fit model for the α-young sample has a scale height of 664 ± 132 pc, which is consistent with the scale-height measurement of the α-old sample above. However, the fraction of stars in this secondary component is too small to constrain its scale length, and is conceivably simply a result of "abundance contamination" of the sample.

Density fits for α-young samples with the same [α/Fe] limits as the nominal α-young sample shown in the top panel, but that are more metal-poor, are also given in Table 2. We do not measure any radial density decline for these more metal-poor α-young samples, and short scale lengths for these samples are ruled out by the data. We consider this further in Section 4.3 and in the discussion section below.

When we split the α-young sample into two pieces, by cutting at [α/Fe] = 0.15, we find that the more α-enhanced sample has the shortest scale length (2.3 ± 0.2 kpc) and the largest scale height (348 ± 13 pc). The sample with [α/Fe] closer to solar has a longer scale length of 4.3 ± 0.2 kpc and a smaller scale height of 239 ± 4 pc.

4.3. The Spatial Structure of Mono-abundance Sub-populations

In the previous two sections, we found that sub-samples of stars defined by their element abundances appear to have a simple spatial structure, approximated by a single exponential in the radial and vertical directions. The scale lengths and heights of these sub-sets seem to vary systematically with the abundances: the α-old sample has a shorter scale length than the α-young sample, and if we split those two samples further in [α/Fe], the part of the α-young sample that has the closest to the solar [α/Fe] ratio has the longest scale length and the smallest scale height. We also noticed that populations with [α/Fe] <0.25 have longer scale lengths and scale heights with decreasing [Fe/H].

To further investigate these trends, we have fitted disk models with single-exponential profiles in R and Z to sub-populations of stars with narrow bins in [Fe/H] and [α/Fe]. We divide stars into bins of width 0.1 dex in [Fe/H] and 0.05 dex in [α/Fe], and only fit those bins with more than 100 stars. The results from these fits are shown in Figure 4. The populations in the lower left part of the [α/Fe]–[Fe/H] diagram all have best-fit scale lengths in excess of 4.5 kpc.

We also fitted two-component models, i.e., two exponential disks, to each of the bins, but found that these led to overfitting, and only marginal improvements in the likelihood for the best fit. Thus, for narrow bins in elemental-abundance space, the sub-populations are very well described by single-exponential profiles in the R- and Z-directions.

A different view of the results in Figure 4 is given in Figure 5. The results in the different [Fe/H]–[α/Fe] bins are shown as a function of scale length and scale height; the points are color-coded by their [α/Fe] or [Fe/H] dependence, and the size of the points corresponds to the total stellar surface-mass density—corrected for mass and sample selection effects—in each population (calculated in Bovy et al. 2012a). Figure 5 also shows the uncertainties in the inferred parameters; the formal uncertainty in the scale height for some points is so small that it cannot be seen. The bins with dashed error bars lie in a part of the abundance plane where abundance contamination is likely to be the most severe, where the [α/Fe]-based age ranking is least reliable, and where the spatial properties change most rapidly. They contain <5% of the disk surface mass.

We see that these fits for mono-abundance sub-components flesh out the main trends we noted in the broader [Fe/H] and [α/Fe] ranges above. At any given metallicity [Fe/H], the scale length increases and the scale height decreases when moving from α-old to α-young populations. At any given α-age, the scale length and the scale height increase for the more metal-poor components, implying an outward metallicity gradient. And, as Figure 5 shows most clearly, increasing scale lengths are correlated with decreasing scale heights (except for a few bins on the boundary between the very long scale lengths at low metallicity and solar α-enhancement and the shorter scale lengths of the α-old populations; see further discussion in Section 5.5). From Figure 4 it is clear that neither [α/Fe] nor [Fe/H], on its own, accounts for the trends in scale height and scale length. We discuss what this implies for disk formation and evolution in Sections 5.4 and 5.5, respectively.

Figure 6 shows the results of fitting two components with exponential profiles in both R and Z to each abundance bin. The scale height of the dominant component is shown against the best-fit scale height, when fitting a single-exponential profile in R and Z. We see that these scale heights are strongly clustered around the one-to-one correspondence line. Thus, for each bin, a single vertical exponential suffices to explain the observed number counts. The fact that the two measurements agree better than would be expected, given the uncertainties shown, is due to the fact that the scale heights for each bin are strongly correlated when fitting a single- or a double-exponential profile in R and Z. Overall, Figure 6 confirms that a single-exponential model in Z and R is a good model for the spatial structure of mono-abundance sub-populations.

In Appendix D, we perform a test to determine whether abundance uncertainties can plausibly lead us to find spurious disk components between a "thin" and a "thick" component. That is, we ask whether it is plausible that an underlying density dominated by distinct thin- and thick-disk components can be smoothed by abundance errors into the density structure we inferred in Figures 4–6. This test shows that if this were the case, every bin is preferentially fitted with two components, corresponding to the input thin and thick components. The equivalent of Figure 6, shown in the bottom right panel of Figure 19, is qualitatively different, with a distinct difference between the single-component scale height and the scale height of the dominant component in the two-component fit.

To test whether the analysis in this section is influenced by our signal-to-noise ratio cut of S/N >15, we have repeated the analysis with a cut of S/N >30, as also used by Lee et al. (2011b). The equivalents of Figures 4–6 look qualitatively the same, albeit with larger uncertainties for each bin, and the dependence of h_z and h_R on elemental abundance is the same as that inferred from the sample with the S/N >15 cut. The number of ([Fe/H], [α/Fe]) bins with more than 100 stars is smaller, but the inferred (h_z, h_R) for those bins with more than 100 stars when using the S/N >30 cut are consistent within the uncertainties with those found with the less restrictive S/N cut. We stress that even when selecting stars with S/N >30, the equivalent of Figure 6 does not show any sign of a second component in the mono-abundance bins.

To perform the binning in this section, we used narrow bins of 0.1 dex in [Fe/H] and 0.05 dex in [α/Fe]. These bins are somewhat narrower than the total typical uncertainty (≈0.15 dex in [Fe/H], ≈0.07 dex in [α/Fe]; Bovy et al. 2012b), but we prefer to oversample, rather than undersample, to avoid smoothing out underlying structure. The analysis in each bin holds irrespective of the bin size. What matters for the analysis is that the data in each bin are disjoint, such that the bins are statistically independent.

5. DISCUSSION

Our basic result is that various stellar disk sub-components, when defined purely through stellar abundances, are simple, i.e., can be described by a single exponential in R and Z, and exhibit distinctive trends of the scale height and scale length with chemical abundance. This suggests that dissecting the Milky Way's disk on the basis of chemical abundances alone is a useful approach. In this section we go through a number of practical issues pertaining to these estimates, before discussing possible implications for galactic disk formation and evolution.

5.1. Distance Systematics

The absolute values of the distance scales measured in this paper are subject to distance systematics, which we discuss in this subsection. We have used the data-driven photometric-distance relation from Ivezić et al. (2008) to infer the spatial structure of the various samples of stars, but an alternative photometric-distance relation can be obtained by using the An et al. (2009) stellar isochrones in the SDSS passbands. These isochrones depend on [Fe/H] as well as on [α/Fe], although in practice a linear relation between [α/Fe] and [Fe/H] is assumed, and the spectroscopically measured [α/Fe] is not used directly to estimate the photometric distance. In the top panel of Figure 1, we compared the distance moduli derived using the An et al. (2009) stellar isochrones with those obtained using the Ivezić et al. (2008) relation for a few values of [Fe/H]. We see that, for the values of [Fe/H] that span most of our sample, the distance modulus difference is −0.2 mag, corresponding to a systematic difference in the inferred distances of about 9%, nearly independent of color. Thus, if we had used the An et al. (2009) photometric distances, we would have obtained scale lengths and scale heights that were 9% shorter.

A second distance systematic that could influence our results is the Malmquist bias (Malmquist 1920, 1922)—the fact that brighter stars are overrepresented in a magnitude-limited survey. For our relatively bright sample, this is dominated by the finite width of the photometric-distance relation. The Malmquist bias in absolute magnitude is apparent-magnitude dependent and approximately equal to −σ²dln A(r)/dr, where A(r) is the differential number count as a function of apparent magnitude and σ is the dispersion in the absolute magnitudes (either due to photometric uncertainties or due to intrinsic scatter in the photometric-distance relation). Conservatively assuming that the combination of the finite width of the photometric-distance relation and the photometric uncertainties is 0.2 mag, and that the underlying density is constant, the Malmquist bias would be of order 2.5%. However, due to the exponential falloff of the density in both the R- and Z-directions, the differential number counts are (1) flat near the peak induced by the vertical exponential and (2) for most apparent magnitudes |dln A(r)/dr| is less than 1. Therefore, the Malmquist bias is at most about 2%, and will not strongly affect the measurement of the vertical scale height in particular.

We have assumed throughout our analysis that all of the stars in our sample are single. The presence of unresolved binaries will lead us to underestimate scales, as these binaries will appear to us as brighter, and thus closer, single stars. The binary fraction and companion-mass distribution for G-type dwarfs remain controversial, but it appears that the overall binary fraction for G dwarfs is approximately 40% (Abt & Levy 1976; Duquennoy & Mayor 1991; Raghavan et al. 2010), similar to but slightly larger than that of M dwarfs (Fischer & Marcy 1992; Raghavan et al. 2010). The distribution of companion masses is poorly known, and could range from being peaked around 20% of the primary's mass (Duquennoy & Mayor 1991), to being relatively flat between 20% and 100% of the primary's mass (Raghavan et al. 2010), with numerical simulations indicating that multiple-star systems form preferentially with approximately equal-mass members (Bate 2005), and an overall multiplicity fraction of around 40% (Bate et al. 2003). Lower-metallicity stars most likely have a higher binary fraction (Machida et al. 2009), and could reach 100% for [Fe/H] <−0.8 (Raghavan et al. 2010).

For a likely scenario where 40% of our α-young sample is made up of binary stars (ignoring higher-order multiplicities) with a flat distribution of companion masses between 20% and 100% of the primary's mass, the magnitude would be overestimated on average by 0.12 mag, such that the scale height and scale length would be underestimated by about 6%. If 70% of the α-old sample were to consist of binary systems (taking into account the rising binary fraction with decreasing metallicity), the magnitudes would be overestimated by approximately 0.21 mag, and the α-old scale heights and scale lengths would be underestimated by 10%. These biases are somewhat larger than the statistical uncertainties on our results, but they are similar to the overall distance-scale uncertainty (see above), and they do not change the conclusion that the α-old scale length is much shorter than that of the α-young sample. Even in a worst-case scenario, where all binary systems have equal-mass companions and where 100% of the α-old stars are in binaries, the α-old scale length would still be ≲ 2.8 kpc (40% up from 2 kpc), which is shorter than the scale length measured for the α-young sample in Table 2 and Figure 5, and the α-young scale lengths themselves would also increase by about 15% in this scenario. In principle, a careful spectral analysis of the SEGUE spectra themselves could provide direct constraints on the (unresolved) binary contamination in this sample.

5.2. Halo Contamination

In our density fits we have mostly fitted disk components to the data, except for the single-exponential disk model where we added a uniform density (Equation (9)). We thus assumed that the stellar halo does not influence our disk fits, beyond what can be described by a uniform density across our survey volume. We can estimate the expected number of halo stars in our sample using the Bell et al. (2008) density fits to the smooth stellar halo. We run the Bell et al. (2008) stellar-halo density through the G-star SEGUE selection function, and marginalize over g − r color using a flat distribution over 0.48 ⩽ g − r ⩽ 0.55, and over [Fe/H] using the Ivezić et al. (2008) halo metallicity distribution (mean [Fe/H] = −1.52, width = 0.32). We then find that for ≈10⁸ G-type stars between 1 and 40 Galactocentric kpc in the stellar halo, there should be about 100 halo stars in our sample, compared to the total sample size of 30,353 G-type dwarfs. Hence, the halo contamination is very small and does not influence the fits. Additionally, halo contamination will be most severe for the α-old sub-populations, and this contamination should work to increase the inferred scales (length and height). Therefore, the result that the radial scale length of α-old sub-populations is shorter than that of α-young sub-populations is robust against any halo contamination.

5.3. Comparison to Traditional Geometric Disk Decompositions

The density fits in this paper are the first to constrain the vertical scale height and radial scale length of numerous disk sub-components, defined using elemental abundances alone, from a large sample of stars. Our results show that the vertically thicker-disk sub-components—when chemically defined—have a much shorter scale length than the thinner-disk sub-components, which is opposite to traditional purely geometric disk decompositions (e.g., Robin et al. 1996; Ojha 2001; Larsen & Humphreys 2003), which typically find that the thick-disk component has a longer scale length than the thin disk, and that this scale length is ≳ 3.5 kpc (e.g., Jurić et al. 2008).

When we fit the spatial structure in our approach, taking stars of all metallicities (specifically, the combination of our α-old ("thick") and α-young ("thin") samples), we can recover the result of purely geometric decompositions: the thin-disk component—i.e., the component with the lowest scale height, ≈300 pc—gets paired with the shortest scale length (≈2 kpc), while the thicker-disk component gets assigned both the largest scale height and scale length (for our particular sample fit with a combination of three double-exponential disks these are ≈600 pc and ≈2.4 kpc, with a small component with an even larger scale height and scale length). Thus, it seems that purely geometric decompositions naturally associate the longest scale length with the largest scale height. That both geometrically determined scale lengths are shorter than the scale length of the α-young sample is due to the fact that the metallicity distribution for the entire sample extends down to [Fe/H] = −1.5, such that the model "expects" many low-metallicity stars in the "thin" component at large distances (as the model does not contain the information that the thin component has higher metallicities), which are not observed. Therefore, metallicity and α-element abundances, which are manifestly quantities that can identify sub-samples of stars independent of their spatial structure and kinematics, lead to a qualitatively different decomposition into two (or more) sub-components than the purely geometrical approach, with its inherent risk of circular reasoning.

5.4. Implications for Disk Formation

The distinctive changes of the global disk structure with abundance, especially with the age proxy [α/Fe], should provide valuable clues to the formation of the Milky Way's disk. While a concrete and quantitative model comparison is beyond the scope of this paper, we discuss some of the qualitative implications here. As mentioned in Section 1, the overall radial-density profile of the stellar disk is expected to be conserved even in the face of large-scale radial migration, but the radial profile of sub-components will tend to relax to the mass-weighted mean radial profile. Thus, a difference in the radial distribution of various populations of stars today is a less-pronounced version of more different initial radial distributions (at formation). Assuming that the [α/Fe] ratio is an adequate proxy for age (e.g., Schönrich & Binney 2009b), our results then imply that the α-enhanced, hence oldest, populations are more centrally concentrated—have a shorter scale length—than populations with α-abundances that are closer to solar, and therefore younger. This is direct observational evidence for inside-out formation of galactic disks across the presumed age range of our sample, 1–10 Gyr, where the inner parts of the disk form before the outer part of the disk. A similar age dependence of the exponential scale length has been found in several external galaxies (de Jong et al. 2007; Radburn-Smith et al. 2012).

Second, our analysis shows that our Milky Way has not only a metallicity gradient among its youngest stars, but that it has always had one (Cheng et al. 2012): at a given [α/Fe], standing in for age, sub-populations with lower [Fe/H] have a longer scale length than more metal-rich stars. This picture is confirmed by looking at the orbital properties of the stars when integrating our sample of G-type dwarfs in a simple model for the Milky Way's potential, made up of a Miyamoto–Nagai disk with a radial scale of 4 kpc and vertical scale of 300 pc contributing 60% of the radial force at the solar radius, a Hernquist bulge with a scale radius of 600 pc contributing 5% of the radial force, and a Navarro–Frenk–White halo with a scale radius of 36 kpc that contributes 35% of the rotational support at the Sun's position. The median of the mean orbital radii as a function of elemental abundance is shown in Figure 7 (see also Lee et al. 2011b; Liu & van de Ven 2012 for similar figures of the eccentricity and rotational velocity). We see that stars with [α/Fe] <0.25 and lower [Fe/H] are thin-disk stars that live, on average, farther out than more metal-rich stars. Thus, the longer scale length for outer-disk stars, combined with the fact that, for solar [α/Fe], decreasing [Fe/H] is correlated with decreasing age (e.g., Schönrich & Binney 2009b), implies that the outer part of the disk formed later than the inner part.

**Figure 7.** Mean orbital radii of the G-dwarf sample. Median values of the mean orbital radii are shown, in bins of width 0.05 dex in [Fe/H] and 0.025 dex in [α/Fe]. Only bins with at least 20 stars are shown.
Download figure:
Standard image High-resolution image

We have assumed that [α/Fe] is an adequate proxy for age, such that the mono-abundance populations that are more [α/Fe]-enhanced are older than the populations with solar [α/Fe]. This is typically the case in standard scenarios for the star formation history of the Milky Way disk, in which [α/Fe] steeply drops around 2–3 Gyr due to the onset of Type Ia supernovae (Dahlen et al. 2008; Maoz et al. 2011), and then stays roughly constant, although the value of [Fe/H] at which the [α/Fe] downturn happens depends on the star formation history (Matteucci & Recchi 2001). Only if the local star formation was characterized by bursts of star formation can younger populations of stars have similar levels of [α/Fe] as older stars (Gilmore & Wyse 1991). Most current fits of the local star formation history prefer a smooth history (e.g., Aumer & Binney 2009), although it is difficult to rule out epochs of enhanced star formation (e.g., Rocha-Pinto et al. 2000).

5.5. Implications for Disk Evolution

The spatial structure inferred for mono-abundance sub-populations (Figures 4 and 5) shows two important results: first, there is a tight anti-correlation between the scale heights and scale lengths of the sub-components. Secondly, there is a continuous distribution in scale height when moving from α-enhanced, metal-poor populations to stars with solar [α/Fe] and [Fe/H]. This suggests that the α-old, "thick,"and the α-younger, "thin," regime of the stellar disk are not two separate entities, but merely opposite ends of the disk evolution spectrum (suggested before in Norris 1987, but never directly measured as we do here). This issue, which requires proper stellar-mass weighting of the sub-components, is worked out in a separate paper (Bovy et al. 2012a). Taken together, these findings suggest a continuous evolutionary mechanism created the observed scale-height distribution, rather than a discrete external heating or accretion event. Radial migration is an obvious candidate for this internal evolution mechanism. That the most centrally concentrated component of the disk is not only the (α-)oldest part, but also has the largest scale height, is a nearly inevitable condition, and hence a natural prediction, of any scenario where much of the disk scale-height distribution is created through radial migration. The α-old sub-population not only had the most time to evolve, but its centrally concentrated parent population implies that stars at 6 kpc < R < 12 kpc have migrated out by the largest factor.

A different internal explanation for the thicker-disk components in the Milky Way is that, rather than being thickened over the history of the Galactic disk, thick components were created thick during an early, turbulent phase in the formation of the disk (e.g., Bournaud et al. 2009; Förster Schreiber et al. 2009). If such a scenario is combined with a inside-out growth of the disk, and the disk remains turbulent over a significant fraction of its history, this formation scenario could plausibly explain the continuous dependence of disk structure on elemental abundance found in this paper.

Our result that the transition between the α-young, "thin," components and the α-old, "thick," components is smooth, rather than showing a clear separation between thin and thick components, may appear to be in conflict with local, high-resolution spectroscopic samples of stars (e.g., Reddy et al. 2006; Fuhrmann 2011; Navarro et al. 2011) or other analyses of the SEGUE data (e.g., Lee et al. 2011b). A detailed comparison between these and our results requires careful accounting for the spectroscopic volume sampling, which has not been done in the Lee et al. (2011b) analysis or for the high-resolution samples, except for the sample of Fuhrmann (2011), which is volume complete out to 25 pc. Without taking the volume selection into account, the sample used here also displays a bi-modality in the [Fe/H]–[α/Fe] plane (see Figure 2). We discuss this issue in more detail in Bovy et al. (2012a), but we note here that the apparent bi-modality in the observed number density of stars disappears when properly correcting for the spectroscopic sampling. Furthermore, the local, high-resolution analyses cannot directly measure the spatial distribution of stars of different elemental abundances (e.g., Fuhrmann 2011, which only has 15 high-[α/Fe] stars out to 25 pc; Reddy et al. 2006) and therefore rely on kinematics to argue that the vertical distribution of stars in the solar neighborhood is characterized by a bi-modal "thin"–"thick" disk dichotomy. This interpretation is driven by the selection of stars that are disjoint in [α/Fe], which leads to disjoint kinematics because the kinematics is a strong—and smooth—function of abundance as well (Bovy et al. 2012b). While the stellar content of different survey volumes can (and should) be connected by dynamics, we note that the effective volumes sampled by, e.g., the Fuhrmann (2011) survey and by our analysis differ by a factor of about 10⁵; hence, the extrapolation from one to the other is enormous. The analysis of the vertical kinematics of stars in our sample confirms the existence of the intermediate populations with scale heights between 400 and 600 pc and vertical-velocity dispersions of 30–35 km s⁻¹ (Bovy et al. 2012b).

Our finding that the scale length does not behave as smoothly as the scale height, as a function of [α/Fe], is presumably a consequence of the disk's formation history: here the increasing metallicity as a function of time (i.e., youth) and the radial metallicity gradient compete. As the mapping between [α/Fe] and age is not linear, but rather [α/Fe] steeply drops around 2–3 Gyr due to the onset of Type Ia supernovae (Matteucci & Recchi 2001; Dahlen et al. 2008; Maoz et al. 2011) and then stays roughly constant, the scale length should change similarly rapidly with [α/Fe]. The scale height, however, is determined by subsequent evolution, where radial migration transports stars to larger Galactocentric radii, where the lower disk density allows them to travel farther from the plane. Since this evolution is continuous, rather than sudden, and includes additional contributions from heating, trends in scale height versus elemental abundance should be expected to be smoother, even if radial migration is not the disk's dominant evolutionary mode. Our results are therefore consistent with a scenario where the thick-disk component is the inner part of the disk that formed at the earliest time, and either by having formed thick or through the effect of radial migration, has a large scale height at the present time.

A gas-rich merger, followed by intense star formation at an early time, could have affected the formation of the early disk (Brook et al. 2004), as seems consistent with the observed distribution of eccentricities of the thick-disk component (Sales et al. 2009; Dierickx et al. 2010; Wilson et al. 2011). However, it would lead to a scale length for the thicker component that is larger than that of the thinner component (Qu et al. 2011). It is clear that internal mechanisms must have played an important role during the evolution of the disk. However, we caution that the radial and vertical consequences of neither radial migration nor turbulent disk evolution, nor of satellite thickening, have been worked out in quantitative detail, and, in particular, resonant coupling between satellites and the disk might induce some similar observational signatures to radial migration.

The rapid change in the mean stellar population in an [α/Fe]–[Fe/H] abundance bin at the onset of Type Ia supernovae is also likely the explanation for the presence of the few points of intermediate [α/Fe] and [Fe/H] in Figure 5 that do not follow the anti-correlation between scale height and scale length; these bins, which do not contribute significantly to the total stellar mass (indicated by the size of the symbols in Figure 5), are also the bins that fall short of the one-to-one correlation between single- and two-disk fits in Figure 6. This provides further evidence of the fact that at the rapid [α/Fe] (age) transition our bins do not adequately resolve single components.

6. CONCLUSIONS

The main conclusions of this paper are as follows.

1.
An assessment of the global (R, Z) structure of the Milky Way's stellar disk for sub-components selected solely by their elemental abundances is now feasible, e.g., with spectroscopic surveys such as SEGUE, but requires a thorough accounting for the effective selection function of the spectroscopic sample.
2.
A decomposition of the Galactic disk, based on SDSS/SEGUE data for G-type dwarfs, into mono-abundance sub-populations in the [Fe/H]–[α/Fe] plane, reveals that each such component has a simple spatial structure that can be described by single-exponential profiles in both the vertical and the radial direction.
3.
Adopting increasing levels of [α/Fe] enhancement as a proxy for the increasing age of the stellar population, the disk dissection into narrow mono-abundance populations in the space of [Fe/H] and [α/Fe] exhibits a continuous trend of increasing scale height and decreasing scale length, when moving from younger to older populations of stars.
4.
We find that the oldest—most α-enhanced—part of the disk is both the thickest and the most centrally concentrated. If we split the sample in only two broad abundance regimes we can make a precise determination of the α-old scale length, 2.01 ± 0.05 kpc, and scale height, 686 ± 11 pc. The scale length of the α-younger disk is around 3.5 kpc (3.6 ± 0.2 kpc for our nominal α-young sample) and is far thinner, with a vertical scale height of 256 ± 4 pc.
5.
These observations show quite directly that the bulk of the Galactic disk has formed from the inside out.
6.
The tight (anti-) correlations between population age, vertical scale height, and radial scale length strongly suggest that the disk's subsequent evolution must have been heavily influenced by internal mechanisms, such as radial migration or turbulent, gravitationally unstable disk evolution, as this naturally explains the continuous increase of scale height with decreasing scale length. At first sight, external mechanisms to form the Milky Way's thick-disk component through external heating or accretion appear to be inconsistent with our results, but a thorough model comparison is warranted.

While, at face value, our results emphasize the importance of evolutionary processes that could be purely internal to the Milky Way (radial migration, turbulent disk formation), the overall ΛCDM cosmogony makes it likely that external processes must also have played some role. In the end, it is likely that the Milky Way disk's formation history may be more complex than inferred here, especially once not only the spatial distribution but also the orbital distribution of the mono-abundance sub-populations are fully analyzed.

It is a pleasure to thank the anonymous referee, James Binney, Doug Finkbeiner, Dan Foreman-Mackey, Patrick Hall, Juna Kollmeier, George Lake, Rok Roškar, Scott Tremaine, Glenn van de Ven, and Lan Zhang for helpful comments and assistance. We thank the SEGUE team for their efforts in producing the SEGUE data set, and Connie Rockosi and Katie Schlesinger in particular for help with the SEGUE selection function. Support for program number HST-HF-51285.01 was provided by NASA through a Hubble Fellowship grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555. J.B. and D.W.H. were partially supported by NASA (grant NNX08AJ48G) and the NSF (grant AST-0908357). D.W.H. is a research fellow of the Alexander von Humboldt Foundation of Germany. J.B. and H.W.R acknowledge partial support from SFB 881 funded by the German Research Foundation DFG. Y.S.L. and T.C.B. acknowledge partial funding of this work from grants PHY 02-16783 and PHY 08-22648: Physics Frontier Center / Joint Institute for Nuclear Physics (JINA), awarded by the National Science Foundation.

Funding for the SDSS and SDSS-II has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the U.S. Department of Energy, the National Aeronautics and Space Administration, the Japanese Monbukagakusho, the Max Planck Society, and the Higher Education Funding Council for England. The SDSS Web site is http://www.sdss.org/.

APPENDIX A: THE SEGUE G-STAR SELECTION FUNCTION

To determine the spatial distribution of the G dwarfs, we require a good understanding of the SEGUE G-star selection function, i.e., the fraction of stars that has been targeted by SEGUE and produced good enough spectra to derive the parameters needed in the present (or any other) analysis (e.g., S/N >15), and we need this selection fraction as a function of position, color, and apparent magnitude. The observed density of G-type stars is simply the product of the underlying density with the sampling selection function, suggesting that one constrains this underlying density by forward modeling of the observations.

The spectroscopic G-star target type was selected uniformly from the set of objects in the G-star color–magnitude box in the area and apparent magnitude range of the spectroscopic plug-plates (simply "plates" hereafter); thus the selection function can be reconstructed. The SEGUE survey implementation distinguishes between "bright" and "faint" plates, with bright plates containing stars with r ⩽ 17.8 mag and faint plates containing stars with r > 17.8 mag. For the purposes of the selection function, we assume that this separation at 17.8 mag is a hard cut, even though in reality some stars were observed on both bright and faint plates for calibration purposes, and some "bright" stars are part of faint plates, and vice versa, because of changes between the photometry used for target selection and that released as part of the SDSS DR7, which we employ here. Duplicates are resolved in favor of the higher S/N observation (typically on the faint plate as this has a longer integration time). We retain stars with r ⩾ 17.8 mag when they were observed on a bright plate, and we keep objects with r < 17.8 mag when they were observed on a faint plate, even though this should not happen in our model for the SEGUE selection function below. A total of 586 stars in the α-old sample and 47 stars in the α-young sample fall into this category; they do not influence any of the fits or conclusions in this paper.

We select the superset of targets by querying the SDSS DR7 imaging CAS⁸ for all potential targets in the color–magnitude box of the G-star target type in the area of a SEGUE plate (Yanny et al. 2009). These objects are primary⁹ detections (removing duplicates and objects from overlapping imaging scans) with stellar point-spread functions (PSFs) (type equal to 6). Objects must not be saturated, nor be close to the edge, nor have an interpolated PSF (interp_psf), and must not have an inconsistent flux count (badcounts). Furthermore, if the center is interpolated (interp_center), there should not be a cosmic ray indicated (cr). See Stoughton et al. (2002) for a description of the SDSS photometric flags. Using the superset of targets we determine for each plate the fraction of stars that were observed spectroscopically of all available targets.

To infer the dependence on color and apparent magnitude of the selection function, we look at the distribution of the potential G-star targets in color–magnitude space. This is shown in Figure 8. The distribution of the spectroscopic sample is overlaid. This shows that the spectroscopic sampling is relatively fair in g − r color, with some frayed edges because of changes between target and current photometry, and that the selection as a function of r-band magnitude tapers at the faint end, as should be expected when using an S/N cut. If all SEGUE plates were integrated to the same depth, the S/N cut should be a clean cut in r, but it is clear from Figure 8 that this is not the case. To distinguish between relatively shallow and relatively deep plates, we introduce the overall plate signal-to-noise ratio plateSN_r,

$\begin{equation} \tt{plateSN\_r} = (\mathrm{sn1\_1}+\mathrm{sn2\_1})/2\,, \end{equation} \tag{ A1 }$

where sn1_1 and sn2_1 are the r-band plate S/N for the two SDSS spectrographs (see Table 17 in Stoughton et al. 2002). The faintest spectroscopic G-type star per plate as a function of plateSN_r for the faint plates is shown in Figure 9 for the faint plates. This figure shows that there is a clear difference in the faintest object that could have been successfully observed at S/N > 15 between relatively shallow and relatively deep plates. The bottom panel of Figure 9 shows the S/N of stars on four plates chosen to cover a range in the overall plate S/N. This shows that the S/N > 15 cut for the entire sample translates into a fairly sharp r-band cut for each individual plate.

**Figure 8.** Distribution of the photometric sample of G-type stars (linear density gray scale; black curves) and the spectroscopic sample (white contours, dashed histograms) after the signal-to-noise ratio cut of S/N >15. The contours contain 68%, 95%, and 99% of the distribution.
Download figure:
Standard image High-resolution image

**Figure 9.** Maximum apparent r-band magnitude per plate vs. overall plate signal-to-noise ratio `plateSN_r` for the G-star sample with S/N >15 for faint plates (top panel). Signal-to-noise ratio, S/N, for stars on four typical SEGUE plates as a function of the apparent r-band magnitude (bottom panel). The four plates have been chosen to show a range in overall plate signal-to-noise ratio `plateSN_r`.
Download figure:
Standard image High-resolution image

Our model for the SEGUE G-star selection function is then the following: for each plate we find the faintest targeted object in r-band magnitude with S/N larger than our S/N cut, with apparent magnitude r_cut (if this object is fainter than the nominal limit r_max for bright or faint plates, we set r_cut equal to this limit; r_max = 17.8 mag for bright plates and 20.2 mag for faint plates), and then assume that the selection function for that plate is given by a hyperbolic-tangent cutoff, centered on r_cut − 0.1 mag, and with a width parameter whose natural logarithm is −3 (≈0.05 mag), such that the total width of the cutoff is about 0.2 mag and the faintest object on the plate is about 2 widths from the center of the cutoff. The function value at the bright end is equal to the number of spectroscopic objects brighter than r_cut divided by the total number of targets brighter than r_cut. Thus, the plate-dependent selection function is given by

$\begin{eqnarray} &&S(\mathrm{plate},r,g-r)\nonumber\\ &&\quad= \frac{\rm{No.\ of }\ \mathrm{spectroscopic\ objects\ with}\ {\it r}_{\mathrm{min}} \le {\it r} \le {\it r}_{\mathrm{cut}}}{\rm{No.\ of\ } \mathrm{targets\ with}\ {\it r}_{\mathrm{min}} \le {\it r} \le {\it r}_{\mathrm{cut}}} \nonumber\\ &&\qquad\times \left[1-\tanh \left(\frac{{\it r}-{\it r}_{\mathrm{cut}}+0.1}{\exp \left(-3\right)}\right)\right]\Big / 2\,, \end{eqnarray} \tag{ A2 }$

where the numbers of objects are evaluated within the ≈7 deg² area of the plate in question and in the 0.48 ⩽ g − r ⩽ 0.55 G-star color range; r_min is 14.5 mag for bright plates and 17.8 mag for faint plates. The selection function is zero outside of the apparent r-band magnitude range of the plate ([14.5, 17.8] for bright plates and [17.8, 20.2] for faint plates).

We use this model both for the bright plates and the faint plates, although most bright plates are in fact consistent with being complete up to 17.8 mag. Figure 10 shows the distribution of Kolmogorov–Smirnov (K-S) probabilities that the spectroscopic sample for any given plate was selected from the target sample with this model for the selection function. All but seven plates have probabilities larger than 0.001 and the distribution of probabilities is relatively flat, as expected.

**Figure 10.** Distribution of the probability that a plate's spectroscopic sample was drawn from the photometric sample combined with the model selection function. The leftmost bin contains the plates with probability <0.001. For the correct model of the SEGUE selection function, the distribution should be approximately flat between zero and one.
Download figure:
Standard image High-resolution image

Rather than using a smooth hyperbolic-tangent cutoff, we also tried a sharp cut at r_cut. With this model for the selection function, 79 plates have a K-S probability <0.05 (≈25% of the number of plates), as opposed to 30 plates in the hyperbolic-tangent-cutoff model (≈9% of the sample). Therefore, the smooth cutoff is necessary to fully model the selection function. The fact that the distribution of K-S probabilities in Figure 10 is not entirely flat is due to remaining details in the faint cutoff of the selection function, as we know that the selection function is flat at brighter magnitudes. This does not impact our analysis greatly, as most stars are much brighter than the cutoff (as compared to the scale over which the selection function changes near the cutoff).

The selection function is simplest in its native coordinates, survey plate, and r-band magnitude. For each value of g − r and [Fe/H], the r-dependent selection function above translates into a (different) spatial selection function through the use of the photometric-distance relation. The selection function projected into spatial coordinates for a typical value of g − r and [Fe/H] is shown in Figure 11. Near |b| = 90° the spectroscopic sample is relatively complete, whereas near the Galactic plane the selection is much less complete.

**Figure 11.** The SEGUE selection function—the fraction of objects successfully observed spectroscopically with S/N >15—for the G-star sample, as a function of Galactic coordinates X and Y (left panel), and of Galactocentric radius R and vertical height Z (right panel). The r-dependent SEGUE selection function is here transformed into spatial coordinates using the photometric-distance relation applied to a color g − r = 0.515 mag and [Fe/H] = −0.5.
Download figure:
Standard image High-resolution image

We have posted a Python code that implements this model for the SEGUE selection function. It is publicly available at https://github.com/jobovy/segueSelect .

APPENDIX B: THE MAGNITUDE–COLOR–METALLICITY DENSITY AND ESTIMATES OF THE EFFECTIVE SURVEY VOLUME

The density in magnitude–color–metallicity space needs to be included in the likelihood in Equation (6), because it forms the basis of the photometric-distance relation used to translate observed colors, metallicities, and apparent magnitudes into distances, which ultimately relate to the effective search volume. We assume here for simplicity that stars of a given g − r and [Fe/H] follow a single stellar isochrone given by the Ivezić et al. (2008) photometric-distance relation in terms of g − r using Equation (1) to translate g − r into the g − i color used by the Ivezić et al. (2008) relation. The reason for expressing the Ivezić et al. (2008) g − i–metallicity–magnitude relation into g − r is to keep the integration in Equation (7) simple; if we had chosen to use the g − i relation we would have to include the r − i color as well, and model and integrate over the full g − r, r − i plane. As the stellar locus is very narrow (≲ 0.1 mag), this adds less (random) scatter than is intrinsic to the photometric-distance relation.

In the single-isochrone model, ρ(r, g − r, [Fe/H]|R, Z, ϕ) becomes the product of a delta function with the density in the color–metallicity plane,

$\begin{eqnarray} && \rho (r,g-r,[\mathrm{Fe/H}]|R,Z,\phi)\nonumber\\ && = \delta (r-r[g-r,[\mathrm{Fe/H}],d]|R, Z,\phi)\rho (g-r,[\mathrm{Fe/H}]|R,Z),\nonumber\\ \end{eqnarray} \tag{ B1 }$

where r[g − r, [Fe/H], d] is the apparent magnitude derived from the photometric-distance relation combined with the distance, and by a slight abuse of notation we have used the same symbol to denote the density in the color–metallicity plane. We assume that this density is independent of Galactocentric azimuth ϕ, but for now allow it to depend on R and Z. Using this, the normalization integral in Equation (7) simplifies to

$\begin{eqnarray} &&\int {d}l\,{d}b\,{d}d\, {d}r\, {d}(g-r) \,{d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta) \nonumber\\ &&= A_p\,\sum _{\mathrm{plates}\ p} \int {d}(g-r)\,{d}[\mathrm{Fe/H}]\, \nonumber\\ && \quad \times \int _{d[r_{\mathrm{min}},g-r,[\mathrm{Fe/H}]]}^{d[r_{\mathrm{max}},g-r,[\mathrm{Fe/H}]]} {d}d\, S(p,r[g-r,[\mathrm{Fe/H}],d],g-r)\,\nonumber\\ &&\quad \times \rho (g-r,[\mathrm{Fe/H}]|R,Z)\,d^2\,\nu _*(R,z|l,b,d,\theta), \end{eqnarray} \tag{ B2 }$

where r_min and r_max are the minimum and maximum apparent magnitude of plate p, and the functions d[ · ] and r[ · ] use the photometric-distance relation.

The color–metallicity distribution for the α-young and α-old sample is shown in Figures 12 and 13, respectively. The top-left panel shows the distribution for the entire sample; the remaining panels show the color–metallicity distribution as a function of Galactocentric radius (including all vertical heights) and as a function of vertical height (including all Galactocentric radii). For both samples, the color–metallicity distribution separates into the product of one-dimensional color and metallicity distributions; thus we assume that ρ(g − r, [Fe/H]|R, Z) = ρ^c(g − r|R, Z) ρ^[Fe/H]([Fe/H]|R, Z). The g − r distribution is independent of R and Z for both the α-young and the α-old sample; we use a spline interpolation of the color distribution for the full sample for ρ^c(g − r|R, Z), independent of R and Z. This interpolation is shown in the top histogram in all panels of Figures 12 and 13. The metallicity distribution of the α-old sample is also mostly independent of R and Z, with only a hint of a trend toward a more metal-poor distribution at large distances from the plane. The [Fe/H] distribution of the α-young sample shows expected trends with R and Z: the peak of the metallicity distribution goes from more metal-rich closer to the Galactic center and closer to the plane, to more metal-poor at larger Galactocentric radii and at larger Z. These shifts are modest (≲ 0.1 dex), which is partly due to the fact that farther from the solar radius we preferentially see stars at larger distances from the plane. We stress that these metallicity distributions are the observed distributions uncorrected for selection effects, but selection effects play a minor role and merely shift the overall distribution by ≈0.1 dex (Schlesinger et al. 2012). We investigate the effect of systematically shifting the metallicity distribution below.

**Figure 12.** Distribution of [Fe/H] and g − r for the full α-young G-dwarf sample (top, left panel) and split into ranges in Galactocentric radius R and vertical height |Z| (other panels). Linear binned densities with contours containing 68%, 95%, and 99% of the distribution and individual outliers beyond 99% are shown in spatial bins with more than 1500 stars. A smooth interpolation of the one-dimensional distributions in the top left panel is shown in all panels. The bottom right panel shows the absolute r-band magnitude as a function of g − r and [Fe/H] from the Ivezić et al. (2008) color–metallicity–magnitude relation (their Equation (A7)).
Download figure:
Standard image High-resolution image

**Figure 13.** Same as Figure 12, but for the α-old G-dwarf sample.
Download figure:
Standard image High-resolution image

The effect of metallicity and color on the absolute magnitude using the Ivezić et al. (2008) color–metallicity–magnitude relation is shown in the bottom right panel of Figures 12 and 13, for the ranges in color and metallicity considered for both samples. From the blue and metal-rich to the red and metal-poor end the shift in absolute magnitude is about 1 mag, or a factor of about 1.6 in distance.

As the α-old metallicity distribution depends only weakly on R and Z, we will assume that it is constant, and use a spline interpolation of the [Fe/H] distribution of the full sample as our model for ρ^[Fe/H]([Fe/H]|R, Z). We do the same for the α-young sample, even though there are slight trends with R and Z. These models are shown in the right histograms of all panels in Figures 12 and 13. We can then simplify the normalization integral in Equation (B2) further to

$\begin{eqnarray} &&\!\int \!\!{d}l\,{d}b\,{d}r\, {d}d\, {d}r\, {d}(g\,{-}\,r) \,{d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g\,{-}\,r, [\mathrm{Fe/H}]|\theta)\nonumber\\ &&\quad = A_p\,\sum _{\mathrm{plates}\ p} \int {d}(g-r)\,\rho ^c(g-r|R,Z)\nonumber\\ &&\qquad\!\times \int {d}[\mathrm{Fe/H}]\,\rho ^{[\mathrm{Fe/H}]}([\mathrm{Fe/H}]|R,Z)\nonumber\\ &&\qquad\! \times \int _{d[r_{\mathrm{min}},g-r,[\mathrm{Fe/H}]]}^{d[r_{\mathrm{max}},g-r,[\mathrm{Fe/H}]]} {d}d S(p,r[g-r,[\mathrm{Fe/H}],d],g-r)\nonumber\\ &&\qquad\!\times d^2\,\nu _*(R,z|l, b,d,\theta). \end{eqnarray} \tag{ B3 }$

If we then determine the overall minimum and maximum heliocentric distances at which we can observe stars in both samples, we can calculate the inner integral between these limits, with the understanding that the selection function is zero outside of the apparent-magnitude range of the plate in question (since bluer or more metal-rich stars can only be observed at distances starting at a value that is larger than the overall minimum distance, and redder and more metal-poor stars can only be seen out to distances that fall short of the overall maximum distance, because of the color and metallicity dependence of the photometric-distance method). We can then calculate the integral by summation on a regular grid as

$\begin{eqnarray} &&\int {d}l\,{d}b\,{d}r\,{d}d\, {d}(g-r) \,{d}[\mathrm{Fe/H}]\,\lambda (l,b,d,r,g-r,[\mathrm{Fe/H}]|\theta) \nonumber\\ &&= A_p\,\sum _{\mathrm{plates}\ p} \sum _{d}\,d^2\,\nu _*(R,z|l,b,d,\theta)\,\sum _{g-r} \,\sum _{[\mathrm{Fe/H}]}\,\rho ^c(g-r)\,\nonumber\\ &&\quad\times \rho ^{[\mathrm{Fe/H}]}([\mathrm{Fe/H}])\, S(p,r[g-r, [\mathrm{Fe/H}],d],g-r), \end{eqnarray} \tag{ B4 }$

where the distance summation is between the overall minimum and maximum distances. We dropped integration factors Δd, Δ(g − r), and Δ[Fe/H], as these only contribute terms that do not depend on the parameters θ in the log likelihood in Equation (8) (note that they do contribute when we do not marginalize over the amplitude of the density in Equation (6)). Written in this way, this normalization integral can be computed efficiently, as all of the necessary coordinate transformations, selection function evaluations, and color–metallicity-distribution function calls can be pre-computed on a dense grid.

APPENDIX C: DETAILED DATA VERSUS MODEL COMPARISONS

In this appendix we present detailed comparisons of our best-fit density models with the observed data, as ultimately the best-fit density parameters are constrained through the quality of the fit in the natural coordinates of the spectroscopic data (l, b, r, g − r, [Fe/H]). We also show that the results we obtain for different sub-samples of our nominal samples are consistent with the best fits for the full samples. As we fit density models by forward modeling the underlying density model, i.e., by taking the spatial density and running it through the SEGUE selection function and the photometric-distance relation, we cannot show direct maps of the density in any meaningful way without massaging the data excessively. Therefore, we compare the observed star counts with the best-fit model by running the underlying star counts model through the selection function and photometric magnitude–color–metallicity relation, and then comparing it with the observed star counts. This has the added advantage that it shows that the entire framework of (1) the underlying density, (2) the photometric magnitude–color–metallicity relation, and (3) our model of the SEGUE selection function provides a valid description of the observed data.

C.1. The α-old Disk Stars

Figure 14 compares the observed distribution of vertical heights |Z| of the α-old G-dwarf sample to that predicted by the best-fit model. This prediction is obtained by running the best-fit density model integrated over the color–metallicity distribution through our model for the SEGUE selection function. There are 35 stars with magnitudes that should put them on faint plates, but that were observed on bright plates, and 551 stars in the opposite situation are cut from the data sample to show this comparison. The comparison between the data and the model is shown for all plates and for the bright and faint plates separately. The agreement between the model and observed distribution is excellent for all of these. Figure 15 shows a similar comparison for the distribution of Galactocentric radii of the data and in the model. The model correctly predicts the observed star counts for most Galactocentric radii, except the smallest around 5 kpc, where the model slightly overpredicts the number of stars (here, and in further comparisons below, the model around 8 kpc behaves somewhat erratically, as this is the boundary between 90° ⩽ l ⩽ 270° and −90° < l < 90° plates, and we do not use the finite extent of the plate in our model distributions). Also shown in this figure and in Figure 14 are models that only differ from the best-fit model in their radial scale length: a model with a scale length of 3 kpc and one with a scale length of 4 kpc. It is clear that these longer scale lengths are strongly ruled out by the data, as they strongly overpredict the star counts at large Galactocentric radii.

**Figure 15.** Same as Figure 14, but for the distribution of Galactocentric radii. The dashed and dotted lines show the same model but with a scale length of 3 and 4 kpc, respectively.
Download figure:
Standard image High-resolution image

Table 1 lists the best-fit parameters for fits that only use (1) bright or faint plates, (2) b > 0° or b < 0° plates, or (3) |b| > 45° or |b| < 45° plates. The results from all of these different samples are roughly consistent with each other; we note that we can even measure the radial scale length with high-latitude plates (|b| > 45°) alone.

Figures 14 and 15 show comparisons between the observed star counts and the model, when we split the α-old sample into more metal-poor and more metal-rich sub-samples by cutting the sample at [Fe/H] = −0.7. Comparisons for when we split the α-old sample into two bins in [α/Fe], by cutting the sample at [α/Fe] = 0.35, are shown in Figure 16.

**Figure 16.** Structural parameters of the G-dwarf sample as a function of [α/Fe]. The top two rows show the α-old sample split at [α/Fe] = 0.35; the bottom two rows contain the α-young sample split at [α/Fe] = 0.15. The left column compares the observed distribution of vertical heights to the model distribution. The right column does the same for the distribution of Galactocentric radii. Each sample is fitted with a mixture of two double-exponential disks. The best-fit parameters of the dominant disk component are shown in the top left of each panel. In each case the secondary component only contributes a few percent of the mass at the solar circle (see Tables 1 and 2 for detailed results).
Download figure:
Standard image High-resolution image

C.2. The α-young Disk Sample

Figure 17 compares the best-fit model to the observed star counts as a function of vertical height and Figure 18 shows this comparison as a function of Galactocentric radius, again removing 4 stars with magnitudes that should put them on faint plates but that were observed on bright plates and removing 43 stars in the opposite situation. We also show models whose parameters are the same as those of the best-fit model, but with shorter scale lengths of 2 and 3 kpc. The faint plates, which only contain 6% of the α-young sample, rule out a short scale length of 2 kpc for the α-young disk. The best-fit model provides a good fit to the observed star counts.

We again also list the best-fit parameters for fits that only use (1) bright or faint plates, (2) b > 0° or b < 0° plates, or (3) |b| > 45° or |b| < 45° plates in Table 2. The results from all of these different samples are again roughly consistent, except for the faint plates fit, which prefers even longer radial scale lengths, but faint plates only contain 6% of the α-young sample. We have also run fits for the α-young sample where we (1) employ a more conservative S/N cut of S/N > 30, (2) enlarge our sample with a less conservative S/N cut of S/N > 10, (3) remove stars on plates whose K-S probability for the spectroscopic sample to have been drawn from the underlying photometric sample combined with our model for the SEGUE selection function (see Figure 10) is smaller than 0.1, (4) use stars from the SEGUE database that were explicitly targeted as G-type stars (with all other log g, S/N, E(B − V) cuts), (5) remove stars with magnitudes that should put them on SEGUE bright plates, but that were observed as part of a faint plate and vice versa, and (6) artificially shift the metallicity distribution 0.1 dex toward the more metal-rich end. The results from these fits are all consistent with those obtained for our nominal sample with fiducial cuts.

Figures 17 and 18 show comparisons between the observed and predicted star counts for the α-young samples that are more metal-poor than the nominal α-young sample. The fit for the −0.6 < [Fe/H] <−0.3 sample is good, while the fit for the most metal-poor α-young sub-sample is not entirely satisfactory. Comparisons between the observed star counts and the model when we split the α-young sample into two, by cutting at [α/Fe] = 0.15, are shown in the lower two rows of Figure 16.

APPENDIX D: ANALYSIS TEST ON MOCK DATA SAMPLES

In order to test the methodology for fitting the density discussed in Section 3, and as check on the code, we create mock data samples selected in exactly the same way as the SEGUE G-dwarf sample and fit them using our algorithm. We also use this framework to test whether the results we obtain can plausibly be the result of abundance errors smoothing out an underlying two-component thin–thick disk structure.

We create mock data sampled from a model underlying density by calculating, for each line of sight, (1) the fraction of stars in the sample that lies along that line of sight and (2) the distribution in r-band magnitude as a function of color g − r and metallicity [Fe/H]. For calculating both of these, we take the SEGUE selection function, described in Appendix A, into account. Thus, we can obtain a sample that is equivalent to what SEGUE would have observed for a particular density model.

To test the methodology and code, we populate each mono-abundance bin in the ([Fe/H], [α/Fe]) plane with a sample drawn from a thin-disk component with h_z = 300 pc and h_R = 3.5 kpc, keeping the abundances and number of stars in each bin the same as in the observed sample. We then run the same analysis code on this sample as is run to produce the real data results in Figures 4–6. We find results that are consistent with the input model within the uncertainties for each bin. The uncertainties are similar to those found for the real data near h_z = 300 pc and h_R = 3.5 kpc. We repeat this for an input "thick" disk model with h_z = 850 pc and h_R = 2 kpc, and again find results that are consistent with the input model within the uncertainties.

We use a similar procedure to investigate whether abundance errors can smooth out an underlying disk model made up of a thin- and a thick-disk component without showing up in our analysis. Assuming SEGUE abundance uncertainties of 0.2 dex in [Fe/H] and 0.15 dex in [α/Fe], we first model the underlying abundance distribution using two Gaussian components, and fit this model to the observed distribution with the assumed abundance uncertainties using the extreme-deconvolution technique (Bovy et al. 2011). We use a Gaussian mixture model for the underlying distribution solely as a convenient way of decomposing the observed distribution for the purpose of this test. The two-Gaussian mixture model adequately represents the observed distribution after convolving again with the uncertainties. The best-fit mixture has a Gaussian centered near solar abundances (40% of the sample; [Fe/H] = −0.3 dex, [α/Fe] = 0.1 dex) and one at metal-poor and α-enhanced abundances (60% of the sample; [Fe/H] = −0.7 dex, [α/Fe] = 0.35 dex). To reproduce the observed distribution, these components both need a dispersion of 0.2 dex in [Fe/H] and 0.07 dex in [α/Fe], with a correlation of −0.85 and −0.6, respectively. We then assign stars to these two components with probabilities computed from their posterior probability of being drawn from either component, based on their abundances and assumed abundance uncertainties. We sample new r-band magnitudes and coordinates for these stars based on the component they are assigned to: we draw the stars assigned to the solar-abundances component from a thin-disk density with h_z = 300 pc and h_R = 3.5 kpc, and stars assigned to the α-enhanced component from a thick-disk distribution with h_z = 850 pc and h_R = 2 kpc. We then run the same analysis code on this sample as is run on the real data.

The results from this test are shown in Figure 19. Although in certain respects they are similar to the results for the real data, they are different in a few crucial ways. Most importantly, when fitting a mixture of two exponential models to each bin we find unambiguous evidence in many bins for two components. This is shown in the lower right panel of Figure 19, where the scale height of the dominant component when fitting the mixture is shown versus the scale height when fitting a single exponential. For most bins with single-exponential h_z ≲ 800 pc, the dominant component is the h_z = 300 pc input thin-disk component. Therefore, even though the abundance pattern of the single-exponential scale height in the lower left panel of Figure 19 is smooth between thin and thick components, most bins are actually resolved into the two input components. This is a major difference with the real data, for which the equivalent comparison, shown in Figure 6, shows a striking one-to-one correlation between the single-exponential and the mixture scale height, with no evidence for a second component for the vast majority of the mono-abundance bins.

In addition to the fact that our analysis correctly identifies two components in each mono-abundance bin in the mock data, the abundance dependence of the inferred single-exponential scale height and scale length is also quite different from that of the mock data. The inferred scale length for the mock data is short for most abundance bins and only reaches h_R ≳ 3 kpc for those abundance bins that are farthest from the center of the metal-poor and α-enhanced abundance component. Thus, the contamination from the thick-disk component with its short scale length drives the inferred scale length for most abundance bins to small values. This behavior is not observed in the real data (Figure 4). The abundance dependence of the single-exponential scale height for the mock data is also much steeper than observed in the real data, with values of h_z ≳ 600 pc as metal-rich as [Fe/H] = −0.3.

From these tests we conclude that abundance errors cannot explain the single-exponential components we observe in each mono-abundance bin in the real data or the abundance behavior of the scale height and scale length. Based on an entirely different argument that uses the observed isothermality of the vertical kinematics of the same mono-abundance populations, Bovy et al. (2012b) infer that the internal SEGUE abundance uncertainties are likely somewhat smaller than the values reported by SEGUE, with likely uncertainties of 0.15 dex in [Fe/H] and 0.07 dex in [α/Fe]. Thus, abundance uncertainties do not influence the main conclusions of this paper.

THE SPATIAL STRUCTURE OF MONO-ABUNDANCE SUB-POPULATIONS OF THE MILKY WAY DISK

Article metrics

Permissions

Author e-mails

Author affiliations

Author notes

Dates

ABSTRACT

1. INTRODUCTION

2. DATA