The following article is Open access

HALO7D. III. Chemical Abundances of Milky Way Halo Stars from Medium-resolution Spectra

, , , , , , and

Published 2023 July 3 © 2023. The Author(s). Published by the American Astronomical Society.
, , Citation Kevin A. McKinnon et al 2023 ApJ 951 43 DOI 10.3847/1538-4357/acd33d

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/951/1/43

Abstract

The Halo Assembly in Lambda Cold Dark Matter: Observations in 7 Dimensions (HALO7D) survey measures the kinematics and chemical properties of stars in the Milky Way (MW) stellar halo to learn about the formation of our Galaxy. HALO7D consists of Keck II/DEIMOS spectroscopy and Hubble Space Telescope–measured proper motions of MW halo main-sequence turnoff stars in the four Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey fields. HALO7D consists of deep pencil beams, making it complementary to other contemporary wide-field surveys. We present the [Fe/H] and [α/Fe] abundances for 113 HALO7D stars in the Galactocentric radial range of ∼10–40 kpc along four separate pointings. Using the full 7D chemodynamical data (3D positions, 3D velocities, and abundances) of HALO7D, we measure the velocity anisotropy, β, of the halo velocity ellipsoid for each field and for different metallicity-binned subsamples. We find that two of the four fields have stars on very radial orbits, while the remaining two have stars on more isotropic orbits. Separating the stars into high-, mid-, and low-[Fe/H] bins at −2.2 and −1.1 dex for each field separately, we find differences in the anisotropies between the fields and between the bins; some fields appear dominated by radial orbits in all bins, while other fields show variation between the [Fe/H] bins. These chemodynamical differences are evidence that the HALO7D fields have different fractional contributions from the progenitors that built up the MW stellar halo. Our results highlight the additional information available on smaller spatial scales compared to results from a spherical average of the stellar halo.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Owing to their long dynamical timescales, galactic stellar halos are long-lived structures that preserve information about their origins. Within a ΛCDM cosmology, galaxies are built up by merger events, each of which can contribute stars to the halo. The positions, kinematics, and chemical properties of halo stars thus reveal a galaxy's mass assembly history and information about the dwarf galaxy progenitors that contributed to it. By studying the stellar halo of our home Galaxy, we seek to place the Milky Way (MW) in its cosmological context.

The chemodynamical properties of halo stars are powerful for constraining masses, star formation rates and efficiencies, and accretion times of progenitors, as well as the total mass and shape of the MW gravitational potential (e.g., Eggen et al. 1962; Searle & Zinn 1978; Bullock & Johnston 2005; Helmi 2008). While the positional clustering of stars from an accreted satellite is eventually washed out, the kinematic coherence of accreted debris persists for much longer periods of time. Stellar atmospheric chemical abundances are relatively stable over a star's lifetime, and they are a direct result of the environment in which it was formed; interstellar medium enrichment levels, star formation rates, and formation lifetimes of a galaxy all impact the chemical makeup of star-forming gas as a function of time.

As α elements (i.e., O, Ne, Mg, Si, S, Ar, Ca, and Ti) can be produced at early times in core-collapse events at relatively constant rates with iron, high-[α/Fe] stars tend to be formed at early times in a galaxy's star-forming life. After enough time (e.g., ∼330 Myr, as measured by Maoz & Badenes 2010), and assuming sufficient star formation, Type Ia supernovae "turn on" and create much of the iron we see in the universe. This causes the [α/Fe] ratio to drop as [Fe/H] increases (Wallerstein 1962; Tinsley 1980). The mass–metallicity relation of local dwarf galaxies (Kirby et al. 2013, 2017,2020) reveals that the metallicity distribution function (MDF) of a galaxy is tied to its mass; this arises because more massive systems have deeper potential wells that are able to retain a greater fraction of their enriched gas from supernovae. While this relationship was determined for local dwarf galaxies that are observed today, recent work (e.g., Leethochawalit et al. 2018, 2019; Naidu et al. 2022) has explored how these mass–metallicity relations were different at earlier cosmic times. The star formation and quenching time of a progenitor system are impacted during its accretion, but the chemical properties of the accreted stars are linked to the mass, star formation rate, and formation lifetime of their birth environment; this interrelatedness enables the inferences of progenitor properties from stellar halo chemical abundances (e.g., Lee et al. 2015; Hasselquist et al. 2021; Cunningham et al. 2022; Horta et al. 2023).

At the present time, a stellar halo is comprised of stars from many different progenitor systems. Simulations of purely accreted stellar halos (e.g., Bullock & Johnston 2005; Robertson et al. 2005; Font et al. 2006a, 2006b, 2006c) have average stellar halo abundances driven by their merger histories (e.g., Johnston et al. 2008), with average [α/Fe] tracking accretion time of infalling dwarf galaxies and [Fe/H] tracking the mass/luminosity of those dwarfs. Recent work with state-of-the-art simulations (e.g., Horta et al. 2023, using the Latte suite of FIRE-2 simulations; Hopkins 2015; Wetzel et al. 2016; Hopkins et al. 2018) has further refined our understanding of how the distributions of present-day chemodynamical observables of stellar halo stars are dictated by the accretion times and masses of disrupted dwarfs.

In addition to stars accreted from dwarf galaxy mergers, in situ stars—those formed in the potential well of the host galaxy—can be heated onto orbits similar to accreted halo stars during merger events. As a result, the halo population is a combination of in situ stars and those accreted during the many mergers a galaxy experiences throughout its history. Zolotov et al. (2009), for example, find evidence from simulations that the in situ halo fraction of the inner regions traces how quiescent the recent merger history has been; more recent mergers tend to cause the inner halo in situ fraction to decrease. 10 Using data from the H3 survey (Conroy et al. 2019a), Naidu et al. (2020) find that the in situ halo fraction drops from 60% to 5% of the total halo mass when going from Galactocentric radii of 8–20 kpc.

For this paper, "in situ halo" refers to the progenitor high-α disk that was kinematically heated through early merger events (Nissen & Schuster 2010; Bonaca et al. 2017, 2020; Haywood et al. 2018; Di Matteo et al. 2019; Amarante et al. 2020). This is the so-called "Splash" of Belokurov et al. (2020). It is thus chemically similar to the thick disk—that is, peaked toward relatively metal-rich [Fe/H] around −0.5 dex (Naidu et al. 2020)—but consists of stars on more isotropic orbits instead of predominantly circular ones. Belokurov et al. (2020) show that their "Splash" sample has less net prograde rotation and larger scatters in all velocity components compared to their "Thick Disk" sample.

Thanks in large part to the Gaia survey (Gaia Collaboration et al. 2018), our current picture of the MW's formation history is becoming clearer: aside from the recent interactions with Sgr and the LMC/SMC, the MW halo seems to have had a relatively quiescent recent history. Recent work has shown that the inner ∼25 kpc of the stellar halo is dominated by a single massive progenitor called Gaia–Sausage–Enceladus (GSE; Belokurov et al. 2018; Haywood et al. 2018; Helmi et al. 2018). The GSE is relatively metal-rich, with a peak [Fe/H] of ∼ −1.2 dex (Naidu et al. 2020) and radially biased orbits. The GSE has a net rotation, 〈vϕ 〉, that is consistent with 0 km s−1 (Belokurov et al. 2020), and is estimated to have come from a 4:1 mass ratio merger approximately 10 Gyr ago (Helmi et al. 2018). While the GSE and in situ halo dominate the bulk of the inner regions of the stellar halo, there have been many other substructures identified over the last 5 yr (e.g., Koppelman et al. 2019; Myeong et al. 2019; Belokurov et al. 2020; Naidu et al. 2020; Yuan et al. 2020). There is growing evidence that 92%–99% of the MW stellar halo stars can be associated with one of the currently known progenitors (Naidu et al. 2020).

These detailed inventories of the MW stellar halo have helped constrain answers to questions about our Galaxy, such as the approximate formation history, fractional contributions from different progenitors, and range of progenitor properties. Many of these studies rely on Gaia-based parallaxes and proper motions. Because of Gaia's apparent magnitude limit of G ∼ 20 mag, this means that these samples are either focused on nearby main-sequence (MS) halo stars when studying the local halo or more distant evolved stars when studying the distant halo. These evolved stars are intrinsically less numerous/spatially dense than their MS counterparts, which means that much of the distant halo work has focused on average properties over large areas of the sky.

The Halo Assembly in Lambda Cold Dark Matter: Observations in 7 Dimensions (HALO7D) survey (Cunningham et al. 2016, 2019a, 2019b) is complementary to contemporary MW stellar halo surveys because it targets 3D positions, 3D velocities, and abundances of MS turnoff (MSTO) stars at moderate halo radii (10 < r < 40 kpc). The HALO7D sample consists of Keck II/DEIMOS spectroscopy and Hubble Space Telescope (HST)-measured proper motions for stars in the apparent magnitude range 19 < mv < 24.5 mag, making it a deep complement to Gaia-based surveys. This paper, the third in the series, measures the seventh and final dimension of HALO7D stars: chemical abundances. The high spatial density of MS halo stars allows us to compare average chemodynamical properties along different lines of sight (LOSs). Cunningham et al. (2019b), for example, measure the halo velocity anisotropy along the four HALO7D LOSs and find variations between the different fields.

In this paper, we describe the HALO7D data set in Section 2. We create a Bayesian spectrophotometric pipeline to measure chemical abundances ([Fe/H] and [α/Fe]) and stellar parameters (Teff, $\mathrm{log}g$, age, and distance) for MSTO stars without known distances in Section 3 and present the resulting abundance measurements for the HALO7D sample in Section 4. In Section 5, we combine our abundances with previously measured LOS velocities and proper motions from HALO7D (Cunningham et al. 2019a, 2019b) to measure the variation in average chemodynamical properties—such as net halo rotation, 〈vϕ 〉, and the velocity anisotropy parameter, β—using different subsamples of the HALO7D stars. Our findings are summarized in Section 6. The detailed tests on the outputs of our chemical abundance pipeline are described in Appendix A, and then validated against well-studied globular clusters (GCs) in Appendix B. We show the statistical significance of our kinematic measurements in Appendix C.

2. Data

The HALO7D data set consists of HST-measured proper motions and Keck II/ DEIMOS spectroscopy for 199 MSTO MW halo stars in four Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS; Grogin et al. 2011; Koekemoer et al. 2011; PIs: S. Faber and H. Ferguson) fields: COSMOS, GOODSN, GOODSS, and EGS. These fields are located at high Galactic latitudes—meaning they have minimal foreground contamination from the MW disk—and they are not located in regions of previously known streams or substructure (e.g., Sagittarius). While the stars that make up the HALO7D sample lie within the same footprints as the CANDELS fields (see Figure 1 of Cunningham et al. 2019a)—and we refer to them using the same field names—it should be noted that the HALO7D data set does not include every MSTO star found in the CANDELS fields. The first paper in this series (Cunningham et al. 2019a) presents the DEIMOS spectroscopic data set (see their Section 2) and measures LOS velocities with a hierarchical Bayesian pipeline called velociraptor. The second paper (Cunningham et al. 2019b) presents proper motions measured from multiepoch HST imaging (see their Section 2 and Table 2 for a description of the HST Programs) and characterizes the halo velocity ellipsoid.

A detailed description of the HALO7D sample and the selection process are presented in the first two papers in this series. To summarize the relevant information, the fields were chosen because of their deep, multiepoch HST astrometry and photometry, which enables proper-motion measurements to much fainter magnitudes than Gaia. HALO7D's velocity sample consists of stars in the 19 < mv < 24.5 mag range and are generally blue to minimize the impact of contamination by foreground disk stars; to see the full color–magnitude diagram (CMD) selection criteria, please see Section 2.2 and Figures 2 and 3 of Cunningham et al. (2019a). HALO7D is thus complementary to previous studies because it is able to measure kinematics and chemical abundances for individual MS stars at halo distances with high enough spatial sampling density to measure halo properties along individual LOSs instead of measuring sky-averaged properties versus Galactocentric radius.

The Keck II/DEIMOS spectroscopic observations were collected between 2014 March and 2017 April using the 600 line mm−1 grating with a 7200 Å central wavelength configuration and 1'' slitwidth. These medium-resolution spectra (R ∼ 2000) consist of 8192 pixels, with a pixel spacing of ∼0.65 Å pixel−1, and cover a typical wavelength range of ∼5000–10000 Å. Each target was typically observed for ∼5–6 hr over the course of this time period, with a minimum of two and a maximum of 33 individual visits per target. The raw spectroscopic data were reduced with the spec2d pipeline produced by DEEP2 at UC Berkeley (Cooper et al. 2012). Table 1 shows a summary of the observations in each field, and Figure 1 shows a CMD for the 199 stars in HALO7D, with the colored points denoting the 113 stars for which we are able to measure abundances. Figure 2 shows cumulative histograms of the combined spectral signal-to-noise ratio (S/N) for the stars with abundances in each field, where we define the combined S/N using a quadrature sum of the S/Ns of the individual observations:

The median combined spectral S/Ns are 62.2, 64.8, 95.8, and 71.3 Å−1 for COSMOS, EGS, GOODSN, and GOODSS, respectively, and 67.2 Å−1 for the complete sample.

Figure 1.

Figure 1. CMD of the 113 HALO7D stars with measured LOS velocities and chemical abundances in HST filters (STMAG). Three MIST isochrones at a distance of 20 kpc with typical MW halo properties are shown in black/gray to guide the eye in this region of color–magnitude space; an increase (decrease) in distance causes the isochrones to move down (up) vertically to fainter (brighter) apparent magnitudes. The faint gray dots show the 86 HALO7D stars that do not have measured abundances.

Standard image High-resolution image
Figure 2.

Figure 2. Cumulative histograms of the combined spectral S/Ns of stars in the chemistry sample of HALO7D (colored points in Figure 1). The black line represents the total sample. By design, GOODSN had more observations than the other fields and thus shows a higher combined S/N. The median S/Ns for each field and the total sample are found in the text.

Standard image High-resolution image

Table 1. Summary of Targets in the CANDELS Fields with LOS Velocities and Proper Motions from Cunningham et al. (2019a, 2019b)

  l b Area a N Halo StarsCatalog References
Field(deg)(deg)(arcmin2)with v3D
COSMOS236.842.128881Muzzin et al. (2013); Nayyeri et al. (2017)
GOODSN125.954.816632Barro et al. (2019)
GOODSS223.6−54.416020Guo et al. (2013)
EGS96.460.438466Barro et al. (2011); Stefanon et al. (2017)
Total998199

Note.

a The listed field area is the area covered with HST multiepoch imaging.

Download table as:  ASCIITypeset image

3. Measuring Stellar Parameters and Abundances

We create a Bayesian pipeline that uses stellar photometry and spectroscopy to measure chemical abundances ([Fe/H], [α/Fe]), stellar parameters (Teff, $\mathrm{log}g$, age), and distances for our HALO7D stars. This technique relies on the library of MIST isochrones 11 (Paxton et al. 2011, 2013, 2015; Choi et al. 2016; Dotter 2016) and a set of synthetic model spectra (described by Escala et al. 2019 in the blue, 4100–6300 Å, and Kirby et al. 2008; Kirby 2011 in the red, 6300–9100 Å). The model spectra are generated with the MOOG spectral synthesis software (Sneden 1973), using the ATLAS9 model atmospheres (Kurucz 1993; Kirby 2011). Each synthetic spectrum is defined by a set of (${T}_{\mathrm{eff}},\,\mathrm{log}g,\,[\mathrm{Fe}/{\rm{H}}],\,[\alpha /\mathrm{Fe}]$) values. For these spectra, [α/Fe] is the total α-element abundance of a stellar atmosphere, where O, Ne, Mg, Si, S, Ar, Ca, and Ti are the α-elements we consider.

Many techniques that measure chemical abundances and stellar parameters from spectra using synthetic model spectra typically study populations of stars that have photometrically constrained distances (Kirby et al. 2010; Escala et al. 2019, for example) or parallax measurements (Conroy et al. 2019a; Cargile et al. 2020). This means that distance-degenerate parameters, such as $\mathrm{log}g$, are able to be fixed and a smaller region of parameter space needs to be explored. For the HALO7D sample, however, we deal with MSTO stars with unknown distances. The absolute magnitudes of stars near the MSTO vary much more than their colors, which means that the stars have large distance uncertainties. As a result, we must explore larger regions of parameter space and consider the relationships between distance and the possible stellar parameters. To help constrain our abundance fits, we use the MIST isochrones, measured properties of the MW stellar halo, and photometric observations of the stars in our sample to create multidimensional, multimodal prior distributions on the parameters of interest for each star.

3.1. Generating Prior Distributions of Stellar Parameters

In building our priors, we use MIST isochrones in the range of 0.1 < age < 14.6 Gyr and −4.0 < [Fe/H] < 0 dex in steps of 0.2 Gyr and 0.02 dex, respectively. The current version of the publicly available MIST isochrones is only for [α/Fe] = 0 dex, so we are not able to include this parameter in the isochrone-derived prior distributions; we assume a flat prior in the range of −0.8 < [α/Fe] < + 1.2 dex. These isochrones are then assigned weights based on average properties of the MW stellar halo, such as an assumed MDF and age distribution. Our particular choices of parameters for the halo priors are shown in Table 2. The [Fe/H] distribution is chosen to have a moderately metal-rich peak and a width that is not overly constraining that is consistent with recent halo studies (e.g., Belokurov et al. 2018; Helmi et al. 2018; Conroy et al. 2019b; Mackereth et al. 2019). Similarly, the age distribution comes from previous studies such as Kalirai (2012) and Bonaca et al. (2020). As we show in Appendix B, the abundances we measure are fairly insensitive to our choice of priors in the spectral S/N range of our data.

Table 2. MW Halo Prior Distributions for Abundance Pipeline

DistributionFunctional Form
p([Fe/H]) ${ \mathcal N }(-1.5\,\mathrm{dex},1.0\,\mathrm{dex})$
p(age) ${ \mathcal N }(12\,\mathrm{Gyr},2.0\,\mathrm{Gyr})$
p([α/Fe])∝1, for −0.8 < [α/Fe] < 1.2
p(mass∣[Fe/H], age)Kroupa (2001) IMF, $\propto k{\left(\tfrac{\mathrm{mass}}{{M}_{\odot }}\right)}^{-\alpha }$ with $\left\{\begin{array}{ll}k=25,\,\alpha =0.3, & \mathrm{mass}\lt 0.08{M}_{\odot }\\ k=2,\,\alpha =1.3, & \mathrm{mass}\lt 0.5{M}_{\odot }\\ k=1,\,\alpha =2.3, & \mathrm{mass}\gt 0.5{M}_{\odot }\end{array}\right.$
p(μ) $\propto {D}^{3}{\left(\tfrac{{R}_{q}}{27\,\mathrm{kpc}}\right)}^{-\alpha }$ with $\left\{\begin{array}{ll}\alpha =2.3, & {R}_{q}\lt 27\,\mathrm{kpc}\\ \alpha =4.6, & {R}_{q}\geqslant 27\,\mathrm{kpc}\end{array}\right.$

Download table as:  ASCIITypeset image

The points within a particular isochrone each have an initial mass, MF606WMF814W color, MF814W absolute magnitude, Teff, and $\mathrm{log}g$. The isochrone points are weighted by integrating a Kroupa (2001) initial mass function (IMF) over the range of masses within an isochrone to account for uneven mass spacing.

Until now, the isochrones have been weighted only by properties that are generic to a halo population, but we now create unique prior distributions for the stellar parameters of each star. We apply additional weights to the isochrone points for each star separately using the observed photometry. Each star has observed apparent HST magnitudes, mF606W and m814W, and corresponding uncertainties, σF606W and σF814W, which we have dereddened using the dust maps of Schlegel et al. (1998) through the dustmaps package in Python (Green 2018).

To give the isochrone points weights based on distances/absolute magnitudes, we use the MW stellar halo density profile of Deason et al. (2011):

where ${R}_{q}^{2}={x}^{2}+{y}^{2}+{(z/q)}^{2}$, with q = 0.59. Accounting for volume elements and the Jacobian between distance and distance modulus, the distance modulus prior is thus $p(\mu )\propto {D}^{3}\cdot \tfrac{{dN}}{{dV}}$. We marginalize over the distance modulus by taking equally spaced values in μ that correspond to distances between 0.1 and 500 kpc and comparing the observed stellar magnitudes to the implied apparent magnitude of the isochrone point at a given distance modulus. Overall, the weight of a given isochrone point ends up as

where pi is the ith point in an isochrone defined by ([Fe/H], age) that has absolute magnitudes (Mi,F606W, Mi,F814W).

When comparing the MIST isochrones to HST photometry of stars in a handful of nearby GCs, we noticed that many of the isochrones required color offsets as large as ∼0.02 mag to have better agreement with the data. To allow for a potential mismatch between the MIST isochrones and the halo stars, we also marginalize over color offsets between −0.02 and 0.02 mag. This is much larger than the typical photometric uncertainty of stars in the HALO7D sample (median uncertainty in mF606WmF814W of 0.006 mag), so this marginalization has a relatively large effect in increasing the width of the prior distributions of stellar parameters.

For each star, the total weighting for a given isochrone point is thus a product of its isochrone [Fe/H] and age weighting, its mass weighting, and the weighting from marginalizing over the distance modulus while comparing the photometry.

At a given color in an isochrone, there are generally three possible distances corresponding to a star being on the MS, the subgiant branch, or the horizontal branch. This results in our prior distributions having three local maxima: one peak for each of the possible phases in a star's evolution. Because it is generally more efficient to sample posterior distributions that are singly peaked, we break each isochrone up into these three phases. The prior probability of a particular phase in any given isochrone is the fraction of the total weight of the isochrone in that phase. In this work, phase = 0 corresponds to MS stars, phase = 1 corresponds to the subgiant branch, and phase = 2 corresponds to the horizontal branch. 12

Finally, for each phase of each isochrone, we generate a 3D prior distribution on (Teff, $\mathrm{log}g$, and MF814W) by passing the weighted isochrone points to a kernel density estimator (KDE) with a Gaussian kernel. We compute the KDE width by measuring a standard deviation and mean of each of the parameters (Teff, $\mathrm{log}g$, and MF814W) in the set of weighted isochrones, and normalize the measurements so that each individual parameter's distribution corresponds to a unit Gaussian. The Gaussian width used in the KDE is chosen to be 0.1, such that the width is 10% of the standard deviation in each parameter. This KDE approach has the benefit of allowing combinations of Teff, $\mathrm{log}g$, and MF814W that do not fall perfectly on the isochrone, meaning we are less reliant on the isochrones perfectly capturing the relationships between stellar parameters.

Overall, the prior probabilities of the stellar parameters are described by

Equation (1)

where $p({T}_{\mathrm{eff}},\mathrm{log}g,{M}_{814{\rm{W}}}| [\mathrm{Fe}/{\rm{H}}],\mathrm{age},\mathrm{phase})$ is calculated by evaluating the KDE generated from the isochrone points with that particular [Fe/H], age, and phase:${{KDE}}_{[\mathrm{Fe}/{\rm{H}}],\mathrm{age},\mathrm{phase}}({T}_{\mathrm{eff}},\mathrm{log}g,{M}_{814{\rm{W}}})$.

3.2. Preprocessing of Spectra

Before the spectral observations can be used in our abundance pipeline, we must shift each of the spectra to the rest frame, continuum normalize, identify useful wavelength regions, and characterize a few quality-of-observation parameters such as the line-spread function (LSF). For our chemical abundance pipeline, we do not coadd the multiple spectral observations of a given star into a single observation. Instead, we model each observation simultaneously. We choose this approach because the spectral observations were taken over the course of years and can have vastly different observing conditions (e.g., seeing, LSFs, and wavelength solution offsets), which causes their coadded spectrum to have washed-out/hard-to-model features. This section is quite technical, so readers who are more interested in the big-picture steps of the abundance pipeline may choose to skip ahead to Section 3.3 or to the results in Section 4.

3.2.1. Measuring LSFs

As a star's light passes through the atmosphere to the telescope, its light is spread out by the seeing we measure during data collection. It is adequate for our purposes to assume the resulting shape is a Gaussian with a width that is the FWHMseeing,i /2.355, where i ∈ {1,...,nobs} refers to the spectral observation number. Accounting for the pixel scale and anamorphic factor allows us to convert this into a width in units of Angstroms in the spectral dimension. Because of the 1'' width of the slits in the DEIMOS mask, any light outside of this width is truncated. The star's truncated Gaussian light is further smoothed out as it passes through the instrument and lands on the CCD. To characterize this additional amount of instrument smoothing as a function of wavelength, we use arc lamp exposures. 13

We identify peaks in the arc lamp spectrum and then fit those peaks with a top-hat function convolved by a Gaussian, where the Gaussian width is unknown and the top-hat width is set by the size of the slit. This is because the arc lamp light fully illuminates each slit, producing a top-hat shape, before it passes through the instrument. In particular, we are interested in measuring the smoothing Gaussian widths. The resulting widths as a function of wavelength are approximately quadratic with a minimum near the chip gap. For each individual observation, we measure this quadratic function on the blue and red chips separately, giving the additional smoothing of the instrument as a function of wavelength. The final LSFi (λ) for the ith observation is then the seeing-defined truncated Gaussian convolved with the wavelength-dependent instrument smoothing. In cases where the seeing is very good, such that virtually all of the star's light is completely inside of the slit (e.g., 3σseeing < slitwidth/2, which implies a seeing FWHM of ∼0farcs39 for our slitwidth of 1''), the resulting LSFi (λ) is a Gaussian with a total width that is the quadrature sum of the seeing Gaussian and the instrument smoothing. In most cases, however, the seeing is large enough relative to the slitwidth that we find it better to use the truncated-Gaussian-then-smoothed model instead. During the fitting process, we allow for the instrument smoothing function to change by a multiplicative factor between 0.5 and 1.5, to not be overly constraining. We place a prior on this multiplicative factor that is a Gaussian centered at 1 and has a width of 0.1 for each observation and allow the blue and red data to have different multiplicative factors:

3.2.2. Shift to Stellar Rest Frame and Pixel Masking

Next, we shift each of the nobs observations of a single star to stellar rest frame using the velociraptor-measured LOS velocities from Cunningham et al. (2019a). Specifically, we use the median LOS velocity from each spectral observation's posterior distribution. The data wavelengths are then restricted to 4100–9100 Å because that is the coverage of the synthetic model spectra we will compare to. We also mask out a few additional wavelength regions, some before shifting to rest frame and others after, as listed in Table 3; these include telluric features, poorly modeled (or unmodeled) features in the synthetic model spectra, and regions where the wavelength solutions of the spectral reductions are unstable.

Table 3. Masked Spectral Regions

Masked before Shift to Rest Frame
NameRegion Masked (Å)
Unstable wavelength solution a <5000
B band(6864, 7020)
A band(7570, 7713)
Telluric feature(7157, 7325)
Telluric feature(8123, 8356)
Telluric feature(8235, 8275)
Telluric feature>8933
Chip gap±20 at gap
Masked after Shift to Rest Frame
NameRegion Masked (Å)
MgH b (5115, 5125)
Na D1, D2 c (5885, 5910)
Ca i (λ6343)(6341, 6346)
Ca i (λ6362)(6356, 6365)
Hα (6559.797, 6565.797)
K i (λ7665)(7662, 7668)
V i (λ8116, λ8119),(8113, 8123)
Poor Arcturus model d (8317, 8330)
Ca ii (λ8498)(8488.023, 8508.023)
Ca ii (λ8542)(8525.091, 8561.091)
Ca ii (λ8662)(8645.141, 8679.141)
Mg i (λ8807)(8804.756, 8809.756)

Notes.

a Very blue DEIMOS wavelengths tend to be extrapolations of a small number of arc lamp lines and are not easily fixed with our correction prescription described in Section 3.2.4. b This molecular absorption feature is masked for stars that have more than 5% prior probability of Teff < 5300 K because the synthetic models currently do not include this feature. c This absorption feature is masked because of potential contamination from interstellar medium absorption. d From Kirby et al. (2008); this is a region where the model spectra were showed to poorly reproduce the spectral features of Arcturus.

Download table as:  ASCIITypeset image

For our spectroscopic setup with DEIMOS, there are not many arc lamp lines at particularly blue wavelengths, which makes anchoring the wavelength solution difficult in this region. We find that the wavelengths on the red chip (i.e., λ > 7200 Å) are robust and that the blue-chip wavelengths are quite reliable down to λ ∼ 5500 Å, but they are often unstable below λ ∼ 5000 Å. Using a handful of stars with previously measured stellar parameters and high-S/N spectra, we cross-correlate with the best-fit synthetic model spectrum at different wavelength locations to remeasure the wavelength solution and compare it to the output of spec2d. Because the wavelength solution we measure is using the stellar features at all wavelengths, this process is able to anchor the solution using lines at the bluest wavelength where arc lines are not available. We find that the offset as a function of wavelength is approximately linear for λ > 5000 Å on the blue chip, with the size of the maximum offset being ∼3 Å.

With these lessons in mind, we decide to mask out wavelengths less than 5000 Å for all spectra because the wavelength offset function tends to become nonlinear in this region. During the stellar parameter measuring process that we will discuss in Section 3.3, we fit each spectral observation with a linear wavelength offset function on the blue-chip wavelengths to correct for these issues. Ideally, we would use more terms in the wavelength offset function. However, we must balance the number of parameters being fit per observation with computation time; we find that a linear correction function is a good compromise. For spectra with S/N less than 20 Å−1, we mask out wavelengths less than 5500 Å, because this is where it becomes difficult to assess how well the wavelength offset has been measured. For the same reason, we drop individual spectral observations with S/Ns less than 3 Å−1.

3.2.3. Continuum Normalization

We then measure an initial continuum estimate for each spectrum and for each possible stellar evolution phase, as defined in Section 3.1. We take the synthetic model defined by the median stellar parameters of the corresponding prior distribution, degrade it to the data quality using the LSFi (λ), and then divide the result from the observed spectrum. This gives an approximate continuum assuming a particular phase; we then smooth the resulting spectrum using a median boxcar of width 6.5 Å (i.e., approximately 10 pixels) to limit the impact of outliers, such as poorly removed skylines. This median-binned spectrum is then fit with a cubic B-spline whose knots are spaced by 100 Å, yielding an estimate of a given observation's continuum assuming a particular phase, cphase,i (λ). This process is repeated for each spectral observation and for each phase.

3.2.4. Identifying Useful Wavelength Regions and Corrections to Wavelength Solutions

We also use these degraded model spectra to build prior distributions for the parameters of the linear wavelength offset function, Δλi (λ) = mλ,i · λ + bλ,i . We cross-correlate the synthetic model with the spectral data in steps of ∼ 100 Å. This gives measures of the wavelength offset as a function of wavelength for a particular observation. We then fit a line to these offsets, taking the resulting fit's mean, θ λ,i , and covariance matrix, V λ,i , to define the prior distribution as a multivariate normal distribution,

where we inflate the covariance matrix by a factor of 10 so as not to be overly constraining.

Finally, we identify wavelength regions of the spectra that are particularly useful for measuring stellar parameters. Using the prior distributions to define a useful region of parameter space (e.g., 95% prior probability), we are able to define a grid of equally spaced Teff, $\mathrm{log}g$, [Fe/H], and [α/Fe] values for each phase. For each set of parameters on the prior grid, we degrade the corresponding synthetic model to the data quality and compare the model spectra to each other as well as to the continuum-normalized spectral observations. This is done for two purposes. The first reason is to determine wavelength regions that do not change significantly across the parameter grid, which means they provide the least power for our likelihood measurements. We choose to mask the 10% least useful pixels of each observation. This has the additional benefit of speeding up computations because we consider a smaller number of pixels; in total, the masking of poorly modeled, telluric, and low-likelihood information regions leaves between 35% and 55% of the data pixels (∼3000–4500 of the original 8129 pixels) for the likelihood measurements. The other reason is to better define the continuum for each observation given a particular phase. For each set of parameters in our prior grid, we divide the continuum-normalized data by the current synthetic model and then fit a B-spline with 100 Å spaced knots to the result, which defines the continuum adjustment required to have the best agreement between that model and the data. The new continuum for each phase is taken to be the initial continuum multiplied by the new continuum adjustment of the model that had the minimum χ2 comparison. The parameters that define the best model (i.e., minimum χ2) for each phase are where we choose to begin our search of parameter space during the fitting process.

3.3. Fitting Spectra with Synthetic Models

With our prior distributions and rest-frame-shifted, continuum-normalized spectra in hand for each star, we begin the fitting process. The synthetic model spectra we compare to are defined by a set of ${T}_{\mathrm{eff}},\mathrm{log}g,[\mathrm{Fe}/{\rm{H}}],$ and [α/Fe] values according to the model grid in Table 4. In cases where we draw a set of parameters that do not lie directly on the model grid, we linearly interpolate from the nearest neighboring models using between 2 and 24 nearest neighbors in (Teff, $\mathrm{log}g$, [Fe/H], [α/Fe]) space. For each observation of a given star, the synthetic model is smoothed with the corresponding LSFi (λ) and resampled onto the data wavelengths.

Table 4. Model Spectrum Grid Spacing by Parameter Value from Escala et al. (2019, Table 4)

ParameterMin. ValueMax. ValueStep
Teff (K)35005600100
 56008000200
$\mathrm{log}g$ (cm s−2)0.0 (Teff < 7000 K)5.00.5
 0.5 (Teff > 7000 K)5.00.5
[Fe/H]−4.5 (Teff ≤ 4100 K)0.00.1
 −5 (Teff ≤ 4100 K)0.00.1
[α/Fe]−0.8+1.20.1

Download table as:  ASCIITypeset image

The posterior probability for our parameters of interest is

Equation (2)

where i corresponds to the spectral observation number, j corresponds to the pixel number within a spectrum, ${\boldsymbol{F}}=({{\boldsymbol{f}}}_{1},{{\boldsymbol{f}}}_{2},...,{{\boldsymbol{f}}}_{{n}_{\mathrm{obs}}})$ are the fluxes of the measured spectra with corresponding uncertainties of ${\boldsymbol{\Sigma }}=({{\boldsymbol{\sigma }}}_{1},...,{{\boldsymbol{\sigma }}}_{{n}_{\mathrm{obs}}})$, ${{\boldsymbol{\theta }}}_{\mathrm{spec}}=({{\boldsymbol{\theta }}}_{\mathrm{spec},1},...,{{\boldsymbol{\theta }}}_{\mathrm{spec},{n}_{\mathrm{obs}}})$ are the set of spectral parameters for all observations, and θ* is the set of stellar parameters with p(θ*) defined in Equation (1). Because the spectra for a given star are on a common wavelength array, λj represents the wavelength of a given pixel and fi,j is the flux measured in that pixel for spectrum i, with corresponding flux uncertainty σi,j .

The θ spec,i parameters include a blue- and red-side multiplier to the LSFi (λ), and the slope and intercept for the linear wavelength solution correction of that observation. Thus, the prior on the spectral parameters for each observation is

where p(blue multi ) and p(red multi ) are defined in Section 3.2.1 and p(mλ,i , bλ,i ) is defined in Section 3.2.4.

The likelihood of a particular spectrum's flux measurement at a particular pixel, p(fi,j θ *, θ spec,i ) comes from a comparison with the synthetic model that has had measured continuum applied. While the continuum we have measured in the previous section does a good job of normalizing each spectrum, we allow for one final continuum fit before evaluating the likelihood, to have an optimal comparison. This is particularly important in cases where there is significant line blanketing at blue wavelengths because the original continuum definition may have removed the effects of line blanketing while trying to normalize the fluxes. To this end, for every draw of parameters, we smooth the corresponding synthetic model to the appropriate data quality, then divide it from the continuum-normalized data; we then coadd the remaining noise spectra from the different observations and fit the result with a final B-spline with 100 Å spaced knots. This process captures any remaining large-scale variations that are required to have good agreement between a particular model and the data.

In plain language, the parameter measurement proceeds as follows:

  • 1.  
    Draw stellar parameters (${T}_{\mathrm{eff}},\,\mathrm{log}g,\,[\mathrm{Fe}/{\rm{H}}]\,,[\alpha /\mathrm{Fe}],{M}_{814{\rm{W}}},\,\mathrm{age},\,\mathrm{phase}$) and spectral correction parameters (blue multi , red multi , mλ,i , and bλ,i for each of the i spectral observations) using multivariate normal proposal distributions.
  • 2.  
    Read in the high-resolution synthetic model defined by the current stellar parameters.
  • 3.  
    For each observation, smooth the synthetic model by LSFi (λ < λchipgap) · bluemultphase,i on the blue side and LSFi (λ > λchipgap) · redmultphase,i on the red side to get the data quality model fluxes.
  • 4.  
    Define the new wavelength vector for each observation using the current linear wavelength offset correction, Δλi (λ) = mλ,i · λ + bλ,i , applied only to the blue side (i.e., Δλ(λ > λchipgap) = 0).
  • 5.  
    Resample the corresponding smoothed model onto this new wavelength array, giving m i .
  • 6.  
    For each observation, divide the continuum-normalized observation, f i / c phase,i , by the resampled smoothed model, m i , to get a noise spectrum that is centered at 1.
  • 7.  
    Coadd the noise spectra together and fit the result with a B-spline to account for any missing continuum; this is the continuum adjustment vector a .
  • 8.  
    For each observation, the likelihood is then;$p({{\boldsymbol{f}}}_{i}| {{\boldsymbol{\theta }}}_{* },{{\boldsymbol{\theta }}}_{\mathrm{spec},i})={\prod }_{j=1}^{{n}_{\mathrm{pix}}}{ \mathcal N }({f}_{i,j}| {a}_{j}\cdot {c}_{\mathrm{phase},i,j}\cdot {m}_{i,j},{\sigma }_{i,j});$
  • 9.  
    Measure the posterior probability for a current draw using Equation (2).
  • 10.  
    Use the Metropolis–Hastings criterion to accept or reject the drawn parameters.
  • 11.  
    Repeat until the parameter samples have converged.

We use 500 Markov Chain Monte Carlo walkers that are initialized with parameters drawn from the prior distributions. These walkers are updated to new parameter values at each iteration of the fitting process using the emcee package (Foreman-Mackey et al. 2013). We generally require ∼500 iterations to reach convergence, so we choose to sample for 1500 iterations for each star. We use a conservative burn-in period that throws out the first 70% of samples, keeping the most recent 30% as our posterior samples. Because we need to consider multiple spectral correction parameters for each observation and we consider each observation separately, this process can be quite computationally expensive in both RAM and time. An example of the pipeline's output posterior distributions and model comparison is shown in Figure 3 for a star in the COSMOS field; in this case, we see good agreement between the synthetic model and the data. 14

Figure 3.

Figure 3. Posterior distributions and model comparison for a star in the COSMOS field. The black points, lines, and contours in the corner plot on the left show the posterior samples, and the orange histograms show 1D projections of the prior for a given parameter. The titles above each histogram show the median and the 68% confidence interval in that parameter. In the upper right corner, the star's spectrum shows close agreement with the best-fit model (red line) as defined by the median parameters of the corner plot. As described in Section 3, various wavelength regions of the data have been masked out for the fitting process, including the central region of Hα. The uncertainty-scaled flux residuals are shown in the lower panel, and the distribution of these residuals shows good agreement with the expected unit normal in the rightmost panel.

Standard image High-resolution image

One limitation of our process comes from the assumption that the flux measurements at each pixel within an observation and between observations are uncorrelated, though we know this not to be true because of our preprocessing steps. While a more rigorous fitting approach would incorporate these correlations, we use the uncorrelated assumption because it simplifies the calculations and increases computation speed.

To validate the results of our pipeline, we generate synthetic, HALO7D-like spectral observations with known stellar parameters. The results of analyzing those synthetic observations are shown in Appendix A. The main takeaway is that the abundances agree with the known parameters for stars with combined spectral S/N > 20 Å−1. For S/N < 20 Å−1, we begin to see a bias in the abundances, so we omit any stars with combined spectral S/N below this limit for the following analyses. We also find that our posterior abundance distributions are slightly too narrow, requiring an inflation of the posterior covariance by a factor of 1.312. This factor implies that our posterior abundance distributions have widths that are approximately 1.31 times smaller than needed to explain the disagreement with the expected values; this implies that our systematic uncertainties in abundances are approximately 31% of the uncertainty reported by the pipeline. All abundance uncertainties shown or reported in this paper have been inflated by this factor. Finally, we analyze MSTO stars in the well-studied GCs of M2 and M92 in Appendix B and show that the pipeline is able to recover results that are consistent with the literature.

4. Chemical Abundances of HALO7D Stars

In this section, we present our chemical abundance measurements for the HALO7D data set. Of the 199 stars from HALO7D with measured 3D velocities (which we hereafter refer to as the "Velocity" sample), 113 had converged posterior results for their chemical abundances (hereafter, the "Chemistry" sample). The numbers per field are summarized in Table 5 and the abundances are displayed in Figure 4. There are several reasons why a star did not converge in the abundance pipeline, but the two main factors are (1) the resulting posterior distribution peaking too close to the edge of the model grid in at least one of the stellar parameters (i.e., Teff, $\mathrm{log}g$, [Fe/H], or α); and (2) having a combined spectral S/N < 20 Å−1, which makes constraining the wavelength solution difficult and generally leads to unconstrained posterior distributions. The latter reason effectively acts as a magnitude cut, which makes the faint magnitude limit mF606W = 23 mag for EGS and GOODSN, 22.5 mag for COSMOS, and 22 mag for GOODSS, instead of the mF606W = 24.5 mag of the original/complete HALO7D sample. As a result, we expect that COSMOS and GOODSS cover slightly nearer distances when compared to EGS and GOODSN.

Figure 4.

Figure 4. Posterior chemical abundances of HALO7D stars colored by the field to which they belong. The error bars are perpendicular vectors that show the 1σ eigenvectors of the posterior covariance matrices.

Standard image High-resolution image

Table 5. Summary of HALO7D Targets in Each Field for Different Measurements

Field ${N}_{{v}_{3{\rm{D}}}}$ a Nabundances b
COSMOS8136
GOODSN3221
GOODSS2016
EGS6640
TOTAL199113

Notes.

a Targets that have measured vLOS and proper motions from Cunningham et al. (2019a, 2019b). b Targets that have measured [Fe/H] and [α/Fe].

Download table as:  ASCIITypeset image

Figure 4 shows the median [Fe/H] and [α/Fe] and corresponding posterior uncertainty for individual stars colored by the field they belong to. The error bars are perpendicular lines that show the eigenvectors of the posterior covariance of the [Fe/H] and [α/Fe] distribution, with the length corresponding to 1σ in those eigenvectors. The bulk of the stars lie in the −2 < [Fe/H] < −1 region, and the vast majority of stars are supersolar in [α/Fe] as we would expect for old halo populations, and as seen by previous studies (e.g., the Sloan Digital Sky Survey with Carollo et al. 2007, 2010; H3 with Conroy et al. 2019a, 2019b; Naidu et al. 2020; and APOGEE with Helmi et al. 2018; Mackereth et al. 2019). GOODSN (in green) has abundance distributions with smaller dispersions compared to the other fields; this is partly because GOODSN stars generally have higher average S/N spectra than the other fields (as seen in Figure 2), and thus have smaller posterior abundance distributions, though it could also be that the GOODSN field is less chemically diverse than the other fields. As we will discuss in Section 5, the kinematics and abundances of the GOODSN stars are consistent with originating almost exclusively from the GSE progenitor, whereas the other fields appear to have multiple contributions. Excluding the clustering of GOODSNs around [Fe/H] = −1.5 dex, the stars of each field occupy the same regions of abundance space, and there is not an immediately obvious difference between the fields when considering chemistry alone.

To compare the distributions of [Fe/H] and [α/Fe] in each field more quantitatively, we fit 2D multivariate normal distributions in a hierarchical model to each field's abundances independently. The abundances in each sample are assumed to follow a 2D Gaussian distribution whose mean and covariance we are measuring, and each star's posterior abundances are assumed to be draws from that population distribution. These results are shown in Table 6 (labeled as the "Chemistry" sample) and Figure 5. The field-colored data points show the median of the 2D distributions, the crosses show the 1σ uncertainties on the median, and the approximately elliptical shapes show the median region that the model predicts to contain 68% of data. These 2D fits show that all the fields have similar median abundances and uncertainties. For the total sample, we find 〈[Fe/H]〉 = −1.65 ± 0.06 dex, σ[Fe/H] = 0.64 ± 0.05 dex, 〈[α/Fe]〉 = +0.28 ± 0.03 dex, and σ[α/Fe] = 0.24 ± 0.02 dex. As we show in Section 5, the fraction of disk contamination in each field and for the total sample is relatively small, so the impact of foreground disk stars on these abundance distributions is minimal.

Figure 5.

Figure 5. Resulting population distribution of abundances for the HALO7D fields and total Chemistry sample after fitting 2D multivariate Gaussians. The colored data points show the median abundances, and the lines coming off of those points are perpendicular 1σ eigenvectors showing the uncertainty in that median. The approximately elliptical shapes show the region that the model predicts to contain 68% of the population data; these shapes are generated by taking samples of the population medians and covariance matrices from the 2D Gaussian fitting process.

Standard image High-resolution image

Table 6. Summary of Abundances for Each Field, and for Different Subsamples of the Data

FieldSample Nstars 〈[Fe/H]〉 σ[Fe/H] 〈[α/Fe]〉 σ[α/Fe]
COSMOSChemistry36 $-{1.70}_{-0.09}^{+0.11}$ ${0.61}_{-0.07}^{+0.08}$ +0.26 ± 0.04 ${0.23}_{-0.03}^{+0.04}$
 High [Fe/H]6−0.72 ± 0.15 ${0.35}_{-0.10}^{+0.22}$ +0.08 ± 0.15 ${0.34}_{-0.10}^{+0.20}$
 Mid [Fe/H]16 $-{1.52}_{-0.07}^{+0.06}$ ${0.24}_{-0.05}^{+0.07}$ $+{0.17}_{-0.07}^{+0.06}$ ${0.20}_{-0.05}^{+0.07}$
 Low [Fe/H]14−2.28 ± 0.05 ${0.17}_{-0.04}^{+0.05}$ +0.44 ± 0.04 ${0.09}_{-0.04}^{+0.05}$
GOODSNChemistry21 $-{1.59}_{-0.09}^{+0.10}$ ${0.42}_{-0.07}^{+0.09}$ +0.26 ± 0.020.07 ± 0.02
 Mid [Fe/H]19−1.48 ± 0.05 ${0.21}_{-0.03}^{+0.05}$ +0.25 ± 0.020.07 ± 0.02
GOODSSChemistry16 $-{1.43}_{-0.24}^{+0.22}$ ${0.91}_{-0.13}^{+0.20}$ +0.24 ± 0.09 ${0.35}_{-0.06}^{+0.08}$
 High [Fe/H]6−0.60 ± 0.20 ${0.47}_{-0.15}^{+0.25}$ $+{0.05}_{-0.21}^{+0.23}$ ${0.52}_{-0.15}^{+0.32}$
 Mid [Fe/H]7 $-{1.61}_{-0.13}^{+0.14}$ ${0.33}_{-0.09}^{+0.15}$ $+{0.26}_{-0.05}^{+0.06}$ ${0.08}_{-0.05}^{+0.10}$
EGSChemistry40 $-{1.72}_{-0.11}^{+0.10}$ 0.67 ± 0.08+0.33 ± 0.050.29 ± 0.04
 High [Fe/H]6 $-{0.78}_{-0.21}^{+0.18}$ ${0.45}_{-0.14}^{+0.23}$ $+{0.33}_{-0.23}^{+0.25}$ ${0.54}_{-0.19}^{+0.36}$
 Mid [Fe/H]20 $-{1.50}_{-0.05}^{+0.06}$ ${0.23}_{-0.04}^{+0.05}$ +0.30 ± 0.07 ${0.30}_{-0.05}^{+0.08}$
 Low [Fe/H]14 $-{2.45}_{-0.09}^{+0.08}$ ${0.30}_{-0.07}^{+0.10}$ $+{0.43}_{-0.10}^{+0.07}$ ${0.23}_{-0.08}^{+0.11}$
TOTALChemistry113−1.65 ± 0.06 ${0.64}_{-0.04}^{+0.05}$ $+{0.28}_{-0.03}^{+0.02}$ 0.24 ± 0.02
 High [Fe/H]18−0.70 ± 0.08 ${0.35}_{-0.06}^{+0.07}$ $+{0.15}_{-0.09}^{+0.10}$ ${0.39}_{-0.07}^{+0.09}$
 Mid [Fe/H]62−1.51 ± 0.030.22 ± 0.02+0.24 ± 0.03 ${0.19}_{-0.02}^{+0.03}$
 Low [Fe/H]33 $-{2.43}_{-0.06}^{+0.05}$ ${0.30}_{-0.04}^{+0.05}$ +0.46 ± 0.04 ${0.15}_{-0.03}^{+0.04}$

Notes. Measurements report the 16th, 50th, and 84th percentiles. The [Fe/H] bins are split at −2.0 and −1.1 dex based on the median posterior [Fe/H]. The missing subsamples of GOODSS Low [Fe/H], GOODSN High [Fe/H], and GOODSN LOW [Fe/H] are omitted because they have too few stars (N ≤ 3) for a useful analysis.

Download table as:  ASCIITypeset image

Recent studies, such as the H3 survey (Conroy et al. 2019a, 2019b), have found a flat halo MDF (with respect to Galactocentric radius) that is peaked at −1.2 dex, has an approximate scatter of 0.5 dex, and has a fairly significant tail toward low [Fe/H]. The H3-measured MDF is generally more metal-rich than previous studies have measured. As Conroy et al. (2019b) point out, many of the previous studies imposed [Fe/H] cuts or targeted blue stars to minimize contamination from nearby MS stars, which biases the MDF toward the metal-poor end (e.g., Carollo et al. 2007; Xue et al. 2015; Das & Binney 2016), while H3 does not select targets based on color. The HALO7D survey also targets blue stars, which may explain why we find a more metal-poor average of −1.65 dex for our sample.

We estimate the impact of our selection function on the HALO7D results in Figure 6. For the "No Selection" model, we take MIST isochrones and weight them by

which produces the purple histogram. 15 We then take those weighted isochrones and apply the observed HALO7D selection functions for each of the fields. This includes masking points outside of the model grid in Table 4, as well as ensuring that the resulting distributions in mf606W and mF814W are exactly what is observed in the HALO7D Chemistry sample, producing the other histograms. The dashed vertical lines show the locations of each distribution's median [Fe/H]. From this, we see that the HALO7D cuts on color, magnitude, and stellar parameters bias the measured MDF toward lower [Fe/H] compared to the underlying model population, and the median is shifted by ∼ −0.3 dex. When accounting for this bias, our median [Fe/H] of −1.65 ± 0.06 dex with a scatter of 0.64 ± 0.05 dex is consistent with the H3 mean halo metallicity of −1.2 dex.

Figure 6.

Figure 6. Resulting changes to the underlying MDF (purple) when the effective selection functions in each HALO7D field are used. The vertical dashed lines show the median of each distribution. HALO7D, like other surveys that target blue stars, results in a bias toward lower [Fe/H].

Standard image High-resolution image

5. Chemodynamics with HALO7D

5.1. Anisotropy Parameter, β

The power of a survey like HALO7D comes from being able to consider multiple dimensions together. We do this by computing field-averaged kinematics in the form of the anisotropy parameter:

as was used in the analysis of Cunningham et al. (2019b). With this definition, β = 1 implies that a population is on entirely radial orbits, β = 0 is for a population with isotropic orbits, and β → − for a population with entirely circular orbits. We calculate anisotropies, net halo rotation (〈vϕ 〉), and the fraction of disk contamination (fdisk) for our four fields and the total sample using the HALO7D stars that have chemical abundance measurements. This is done by modeling the spatial densities, MDFs, age distributions, and velocity component distributions for both the disk and the halo.

We first generate a "Thick Disk" and a "Halo" absolute magnitude probability distribution for each star. As in Section 3.1, we do this by weighting MIST isochrones using a Kroupa IMF, an age distribution, an [Fe/H] distribution, each star's dereddened color, and a density profile for either the Thick Disk or MW stellar halo. These absolute magnitude distributions give distributions on distances that we use in combination with the previously measured proper motions and LOS velocities to calculate the kinematics for each field. For both the disk and halo models, we use the posterior distributions in [Fe/H] and age for each star as measured from the abundance pipeline; because of this, the isochrone weighting—and therefore the distance distribution and corresponding velocity measurements—for each star depends on the abundances we have measured. For the disk isochrone weighting, we use the distributions shown in Table 7; the density profile is chosen to match that of Cunningham et al. (2019b). For the halo isochrone weighting, we use the same priors as in Table 2.

Table 7. Disk Model Distributions

DistributionFunctional Form
p(mass∣[Fe/H], age)Kroupa (2001) IMF
p(μ = mM) $\propto {D}^{3}\exp \left(-\tfrac{{R}_{D}}{3\,\mathrm{kpc}}-\tfrac{z}{1\,\mathrm{kpc}}\right)$, where ${R}_{D}^{2}={x}^{2}+{y}^{2}$
p( M ∣[Fe/H], age, μ) = ∏f p(Mf ∣[Fe/H], age, μ) ${\prod }_{f}{ \mathcal N }({M}_{f,[\mathrm{Fe}/{\rm{H}}],\mathrm{age}}| {m}_{f}-\mu ,{\sigma }_{{m}_{f}})$

Notes. For measuring kinematics from the real HALO7D data, we use the posterior [Fe/H] and age distributions as measured using the chemistry pipeline. The disk's density profile distribution is chosen to be the same as used in the analysis of Cunningham et al. (2019b).

Download table as:  ASCIITypeset image

When these absolute magnitude distributions are combined with the LOS velocities and proper-motion measurements of Cunningham et al. (2019a, 2019b), we have 3D velocities and positions for each star when belonging to the disk or halo, which allow us to measure the properties of the halo's velocity ellipsoid. For the velocity components, we assume the distributions shown in Table 8, where ${ \mathcal S }{ \mathcal K }{ \mathcal N }$ is a skew-normal distribution with shape parameter a. As before, these distributions are chosen to match those of Cunningham et al. (2019b). To transform between the observer frame and a Galactocentric one, we use r = 8.5 kpc, assume a circular speed of 235 km s−1, and solar peculiar motion (U, V, W) = (11.1, 12.24, 7.25) km s−1 (Schönrich et al. 2010). Our calculations use a right-handed coordinate system, such that 〈vϕ 〉 < 0 corresponds to prograde motion.

Table 8. Velocity Prior Distributions for Disk and Halo Model

ComponentDistributionFunctional Form
Halo p(vr ) ${ \mathcal N }(0,{\sigma }_{r})$
  p(vϕ ) ${ \mathcal N }(\langle {v}_{\phi }\rangle ,{\sigma }_{\phi })$
  p(vθ ) ${ \mathcal N }(0,{\sigma }_{\theta })$
Disk $p({v}_{{R}_{D}})$ ${ \mathcal N }(0,45\,\mathrm{km}\ {{\rm{s}}}^{-1})$
  p(vz ) ${ \mathcal N }(0,20\,\mathrm{km}\ {{\rm{s}}}^{-1})$
  p(vT ) ${ \mathcal S }{ \mathcal K }{ \mathcal N }(\mu =242\,\mathrm{km}\ {{\rm{s}}}^{-1},\sigma =46.2\,\mathrm{km}\ {{\rm{s}}}^{-1},a=-2)$

Note. These are the same distributions used in the analysis of Cunningham et al. (2019b).

Download table as:  ASCIITypeset image

We fit for the unknown halo velocity components of 〈vϕ 〉, ${\sigma }_{{v}_{r}}$, ${\sigma }_{{v}_{\theta }}$, and ${\sigma }_{{v}_{\phi }}$, which are used to calculate the anisotropy parameter β, choosing a uniform prior on 〈vϕ 〉 and noninformative priors that are proportional to 1/σr,ϕ,θ for the dispersion components. Because each star has the possibility of belonging to the disk or the halo, we fit the population as a mixture model using fdisk, which is the fraction of contamination by foreground disk stars. We note that the velocity ellipsoid parameters (i.e., 〈vϕ 〉, ${\sigma }_{{v}_{r}}$, ${\sigma }_{{v}_{\theta }}$, and ${\sigma }_{{v}_{\phi }}$) mentioned here and shown in the following tables and figures refer only to the halo population; we fix the parameters of the disk model velocity distributions for each LOS to those shown in the bottom of Table 7, based on the Besançon disk model (Robin et al. 2003). During the fitting process, we marginalize over the absolute magnitude distributions of each star to account for uncertainties in distance. The anisotropy results for various HALO7D subsets are listed in Table 9. One key takeaway is that the disk contamination is quite low in all the HALO7D fields for the chemistry sample, so we are measuring the properties of a fairly clean halo sample. The small possible number of disk stars in our sample is thus unlikely to bias the average abundances we present in Section 4.

Table 9. Summary of Anisotropy, Distance, and Galactocentric Radius for Each Field

FieldSamplevϕ σϕ σθ σr D D Ranger r Range fdisk β
  (km s−1)(km s−1)(km s−1)(km s−1)(kpc)(kpc)(kpc)(kpc)  
COSMOSChemistry $-{66}_{-13}^{+12}$ ${71}_{-8}^{+10}$ ${69}_{-7}^{+9}$ ${106}_{-12}^{+14}$ 7.3 ± 0.33.4 −13.3 ${13.3}_{-0.2}^{+0.3}$ 10.4−18.5 ${0.02}_{-0.02}^{+0.03}$ $+{0.36}_{-0.21}^{+0.17}$
 High [Fe/H] $-{86}_{-48}^{+56}$ ${96}_{-31}^{+57}$ ${44}_{-16}^{+22}$ ${98}_{-26}^{+49}$ ${9.6}_{-0.7}^{+1.2}$ 5.0 −17.1 ${15.2}_{-0.6}^{+1.1}$ 11.5−21.9 ${0.18}_{-0.13}^{+0.24}$ $-{0.13}_{-1.73}^{+0.75}$
 Mid [Fe/H]−37 ± 17 ${58}_{-10}^{+14}$ ${54}_{-9}^{+11}$ ${116}_{-18}^{+25}$ ${8.6}_{-0.6}^{+0.8}$ 4.6 −13.7 ${14.4}_{-0.5}^{+0.6}$ 11.2−18.8 ${0.04}_{-0.03}^{+0.06}$ $+{0.69}_{-0.18}^{+0.12}$
 Low [Fe/H]−81 ± 19 ${71}_{-13}^{+18}$ ${91}_{-15}^{+22}$ ${106}_{-17}^{+23}$ ${5.7}_{-0.2}^{+0.3}$ 2.4 −8.812.0 ± 0.29.7−14.5 ${0.04}_{-0.03}^{+0.07}$ $+{0.07}_{-0.53}^{+0.35}$
GOODSNChemistry−28 ± 14 ${55}_{-10}^{+12}$ ${75}_{-13}^{+16}$ ${144}_{-21}^{+27}$ ${6.4}_{-0.3}^{+0.4}$ 3.8 −17.6 ${12.2}_{-0.2}^{+0.3}$ 10.4−22.0 ${0.10}_{-0.05}^{+0.08}$ $+{0.76}_{-0.13}^{+0.09}$
 Mid [Fe/H]−34 ± 14 ${53}_{-10}^{+13}$ ${59}_{-13}^{+17}$ ${152}_{-22}^{+31}$ ${6.4}_{-0.3}^{+0.4}$ 3.6 −14.3 ${12.2}_{-0.2}^{+0.3}$ 10.3−18.9 ${0.10}_{-0.06}^{+0.08}$ $+{0.83}_{-0.11}^{+0.07}$
GOODSSChemistry $-{68}_{-13}^{+14}$ ${47}_{-9}^{+13}$ ${136}_{-23}^{+31}$ ${121}_{-20}^{+28}$ ${6.0}_{-0.3}^{+0.4}$ 2.3 −13.1 ${12.3}_{-0.2}^{+0.3}$ 9.7−18.4 ${0.15}_{-0.07}^{+0.10}$ $+{0.11}_{-0.55}^{+0.34}$
 High [Fe/H] $-{96}_{-25}^{+28}$ ${46}_{-19}^{+38}$ ${223}_{-68}^{+126}$ ${124}_{-36}^{+64}$ 8.3 ± 0.51.4 −14.614.4 ± 0.49.2−19.7 ${0.36}_{-0.16}^{+0.18}$ $-{1.20}_{-3.63}^{+1.42}$
 Mid [Fe/H]−36 ± 20 ${42}_{-14}^{+20}$ ${55}_{-13}^{+21}$ ${124}_{-28}^{+44}$ ${7.5}_{-0.5}^{+0.8}$ 4.7 −11.6 ${13.5}_{-0.4}^{+0.6}$ 11.3−17.1 ${0.10}_{-0.08}^{+0.14}$ $+{0.77}_{-0.24}^{+0.12}$
EGSChemistry−31 ± 13 ${78}_{-9}^{+10}$ ${69}_{-8}^{+9}$ ${137}_{-14}^{+17}$ 8.7 ± 0.34.6 −17.1 ${12.5}_{-0.2}^{+0.3}$ 9.9−19.5 ${0.04}_{-0.02}^{+0.04}$ $+{0.68}_{-0.11}^{+0.08}$
 High [Fe/H] $+{5}_{-16}^{+17}$ ${35}_{-14}^{+20}$ ${35}_{-10}^{+16}$ ${99}_{-24}^{+40}$ ${11.3}_{-0.9}^{+1.0}$ 7.6 −20.914.6 ± 0.811.7−23.0 ${0.09}_{-0.07}^{+0.14}$ $+{0.85}_{-0.22}^{+0.09}$
 Mid [Fe/H]−32 ± 9 ${34}_{-7}^{+9}$ ${49}_{-8}^{+11}$ ${132}_{-19}^{+24}$ ${8.8}_{-0.5}^{+0.4}$ 5.6 −13.912.6 ± 0.310.4−16.7 ${0.03}_{-0.02}^{+0.05}$ $+{0.86}_{-0.07}^{+0.05}$
 Low [Fe/H] $-{46}_{-35}^{+38}$ ${126}_{-23}^{+32}$ ${102}_{-19}^{+26}$ ${165}_{-28}^{+40}$ ${5.7}_{-0.2}^{+0.3}$ 2.9 −18.5 ${10.5}_{-0.1}^{+0.2}$ 9.1−20.8 ${0.13}_{-0.08}^{+0.11}$ $+{0.43}_{-0.38}^{+0.23}$
TOTALChemistry−47 ± 7 ${70}_{-5}^{+6}$ ${80}_{-5}^{+6}$ ${125}_{-8}^{+9}$ ${7.6}_{-0.2}^{+0.3}$ 3.5 −14.812.7 ± 0.210.0−19.00.04 ± 0.02 $+{0.57}_{-0.08}^{+0.07}$
 High [Fe/H] $-{53}_{-22}^{+23}$ ${78}_{-15}^{+20}$ ${106}_{-18}^{+25}$ ${101}_{-16}^{+23}$ ${10.5}_{-0.7}^{+1.0}$ 4.6 −16.1 ${14.9}_{-0.7}^{+1.0}$ 11.0−20.8 ${0.18}_{-0.09}^{+0.11}$ $-{0.05}_{-0.62}^{+0.41}$
 Mid [Fe/H]−35 ± 747 ± 5 ${52}_{-5}^{+6}$ ${130}_{-11}^{+13}$ ${8.2}_{-0.3}^{+0.4}$ 4.8 −13.7 ${13.0}_{-0.2}^{+0.3}$ 10.6−18.3 ${0.02}_{-0.02}^{+0.03}$ $+{0.82}_{-0.05}^{+0.04}$
 Low [Fe/H]−63 ± 17 ${94}_{-11}^{+14}$ ${104}_{-12}^{+14}$ ${132}_{-15}^{+19}$ ${5.0}_{-0.2}^{+0.3}$ 2.6 −16.411.1 ± 0.39.2−19.2 ${0.05}_{-0.03}^{+0.05}$ $+{0.30}_{-0.25}^{+0.19}$

Notes. Measurements report the 16th, 50th, and 84th percentiles, and we use the convention that vϕ < 0 corresponds to prograde rotation. The subsamples are the same as listed in Table 6.

Download table as:  ASCIITypeset image

5.2. Anisotropy and Abundances

Figure 7 reveals that the HALO7D fields have significant differences in their kinematics, as was first seen in Cunningham et al. (2019b). We note that we are considering a smaller subset of the HALO7D stars than was used in Cunningham et al. (2019b), so we will not directly compare our measured kinematics. With the chemistry sample of HALO7D, we find that GOODSN and EGS are more radially biased than GOODSS and COSMOS.

Figure 7.

Figure 7. Summaries of field distributions in [Fe/H], [α/Fe], net halo rotation 〈vϕ 〉, fraction of disk contamination fdisk, and Galactocentric radii compared to anisotropy. The abundance distributions are those fit with a 2D multivariate normal, as summarized in Figure 5 and explained in the text. The error bars in these panels show the median and 68% confidence interval for each distribution being considered. These plots show that the fields begin to separate when we consider kinematics. As described in the text of Section 5.1, we model both the disk and halo velocity component distributions, so a large fdisk measurement does not directly cause the large prograde rotations we measure in some subsamples. EGS and GOODSN both have very radial anisotropies (β → 1), while GOODSS is the field with the most isotropic (β ∼ 0) orbits. All fields cover the same approximate radial extent. EGS and GOODSN have the smallest net rotation and the most radial anisotropies, both of which are signatures of the GSE. COSMOS and GOODSS both have large and negative net rotations and show less radial anisotropies, suggesting that these fields may have a larger contribution from the in situ halo.

Standard image High-resolution image

We also notice that all of the fields have significant net halo rotation, different than the small or nearly zero net rotation measured by Cunningham et al. (2019b) and others (e.g., Belokurov et al. 2020; Bird et al. 2021). We also note a small apparent correlation between fdisk and anisotropy. These may be due to distance systematics (Schönrich et al. 2011) or our choice to model the velocity ellipsoid with a single Gaussian. Lancaster et al. (2019) find the halo velocity ellipsoid is well described by two components, one nonrotating and radially biased and a second nonrotating, more isotropic population. We use a single component as a more robust choice for our smaller sample size. As a consequence, we focus our analysis on the trends in 〈vϕ 〉 between different subsamples.

To assess the statistical significance of the differences we see between the fields, we assume a null hypothesis that the fields have the same halo velocity ellipsoid parameters that we find for the total population. After generating many simulations of HALO7D-like data and measuring their anisotropies, we indeed find the differences we see between the four fields are not likely to be explained by chance alone. A description and summary of these tests are presented in Appendix C.

These anisotropy differences are also not likely explained by differences in β as a function of average Galactocentric radius; between Galactocentric radii of 10 and 20 kpc, Loebman et al. (2018) find an increase in median β from ∼0.5 to ∼0.6 using simulated MW-like galaxies. This approximate trend of Δβr = 0.01 kpc−1 is not likely to explain the large anisotropy differences between the HALO7D fields. Similarly, various studies of different stellar populations in the MW stellar halo have found that the anisotropy profile does not change significantly in the 10–20 kpc radial range (e.g., Bird et al. 2019, 2021; Lancaster et al. 2019; Liu et al. 2022; Wu et al. 2022) after stars associated with Sagittarius have been removed. In these studies, however, the β measurements come from averages over the sky, whereas the HALO7D sample is able to compare along different LOSs.

Anisotropy variations at different positions in the MW stellar halo have been previously observed. Iorio & Belokurov (2021), for example, measure anisotropies of RR Lyrae in bins of Galactic Z and R (see their Figures 3 and 7). While many of their bins have high anisotropy (β ∼ 0.8), there are also a handful of regions with isotropic measurements (β ∼ 0). It is possible that the more isotropic HALO7D fields may intersect with some of these regions.

One possible explanation for the variation in anisotropy among the fields is the difference in fractional contribution from different MW halo progenitors. For instance, the highly radial EGS and GOODSN samples could indicate that these fields are dominated by stars from the GSE—a progenitor that is marked by stars on strongly radial orbits with no net rotation (e.g., Belokurov et al. 2020 find 〈vϕ 〉 ∼ 0 km s−1)—while GOODSS and COSMOS have more significant contributions from non-GSE sources. Naidu et al. (2020), for instance, found that the halo fraction contributed by the GSE peaks at rgal ∼ 20 kpc and that the in situ halo contribution becomes minimal around the same radius. Many other studies have found similar peaks in the fractional contribution from a radial halo population in the rgal range of 10–20 kpc (e.g., Deason et al. 2018; Lancaster et al. 2019; Iorio & Belokurov 2021; Liu et al. 2022). Our HALO7D fields cover the range of radii where the transition between the dominance of the in situ halo decreases and the dominance of the GSE peaks, so differences in their average kinematics may be particularly sensitive to variation in the fractional contribution from these structures. Because all the fields have similar average chemical abundance patterns, we expect that all the fields have relatively large contributions from the GSE, and that contributions from non-GSE sources likely cover similar regions of abundance space, such as Wukong (Naidu et al. 2020; Yuan et al. 2020), Nereus (Donlon et al. 2022; Donlon & Newberg 2023), and the in situ halo/Splash.

5.3. Anisotropy in [Fe/H] Bins

To explore the relationship between kinematics and chemistry, we split the sample up into different [Fe/H] bins. Based on previous MW halo inventory studies, the high-[Fe/H] bin ([Fe/H] > −1.1 dex) is where we expect the largest fraction of in situ halo/disk stars and the mid-[Fe/H] bin (−2 < [Fe/H] < −1.1 dex) is where we expect stars from the GSE to dominate. The binned abundances and kinematics are summarized in Tables 6 and 9 and Figure 8.

Figure 8.

Figure 8. Same as Figure 7, but for different [Fe/H] bins that are split at −2.0 and −1.1 dex. Triangles that point up correspond to the high-[Fe/H] bin, squares are the middle [Fe/H] bin, and triangles that point down are the low-[Fe/H] bin. The missing [Fe/H] bins for GOODSN and GOODSS are omitted for having too few stars for useful calculations. All of the mid-[Fe/H] bins have very radial anisotropies; the stars in this bin have metallicities and kinematics similar to the GSE. COSMOS changes in anisotropy and halo rotation more than EGS does for each of the bins. This suggests that EGS might be dominated by stars from a single progenitor, while COSMOS has a higher fractional contribution from multiple progenitors. The GOODSS high-[Fe/H] bin has the most circular orbits compared to the other fields, suggesting that this subsample may contain the largest fraction of in situ halo stars.

Standard image High-resolution image

All fields in the mid-[Fe/H] bin have highly radial anisotropies and a small net halo rotation (〈vϕ 〉), which suggests that stars in this bin are chemodynamically similar and thus may originate from a single progenitor. In the low- and high-[Fe/H] bins, we notice differences in the kinematics between the fields. In particular, the high-[Fe/H] bin shows the largest differences in anisotropy between the fields, with EGS having the most radial stars and GOODSS having the most circular stars. As before, we test the probability of observing these results by chance alone and find that the anisotropy differences in the high-[Fe/H] bin are likely statistically significant; these tests are summarized in Appendix C. These anisotropy differences between the fields may be caused by variations in the fractional contributions of different progenitors, which change the average kinematics we measure. These differences highlight the additional information available on smaller spatial scales that a survey like HALO7D is able to capture.

The mid-[Fe/H] bin covers −2 < [Fe/H] < −1.1 dex in metallicity and spans Galactocentric radii of ∼10–20 kpc in all fields. The GSE has been measured to dominate the halo in this radial and metallicity range (e.g., Naidu et al. 2020). These facts, combined with the low net rotation and highly radial anisotropy we measure in the mid-[Fe/H] bins, are evidence that all the fields have a dominant fractional contribution from the GSE in the −2 < [Fe/H] < −1.1 dex range. Because we measure a nonzero 〈vϕ 〉 of ∼ −30 km s−1 in this bin, the GSE is almost certainly not the only progenitor present in this bin; however, the relative contributions from prograde structures must be relatively consistent between the different fields to produce a similar net rotation. The high-[Fe/H] bin has the largest range of anisotropies and vϕ values compared to the other bins. This is the bin that we expect to contain a significant number of stars from the GSE, the in situ halo, and a few other known MW progenitors, because its higher metallicity intersects with the MDFs of multiple structures. As a consequence, this large number of progenitor options is likely causing the increased variation we see in kinematics in this bin. Like the mid-[Fe/H] bin, the low-[Fe/H] bin does not show much variation between the chemodynamics of the fields.

The results from EGS and COSMOS are particularly useful to compare because they have approximately the same number of stars in each of the [Fe/H] bins, which limits the impact that sample size has on these comparisons. EGS has anisotropies in each [Fe/H] bin that are quite radial and fairly consistent with each other, which suggests that EGS is dominated by stars from a single progenitor in all [Fe/H] bins. As explained in the previous paragraph, the origin of these stars may be the GSE, as the EGS stars show the characteristic radial bias and low net rotation previously measured for the GSE. For COSMOS, the anisotropy and net halo rotation change more noticeably between the [Fe/H] bins. At high and low [Fe/H], the COSMOS anisotropies are isotropic and the net rotations are prograde, which suggests that these bins have significant contributions from sources other than that which produced the kinematics in the mid-[Fe/H] bin. Like the other fields, the COSMOS mid-[Fe/H] bin has kinematics and chemistry that are consistent with the GSE. A possible origin for the high-[Fe/H], prograde stars in COSMOS is the in situ halo, while the low-[Fe/H], prograde stars may originate from the Wukong, Nereus, and/or the metal-poor tail of the in situ halo. Because COSMOS is closer to the disk plane than any of the other HALO7D fields and EGS is the farthest, the larger contribution of in situ halo stars in COSMOS and a smaller contribution in EGS is not unexpected. All the HALO7D fields are located far away from any Sagittarius Stream debris, so this is not likely to explain the difference in chemodynamics we observe between the fields.

In summary, the HALO7D fields show significant differences when considering chemistry and kinematics together. Looking at different [Fe/H] bins reveals that all the fields may have a significant fraction of stars from the GSE in the middle [Fe/H] bin of −2 < [Fe/H] < −1.1 dex. EGS has similar kinematic properties in its high- and low-[Fe/H] bins, and these are again consistent with debris from the GSE. COSMOS, on the other hand, has more variation in its kinematics across the [Fe/H] bins, which suggests it has larger fractional contributions from non-GSE sources. The high-[Fe/H] bin has properties consistent with a large fraction of in situ halo stars, the mid-[Fe/H] bin has properties like the GSE, and the low-[Fe/H] bin has properties like the metal-poor tail of the in situ halo, the Wukong progenitor, and/or the Nereus progenitor. This spatial nonuniformity is compelling evidence that the MW stellar halo is not uniformly mixed in its chemodynamical distributions.

6. Summary

We have measured [Fe/H] and [α/Fe] abundances for 113 MSTO MW stellar halo stars across four CANDELS fields in the HALO7D survey. By focusing on MSTO stars in the stellar halo at Galactocentric radii in the range 10–40 kpc, HALO7D is able to measure MW halo properties on smaller spatial scales than other contemporary surveys. Our abundances are combined with the previously measured 3D positions and 3D velocities from HALO7D (Cunningham et al. 2019a, 2019b) to measure the variation in average chemodynamical properties along each LOS; these properties include the net halo rotation, anisotropy, and average abundances. To measure our abundances for HALO7D, we have created a Bayesian pipeline that uses photometric and spectroscopic information to determine stellar parameters (${T}_{\mathrm{eff}},\mathrm{log}g$, age, and distance modulus) and chemical composition ([Fe/H], [α/Fe]) for MSTO stars without known distances (Section 3). Our key results include:

  • 1.  
    The abundance patterns in each of the HALO7D fields agree with each other. The average [Fe/H] of the full 113 star HALO7D Chemistry sample is −1.65 dex with a scatter of 0.61 dex, which is more metal-poor than some recent contemporary surveys (e.g., 〈[Fe/H]〉 ∼ −1.2 dex for Conroy et al. 2019b; Naidu et al. 2020), but this is almost certainly because of our blue selection function. The average [α/Fe] for the HALO7D Chemistry sample is +0.28 dex with a scatter of 0.24 dex, which is in agreement with what we would expect for a sample drawn from a population of old, metal-poor halo stars (Section 4, Figures 4 and 5).
  • 2.  
    The HALO7D fields separate in kinematic space when we measure average properties like the anisotropy parameter β. EGS and GOODSN show more radially biased orbits and near-zero halo rotation compared to GOODSS and COSMOS, which have more isotropic orbits and fairly negative halo rotation (Section 5.2, Figure 7).
  • 3.  
    Breaking the HALO7D fields into low-, mid-, and high-[Fe/H] bins at −2.0 and −1.1 dex shows differences in the chemodynamic makeup of the fields. All the fields have similar anisotropy in the mid-[Fe/H] bin, but the high- and low-[Fe/H] bins show differences between the fields. EGS has relatively similar anisotropy and net halo rotation between the [Fe/H] bins, all of which are similar to the properties of the GSE. COSMOS, on the other hand, has a mid-[Fe/H] bin with kinematics that are similar to the GSE, but its low- and high-[Fe/H] bins have kinematics that suggests these bins have larger contributions from prograde structures, such as the kicked-up disk/in situ halo, the metal-poor progenitors of Nereus and Wukong. These chemodynamical differences between the fields, even at the same Galactocentric radii, suggest that the MW stellar halo is not uniformly mixed along different LOSs (Section 5.3, Figure 8).

Future work will focus on studying the full 7D chemodynamic relationship of our stars, such as using an MW potential model to estimate individual orbits, which will allow us to better quantify the relative contributions from different progenitors in each field. We are also in the process of expanding the HALO7D survey to include more LOSs and more stars along each LOS, which will allow us to measure how the chemodynamical distributions change as a function of 3D position, with the goal of contributing to a more complete picture of our Galaxy's accretion history.

Acknowledgments

The authors thank the anonymous referee for comments that helped improve the clarity of this paper. K.M., C.M.R., and P.G. were supported by NSF grant No. AST-1616540 and by NASA through grant No. HST-AR-16625.005-A, awarded by the Space Telescope Science Institute. E.C.C. acknowledges support for this work provided by NASA through the NASA Hubble Fellowship Program grant No. HST-HF2-51502.001-A, awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., for NASA, under contract NAS5-26555. I.E. acknowledges generous support from a Carnegie–Princeton Fellowship through Princeton University. E.N.K. acknowledges support from the National Science Foundation under grant No. AST-2233781.

The authors recognize and acknowledge the very significant cultural role and reverence that the summit of Maunakea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain.

K.M. thanks Avesta Rastan for helpful conversations that improved the presentation of scientific figures. K.M. also thanks Brian DiGiorgio for insightful conversations that led to significant reductions in computation time.

Facilities: HST - Hubble Space Telescope satellite, Keck II (DEIMOS) - .

Software: Astropy (Astropy Collaboration et al. 2013,2018, 2022), corner (Foreman-Mackey 2016), dustmaps (Green 2018), emcee (Foreman-Mackey et al. 2013), IPython (Pérez & Granger 2007), jupyter (Kluyver et al. 2016), matplotlib (Hunter 2007), numpy (Harris et al. 2020), scipy (Virtanen et al. 2020), scikit-learn (Pedregosa et al. 2011), spec2d (Cooper et al. 2012).

Appendix A: Chemical Abundance Pipeline Internal Error with Fake Stars

We test the ability of our pipeline to recover input chemical abundances across a range of stellar parameters and spectral S/Ns by generating fake stars with known photometry and stellar parameters. We create two types of fake stars; those with single spectral observations that model our real observations of GC stars in M2 and M92 (see Appendix B), and those with multiple spectroscopic visits that model real HALO7D data. In each case, a fake star consists of (mF606W, mF814W) apparent magnitudes and between one and 10 separate spectral observations at a chosen S/N.

In the case of the GC analog fake stars, we use the literature-defined MIST isochrone to define the photometry and stellar parameters. We do this by taking a real GC star and finding the closest point on the MIST isochrone in terms of photometry; the fake star then inherits that point's absolute magnitudes (MF606W, MF814W) and stellar parameters (${T}_{\mathrm{eff}},\,\mathrm{log}g$), as well as the cluster distance, age, and [Fe/H]. The [α/Fe] is randomly drawn from a uniform distribution in the range of −0.2 to +0.6 dex. With the fake star's stellar parameters and abundances defined, we can smooth the corresponding synthetic model spectrum by the real GC star's observed seeing and LSF, and then rebin onto the observed DEIMOS wavelength array. We next multiply the smoothed, continuum-normalized fake spectrum by an estimate of the real star's continuum (as described in Section 3.2.3) to obtain DEIMOS-like observations for the fake star. Finally, the chosen S/N of the fake star's spectrum is used to apply flux noise and to define the flux uncertainty in each pixel. With the fake star having apparent magnitudes and a spectral observation at a chosen S/N, we are able to feed it to the chemical abundance pipeline by following the process listed in Section 3.

For the HALO7D-like fake star analogs, we follow a similar process as with the GC stars. To define the input parameters of the fake star, we use the real HALO7D star's posterior distribution from its analysis using the chemical abundance pipeline; the fake star's parameters are chosen to be the median of the posterior distribution, and the apparent magnitudes are exactly those of the HALO7D star. The corresponding synthetic model spectrum is repeated so that the fake star is given the same number of spectroscopic visits/observations as the real HALO7D star that it is based on. The seeing, LSF, and continuum of each real spectroscopic observation are used to define the smoothing and continuum of the different fake spectral visits. Next, a chosen S/N defines the resulting combined S/N of the different spectroscopic visits, where the S/Ns of the individual visits are chosen to follow the same ratio of the real HALO7D star; the S/N of each spectroscopic visit sets the flux noise and the flux uncertainty in each pixel. As with HALO7D stars, the apparent magnitudes and multiple spectroscopic visits of the fake stars are processed through the chemistry pipeline.

For each real star that defines a set of stellar parameters and abundances, we create fake stars with varying S/Ns from 5 to 200 Å−1. These fake stars are processed identically to the real stars, following the methods of Section 3, to assess the pipeline's ability to recover the input parameters as a function of spectral S/N. An example posterior distribution for one fake star with spectral S/N of 40 Å−1 and properties that place it near the MSTO of M2 (i.e., distance of 11 kpc, age of 13 Gyr, [Fe/H] of −1.65 dex) is shown in Figure 9. The black points and lines show the posterior samples that have good agreement with the blue lines, which show the values of the input parameters. In cases where the pipeline incorrectly measures $\mathrm{log}g$—that is, when the star is assigned the incorrect phase, placing it on one branch when it belongs to another—we find that the chemistry is unchanged; specifically, the posterior [Fe/H] and [α/Fe] samples are in agreement for each of the possible phases. Since we are most interested in measuring abundances, we include stars with incorrectly measured posterior phase/$\mathrm{log}g$.

Figure 9.

Figure 9. Posterior distributions for a fake star that has a single spectral observation with median S/N of 40 Å−1 with stellar parameters that are near the MSTO of M2. The blue lines show the locations of the fake star's true parameters, the black points/lines/contours show the posterior samples, and the orange histograms show 1D projections of the prior for a given parameter. The titles above each histogram show the median and the 68% confidence interval in that parameter.

Standard image High-resolution image

The posterior chemical abundance distributions of the fake stars are then used to assess the internal errors of our chemistry pipeline and the reasonableness of the posterior uncertainties. The 2D abundance distance between the input truth and the posterior median is measured using the covariance of the posterior [Fe/H] and [α/Fe] samples; these distances should follow a chi-squared distribution with 2 degrees of freedom (dof) and a scale of 1 if the posterior uncertainties are reasonable. While the distribution we measure does indeed follow a chi-square with 2 dof, we instead measure a scale factor of 1.31, meaning that our posterior widths need to be inflated by this amount (i.e., the covariance matrix needs to be inflated by 1.312) to capture the pipeline's true uncertainty. All remaining stellar parameter measurements in this paper, including Figure 10 and all other figures, have had their posterior uncertainties increased by this factor. After applying this inflation, we still noticed a slight bias in the [Fe/H] and [α/Fe] results for fake stars with spectral S/N < 20 Å−1. As a result, we ignored real and fake stars with spectral S/N < 20 Å−1 from all analyses.

Figure 10.

Figure 10. Comparison of measured posterior [Fe/H], [α/Fe], and input truth for fake spectra as a function of spectral S/N. The blue error bar points show the 68% confidence interval of the posterior around the median. The orange lines in the top panels show the median and 68% scatter within S/N bins of 50 stars and the orange lines in the bottom panels show the median 68% width of the posterior samples (and corresponding 1σ uncertainty) in those same S/N bins. We remove observations with spectral S/N < 20 Å−1 because the abundance uncertainties begin to show systematic biases at this level.

Standard image High-resolution image

Plotting the difference between the posterior abundances and the input truth versus input spectral S/N, as in Figure 10, shows that the pipeline recovers useful (posterior disagreement <0.25 dex) [Fe/H] above S/N ∼ 25 Å−1 and [α/Fe] above S/N ∼ 45 Å−1. This is similar performance to other spectroscopic analysis pipelines of DEIMOS data (e.g., Kirby et al. 2008; Escala et al. 2019). The uncertainty-scaled difference between truth and posterior abundance is shown in Figure 11. These distributions show good agreement with the unit Gaussian (shown in orange), implying that the posterior abundances and uncertainties are reasonable. Our choice of a uniform prior on [α/Fe] that has no correlation with the other stellar parameters can be improved in the future with access to MIST isochrones that contain different values of [α/Fe]; this flat [α/Fe] prior currently plays a part in causing the relatively large scatter on our ability to recover [α/Fe] as compared to [Fe/H].

Figure 11.

Figure 11. Uncertainty-scaled comparison of measured posterior [Fe/H], [α/Fe], and input truth for fake spectra. The unit Gaussian (orange line) shows that the pipeline is returning reasonable abundances and corresponding uncertainties.

Standard image High-resolution image

Our posterior uncertainties are relatively large compared to other pipelines because of our large prior uncertainties in $\mathrm{log}g$, which come from unknown distances. Many other investigations that measure stellar abundances are able to use previously measured parallaxes or distances to tightly constrain the possible $\mathrm{log}g$ values, which helps to return better-measured abundances from the same spectral S/N and to push the limiting S/N to much lower values. Our necessarily diffuse distance priors cause the pipeline to consider a wider range of models and thus return more uncertain abundances.

While a given star's posterior [Fe/H] and [α/Fe] measurements are correlated (as seen in Figure 9, as well as our other posterior distributions), we want to ensure that the pipeline does not produce a correlation in any population statistics (e.g., group means and spreads) measured from a collection of stars. To test this, we fit a hierarchical model using the Δ[Fe/H] and Δ[α/Fe] 16 posterior distributions of all fake stars to measure a pooled population mean and covariance matrix of a 2D multivariate normal. The posterior mean of the population distribution was centered at (Δ[Fe/H], Δ[α/Fe]) = (0, 0) dex, corresponding spreads of (0.02, 0.01) dex, and a correlation between Δ[Fe/H] and Δ[α/Fe] that is consistent with 0. Finally, we also perform a prior sensitivity analysis by changing the parameters of the [Fe/H] and age distributions in Table 2 while re-analyzing the fake stars; we find that the posterior chemical abundances are largely unchanged for fake stars with spectral S/N > 20 Å−1 when using reasonable choices of the [Fe/H] and age distribution parameters. Therefore, the pipeline does not induce artificial correlations in the population statistics.

Appendix B: Chemical Abundance Pipeline External Validation with GCs

In Appendix A, we used fake stars with well-behaved data to test our chemical abundance pipeline. In this appendix, we assess the pipeline's ability to recover chemical abundances using real data from well-studied GCs: M2 and M92. M2 is at a heliocentric distance of 11 kpc and has an iron abundance of −1.65 dex (Harris 1996, 2010 edition), which is relatively typical of MW halo stars. M92 is at a heliocentric distance of 9 kpc and is quite metal-poor, with an average [Fe/H] of −2.35 dex (Harris 1996, 2010 edition), so it is a fairly extreme test of our pipeline's ability to measure abundances with hot, metal-poor stars on the MSTO. A CMD of the stars used in our analysis is shown in Figure 12 to highlight that this sample includes many stars around the MSTO, which makes this a good test for the pipeline's accuracy for HALO7D-like data. The photometry comes from Stetson et al. (2019). The spectral observations have 20 < S/N < 100 Å and were observed with Keck II/DEIMOS in a similar configuration to the HALO7D data; see Escala et al. (2019) for details. The main differences between the DEIMOS configurations are that the GC data used a 0farcs8 slitwidth and a central wavelength of ∼7500 Å instead of ∼7200 Å.

Figure 12.

Figure 12. CMD of the M2 and M92 GC stars in HST filters (STMAG) used in our validation sample. The HST photometry was calculated by transforming ground-based photometry, which is why the uncertainties are quite large. Both clusters have stars that are around the MSTO.

Standard image High-resolution image

To transform the Johnson–Cousins photometry of Stetson et al. (2019), we use the mu magnitudes and mu mv colors of each cluster and align the literature MIST isochrone on the CMD data using the appropriate filters. With this, we find the closest matching point on the isochrone to each star, which assigns an estimated mass to each star. Using this mass estimate and the same isochrone in HST filters gives each star's approximate mF606W and mF814W magnitudes.

We use the same prior-building process for the GC stars as for the MW halo stars. That is, the known distances, ages, and abundances of the clusters are not used; instead, we assume that all the cluster stars are distributed like MW halo stars. While this leads to larger posterior uncertainties than we could achieve using the known GC parameters, it gives a better measurement of how well the pipeline will perform with MW halo stars. These GC comparisons also function to show the insensitivity of the pipeline to our choice of priors in Table 2; we know the prior distributions do not describe the GC stars very well, so our ability to recover reasonable posterior abundances for stars with spectral S/N > 20 Å−1 is not overly dependent on our priors.

For each cluster, we measure a population average and scatter in [Fe/H] and [α/Fe] using the posterior distributions of each star. For M92, we measure an average [Fe/H] of −2.31 dex with a scatter of 0.08 dex and an average [α/Fe] of 0.16 dex with a scatter of 0.10 dex. For M2, we measure an average [Fe/H] of −1.64 dex with a scatter of 0.10 dex and an average [α/Fe] of 0.19 dex with a scatter of 0.09 dex. These means in [Fe/H] are consistent with the literature values quoted above. A significant contribution to the scatter is that we force the priors to contain MW halo-like properties instead of the properties of each cluster. This leads to stars with lower-S/N spectra relying on incorrect prior information, thereby increasing the scatter.

We are most interested in assessing whether the pipeline returns abundance measurements that are unbiased over the HALO7D range of spectral S/N and whether the posterior uncertainties are reasonable. To that end, we compare the chemical abundances to our population averages of [Fe/H] and [α/Fe] for the GCs instead of to literature values. This is shown in Figure 13, where the uncertainty-scaled [Fe/H] and [α/Fe] distributions for both clusters agree with the unit Gaussian. This is evidence that the pipeline is giving reasonable abundances for real stars with spectra like those of HALO7D.

Figure 13.

Figure 13. Uncertainty-scaled comparison of measured [Fe/H] and [α/Fe] with population averages for stars in M92 and M2. The green lines show that a unit Gaussian has good agreement with our results.

Standard image High-resolution image

In addition to not being MW halo stars, and therefore not truly following the assumptions that go into the priors, the cluster data are dissimilar to HALO7D in a few other ways. First, all the GC data consist of a single spectroscopic observation per star instead of multiple, which means that any systematic errors in an individual spectrum are more likely to impact the results (e.g., poor wavelength solution, bad skyline removal, and vignetting). Second, the GC photometry comes from ground-based observations that have significantly larger uncertainties than our HALO7D photometry measured with HST; this large color uncertainty leads to significantly more diffuse priors for the GC stars as compared to HALO7D stars. Third, the GC spectra have wavelength solutions that are generally less well behaved than HALO7D spectra, because the observations were calibrated with a single set of relatively red arc lamp exposures and did not use a second set of bluer arc lamps, as was done for the HALO7D observations. Finally, the DEIMOS configuration for the GC stars is centered on ∼7500 Å instead of the ∼7200 Å used in HALO7D, which means that many of the GC spectra do not extend to wavelengths as blue as ∼5000 Å, where many of the strongest [Fe/H] and [α/Fe] features exist.

Appendix C: Statistical Significance of HALO7D Anisotropy Differences

To test the statistical significance of the differences we see in the anisotropy measurements of the four HALO7D fields, we generate simulated HALO7D-like surveys and measure their kinematics. For each HALO7D-like realization, we assume that each of the four fields has the same number of stars as we observed: 40 in EGS, 36 in COSMOS, 21 in GOODSN, and 16 in GOODSS. For the fraction of disk contamination, we again use the values measured in our HALO7D analysis, such that two of the EGS stars are disk contaminants, as are two of the GOODSN and GOODSS stars, and one of the COSMOS stars; the remaining 106 of the 113 stars are halo stars.

We generate realizations of HALO7D that follow the same magnitude and color limits in each of the four fields. For the disk stars, the [Fe/H]s are drawn from

Equation (C1)

the ages are drawn from

Equation (C2)

and the masses and distance moduli are drawn from the prior distributions shown in Table 7. The [Fe/H] distribution comes from an analytical approximation of the "high-alpha disk" and "metal-weak thick disk" populations of Naidu et al. (2020), while the age distribution comes from another analytical approximation of the "high-alpha disk," from Bonaca et al. (2020). The resulting prior MDFs for the Halo and Disk models are compared in Figure 14.

Figure 14.

Figure 14. Comparison of the chosen prior Halo and Disk model MDFs. The Halo model is a simple Gaussian centered at −1.5 dex with a scatter of 1.0 dex, while the Disk model is a mixture of a Gaussian and skew-normal distribution, as described in Equation (C1). The Disk's distribution comes from an analytical approximation of the "high-alpha disk" and "metal-weak thick disk" populations of Naidu et al. (2020).

Standard image High-resolution image

The velocity components for the disk stars are drawn from the prior distributions in the bottom half of Table 8. For the halo stars, the [Fe/H], age, mass, and distance modulus are drawn from the prior distributions shown in Table 2. For the kinematics of each halo star, we assume that all four fields have the same halo velocity ellipsoid that matches the values we measure for the total HALO7D population. Specifically, 〈vr 〉 = 〈vθ 〉 = 0 km s−1, 〈vϕ 〉 = −47 km s−1, and (σr , σϕ , σθ ) = (125, 70, 80) km s−1, which implies an anisotropy of β = 0.57. The velocity components for each halo star are drawn from the distributions in the top half of Table 8.

With each star having a defined [Fe/H], age, mass, and distance modulus, we use the corresponding MIST isochrone to get the apparent mF606W and mF814W magnitudes, ensuring that the colors and magnitudes are within the limits that we observe for each HALO7D field. The distance to each star is used to transform the velocity components into proper motions and LOS velocities, which means that each fake star has a set of observables that make them comparable to the HALO7D sample. Each proper-motion component is given an uncertainty equal to the median uncertainty from the HALO7D sample: ${\sigma }_{{\mu }_{l}\cdot \cos b}=0.17\,\mathrm{mas}\ {\mathrm{yr}}^{-1}$, ${\sigma }_{{\mu }_{b}}=0.16\,\mathrm{mas}\ {\mathrm{yr}}^{-1}$. For the LOS velocities, we use Figure 7 of Cunningham et al. (2019a), which shows the relationship between ${\sigma }_{{V}_{\mathrm{LOS}}}$ and a star's apparent m606W magnitude for the HALO7D sample; we capture this relationship as

which gives LOS velocity uncertainties of 1.6 km s−1 for m606W = 19 mag, 5.5 km s−1 for m606W = 22 mag, and 41.5 km s−1 for m606W = 24.5 mag.

As with the HALO7D sample, we generate absolute magnitude prior distributions assuming each star belongs to the Thick Disk and the Halo. With the absolute magnitude distributions giving corresponding distance distributions, we are able to follow the same steps as outlined in Section 5, fitting for the halo velocity components (〈vϕ 〉, σr , σϕ , σθ ) and the thick disk fraction, fdisk. We repeat this process for each realization of HALO7D and measure the median β for each of the simulated fields. We keep results from the realizations that had a measured β for the total sample that is within the 68% region of the real data's total sample (i.e., only cases where the fake star total sample's median β agrees closely with the data's ${\beta }_{\mathrm{TOTAL}}={0.57}_{-0.08}^{+0.07}$); for our 200 realizations, we find 129 of the medians fall within this region, which agrees with the expectation from a binomial distribution with p = 0.68 and n = 200. These results are summarized in Figure 15, where the black points show the median β measured for each of the 129 realizations. The dashed colored lines show the input anisotropy, β = 0.57, while the shaded gray regions show the 68% region of the results we measure for the real HALO7D data. The dashed gray lines and gray X's show the median β we measure for the real HALO7D data.

Figure 15.

Figure 15. Distributions of median anisotropy β for 129 HALO7D-like realizations for each field. The black points show the posterior median β for each realization, and the histograms on the diagonal show their 1D distribution. The dashed vertical colored lines show the input β that was used to generate the data. The gray dashed lines in the histograms and the gray X's in the scatter plots show the posterior median β we measure for each field in the real HALO7D data. The gray-shaded regions show the 68% region of the β distribution for the real data. There are only three black points that are within the 4D hypercube defined by the gray-shaded region. This implies that the differences in β that we measure for the real data are not likely explained by random chance.

Standard image High-resolution image

From the 129 simulated HALO7D realizations with total β in agreement with the input value, we find only three cases that are similar to the results we measure from the real HALO7D data. In particular, there are three realizations where the measured β is within the 68% shaded regions in all four fields. The fractions of medians within each field's 68% region separately are approximately 31%, 26%, 48%, and 38% for GOODSS, GOODSN, COSMOS, and EGS, respectively. This works out to a 1.5% probability of seeing data in all four shaded regions at the same time from chance alone, which is similar to the 3/129 realizations we measure in this region. From this analysis, we reject the null hypothesis that the anisotropy differences we see between the four HALO7D fields are likely to be explained by statistical chance.

We repeat this analysis technique for the different [Fe/H] bins. We do this by again assuming that the input velocity components are those measured using the total population in the real HALO7D data in that [Fe/H] bin (i.e., the values reported in Table 9). We again create realizations of HALO7D data, this time drawing the expected number of stars in each of the [Fe/H] bins so that the numbers of stars in each field and each bin match the totals shown in Table 6. As before, we keep only the β medians from the realizations that agree with the 68% region of the βTOTAL for that [Fe/H] bin; this works out to 140, 139, and 130 of the 200 total realizations for the high-, mid-, and low-[Fe/H] bins, respectively. For the high-[Fe/H] bin case, the β medians of the realizations are shown in Figure 16. We find that β medians fall in all the gray-shaded regions only 3.3%, 18%, and 35% of the time for the high-, mid-, and low-[Fe/H] bins, respectively; as with the real HALO7D data, we omit the GOODSS low-[Fe/H] bin measurements as well as the GOODSN high- and low-[Fe/H] bin measurements for having too few stars. From these probabilities, we can see that the anisotropy distributions we measure in the high-[Fe/H] bin are not likely to be produced by random chance alone, meaning the GOODSS, COSMOS, and EGS fields likely have different average halo properties in this [Fe/H] bin. For the mid- and low-[Fe/H] bins, these differences are not as statistically significant. In the case of the mid-[Fe/H] bin, this is in line with our expectations, because the fields have similar β measurements in the HALO7D data, suggesting that the velocity distributions of the stars in this [Fe/H] bin are more similar between the fields. For the low-[Fe/H] bin, these results tell us that the differences in anisotropy we see between COSMOS and EGS in this bin could be produced by chance alone approximately ∼two-fifths of the time.

Figure 16.

Figure 16. Same as Figure 15, but for 140 realizations in the high-[Fe/H] bin. The GOODSN panels are omitted because they had too few stars in this [Fe/H] bin for a useful analysis. There are only six black points that are within the 3D hypercube defined by the gray-shaded region in this figure. This implies that the differences in β that we measure for the real data are not likely explained by random chance.

Standard image High-resolution image

Footnotes

  • 10  

    While their definition of "in situ stars" is different than ours, the dependence on fractional contribution of in situ halo stars as a function of merger history should be similar.

  • 11  
  • 12  

    In the MIST parlance, our phase = 0 is also their phase = 0, our phase = 1 is their phase = 2, and our phase = 2 is their phase = 3 and phase = 4.

  • 13  

    We use Kr, Xe, Ar, and Ne for our red arcs, and the same elements plus Hg, Cd, and Zn for our blue arcs.

  • 14  

    The data and model in the "Normalized Flux" panel have been smoothed slightly for the purpose of visual comparison. The data used by the chemistry pipeline and shown in the "Flux Residual" spectrum and histogram are the original, unsmoothed data.

  • 15  

    While our choice of σ = 1.0 dex in the [Fe/H] distribution is larger than the scatter measured by Conroy et al. (2019b), we choose to use this simplified model as a reasonable approximation that produces a significant number of low-[Fe/H] stars.

  • 16  

    Here, ΔX = XposteriorXinput.

Please wait… references are loading.
10.3847/1538-4357/acd33d