Brought to you by:

Photometric Redshifts with the LSST. II. The Impact of Near-infrared and Near-ultraviolet Photometry

, , , , , , , , , , , , and

Published 2020 May 13 © 2020. The American Astronomical Society. All rights reserved.
, , Citation Melissa L. Graham et al 2020 AJ 159 258 DOI 10.3847/1538-3881/ab8a43

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1538-3881/159/6/258

Abstract

Accurate photometric redshift (photo-z) estimates are essential to the cosmological science goals of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). In this work, we use simulated photometry for mock galaxy catalogs to explore how LSST photo-z estimates can be improved by the addition of near-infrared (NIR) and/or ultraviolet (UV) photometry from the Euclid, Wide-Field InfrarRed Survey Telescope (WFIRST), and/or Cosmological Advanced Survey Telescope for Optical and ultraviolet Research (CASTOR) space telescopes. Generally, we find that deeper optical photometry can reduce the standard deviation of the photo-z estimates more than adding NIR or UV filters but that additional filters are the only way to significantly lower the fraction of galaxies with catastrophically under- or overestimated photo-z. For Euclid, we find that the addition of JH 5σ photometric detections can reduce the standard deviation for galaxies with z > 1 (z > 0.3) by ∼20% (∼10%), and the fraction of outliers by ∼40% (∼25%). For WFIRST, we show how the addition of deep YJHK photometry could reduce the standard deviation by ≳50% at z > 1.5 and drastically reduce the fraction of outliers to just ∼2% overall. For CASTOR, we find that the addition of its UV- and u-band photometry could reduce the standard deviation by ∼30% and the fraction of outliers by ∼50% for galaxies with z < 0.5. We also evaluate the photo-z results within sky areas that overlap with both the NIR and UV surveys and when spectroscopic training sets built from the surveys' small-area deep fields are used.

Export citation and abstract BibTeX RIS

1. Introduction

Photometric redshifts (photo-z's; for a broad review, see, e.g., Salvato et al. 2019) are a key component to achieving the cosmological science goals of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST), including weak lensing, galaxy clusters, and supernova host galaxies (e.g., Zhan & Tyson 2018; Ansari et al. 2019; Ivezić et al. 2019). Establishing how to obtain the most accurate photo-z possible from the LSST data set is currently an active and urgent area of study. Toward this goal, (Graham et al. 2018, Paper I in this series) presented and characterized the color-matched nearest neighbors (CMNNs) photometric redshift estimator and demonstrated its use as an efficient tool to evaluate how potential changes to LSST survey parameters directly impact the LSST photo-z results. Schmidt et al. (2020) provides a comparison of the CMNN technique to 11 other modern photo-z algorithms using a variety of metrics and mock LSST galaxy catalogs.

Increasing the wavelength coverage of a galaxy catalog's photometric data can improve photometric redshift estimates (e.g., Banerji et al. 2008; Hildebrandt et al. 2010). This is because doing so provides additional information about the galaxy's spectral energy distribution (SED). For example, when only optical filters are available, galaxies at redshifts of z ∼ 0.1 have similar colors to galaxies at z ∼ 2. This is because the change in color due to the passage of the Balmer break (at 4000 Å) through the u-band filter as a function of redshift is nearly indistinguishable from the change in color due to the Lyα break (at 912 Å). This degeneracy causes catastrophic outliers, as photo-z estimators have trouble distinguishing between very low- and very high-z galaxies. This issue is mitigated if we can jointly track the Balmer and Lyman breaks as their observed wavelength moves with redshift by extending the filter coverage into the UV and NIR.

Graham et al. (2018) used the CMNN estimator to illustrate how the LSST photo-z quality would deteriorate in regions without u- or y-band coverage; for example, the standard deviation in ztrue − zphot/(1 + zphot) increases by ∼50% at intermediate redshifts (zphot ≈ 1.5). In this work, we use the CMNN estimator to evaluate the benefit of adding near-infrared (NIR) and ultraviolet (UV) photometry to the LSST optical data when estimating galaxy photometric redshifts. We consider two potential sources of NIR photometry: the European Space Agency (ESA) Euclid mission and the National Aeronautics and Space Administration (NASA) Wide-Field InfrarRed Survey Telescope (WFIRST). We also consider one potential source of UV photometry, a proposed satellite led by the Canadian Space Agency (CSA): the Cosmological Advanced Survey Telescope for Optical and ultraviolet Research (CASTOR4 ).

ESA's Euclid mission (Laureijs et al. 2011; Joachimi 2016; Racca et al. 2016) is a planned 7 yr NIR survey, which (at the time of this analysis) is set to launch in 2021, ∼2 yr before the start of LSST operations. Euclid will cover 15,000 square degrees of sky: nearly the entire extragalactic sky with Galactic latitude $\gt | 30| $, and an avoidance zone around the ecliptic as well. Its step-and-stare survey strategy will cover the full area to a depth of YJH ∼ 24 mag (AB magnitudes, for a 5σ point source; Joachimi 2016). Revisits to one or more ∼40 square degree "deep field" will extend this limit by ∼2 mag. By 2028, approximately 7000 square degrees (∼40%) of the LSST's "wide-fast-deep" (WFD) main survey area will have overlapping NIR coverage from Euclid (see, e.g., Figure 6 of Rhodes et al. 2017). The addition of Euclid's J and H filters are expected to improve photo-z estimates at and above redshift ∼1.5, where the 4000 Å Balmer break is redshifted to 10,000 Å and begins to influence the Y − J color.

NASA's WFIRST (Spergel et al. 2015) will produce a sky survey that has ∼2200 square degrees of overlap (3.8 × 108 galaxies) with the LSST WFD main survey and 5σ point-source detection limits down to YJH ≳ 26.5 mag. WFIRST will also survey in a redder band (F184, 1.7–2.0 μm; referred to as K for simplicity in this work) to a shallower depth (≳25.5). WFIRST is currently planned for a mid-2020s launch date. The addition of WFIRST's YJH filters is expected to improve photo-z estimates for the same reason as Euclid, sampling the Balmer break at intermediate redshifts. Additionally, the redder filter at 1.7–2.0 μm should improve photo-z estimates above redshift ∼2.5, where the Balmer break influences the H − K color.

The scientific benefits of combining data from LSST, Euclid, and WFIRST has been amply demonstrated in the literature. For example, Rhodes et al. (2017) investigate a variety of science goals that benefit from the combination of LSST and Euclid data at either the pixel or catalog level. They show that improvements to the photo-z accuracy will improve the weak lensing signal in particular but will also be of benefit to classifying high-z supernovae and making cluster mass estimates. As another example, Jain et al. (2015) shows that when WFIRST NIR photometry is included in LSST photo-z estimates, the most significant improvements are reductions in the scatter between photometric (zphot) and true (ztrue) redshifts for galaxies with zphot > 1.5 and in the catastrophic outliers rate across the full range of 0.0 < zphot < 3.5. They discuss how this improvement is likely to have an impact on a wide variety of extragalactic and cosmological science goals.

The proposed CASTOR mission (Côté et al. 2012; Capak et al. 2019), led by the CSA with international partners, aims to carry out wide-field imaging in three passbands located in the UV/blue-optical region (0.15–0.55 μm). The depths and areal coverage of these surveys are currently under investigation, but for the purposes of our analysis, we consider two notional surveys: (1) the Primary Survey, a $\sim 7200$ square degree field in the north that overlaps with the Euclid-Wide survey (but possibly has a limited overlap with LSST); and (2) the Cadence Survey, ∼20 square degrees in the south that overlaps with WFIRST and LSST, in which depth is built up over many repeated visits. The 5σ limiting magnitudes for CASTOR Primary would be mUV ∼ 27.4, mu ∼ 27.4, and mg ∼ 27.1 mag, and for CASTOR Cadence, they would be mUV ∼ 29.25, mu ∼ 28.95, and mg ∼ 28.45 mag (Côté & Scott 2014). There are currently two proposed CASTOR passband pairs, UV and u, and UV-dark and u-wide; we consider both pairs in our analysis. The addition of CASTOR UV and deep u-band photometry is expected to improve the photo-z estimates for low-redshift galaxies (z < 1.5) by breaking the color degeneracy between low- and high-redshift galaxies and also by sampling the UV-upturn from the blackbody emission of evolved hot stars at ≳1000 Å (e.g., Greggio & Renzini 1990, 1999) and the Lyα break at 912 Å.

In this work, we focus on how the combination of LSST, Euclid, WFIRST, and/or CASTOR photometry at the catalog level will impact photo-z estimates at low (0.3 < z ≲ 0.6), intermediate (z ∼ 1), and high (z > 2) redshifts. The LSST Science Requirements Document (Ivezić & the LSST Science Collaboration 2011) defines the minimum deliverables for statistical measures of the accuracy of photometric redshifts derived from LSST photometry. First, the rms error in photo-z must be < 0.02(1 + zphot); second, the 3σ outlier fraction must be <10%; and third, the average bias must be < 0.003(1 + zphot). These target values apply to an i < 25 mag sample of 4 × 109 galaxies from 0.3 < z < 3.0, and they serve as suitable reference targets for this work. We therefore limit our analysis to a simulated set of galaxies with i < 25 mag and 0.3 < z < 3.0, and we include these target values as a reference benchmark in our analyses.5

In Section 2, we describe our simulation design: the simulated galaxy catalog, its optical, NIR, and UV photometry, our photo-z estimator, and how we analyze the results. In Sections 3 and 4, we evaluate the impact of including NIR photometry from Euclid and WFIRST, respectively, and in Section 5, we demonstrate the impact of including UV-, u-, and g-band photometry from CASTOR. We summarize and discuss our results in Section 6 and present several ideas for additional ways in which the space-based imaging of these missions can provide further improvements to the LSST photo-z estimates.

2. Simulation Design

In order to evaluate the impact of adding NIR and/or UV to the LSST optical photometry to estimated photometric redshifts, we use the CMNN photo-z estimator (Section 2.1) with a simulated galaxy catalog with realistic observed photometric characteristics (Section 2.2). Section 2.3 describes the statistical measures and evaluation tools that we will use in our analysis.

2.1. The CMNN Photo-z Estimator

The CMNN photometric redshift estimator that we use for this work was first introduced and described by Graham et al. (2018).6 It is not intended to provide the "official" or "best" LSST photo-z but instead to provide sets of photo-z results in which the accuracy and precision of the photo-z estimates are directly related to the precision of the input photometry. In this way, it is particularly useful for comparing the relative photo-z quality from different survey strategies that affect the LSST photometric quality of the LSST.

The CMNN photo-z estimator requires "training" and "test" galaxy catalogs: the former is equivalent to a set of galaxies with "known" or spectroscopic redshifts, and the latter is equivalent to a sample of galaxies for which photo-z will be estimated. The full catalog of simulated galaxies that we use for this work is described in Section 2.2. For this work, instead of splitting the simulated catalog into test and training sets, we employ a leave-one-out strategy: every catalog galaxy is considered as the test galaxy in turn, and the training set is composed of all of the other galaxies in the simulated catalog (Section 2.2). For the test galaxy, the estimator identifies a subset of training galaxies that are its nearest neighbors in color-space using the Mahalanobis distance7

Equation (1)

where ctrain is the color of a training-set galaxy, ctest is the color of a test set galaxy, and δctest is the measurement error in the color of the test set galaxy (Section 2.2.2). If a galaxy is undetected in a given filter, the associated color does not contribute to DM. We enforce the number of colors to be Ncolors ≥ 3 (i.e., a galaxy must be detected with at least three colors to obtain a photo-z estimate).

Once DM is measured for all training-set galaxies, we use a percent-point function (PPF) value of PPF = 0.68 and the degrees of freedom, Ncolors, to define a threshold value, ${T}_{{D}_{M}}$. Training-set galaxies with DM values below ${T}_{{D}_{M}}$ are identified as the set of CMNNs. For example, for ${N}_{\mathrm{colors}}=5$, PPF = 68% of all training galaxies consistent with the test galaxy will have DM < 5.86. The CMNN subset galaxies are then subjected to a weighted (${D}_{M}^{-1}$)8 random selection, and the redshift of the chosen training-set galaxy is adopted as the photo-z for the test galaxy. For test galaxies with less than 10 training-set galaxies in their CMNN subset (${N}_{\mathrm{CM}}\lt 10$), the 10 nearest neighbors are used. If NCM ≥ 10, the photo-z uncertainty (δz) is the standard deviation in the true redshifts of the color-matched subset; if NCM < 10, we calculate the PPF threshold value associated with the 10th nearest neighbor, and we inflate the uncertainty by $\times {\mathrm{PPF}}_{10}/0.68$.

Note that this is different from the application in Graham et al. (2018), which also used a PPF = 0.68 but chose the nearest neighbor in color-space and did not have a modification for galaxies with NCM < 10 matches (such that galaxies with zero matches simply failed to obtain a photo-z). We furthermore do not apply an optical magnitude or color "pseudo-prior" to our photo-z estimates, as was done in Graham et al. (2018), so as to not give the optical an additional influence on the photo-z results in this paper, which focuses on the addition of UV and NIR photometry. The other major difference compared to Graham et al. (2018) is that, here, the entire simulated set of galaxies is used as both a test and training set instead of assigning each galaxy to one or the other before simulating photo-z results. This has allowed us to use much larger test sets and achieve more accurate statistical measures for the photo-z results in high-redshift bins.

Incorporating the NIR filter(s) and/or UV passbands causes an increase in ${N}_{\mathrm{colors}}$, which increases the number of degrees of freedom in Equation (1). This can also increase the size of the CMNN subset of training galaxies and, at times, degrade the overall photo-z quality. This is an important aspect of the CMNN estimator to keep in mind as we explore the impact of adding NIR and UV photometry. The photometric quality of the data must be good enough to overcome the increase in degrees of freedom in the estimator in order to see a beneficial impact on the photo-z results. In this work, we encounter several instances where the photometric quality for some simulated galaxy populations is insufficient to benefit from the addition of NIR or UV filters, and the photo-z results are deteriorated. Generally, we will mitigate this issue by identifying these populations using their observed characteristics (i.e., not using true catalog redshift), and then excluding the additional filters from the photo-z estimates. Each of these instances is discussed in detail in the relevant analysis section.

2.2. Simulated Galaxy Catalog

As in Graham et al. (2018), we use a galaxy catalog based on the Millennium simulation (Springel et al. 2005)—specifically, a catalog based on the galaxy formation models of Gonzalez-Perez et al. (2014) and fabricated using the lightcone construction techniques described by Merson et al. (2013).9 This simulated galaxy catalog was designed to model the optical and NIR properties of galaxies, and with appropriate limits, it also serves as a realistic representation of future LSST catalogs. "True" catalog apparent magnitudes for optical and NIR filters ugrizy and YJHK are included in this simulated galaxy catalog (for simplicity, we refer to WFIRST's F184 filter as K in this work because it fulfills a similar role), but we must synthesize magnitudes to represent the proposed CASTOR passbands.

2.2.1. CASTOR Passbands and Simulated Photometry

To simulate "true" catalog photometry for the CASTOR passbands, we start by using the same process as in Graham et al. (2018, their Section 5.1) to assign each galaxy an SED type from the catalog of Brown et al. (2014). First, we find the SED template that best matches each catalog galaxy's rest-frame g − r color. The SEDs are sequenced from early- to late-type, numbered from 0 to 128, and we include a modest amount of intrinsic scatter by simply adding a small uniform-random integer to the assigned SED number (within ±2). Next, we flux-scale the SED to match the catalog's LSST u- and g-band apparent magnitudes and then apply the four CASTOR bandpasses10 (see Figure 1) to calculate "true" catalog magnitudes in uc, u-wide (uw), UV, and UV-dark (UVd). We do not recalculate a CASTOR gc-band true catalog magnitude because the passband shape is quite similar to the LSST g filter, as shown in the top panel of Figure 1. Instead, we will simply apply the CASTOR magnitude limits to the catalog's LSST g-band apparent magnitudes.

Figure 1.

Figure 1. Top panel: the full system transmission functions for the CASTOR UV, UVd, uc, uw, and gc bandpasses considered in this work, compared to the LSST u and g filters. Bottom panel: observed photometric colors for our simulated galaxies: CASTOR UV − LSST u vs. LSST u − g. Points are colored by true catalog redshift (as in the legend). Galaxies are clearly grouped by redshift in this 2D plane of UV-color vs. optical color, which confirms to us that the UV photometry we have synthesized will provide some redshift information to the CMNN photo-z estimates. A random 50% of catalog galaxies are included in this plot, for clarity; the banding is due to the finite number of SED templates.

Standard image High-resolution image

To validate that our synthesized CASTOR UV magnitudes are providing independent information about redshift, we plot the simulated galaxies in the color–color plane of CASTOR UV − LSST u versus the LSST u − g in the bottom panel of Figure 1. Plotted point color represents the true catalog redshift for each galaxy, and we can see that redshift is correlated with the UV − u color in a way that is unique from the u − g color. This confirms that our synthesized CASTOR UV photometry should have an influence on the estimated photometric redshifts. Since the UV magnitudes are the only passbands being synthesized in this work, we do not show similar color–color plots for Euclid or WFIRST NIR. The impact of CASTOR photometry on LSST photo-z is presented in Section 5.

2.2.2. Photometric Errors

To incorporate an LSST-like observational uncertainty into our catalog, we simulate observed apparent magnitudes from the true catalog magnitudes by adding a normal random scatter11 with a standard deviation equal to the predicted magnitude error for each galaxy. Predicted magnitude errors for the LSST, as described in Section 3.2.1 of Ivezić et al. (2019), depend on the galaxy's magnitude and the total survey exposure time elapsed in a given filter (for this work, we assume no additional components from, e.g., deblending, or the different image resolutions of the four facilities). We assume standard observing conditions, an airmass of 1.2, and a uniform survey progression that accumulates a total of 56, 80, 184, 184, 160, and 160 visits in filters ugrizy by year 10 (where each visit is 30 s of integration time). We use the LSST simulations software package described by Connolly et al. (2014) to calculate the magnitude errors; it includes a systematic floor of 0.005 in all filters. After 10 yr of the LSST survey, these accumulated visits result in 5σ detection limits for each filter of u < 26.1, g < 27.4, r < 27.5, i < 26.8, z < 26.1, and y < 24.9 (as also listed in Table 1). In the bottom panel of Figure 2, we show the magnitude error as a function of observed apparent magnitude, with magnitude cuts applied at the 5σ limiting magnitude in filters ugrzy and at i ≤ 25 mag.

Figure 2.

Figure 2. Top panel: the cumulative distribution of true catalog apparent magnitudes for the set of simulated galaxies, with the imposed constraints of i < 25 and 0.3 < z < 3.0, for each of the six LSST filters, ugrizy, the four NIR filters considered in this work, YJHK, and the CASTOR UV passband. The dashed portions of the lines mark the range of magnitudes that are below the LSST, Euclid, and CASTOR surveys' 5σ detection limits (Table 1). Note that this set of galaxies would be completely detected by WFIRST. Bottom panel: predicted magnitude error vs. the catalog apparent magnitude for the six LSST bands ugrizy, the Euclid photometry of NIR bands YJH (dashed), the WFIRST photometry for NIR bands YJHK, and the CASTOR UV band, each to the surveys' 5σ limiting magnitudes.

Standard image High-resolution image

Table 1.  Signal-to-noise Ratio = 5 Limiting Magnitudes

Filter LSSTa Euclidb WFIRSTc CASTORd
UV ... ... ... 27.4
u 26.1 ... ... 27.4
g 27.4 ... ... 27.1
r 27.5 ... ... ...
i 26.8 ... ... ...
z 26.1 ... ... ...
y/Y 24.9 24.0 26.7 ...
J ... 24.2 26.9 ...
H ... 23.9 26.0 ...
K ... ... 25.8 ...

Notes.

aLimits for a nominal 10 yr survey. bLimits from J.-C. Cuillandre, private conversation. cLimits from B. Jain, private conversation. dLimits for the primary (wide-area) survey.

Download table as:  ASCIITypeset image

To predict the photometric errors for the NIR filters of Euclid and WFIRST, and the UV/blue passbands of CASTOR, we use the expected 5σ limiting magnitudes listed in Table 1. We note that the predicted WFIRST limiting magnitudes have slightly evolved since our simulations were run. Doré et al. (2019) reports that the expected 5σ point-source imaging depths in Y, J, H, and K (F184) are 26.9, 26.95, 26.9, and 26.25, respectively—not too far off from the values we have used (except for the H band). For all three future facilities, we use the same prescription for error as described in Section 3.2.1 of Ivezić et al. (2019), namely,

Equation (2)

where $x={10}^{0.4(m-{m}_{5})}$, m5 is the 5σ limiting magnitude, m is the magnitude of the galaxy, and we use the fiducial value of γ = 0.04, which sets the impact of, e.g., sky brightness, to be zero. As with the LSST photometry, we impose a systematic floor of σrand ≥ 0.005 mag. The magnitude error as a function of apparent magnitude, down to the 5σ detection limit of each filter for the surveys considered in this work, is shown in the bottom panel of Figure 2.

The uncertainty in the apparent observed color of a galaxy is calculated as the root of the sum of the squares of the magnitude uncertainties in the two filters (i.e., the magnitude uncertainties from the two filters are added in quadrature, under the assumption of uncorrelated errors). In some parts of our analysis, we extend the cut on apparent magnitude from the 5σ to the 1σ limiting magnitude, and in all cases, we use ${m}_{1\sigma }={m}_{5\sigma }+1.75$ mag. In practice, obtaining matched aperture photometry for images with very different resolutions can lead to additional uncertainties in the combined catalogs' photometry (e.g., see Section 5.1 in Spergel et al. 2015). For this work, we must assume that this potential issue has been resolved by a sophisticated joint pixel analysis.

2.2.3. Catalog Demographics

In the top panel of Figure 2, we show the cumulative distribution of true apparent magnitudes in our simulated catalog. The line style switches from solid to dashed when the distribution passes the 5σ limiting magnitude for detection in the filter. From this, we can see that for the LSST photometry, ∼25% of the galaxies in our sample remain undetected in the u band, but all galaxies are detected in the grz bands and almost all in the y band. All galaxies are detected in the i band by design,12 since we impose a cut13 of i < 25 mag, which is brighter than the 5σ detection limit of 26.8 mag.

In Figure 2, we have included the Euclid NIR photometry in our plots of true catalog magnitude distributions (top panel) and error versus magnitude (bottom panel). We can see that the cumulative distributions of true magnitudes are very similar for Euclid Y and LSST y (top panel) but that the 10 yr LSST photometry has significantly smaller errors than Euclid's (bottom panel). Our treatment of these two y-bands in our photo-z estimates is discussed in Section 2.2.4. The photometric quality offered by WFIRST is also illustrated in the lower panel of Figure 2. The WFIRST limiting magnitudes are sufficiently deep that all catalog galaxies are detected in all four WFIRST filters.

In Figure 3, we show the redshift distributions of the simulated galaxy catalog. Since we use each catalog galaxy as the "test" galaxy in turn, the "test" and "training" sets have the same redshift distributions. This similarity helps to avoid redshift bias in the photo-z results but is not a likely property of future spectroscopic training sets, as discussed below. In the top panel of Figure 3, we show the true redshift distribution of our simulated catalog, with applied cuts of i ≤ 25 and ≤26.8 mag (most of our simulations use the former).

Figure 3.

Figure 3. Top panel: the simulated galaxy catalog's redshift distribution for galaxies with i < 25 mag (thick gray line) and i < 26.8 mag (thin black line), the LSST's 10 yr 5σ and 1σ limiting magnitudes. Middle panel: fraction of i < 25 mag galaxies detected in the CASTOR UV (pink lines) and uc (purple lines) bandpasses compared to the LSST u-band (dashed purple lines), for 1σ and 5σ detection limits (thin and thick lines, respectively). Bottom panel: fraction of i < 25 mag galaxies detected in the Euclid J (solid salmon line) and H (dashed teal line) filters, for 1σ and 5σ detection limits (thin and thick lines, respectively). Note that the thin lines are at y = 1 because catalog galaxies with i < 25 mag are completely detected by Euclid at 1σ.

Standard image High-resolution image

In the middle panel of Figure 3, we show the fraction of catalog galaxies detected in the CASTOR uc and UV passbands as a function of true redshift, compared to LSST u, for detection limits of 1σ and 5σ. We can see that the fraction of galaxies detected in the CASTOR u passband at 5σ is approximately equivalent to that detected by the LSST u filter at 1σ (thick solid and thin dashed purple lines, respectively). The sharp drop in the fraction of galaxies detected in CASTOR UV at 5σ indicates that adding CASTOR UV photometry will probably only be able to improve photometric redshift estimates for z ≲ 1.7.

In the bottom panel of Figure 3, we show the fraction of catalog galaxies detected in the Euclid J and H filters as a function of true redshift, for detection limits of 1σ and 5σ. Like the CASTOR UV passband, the impact of including Euclid 5σ photometry in the photo-z estimates will drop for z ≳ 1.7. WFIRST detection fractions are not shown in any panel because, as with Euclid 1σ photometry, all LSST galaxies with i < 25 mag are completely detected in the four WFIRST filters.

Throughout most of this work, we will use a training set that has the same distributions of redshift and magnitude as the test set, and we emphasize that this is an ideal case (but appropriate for this work, because our focus is on the impact of photometry, exclusively, on the photo-z results). In reality, spectroscopic catalogs have some inherent bias and are not perfectly representative of the test set (which is an extremely difficult data product to build; e.g., Newman et al. 2015). In some of our simulations, we consider a training set that has better or deeper photometry than a test set from a wide-field imaging campaign, and in some cases, we also change the limiting magnitudes such that the test and training sets are no longer matched—but only when the proposed surveys seem likely to provide such a data product.

2.2.4. LSST y versus Euclid Y

The LSST y- and Euclid Y-band filters will not have exactly the same transmission function (the Euclid Y-band filter will be somewhat shifted redward and have sharper edges14 ), but for the purposes of this work, we consider the two filters to contribute equivalent information about a galaxy's SED to our photo-z estimator, and we use only one or the other. This is the most appropriate choice for our particular photo-z estimator, because including colors z − y, z − Y, y − J, and Y − J when calculating the Mahalanobis distance in color-space would give that region of the SED twice as much influence in Equation (1). We leave a study of whether two nearly superimposed filters could provide additional SED information, e.g., emission line galaxies, for future work.

In Figure 4, we plot the apparent magnitude error versus apparent magnitude for Euclid Y-band and the LSST y-band at survey years 1, 2, 5, and 10. It is clear that LSST's y-band photometric quality will be approximately equivalent to Euclid's Y-band with the LSST 2 yr data release. In this work, we do not simulate the LSST photo-z results at phases <2 yr, so in all cases, our simulations use the LSST y-band instead of the Euclid Y-band.

Figure 4.

Figure 4. Magnitude error vs. apparent magnitude for the Euclid Y-band (black solid line) and the LSST y-band at years 1, 2, 5, and 10 (black to light gray dashed lines). All lines are cut off at the 5σ detection limit. It is evident that by year 2 of LSST, its y-band photometry will have a better accuracy than that of the Euclid Y band.

Standard image High-resolution image

As a final note, the y-band filter of any ground-based facility like LSST will be impacted by a variable water absorption feature. However, it should be possible to correct the y-band colors to a fiducial filter system to within acceptable uncertainties, given a measure of the precipitable water vapor and a rough SED shape determined from the other five LSST bands (S. Schmidt 2020, private communication). Thus, we have not explicitly included the effects of water vapor on the y-band photometry in this work.

2.3. Evaluation Methodology

To the results of our simulations, we apply a similar evaluation methodology as used in Graham et al. (2018). Since we are using the CMNN estimator to predict the relative improvement in photo-z due to changes in the input photometry (and not to predict absolute qualities), we do not extend our analysis to include predictions for cosmological parameters but suggest that readers consult Nakajima et al. (2012) and/or Mandelbaum (2018) for discussions of how photo-z errors impact cosmological analyses. Below, we describe first our statistical measures of photo-z quality and then the common plot styles that we will use to represent our results in this work.

2.3.1. Statistical Measures

In our statistical analysis, we use the photo-z error ${\rm{\Delta }}{z}_{(1+z)}=({z}_{\mathrm{true}}-{z}_{\mathrm{phot}})/(1+{z}_{\mathrm{phot}})$, where ztrue is the "true" catalog redshift, and zphot is the photo-z. Including a factor of (1 + z) in the denominator compensates for larger uncertainties at high-z and provides a comparable fractional error across our redshift range. For all of our results, we calculate the robust standard deviation in ${\rm{\Delta }}{z}_{(1+z)}$ as the FWHM of the interquartile range (IQR) divided by 1.349 (σIQR) and the robust bias as the mean value of ${\rm{\Delta }}{z}_{(1+z)}$ in the IQR ($\overline{{\rm{\Delta }}{z}_{(1+z),\mathrm{IQR}}}$). We bootstrap our uncertainties onto these statistical measures by randomly drawing galaxy subsets and recalculating the statistics 1000 times, and we then use the standard deviation of all recalculations as the error in the statistical measure.

Outlier galaxies are identified as those with ${\rm{\Delta }}{z}_{(1+z)}\gt 3{\sigma }_{\mathrm{IQR}}$ or ${\rm{\Delta }}{z}_{(1+z)}\gt 0.06$, whichever is larger, where σIQR is calculated from all galaxies in 0.3 ≤ zphot ≤ 3.0, i.e., outliers are defined globally. Since σ changes with redshift, outliers could instead be defined locally, as is done in other photo-z analyses; however, we have adopted this global definition because it is equivalent to the LSST SRD's 3σ outlier, as mentioned in Section 1. Catastrophic outliers with $| {z}_{\mathrm{phot}}-{z}_{\mathrm{spec}}| \geqslant 2$ are removed from the calculation of standard deviation and bias to keep these two statistical measures as representative of galaxies with "good" photo-z estimates. This removal is appropriate because catastrophic outliers are generally recognizable by the uncertainty in their photo-z error returned by the CMNN estimator (and their photometric colors) and could be removed or flagged in a sample of real LSST photometric redshifts as well. Catastrophic outliers are included in the fraction of outliers statistical measure, though.

Generally, we discuss our results in three main redshift ranges: low (0.3 < z < 0.6), intermediate (z ∼ 1), and high (z > 2). We do not include galaxies with z < 0.3 in our statistical evaluations, as mentioned in Section 1, but also—for interest's sake—we do not exclude the low-z galaxies from our plots.

2.3.2. Plot Styles

To visualize our photo-z results, we create plots that compare the true versus photometric redshifts. We use a two-dimensional histogram with log-normalized shading (such that black is always the most populated bin) in the densely sampled areas of ztrue versus zphot space, and we over-plot outlier galaxies with transparent red dots (see, e.g., Figure 5). We also draw a solid line representing ztrue = zphot to guide the eye. These plots are useful for obtaining a global sense of the photo-z quality and structure in the outliers positions, especially the features that are perpendicular to the ztrue = zphot, which represent photo-z degeneracies caused by the Balmer break passing between filters.

Figure 5.

Figure 5. True vs. photometric redshifts when photometry from a 10 yr LSST survey is used alone (top panel), and when Euclid NIR JH filters' 5σ and 1σ photometry are included (middle and bottom panels, respectively). The two-dimensional histogram's scale is log-normalized, and the most populous bin is black. The red points are outlier galaxies.

Standard image High-resolution image

To directly compare the statistical measures of photo-z quality for our different simulations, we generate plots of the robust standard deviation, robust bias, and fraction of outliers as a function of binned zphot (see, e.g., Figure 6). We use a bin width of 0.3 and a bin spacing of 0.15 in zphot, such that bins overlap. Small vertical error bars mark the error in the statistical measurements, and wide horizontal bars mark the value of the statistic over the full range of 0.3 ≤ zphot ≤ 3.0. Dashed lines are often used to represent the SRD's target values in these plots.

Figure 6.

Figure 6. Statistical measures for LSST photo-z results show the impact of including Euclid NIR photometry with 5σ and 1σ detection limits (left and right column panels, respectively). Panel rows, from top to bottom: the robust standard deviation, robust bias, and fraction of outliers in bins of zphot. The green and blue lines represent photo-z results based only on LSST photometry with 5σ limiting magnitudes equivalent to a baseline survey strategy at 2 and 10 yr, respectively. The red and orange lines represent the photo-z results at LSST years 2 and 10 when Euclid JH photometry is added (thin lines are the results when NIR photometry is included for all galaxies even if it does not help the photo-z estimate, as described in the text). The horizontal solid lines represent the statistics over the full range of 0.3 ≤ zphot ≤ 3.0, and the dashed lines show the LSST's SRD target values as a reference point.

Standard image High-resolution image

3. ESA's Euclid Space Telescope

Euclid's launch is planned to be in 2021, and LSST is set to start operations in late 2022. In 2024, the 1 yr LSST photometry will be available to be combined with the 3 yr Euclid data set, the 2 yr LSST data with the 4 yr Euclid data, and so forth. Assuming that the LSST adopts a survey that uniformly progresses the depth of each annual data release for the WFD Main Survey of 18,000 square degrees, we list the 5σ i-band limiting magnitude at survey years 1, 2, 5, and 10 in Table 2. Euclid will adopt a step-and-stare strategy that surveys a new area to full depth every year, as illustrated in Figure 6.11 of Laureijs et al. (2011). We list the Euclid-Wide survey area that will be complete after 3, 4, and 7 yr in Table 2. Once the Euclid-Wide survey is complete, it will have an overlap area of ∼40% of the LSST's 18,000 square degree main survey (Rhodes et al. 2017). Since Graham et al. (2018) demonstrated that the LSST y-band data primarily helps improve the photo-z quality for galaxies with 1 < zphot < 1.5, we expect the addition of Euclid data to improve the results for at least zphot > 1.

Table 2.  Anticipated Progression of LSST and Euclid

Calendar LSST Main Survey Euclid-Wide Survey
Year Year Depth (i, 5σ) Year Area Covered
    (mag)   $({\deg }^{2})$
2024 1 25.6 3 ∼2300
2025 2 26.2 4 ∼4300
2027 5 26.5 7 ∼7200
2032 10 26.8 7 ∼7200

Download table as:  ASCIITypeset image

We estimate photometric redshifts for simulated galaxies within the 7 yr Euclid footprint and evaluate how the impact of adding Euclid photometry changes as the LSST survey adds depth. For the training set, we assume that a field of galaxies with spectroscopic redshifts has been observed by both surveys. We simulate the observed apparent magnitudes and their uncertainties using the LSST 10 yr 5σ limiting magnitudes and Euclid's 5σ limiting magnitudes for JH, as listed in Table 1. We apply detection threshold cuts in observed apparent magnitude of i < 25 mag, the LSST 10 yr 5σ limiting magnitude for ugrzy, and Euclid's 5σ limiting magnitudes for JH.

For the test sets, we simulate the observed apparent magnitudes and their uncertainties using the 2 and 10 yr 5σ LSST limiting magnitudes and the Euclid 5σ limiting magnitudes. The LSST's 2 yr depths are 25.2, 26.5, 26.7, 25.9, 25.2, and 24.0 mag in filters ugrizy, respectively. We apply detection threshold cuts in observed apparent magnitude of i < 25 mag, the LSST 2 or 10 yr 5σ limiting magnitude for ugrzy, and Euclid's 5σ or 1σ limiting magnitudes for JH. The Euclid 1σ limits are 25.95 and 25.65 mag in J and H, respectively. Including galaxies detected at 1σ is the best way to incorporate Euclid "non-detections" with the CMNN estimator, because there is not a simple way to include upper limits in the Mahalanobis distance (Section 2.1).15 Recall from Figure 3 that our set of simulated galaxies with i < 25 mag is completely detected in the Euclid NIR filters at 1σ but >50% incomplete at redshifts ≳1.7 with a 5σ cut.

In Figure 5, we plot the true versus photometric redshift results for a 10 yr LSST survey alone and when Euclid detections at 5σ and 1σ are included. These plots show how the addition of Euclid data reduces the number of outliers (as defined in Section 2.3.1), especially the catastrophic outliers with overestimated photo-z (the cloud of red points in the lower-right corner). The fact that Euclid also reduces the scatter around ztrue = zphot for non-outlier galaxies is almost imperceptible in these plots, but in Figure 6, we show the standard deviation, bias, and fraction of outliers as a function of binned photo-z for LSST year 2 and 10, with and without the addition of 5σ or 1σ Euclid photometry. When considering Figure 6, it is important to remember that the definition of an outlier depends on the standard deviation and includes catastrophic outliers ($| {z}_{\mathrm{true}}-{z}_{\mathrm{phot}}| \geqslant 2$), whereas the robust standard deviation and bias exclude catastrophic outliers. This leads to the standard deviation and fraction of outliers being inversely correlated in some of our results.

With the CMNN photo-z estimator, there can be instances when additional filters increase the number of degrees of freedom in Equation (1), but the photometric quality is insufficient to produce an improvement in the photo-z results (as discussed in Section 2.1). We find that the addition of Euclid data to the 2 yr LSST photometry is one of those cases. In Figure 6, the thinner red lines represent the photo-z quality when the Euclid photometry is added to the 2 yr LSST photometry for all galaxies, and we see that this leads to a larger standard deviation in the redshift bins 0.5 < zphot < 1. The simplest way to mitigate this issue with the CMNN estimator is to not include additional photometry if it is unlikely to improve the photo-z estimate. We explored our simulated data and found that the photo-z error ${\rm{\Delta }}{z}_{1+z}$ was likely to be larger (i.e., the photo-z was less accurate) if (1) the photo-z uncertainty was larger when NIR data was included and (2) the galaxy had a bluer color in the bluer filters. Based on this, we imposed a restriction that the NIR photometry be excluded from the photo-z estimate for galaxies with $u-g\lt 0.5$ or $g-r\lt 0.4$ unless it produced a smaller photo-z uncertainty than the LSST optical filters alone. We find that with this restriction, the apparent deterioration of the results is mitigated, as shown by the thicker red lines in Figure 6. This restriction still allows the NIR photometry to contribute to the photo-z estimates of most simulated galaxies, as shown in Figure 7. There remains a small deterioration in the standard deviation in redshift bins $0.6\lt {z}_{\mathrm{phot}}\lt 0.8$ (thick green versus red line in the top right plot), which suggests that our restriction could be further optimized, but we do not make any more complicated restrictions on the addition of NIR photometry at this time. We find that this restriction is not as necessary for the 10 yr LSST photometry (i.e., the thin and thick orange lines in Figure 6 show only minor differences).

Figure 7.

Figure 7. The fraction of galaxies in each redshift bin that were detected by Euclid and for which Euclid NIR photometry was deemed to be worth including in the photo-z estimate for a given simulation (as labeled in the legend). Galaxies were considered detected by Euclid if they had J < 24.2 or J < 25.95 mag for the 5σ and 1σ detection limits (all simulations), and/or if they had Y < 24.0 mag (northern survey simulations).

Standard image High-resolution image

Figure 6 demonstrates that incorporating the 5σ Euclid photometry will improve both the 2 and 10 yr LSST photo-z results at intermediate and high redshifts, zphot > 1. This improvement is larger and extends to higher redshifts when the Euclid detection limit is extended to 1σ. This improvement was expected because intermediate redshifts are where the 4000 Å Balmer break becomes redshifted beyond the optical filters. The NIR colors provide the location of the break and prevent truly low-z galaxies from being assigned a high-zphot. Figure 6 also shows that the 1σ Euclid photometry improves the results at low redshifts, zphot < 0.5, for the same reason.

Quantitatively, Figure 6 shows that including Euclid 5σ JH photometry could provide a $\sim 20 \% $ reduction in standard deviation (at LSST years 2 and 10) for redshifts zphot > 1, and ∼6% (year 2) and ∼10% (year 10) reductions over the full redshift range (0.3 < zphot < 3.0). For the bias, Euclid offers a ∼25% reduction at LSST year 2 for zphot > 1, but this has little impact in the later years of the survey. Most remarkably, Euclid would provide a ∼25% (year 2) and a ∼40% (year 10) reduction in the fraction of outliers for zphot > 1, and a ∼6% (year 2) and a ∼25% (year 10) reduction over the full redshift range. If Euclid 1σ detections are included, the reduction in the fraction of outliers increases to ≳50% for zphot > 1, and ∼40% over the full redshift range for LSST year 10.

Given the clear and beneficial impact of Euclid photometry on LSST photo-z estimates, the question arises of which has more of an impact on the photo-z estimates: adding Euclid photometry for more LSST galaxies as the overlap area increases over time or improving the LSST ugrizy photometry with the yearly progression in imaging depth? With respect to the bias and standard deviation, we find that improving the optical depth typically has a greater impact. There is one notable exception to this: for LSST year 5 (not shown in our plots), the addition of Euclid 1σ photometry could bring the standard deviation down to the 10 yr values in a limited range of redshift bins (1.3 ≲ zphot ≲ 1.6 and zphot > 2.7). With respect to the fraction of outliers, however, for LSST at year 5, we find that the addition of Euclid 1σ photometry brings the fraction of outliers down to the 10 yr values for all galaxies with zphot > 1. Generally, we find that reducing the outliers at low and high redshifts is better accomplished with Euclid than by obtaining deeper LSST imaging (e.g., as seen in the bottom plots of Figure 6).

As a final note, we consider whether the photo-z improvements offered by Euclid NIR photometry might instead be achievable by obtaining more images with LSST in just the z and y filters. We simulate LSST-only photo-z results with deeper z and y photometry and find that in order to decrease the standard deviation over the full redshift range of 0.3 < zphot < 3 to a value below that offered by combining LSST and Euclid 5σ photometry, the LSST would have to more than double the total amount of integration time in both z and y. This would require another ∼2 yr of survey to cover 15,000 square degrees in the two filters and still would not fully replicate the benefits of Euclid NIR photometry: additional depth in LSST z and y mainly improves the standard deviation for zphot < 1.5, whereas Euclid improves it in the higher-zphot bins and provides a more drastic reduction in the fraction of outliers.

3.1. The Impact of a Deeper Training Set

Both the LSST and the Euclid surveys will include smaller areas in which a significantly larger number of images are obtained and deeper imaging stacks will be created. For the LSST deep-drilling fields (DDFs), Ivezić et al. (2019) describes a potential observing strategy that would obtain ∼25 30 s consecutive visits in the griz filters every other night for four months in order to find high-z supernovae. This would generate up to an additional 1500 visits in each filter compared to the WFD main survey. There will be at least four LSST DDFs, totaling ∼40 deg2. Since Graham et al. (2018) demonstrated that the u- and y-band filters are important for the quality of photometric redshifts, we assume that the observing strategy for any DDF to be used as a photo-z spectroscopic training set would include, e.g., 500 visits in each of the filters u and y, and 1000 visits in each of the filters griz. The 5σ limiting magnitudes of such a DDF deep stack would be 27.3, 28.8, 28.4, 27.8, 27.1, and 25.5 in filters ugrizy, respectively. Euclid is projected to have at least two deep fields that total ∼ 40 deg2 in which the detection limit extends by ∼2 mag (e.g., Joachimi 2016). The 5σ limiting magnitudes of the Euclid deep fields would be 26.0, 26.2, and 25.9 mag in each of the filters YJH.

We simulate a deep training set with apparent observed magnitudes using these 5σ limiting magnitudes and apply cuts in apparent magnitude at these 5σ limits in all filters except the i band, where we retain the cut of i < 25 mag because obtaining a representative set of spectroscopic redshifts for galaxies fainter than this is difficult and not guaranteed.16 The i < 25 mag restriction also keeps this deep training set approximately matched to the test set in terms of its i-band magnitude and redshift distribution, which helps to avoid introducing a bias in the CMNN photometric redshift results. The photometric quality of the training set is thus improved in two ways: (1) the photometric errors and observational scatter are reduced and (2) more galaxies are detected in more filters. We use a test set of galaxies with simulated photometric quality equivalent to a 10 yr LSST survey and the main Euclid survey. For both test and training sets, the LSST y-band filter is used instead of the Euclid Y band.

Figure 8 demonstrates the impact of using a deep training set on the photo-z results by plotting the statistical measures of standard deviation, bias, and fraction of outliers as a function of redshift. As in the previous section, the thin lines represent the results when the NIR photometry is included for all galaxies without a restriction, and the fraction of galaxies for which the NIR photometry is included in the photo-z estimates for this simulation is shown in the bottom panel of Figure 7. As expected, the standard deviation, bias, and fraction of outliers are all improved by the use of a deeper training set and the inclusion of Euclid photometry. The greatest impact is seen in the intermediate- to high-z galaxy bins (zphot > 1), where the standard deviation is improved by ∼25% with the use of a deep training set and then an additional ∼25% by the inclusion of Euclid photometry—but there is also significant improvement over the full redshift range of 0.3 < zphot < 3.0. Note that the bias in the high-z bins switches from a slight overestimate (bias < 0) to a slight underestimate (bias > 0) when a deep training set is used. The alleviation of overestimates with a deeper training set is likely because the CMNN subsets of training galaxies will contain fewer high-z interlopers when the photometric quality is improved.

Figure 8.

Figure 8. Statistical measures of photo-z results for a 10 yr LSST survey without (blue) and with (orange) Euclid photometry included and for a deeper training set derived from well-covered LSST (green) and Euclid areas (red). The thin and horizontal lines have the same meaning as those in Figure 6.

Standard image High-resolution image

To explore the potential impact of a deeper training set on the photo-z results earlier in the LSST survey, we repeat the simulation with test sets of a photometric quality equivalent to 2 and 5 yr, with and without Euclid JH photometry. We find that the deeper training set alone (no Euclid included) mitigates the bias at low- and intermediate redshifts at years 2 and 5 (the unmitigated bias can be seen in the middle plots of Figure 6). The use of a deeper training set at earlier times also helps to lower the fraction of outliers, but not as much as the addition of Euclid data. The biggest impact of the deep training set at early times is in the standard deviation: the year 2 results are significantly reduced across the full redshift range, approaching the year 5 results for zphot > 1.7, and they are better than the year 5 results for zphot < 0.6. When we use both a deep training set and Euclid photometry, the photo-z standard deviation at year 2 (5) is equivalent to or better than the results at year 5 (10) across the full redshift range (0.3 < zphot < 3.0).

3.2. LSST+Euclid Results for a Shallow Northern Field

Rhodes et al. (2017) propose that LSST cover an additional northern survey area of ∼ 3000 square degrees in order to increase the amount of overlap with Euclid. This proposed northern survey is shallower than the LSST main survey, with a total of 43 visits in filters griz. We test what the photo-z results would be for this northern survey by simulating photometry with 6, 13, 13, and 11 visits in each of filters g, r, i, and z, respectively (i.e., distributing the 43 visits using the same relative fractions for each filter as the WFD survey). The LSST 5σ detection limits in griz for this shallow survey would be 26.0, 26.1, 25.4, and 24.6 mag, respectively. This is significantly deeper than PanSTARRS, which goes to 23.3, 23.2, 23.1, and 22.3 mag in its griz filters (Chambers et al. 2016). For now we ignore the degradation of the photometry due to the fact that most of these visits will be at higher airmass (all photometric errors used in this work are for airmass 1.2, as described in Section 2.2.2). This is acceptable here because our main goal is to evaluate how NIR photometry affects the photo-z in this shallow northern survey, and not the absolute quality of photo-z results for such a survey.

We simulate a test set with apparent observed magnitudes using the LSST 5σ limiting magnitudes quoted above and the Euclid 5σ limiting magnitudes for filters YJH. Note that Euclid Y-band is used here because there would be no LSST y-filter coverage in this shallow northern survey. We apply cuts in apparent magnitude of i < 25 mag and at the 5σ detection limit for all other LSST and Euclid filters. For the training set we simulate photometry using the LSST 10 yr and Euclid main survey 5σ limiting magnitudes, and apply cuts in apparent magnitude of i < 25 mag and at the 5σ limits for the other filters. Note that in this simulation the training set is deeper than the test set, which could lead to a bias in the photo-z results.

In Figure 9 we show the statistical measures of standard deviation, bias, and fraction of outliers for photometric redshifts based on the photometry of this proposed shallow northern survey (and for context we also show the LSST 10 yr equivalent results). As in previous sections, the thin lines represent the results when the NIR photometry is included for all galaxies, regardless of whether it might deteriorate the photo-z estimate. We find that the shallow northern survey is similar in depth to the LSST year 1 results (not shown in this work, but for reference see the year 2 results in Figure 6). Since plenty of science goals will be achievable with the LSST 1 yr data release photometric redshifts (e.g., as demonstrated by Graham et al. 2018), the same applies to a shallow northern LSST survey area. As also seen for the LSST year 2 results, the photo-z results for galaxies with zphot > 1 are all improved when Euclid NIR is added to LSST photometry from a shallow northern survey. In particular, at redshifts 1 < z < 2.5 the addition of Euclid photometry to a shallow northern LSST survey significantly improves the photo-z results, decreasing the standard deviation, fraction of outliers, and absolute bias by up to ∼30% (Figure 6).

Figure 9.

Figure 9. Statistical measures of photo-z results for a 10 yr LSST survey without (blue) and with (orange) Euclid photometry included, compared to the results for a shallow northern LSST survey area (green) that overlaps with Euclid (red) and a CFHT u-band (purple) survey. The thin and the horizontal lines have the same meaning as in Figure 6.

Standard image High-resolution image

Another option for this region will be to include photometry from the the Canada–France Imaging Survey (CFIS), which has been covering this area of the northern sky in the u-band with the Canada–France–Hawaii Telescope (CFHT) as a continuation of the Legacy for the U-band All-sky Universe17 program, for a wide variety of scientific applications. Ibata et al. (2017) present the first results from the CFIS u-band component, which has a 5σ limiting magnitude of ∼24.4 mag (i.e., approximately equal to 2 LSST u-band visits). To evaluate the impact of CFIS-u coverage for this shallow northern survey, we simulated photo-z results and include them in Figure 9. We find a small decrease in the standard deviation in the lowest redshift bin (zphot ∼ 0.5) compared to using LSST and Euclid photometry only, and a more significant improvement at zphot > 1.7 where the u-band data helps to resolve degeneracies between low- and high-z galaxies. Similar results for the impact on photo-z of adding CFHT u-band photometry to grizy survey data—in this case from the Hyper Suprime-Cam Subaru Strategic Program—were presented by Sawicki et al. (2019). They demonstrated that the photo-z were most improved at z < 0.75 and z > 2, e.g., with ∼25% and ∼40% reductions in standard deviation, respectively.

As a final note, if this shallow northern survey was done as an extension of the WFD survey area (and not, e.g., as a separate mini-survey), then it would remove approximately 1.2 visits per filter from all fields within the main survey's 18,000 square degrees. This is a small enough fraction that we do not simulate the overall impact of this loss of depth on the photo-z for the LSST main survey.

4. NASA's WFIRST Mission

The NASA WFIRST survey will have 2200 square degrees of overlap with the LSST main survey, produce significantly deeper catalogs of YJH photometry than Euclid (Table 1), and will also have a redder (F184) filter, which we will refer to as K. The LSST-WFIRST overlap region is projected to be ∼12% of the total LSST main survey of 18,000 deg2, and all galaxies detected by LSST with i < 25 mag within 0.3 < z < 3.0 would be detected by WFIRST in all four filters, YJHK (as described in Section 2.2.3). WFIRST is thus expected to provide significant improvements to the photometric redshifts for the LSST galaxy catalog. For this simulation we use the same test and training-set catalogs as in Section 3, with three main differences: (1) the addition of a redder filter to the set of colors included in the Mahalanobis distance in color-space (F184, referred to here for simplicity as K); (2) the use of WFIRST Y instead of LSST y because the former is significantly deeper; and (3) the use of the WFIRST 5σ limiting magnitudes to simulate the NIR photometry. Cuts in apparent magnitude are applied using the 5σ limiting magnitudes for all filters except i-band, for which we use i < 25 mag. Recall from Section 2.2.2 that the predicted limiting magnitudes of the WFIRST image depths were recently updated to be deeper (Doré et al. 2019) than we have used for this simulation, so our results should be interpreted as an assessment of WFIRST's minimum impact on LSST photo-z estimates.

For this simulation we do not show a true- versus photometric-redshift plot (as in Figure 5) because the addition of WFIRST photometry tightens up the scatter around the ztrue = zphot locus so much—except for the spurs at z ≲ 0.5 that are also seen in Figure 5—that the results are better demonstrated by the statistical measures. In Figure 10 we show how the standard deviation, bias, and fraction of outliers are all drastically improved when WFIRST photometry is included. The standard deviation decreases across all redshift bins, and by ∼60% for redshifts zphot ≳ 1.5, and the absolute bias is lowered to be ≲0.001 across all bins. The catastrophic outlier galaxies at low (zphot < 1) and high (zphot > 2.5) redshift, which are caused by degeneracies between optical color and redshift, are almost completely eliminated.

Figure 10.

Figure 10. The statistical measures of standard deviation (top panel), bias (middle panel), and fraction of outliers (bottom panel) for our photo-z results with: LSST photometry only (blue); Euclid 1σ (orange; with the restriction described in Section 3); WFIRST YJH (green); and YJHK (red) 5σ photometry; and WFIRST YJHK shallower by 1.2 mag (gray). Style conventions are the same as those in Figure 6.

Standard image High-resolution image

The primary reason for the improved photo-z statistics is the extreme depth of the WFIRST photometry, but the additional redder filter also makes a unique contribution. In Figure 10, we show separately the results when WFIRST filters YJH only, or YJHK, are included. We find that the additional redder filter lowers the standard deviation in the highest-redshift bins (z > 2.5) by ∼13%, a small but not insignificant amount (i.e., the change is larger than the error bars in Figure 10). The K filter has an impact at z > 2.5 because this is where the 4000 Å Balmer break would start to influence the H − K color. For comparison, we also show the results when Euclid photometry is included. Euclid will be significantly shallower than WFIRST, and so, it does not improve the photo-z estimates as much, but Euclid will begin operations much sooner and cover a wider area than WFIRST. Each survey will make unique contributions to cosmological analyses involving LSST photo-z.

The planned WFIRST survey area (∼2200 deg2) is significantly smaller than that of Euclid and LSST (∼7200 and ∼18,000 deg2) but could be made wider at the expense of depth. To investigate this, we also simulate photo-z estimates that include photometry from a shallow WFIRST survey for which the limiting magnitudes (Table 1) are all reduced by 1.2 mag, to 25.5, 25.7, 24.8, and 24.6 in filters YJHK, respectively, in order to extend the WFIRST survey area to cover the entire LSST footprint. The results are represented by the gray lines in Figure 10. We find that the standard deviation, bias, and fraction of outliers for galaxies with z < 0.5 would be just slightly (∼5%–10%) lower than the results with Euclid JH. However, around z ∼ 1, the standard deviation and fraction of outliers would be nearly equivalent to the quality of a deep WFIRST survey (as is the bias for z > 1). At high redshifts, a shallow WFIRST would reduce these statistics by at least twice as much as Euclid and, most notably, reduce the fraction of outliers to <3% out to z ∼ 2.5.

Given the considerable depths of the WFIRST photometry, we investigated the quality of photo-z estimates based solely on WFIRST filters YJHK. We concluded that they are not scientifically usable, and so, we have not shown the results in a plot. We found that the statistical measures for zphot < 1.5 would be very large (standard deviation ∼0.2 and absolute bias ∼0.05), and only for 2.0 < zphot < 2.5 do the accuracy and precision approach those of photo-z from optical and NIR photometry combined (standard deviation ∼0.05 and absolute bias ∼0.01).

4.1. A Deeper Test Set

The survey area of overlap between LSST and WFIRST may only be 2200 square degrees, but this area will be observed to an unprecedented depth, which might enable alternative science goals. Here, we explore the impact on photo-z of adding WFIRST photometry to a deeper set of LSST galaxies that extends to i < 26.8 mag, the LSST i-band 5σ detection limit, instead of i < 25 mag as applied in the rest of this work. In this simulation, we keep the i < 25 mag cut applied to the training set because it is difficult to obtain large representative samples of galaxy spectra much deeper than that. We expect that this mismatch in depth between test and training set might induce a bias in the photo-z results: that the photo-z's are biased toward low-zphot (under-predicted, on average) because the brighter training set has a lower average redshift.

In all of our simulations in this work, we impose the constraint that a galaxy must have three colors in order to obtain a photo-z estimate, but here, we furthermore constrain that those colors must be optical. This constraint prevents the situation where the photo-z results are deteriorated by a large number of faint, i > 25 mag galaxies that are detected only (or mainly) in the WFIRST filters—a population that would be cut from all of our other simulated test sets. This optical colors constraint ensures we are "comparing apples to apples" in this simulation and evaluating only the impact of adding WFIRST photometry to an LSST galaxy catalog (and not adding WFIRST-detected galaxies to an LSST catalog).

In Figure 11, we show the standard deviation and bias for a test set of i < 26.8 mag galaxies with LSST photometry only and when WFIRST YJHK photometry is included. For comparison, we also show the results for a test catalog with i < 25 mag. As expected, the overall photo-z results are poorer when fainter galaxies are included, but the addition of WFIRST photometry reduces the standard deviation by ≳60% for 1 < zphot < 2. As also expected, using an i < 25 mag training set for this i < 26.8 mag test set induced a bias in the photo-z results at zphot ∼ 1.5 and in the highest-z bins, but we find that the inclusion of the WFIRST photometry alleviates the bias. We do not show the plot for the fraction of outliers but find that, as expected, the addition of WFIRST photometry to a faint test set reduces the fraction of outliers by ∼60% at intermediate to high redshifts (zphot > 1.5).

Figure 11.

Figure 11. The standard deviation (top panel) and bias (bottom panel) for photo-z results for a simulated test set of galaxies with i < 25 mag (blue), i < 26.8 mag (green), and i < 26.8 mag but with WFIRST YJHK photometry included (red).

Standard image High-resolution image

In Section 3, we found that the additional degrees of freedom from adding NIR photometry could deteriorate the photo-z results, especially for test sets with poorer photometric quality. Despite this simulation including fainter i < 26.8 mag galaxies, we find no such deterioration when adding WFIRST photometry, and so, we do not need to apply any restrictions as was done in Section 3.

5. The Proposed CASTOR Mission

The proposed CASTOR mission would perform an imaging survey that would overlap with the LSST survey area and provide an additional passband and deeper photometry that could be included in photo-z estimates. CASTOR currently has two proposed UV/u bandpass pairs, UV/uc and UVd/uw, and a gc band (as shown in the top-left panel of Figure 1). There are several proposed surveys being considered for CASTOR, but we focus on two: (1) a primary survey that would cover the 7200 square degree region defined by the overlap of the LSST WFD and Euclid-Wide surveys (which also includes the WFIRST High Latitude Survey), and (2) the CASTOR-Cadence survey that would provide multi-epoch, much deeper imaging in a ∼20 square degree region that overlaps with the LSST Deep Fields and WFIRST in the southern skies. The schedule for the CASTOR mission is not yet set, but it is not likely to launch sooner than 2027. In this analysis, we only evaluate the final combined surveys' photometry. Since Graham et al. (2018) demonstrated that LSST u-band data primarily helps improve the photo-z quality for galaxies with zphot < 0.6 and zphot > 1.5, we might expect the addition of CASTOR data to have the largest impacts at low and high z.

5.1. Testing CASTOR Passbands for Photo-z

To explore which UV/u bandpass pair would be best for photometric redshifts, we simulate photo-z results with the CASTOR-Primary survey limits (Table 1), imposing i < 25 mag for the test and training set as usual. We show the standard deviation as a function of binned zphot in Figure 12. As we found with the addition of Euclid photometry (Section 3), including NUV photometry for all galaxies caused a slight deterioration in the standard deviation for galaxies with 1.5 < zphot < 2 (thin lines in Figure 12). To mitigate this, we only include the CASTOR photometry if it lowers the estimated photo-z uncertainty and is thus likely to improve the photo-z estimate. The statistical results after this restriction has been imposed are represented by the thicker lines in Figure 12. Figure 13 shows the fraction of galaxies that are detected by CASTOR and have their CASTOR photometry included in the photo-z estimate. We find that the CASTOR UV passband is useful for up to ∼40% of all simulated galaxies, and the deeper u band is useful for ∼40%–60% of simulated galaxies.

Figure 12.

Figure 12. The standard deviation as a function of redshift bin for the photo-z results of using the CASTOR UV passband (green) or the two proposed CASTOR bandpass pairs—UV and uc (hot pink) or UVd and uw (purple)—compared to the LSST ugrizy filter set only (blue). The thin lines represent the results when NUV photometry is included for all galaxies regardless of its impact on the photo-z estimate.

Standard image High-resolution image
Figure 13.

Figure 13. The fraction of galaxies in each redshift bin that were brighter than CASTOR's 5σ limiting magnitude, i.e., ${UV}\leqslant 27.4$ mag (green) and/or u ≤ 27.4 mag (pink and purple), and for which the CASTOR photometry was included in the photo-z estimate because it lowered the estimated photo-z uncertainty. Shown for each of our simulations in Figure 12, as labeled in the legend.

Standard image High-resolution image

The statistical measure of the standard deviation shown in Figure 12 demonstrates that all of the proposed CASTOR bandpasses could provide an improvement of ∼10% across the full redshift range. At low redshifts, z < 0.6, we see that the deeper u-band photometry provides a larger reduction in the standard deviation than the addition of the UV passband alone. This is because for low-z galaxies, information about the Balmer break at 4000 Å is conveyed via the u − g color, and the UV − u color is less indicative of redshift.

We find that, surprisingly, our restriction to only include CASTOR photometry when it decreases the photo-z uncertainty actually causes a small increase in the standard deviation in the lowest-z bins, when either the CASTOR UV/uc or UVd/uw filter pair is used. This indicates that the CASTOR photometry can improve the reported precision of some photo-z estimates while degrading their accuracy (this issue is also discussed in Section 5.2). This could be mitigated by applying further conditions to our restriction, such as always including CASTOR photometry when it indicates zphot < 0.5, but since the effect is minor, we avoid such additional complications at this time.

At intermediate redshifts (1 < z < 1.5), we see that all CASTOR filters provide an equivalent improvement to the standard deviation, a reduction of ∼15% compared to using the LSST filters alone, and in the highest-redshift bins, the UV/uc performs a little better than UVd/uw. Since the UV/uc pair provides a slightly lower standard deviation across the full redshift range of 0.3 < z < 3.0 compared to the UVd/uw pair, we continue the rest of this analysis with UV/uc.

5.2. CASTOR Primary

To further investigate the potential photo-z results in the CASTOR-Primary survey area with the UV/uc passband pair, we add the CASTOR UV photometry and replace the LSST u-band with the CASTOR uc photometry for all test- and training-set galaxies. As in the previous section, for all filters, we use the surveys' 5σ limiting magnitudes to simulate apparent observed magnitudes and errors, and we apply cuts at i < 25 mag and the 5σ limits. We do not replace the LSST g band with the CASTOR gc band because, as is shown in Table 1, the 10 yr LSST main survey is predicted to be slightly deeper than the CASTOR-Primary survey in the g band.

In Figure 14, we show three panels representing the true versus photometric redshifts when: (1) only LSST ugrizy photometry is used, (2) the CASTOR-Primary UV and uc photometry is included for all galaxies, and (3) the CASTOR photometry is included with the restriction that it must decrease the photo-z uncertainty (as described in Section 5.1). By comparing the bottom two panels with the top panel, we can immediately see how adding CASTOR data resolves degeneracies in galaxy color-space that cause outliers in certain regions of the ztruezphot plane (brown boxes). By comparing just the bottom two panels, we can see that our restriction inhibits the reduction of outliers in the low-z spurs at z < 0.5, indicating that in some cases, the CASTOR photometry improves the photo-z accuracy but not its precision, as discussed above.

Figure 14.

Figure 14. True ${\rm{vs}}.$ photometric redshifts for simulated galaxies with i < 25 mag based on the LSST 10 yr 5σ detection limits only (top panel), when CASTOR-Primary UV/uc photometry is included for all galaxies (middle panel), and when that CASTOR data is excluded if it increases the photo-z uncertainty (bottom panel). The brown boxes guide the eye to sensitive outlier populations.

Standard image High-resolution image

The statistical measures of photo-z quality are shown in Figure 15, which plots the standard deviation and fraction of outliers in bins of photo-z. The bias is not significantly impacted by CASTOR photometry and so is not shown. As described in the previous section, the thin lines represent results when the CASTOR photometry is included for all galaxies, without the restriction that the addition must result in a smaller photo-z uncertainty.

Figure 15.

Figure 15. The standard deviation (top panel) and fraction of outliers (bottom panel) of the photo-z results with LSST photometry alone (blue), LSST and Euclid JH (orange), LSST and CASTOR-Primary UV and uc (pink), and all three surveys combined (brown).

Standard image High-resolution image

The top panel of Figure 15 illustrates how adding CASTOR to LSST photometry reduces the standard deviation by ∼10% for galaxies with 0.3 < zphot < 3.0. At low redshifts (zphot ≲ 0.6), we find that the impact of CASTOR is equivalent to that of Euclid but that applying CASTOR photometry without restriction is more impactful and could reduce the standard deviation by ∼30%. At intermediate redshifts, 1 < zphot < 2, we see that CASTOR provides a reduction of ∼15% in standard deviation compared to using the LSST filters alone. In the redshift range 1 < z < 1.5, the UV filter is passing over the UV-upturn from evolved hot stars (e.g., Greggio & Renzini 1990, 1999), and the flux from this UV-upturn provides redshift information via the UV − u color. By redshift z ∼ 1.5, the Lyα break has entered the UV bandpass, but the UV − u color continues to provide redshift information until z ∼ 2. At intermediate redshifts (1.4 < zphot < 2), we find that the addition of Euclid is more effective at reducing the standard deviation.

In even the highest-redshift bins, we continue to see an improvement to the standard deviation when CASTOR UV photometry is included, despite the fact that Figure 13 shows that the UV only contributes to the photo-z estimate $\sim 10 \% $ of the time for galaxies with true redshifts ztrue > 2. This is the result of the CASTOR photometry assisting with the identification of truly lower-z galaxies that have optical colors that are degenerate with higher-z galaxies. These galaxies would otherwise be assigned a high photo-z based on their optical colors alone, where they increase the scatter in the high-zphot bins. In the bottom panel of Figure 15, we show that the improvements to the fraction of outliers are generally of a similar scale, except that CASTOR could reduce this metric by ∼50% for z < 0.5, and Euclid is more impactful for all galaxies with z > 1.

Generally, for both statistical measures and across all redshift bins, Figure 15 demonstrates how the addition of both CASTOR and Euclid could provide further improvements than either could on its own. This indicates that the correlations between galaxy UV and NIR photometry and redshift, which are put to use when we add CASTOR and Euclid data to the photo-z estimates, are independent of each other. The fact that each survey provides a similar impact on the photo-z results when added individually is just a coincidence, and CASTOR and Euclid would each deliver complementary information to the photo-z estimates.

5.3. CASTOR Cadence

The proposed CASTOR-Cadence survey would cover 20 square degrees and might overlap with the LSST DDFs, Euclid's deep fields, and/or the WFIRST survey area. Similar to Section 3.1, we evaluate the impact of using this region to build a deeper spectroscopic training set. For this simulation, we use a test set of galaxies limited to i < 25 mag, with photometric quality based on the 5σ limiting magnitudes of a 10 yr LSST survey, the CASTOR-Primary survey, and/or the Euclid main survey (all as quoted in Table 1). We use a training set of galaxies also limited to i < 25 mag, with photometric quality based on the 5σ limiting magnitudes of an LSST DDF (Section 3.1), the CASTOR-Cadence deep field (mUV ∼ 29.25, mu ∼ 28.95, and mg ∼ 28.45 mag; Section 1), and the WFIRST survey (Table 1). Two aspects that are different about this simulation from all of those preceding it are that (1) the CASTOR gc is replacing the LSST g band for the first time because the Cadence survey would be deeper than a stacked LSST deep-drilling field, and (2) the two NIR surveys' photometries are being mixed, with WFIRST used for the training set and Euclid for the test set.

In Figure 16, we demonstrate the impact of using a deeper training set based on the LSST DDFs only, when the CASTOR-Cadence photometry is included and when the WFIRST photometry is also included. We can see that a deeper training set that includes CASTOR photometry can significantly decrease the standard deviation and fraction of outliers at low (${z}_{\mathrm{phot}}\lt 1.5$) and high (zphot > 2) redshifts. As expected, including the WFIRST photometry in this deep training set provides further improvements across all redshifts.

Figure 16.

Figure 16. The standard deviation (top panel) and fraction of outliers (bottom panel) of the photo-z results when a deep training set that includes photometry from LSST (green), CASTOR (pink), or both CASTOR and WFIRST (red) is used.

Standard image High-resolution image

6. Discussion and Conclusions

In this study, we have investigated the impact of including photometry from the Euclid, WFIRST, and/or CASTOR space telescopes surveys' in the photometric redshift estimates for LSST galaxies. Using mock galaxy catalogs with simulated photometry, we have measured the robust standard deviation, robust bias, and fraction of outliers for our results. In Table 3, we summarize the change in these statistics for three main redshift ranges: low (0.3 < zphot < 0.6), intermediate ($0.8\,\lt {z}_{\mathrm{phot}}\lt 1.2$), and high ($2.0\lt {z}_{\mathrm{phot}}\lt 2.5$). The table's values are the ratio of the statistical results with NIR or NUV data to the results when the LSST 10 yr photometry alone is used to estimate photometric redshifts. For example, the factor representing the decrease in the robust standard deviation within the redshift range $0.3\lt {z}_{\mathrm{phot}}\lt 0.6$ when Euclid photometry is included is ${\sigma }_{\mathrm{LSST}+\mathrm{Euclid}}/{\sigma }_{\mathrm{LSST}}=0.87\pm 0.01$. The values quoted in Table 3 are based on simulations in which the inclusion of NIR and NUV photometry for Euclid and CASTOR is subject to the restrictions discussed in Sections 3 and 5.

Table 3.  Summary of Statistical Results

  Fractional Changea from Using LSSTb Photometry Only When External Survey Data is Added
zphot
Range Euclidc WFIRST CASTORd
  Robust Standard Deviation:
0.3–0.6 0.87 ± 0.01 0.74 ± 0.01 0.86 ± 0.01
0.8–1.2 0.87 ± 0.01 0.75 ± 0.01 0.93 ± 0.01
2.0–2.5 0.81 ± 0.02 0.32 ± 0.01 0.83 ± 0.02
  Robust Bias:
0.3–0.6 1.45 ± 0.12 0.88 ± 0.08 0.89 ± 0.09
0.8–1.2 0.97 ± 0.31 0.63 ± 0.23 0.30 ± 0.22
2.0–2.5 0.29 ± 0.12 0.09 ± 0.05 0.73 ± 0.16
  Fraction of Outliers:
0.3–0.6 0.83 0.47 0.79
0.8–1.2 0.46 0.15 0.68
2.0–2.5 0.57 0.01 0.62

Notes.

aExempli gratia ${\sigma }_{\mathrm{LSST}+\mathrm{Euclid}}/{\sigma }_{\mathrm{LSST}}$. bA 10 yr LSST survey. cUsing the Euclid 1σ detection limit. dUsing CASTOR-Primary UV and uc filters.

Download table as:  ASCIITypeset image

For almost every statistic and every redshift range, we find a significant improvement to the photo-z results when the Euclid, WFIRST, or CASTOR photometry is included, and the improvements tend to be greater in the highest-redshift bins (i.e., the factors are smaller). In the one and only case where this factor is greater than 1, the robust bias when Euclid is added in the lowest-z bin has a very low absolute value, 0.0025, which is well within the LSST SRD's targeted range (see Figure 6).

In this study, we have also explored the relative photo-z results in four main sky-area zones of overlap between LSST, Euclid, WFIRST, and CASTOR: (0) the ∼18,000 square degrees of the LSST WFD survey at 10 yr with filters ugrizy; (1) the ∼7200 square degrees of overlap between LSST grizy, Euclid-Wide JH, and CASTOR-Primary UV uc; (2) the ∼2000 square degrees of overlap between LSST griz, WFIRST YJHK, and CASTOR-Primary UV uc; and (3) the northern area of ∼3000 square degrees with shallow coverage from LSST griz, Euclid YJH, and CFHT-u. In Table 4, we summarize the fractional changes in each statistical measure, in three redshift ranges, for galaxies in sky-area zones 1, 2, and 3 compared to zone 0 (i.e., areas of overlap with Euclid, WFIRST, and/or CASTOR compared to areas with LSST alone).

Table 4.  Summary of Results by Sky Area

  Fractional Changea from Results for LSST Zone 0b for the Other Sky Regions
zphot
Range Zone 1c Zone 2d Zone 3e
  Robust Standard Deviation:
0.3–0.6 0.72 ± 0.01 0.63 ± 0.01 1.57 ± 0.02
0.8–1.2 0.83 ± 0.01 0.74 ± 0.01 2.12 ± 0.02
2.0–2.5 0.57 ± 0.01 0.62 ± 0.02 1.82 ± 0.05
  Robust Bias:
0.3–0.6 0.99 ± 0.09 0.02 ± 0.04 0.99 ± 0.14
0.8–1.2 0.33 ± 0.21 0.92 ± 0.27 1.23 ± 0.57
2.0–2.5 0.56 ± 0.11 0.27 ± 0.10 3.01 ± 0.53
  Fraction of Outliers:
0.3–0.6 0.36 0.30 0.90
0.8–1.2 0.29 0.22 2.00
2.0–2.5 0.23 0.31 1.22

Notes.

aExempli gratia ${\sigma }_{\mathrm{Zone}1}/{\sigma }_{\mathrm{Zone}0}$. bZone 0: LSST only (∼18,000 deg${}^{2}$). cZone 1: LSST, Euclid, and CASTOR (∼7200 deg${}^{2}$). dZone 2: LSST, WFIRST, and CASTOR (∼2000 deg2). eZone 3: Shallow LSST, Euclid, and CFHT-u (∼3000 deg2).

Download table as:  ASCIITypeset image

We find that the overall improvements to the standard deviation and fraction of outliers in zones 1 and 2 are quite similar, while the improvements to the bias appear to be quite different in the low and intermediate redshift ranges for zones 1 and 2. For example, it seems that the bias at low z is barely improved in zone 1 but improved by almost two orders of magnitude in zone 2. However, the bias in photo-z results for zones 0, 1, and 2 is low, within the SRD's targeted ranges, and so, the fractional improvements in Table 4 represent small changes to already-small values. As expected, the photo-z results in zone 3—the potential shallow northern LSST survey extension discussed in Section 3.2—are mostly of a poorer quality compared to zone 0. The exception is in the lowest-z bin, where the the standard deviation and fraction of outliers are not deteriorated, because the addition of Euclid and CFHT photometry has mitigated the impact of shallower LSST photometry.

Below, we summarize the results for each survey in turn and then discuss some ideas for future work regarding ways in which space-based imaging could further improve the LSST photo-z results.

6.1. Euclid

We have simulated photometric redshift results for LSST galaxies that overlap with the Euclid-Wide survey area. We found that Euclid mainly improves the photo-z estimates at redshifts z > 1, as expected, because this is where the Balmer break is redshifted beyond the optical filters. Quantitatively, we found that the addition of Euclid 5σ detections to the LSST 10 yr catalog can reduce the standard deviation for galaxies with z > 1 (or within the full redshift range of 0.3 < z < 3.0) by ∼20% (∼10%), and the fraction of outliers by ∼40% (∼25%). We showed that Euclid would have a relatively larger positive impact on the photo-z estimates when added to the 10 yr LSST photometry than at earlier years of the survey and/or when Euclid detections down to 1σ are included (instead of 5σ). We conclude that reducing the standard deviation in the photo-z results is more efficiently done with deeper LSST ugrizy photometry than with Euclid data, but we emphasize that Euclid does offer the unique benefits of reducing the standard deviation at z > 1 and especially of reducing the fraction of outliers. Furthermore, we point out that observing for a longer time period with the LSST z and y filters would not recreate the benefits of including Euclid photometry. We demonstrated how a spectroscopic training set from overlapping LSST and Euclid DDFs can significantly improve the photo-z results and how the photo-z for a shallow northern LSST extension that overlaps with the Euclid-Wide survey would be scientifically useful (i.e., it would have an average standard deviation just ∼2 times higher than the LSST year 10 survey).

6.2. WFIRST

Compared to Euclid, WFIRST will provide deeper NIR photometric catalogs and one additional, redder filter (F184, which we have referred to as K), but it will have a significantly smaller overlap area with LSST. Within the overlapping area, we have demonstrated that the addition of WFIRST NIR photometry would provide a drastically improved photo-z: the standard deviation is reduced by ≳50% (∼25%) for galaxies with z > 1.5 (0.3 < z < 3), the bias becomes negligible for z > 1 (and is reduced by ∼30% for z < 0.5), and the fraction of outliers is reduced to just ∼2% for galaxies within 0.3 < z < 3.0. In particular, the catastrophic outliers caused by a degeneracy between optical colors of galaxies at low and high redshifts are almost entirely removed. We also considered the impact of adding WFIRST to a deeper LSST catalog (i < 26.8 instead of i < 25 mag) and found that WFIRST would provide critical improvements to the standard deviation, lowering it by 50% for intermediate redshifts.

6.3. CASTOR

The proposed CASTOR mission might provide deep UV-, u-, and g-band photometry that overlaps with the LSST main survey. We found that including the CASTOR-Primary survey photometry improves the LSST photo-z estimates at all redshifts, as expected, because detections in the bluer passbands help to break degeneracies in the optical colors of low- and high-redshift galaxies by identifying galaxies that are truly low z. In particular, we showed that including CASTOR photometry could reduce the standard deviation by ∼30% and the fraction of outliers by ∼50% at z < 0.5. We also demonstrate how further improvements to the photo-z estimates might be attained by building a deeper spectroscopic training set from a combination of the CASTOR-Cadence and LSST DDFs. Although we illustrated how both of CASTOR's potential UV- and u-band pairs could provide similar benefits to the LSST photo-z, at the time of this work's writing, the CASTOR passbands and surveys were still in the proposal stage, and they might continue to evolve from what we have assumed in this work.

6.4. Summary

Generally, we have found that increased photometric depth provides the largest potential reduction in the standard deviation for predicted LSST photo-z estimates, compared to including additional filters beyond the optical. However, we have also found that the addition of UV and/or NIR filters can significantly reduce the standard deviation, in particular, redshift ranges and, more importantly, that additional filters are absolutely necessary for reducing the fraction of 3σ and catastrophic outliers. We have demonstrated how the addition of NIR or UV photometry from Euclid or CASTOR, respectively, would each result in similar reductions for the standard deviation and fraction of outliers when included in LSST photo-z estimates individually. In addition, we have shown how Euclid and CASTOR each provide complementary information about a galaxy's redshift and, thus, how additional significant improvement to the photo-z estimates could be obtained by including data from both surveys. As a final note, the deeper NIR photometry from WFIRST should deliver superb photo-z estimates for LSST galaxies, albeit in the smaller area of overlap between the LSST and WFIRST surveys.

6.5. Future Work

The Euclid, WFIRST, and CASTOR missions are all space telescopes, and there are additional merits to their data aside from just the improved photometric quality and expanded spectral range (e.g., spatial resolution). These aspects can also be used to improve the LSST photo-z, over and above what we have demonstrated (e.g., Pasquet et al. 2019). Doing so requires a photo-z estimator that has been constructed to ingest, and properly apply, the additional features of the data. Here, we suggest several options for obtaining even greater improvements for the LSST photo-z from the Euclid, WIFRST, and/or CASTOR surveys, which are beyond the scope of this work.

  • 1.  
    With the CMNN estimator, all galaxies must have a photometric detection with appropriate errors in order to simulate photometric redshifts. A photo-z estimator that fully utilized non-detections might be better able to quantify the improvements offered by surveys that are significantly shallower than the LSST.
  • 2.  
    With the CMNN estimator, adding extra filters can deteriorate the photo-z quality if the photometry is not constraining enough to balance the additional degrees of freedom. In this work, we have mitigated this issue by only adding extra filters when they are likely to improve the photo-z estimate. A photo-z estimator that takes a more sophisticated approach to the provisional addition of photometry might be able to show an even greater improvement in the photo-z quality.
  • 3.  
    Photometric differences between filters that are similar, such as LSST's y and the Euclid or WFIRST Y, could be used to constrain the redshift of strong emission lines and, thus, provide a more precise photo-z for a subset of galaxy types.
  • 4.  
    Euclid and WFIRST will have grism data, which we have not considered in this work but which could be used to further improve the photo-z estimates—either by providing additional SED information for test galaxies, or perhaps enhancing the training-set photometry.
  • 5.  
    All space-based imaging surveys offer significantly better spatial resolution than ground-based imaging, and this could improve the photo-z by, for example, providing galaxy size and shape priors and/or improving the treatment of blended objects. This may require pixel-level analysis, whereas, here, we have worked only with mock catalogs.

This material is based upon work supported in part by the National Science Foundation through Cooperative Agreement 1258333 managed by the Association of Universities for Research in Astronomy (AURA) and the Department of Energy under Contract No. DE-AC02-76SF00515 with the SLAC National Accelerator Laboratory. Additional funding for the Rubin Observatory comes from private donations, grants to universities, and in-kind support from LSSTC Institutional Members. A.J.C. acknowledges support by the U.S. Department of Energy, Office of Science, under Award No. DE-SC-0011635. S.J.S. acknowledges support from DOE grant DE-SC0009999 and NSF/AURA grant N56981C. M.L.G., A.J.C., C.B.M., Z.I., S.F.D., R.L.J., M.J., P.Y., and J.B.K. acknowledge support from the DiRAC Institute in the Department of Astronomy at the University of Washington. The DiRAC Institute is supported through generous gifts from the Charles and Lisa Simonyi Fund for Arts and Sciences, and the Washington Research Foundation. M.J. wishes to acknowledge the support of the Washington Research Foundation Data Science Term Chair fund, and the University of Washington Provost's Initiative in Data-Intensive Discovery.

Footnotes

  • "Castor" is the genus of, and French word for, "beaver." The North American beaver, Castor canadensis, is the national animal of Canada.

  • Some readers might notice that the definition of an LSST "Gold Sample" has evolved from i < 25.3 mag (Ivezić & the LSST Science Collaboration 2011).

  • A demonstrative version of the CMNN estimator is available at https://github.com/dirac-institute/CMNN_Photoz_Estimator.

  • This is an approximation of the Mahalanobis Distance, which is typically expressed as ${D}_{M}^{2}={(\bar{x}-\bar{u})}^{T}{S}^{-1}(\bar{x}-\bar{u})$ where S is the covariance error matrix. For this work, we verified that using the approximation does not significantly change the photo-z estimates and has no impact on our overall conclusions.

  • Since the test galaxy is left out of its own training set, there are no instances of DM = 0 and no infinite weights.

  • Documentation for this catalog can be found at http://galaxy-catalogue.dur.ac.uk.

  • 10 

    The four potential CASTOR transmission functions that we use are the latest and most realistic estimates as of 2019 December.

  • 11 

    The normal random selection of the amount of scatter to apply, which is based on the simulated photometric error, is done in magnitude-space for all galaxies.

  • 12 

    However, we note that this does assume a perfect observing efficiency, and in reality, some small population of low surface-brightness galaxies that have an integrated flux brighter than i ∼ 25 mag might be undetected.

  • 13 

    Recall that the i < 25 mag cut is imposed in order to compare the photo-z results with the LSST science requirements, as described in Section 1.

  • 14 

    As shown in the bottom-middle figure at https://www.euclid-ec.org/?page_id=2490.

  • 15 

    One alternative method to include non-detections in the CMNN photo-z estimates—which we do not explore in this work—is to use them as a prior or a weight when constructing the CMNN subset of training galaxies.

  • 16 

    Although, with dedicated time on 10–30 m telescopes, it might be possible to obtain a small, but very deep, training set, we leave further exploration of training-set variations to future work and just consider a simple extension here.

  • 17 
Please wait… references are loading.
10.3847/1538-3881/ab8a43