Abstract
Some recent literature has claimed there to be an evolution in galaxies' dust temperatures toward warmer (or colder) spectral energy distributions (SEDs) between low and high redshift. These conclusions are driven by both theoretical models and empirical measurement. Such claims sometimes contradict one another and are prone to biases in samples or SED fitting techniques. What has made direct comparisons difficult is that there is no uniform approach to fitting galaxies' infrared/millimeter SEDs. Here we aim to standardize the measurement of galaxies' dust temperatures with a python-based SED fitting procedure, MCIRSED.1We draw on reference data sets observed by Infrared Astronomical Satellite, Herschel, and Scuba-2 to test for redshift evolution out to z ∼ 2. We anchor our work to the LIR–λpeak plane, where there is an empirically observed anticorrelation between IR luminosity and rest-frame peak wavelength (an observational proxy for luminosity-weighted dust temperature) such that where η = −0.09 ± 0.01, Lt = 1012 L⊙, and λt = 92 ± 2 μm. We find no evidence for redshift evolution of galaxies' temperatures, or λpeak, at fixed LIR from 0 < z < 2 with >99.99% confidence. Our finding does not preclude evolution in dust temperatures at fixed stellar mass, which is expected from a nonevolving LIR–λpeak relation and a strongly evolving SFR–M⋆ relation. The breadth of dust temperatures () at a given LIR is likely driven by variation in galaxies' dust geometries and sizes, and it does not evolve. Testing for LIR–λpeak evolution toward higher redshift (z ∼ 5−6) requires better sampling of galaxies' dust SEDs near their peaks (observed ∼200–600 μm) with ≲1 mJy sensitivity. This poses a significant challenge to current instrumentation.
Export citation and abstract BibTeX RIS
Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
Astrophysical dust is a key component of the interstellar medium (ISM) in galaxies, despite comprising a negligible proportion of their total mass (≲1% of the baryonic budget; e.g., Rémy-Ruyer et al. 2014). Dust is an efficient absorber of ultraviolet and visible light emitted from both young stars and active galactic nuclei (AGNs). It re-radiates the absorbed energy thermally between infrared (IR) and millimeter wavelengths (e.g., Wolfire et al. 1995). The total energy of re-radiated emission, LIR (formally defined as the integral of the SED between 8 and 1000 μm), is often used as a fundamental proxy for a galaxy's star formation rate (e.g., Sanders & Mirabel 1996; Kennicutt 1998). Thus, lack of knowledge of a galaxies' re-radiated dust emission can result in catastrophic misinterpretation of galaxies' underlying physics.
While rough estimates of a galaxies' infrared luminosity can be made even with fairly crude IR/millimeter constraints (e.g., Sanders et al. 2003), the accurate measurement of galaxies' dust SEDs can unlock a more detailed understanding of the systems' physical configuration, chemical enrichment, and mass build-up (see reviews by Casey et al. 2014; Galliano et al.2018; Hodge & da Cunha 2020). Beyond IR luminosity, the temperature of thermal dust emission can be roughly inferred using Wien's Displacement Law (Wien 1896), whereby the rest-frame peak wavelength, λpeak, of the SED in Sν is inversely proportional to a blackbody's temperature. Other characteristics may also be measured—the optical depth of dust as a function of wavelength, the dust mass surface density, the emissivity spectral index β, the presence of warm dust heated by nuclear processes, the dust extinction curve, and the abundance of polycyclic aromatic hydrocarbons (PAHs; Galliano et al. 2018). However, a dearth of photometric constraints in the IR/millimeter regime for the majority of galaxies severely limits tight constraints on most of these quantities, save LIR and λpeak. Additionally, the degeneracies between some physical quantities such as dust emissivities and temperatures further limits our ability to constrain them (e.g., Spilker et al. 2016). Observations that would break these degeneracies do not exist for the majority of galaxies. In the face of this, the most reasonable choice for deciphering trends in the underlying population is to focus on IR luminosities and the wavelengths where the dust emission peaks, which is a proxy for dust temperature—the two parameters encapsulating dust SEDs for which large samples of galaxies have observations.
Measurements of galaxies' dust temperatures in the literature are plentiful, yet different studies employ different methods for converting a set of IR/millimeter photometry to a temperature, ranging from single modified blackbody fits (Dunne & Eales 2001), to a superposition of blackbody fits (Dale et al. 2001; Younger et al. 2007; Casey 2012), the use of empirical templates (Chary & Elbaz 2001; Draine & Li 2007; Rieke et al.2009), or theoretical models (Silva et al. 1998; Burgarella et al. 2005; Siebenmorgen & Krügel 2007; da Cunha et al. 2015). This conversion naturally neglects the complexity of galaxies' ISM, where dust cannot simply be characterized as having one temperature. In reality, ISM dust exists in several phases, from warm direct photon-heated dust in close proximity to young OB associations or AGNs to cold, diffuse ISM dust heated only by the ambient radiation field. In practice, what is measured from galaxies' SEDs is the luminosity-weighted dust temperature averaged over the entirety of the galaxy. Though this reduces the complexity of galaxies' ISM to a single parameter, that parameter is, in principle, directly measurable for a large sample of galaxies spanning cosmic time.
In the simple case of a modified blackbody, the same set of photometry could be described as having very different dust temperatures depending on the underlying dust opacity model. For example, Cortzen et al. (2020) found that an optically thin blackbody fit to GN20 gives a dust temperature of 33 K while an optically thick model gives 52 K. For this reason, some works (Casey et al. 2018a, 2018b) have advocated for use of λpeak to quantify the dust temperature. λpeak is a direct, observable quantity that is model independent and thus a direct reflection of data constraints. Note that converting λpeak to a dust temperature still requires assumptions about the dust opacity in the absence of direct constraints.
Given the nuances in both deriving and comparing dust temperatures, it is not necessarily surprising that the literature has reported several seemingly contradictory temperature trends with redshift. The first set of claims argues that higher-redshift galaxies display hotter dust temperatures on average, in comparison to galaxies at z ∼ 0. Such conclusions are reached using both observational data at z ∼ 1–2 (e.g., Magnelli et al. 2014; Béthermin et al. 2015; Schreiber et al. 2018) out to z ∼ 4–5 (e.g., Magdis et al. 2012; Faisst et al. 2017, 2020; see also Bakx et al. 2021 at z = 7.13) and via theoretical modeling, largely at the earliest epochs, z ≳ 5 (e.g., De Rossi et al. 2018; Arata et al. 2019; Liang et al. 2019; Ma et al. 2019; Sommovigo et al. 2020). In theory, the higher dust temperatures at earlier epochs are attributed to either a lower relative dust abundance, higher specific star formation rates (sSFR), or higher star formation surface densities (ΣSFR) in high-redshift galaxies. These conditions may only predominate at z ≳ 5, at which point the dust reservoirs of even massive galaxies may not have had sufficient time after the Big Bang to reach an enriched chemical equilibrium. Observationally, higher dust temperatures for high-redshift galaxies have been measured in "main-sequence" galaxies (e.g., Magnelli et al. 2014; Béthermin et al. 2015; Schreiber et al. 2018) while galaxies elevated above the main sequence are observed to be uniformly "hot." Observational work at z > 3 has focused on galaxies lying far off expectation (Capak et al. 2015) in the relationship between their rest-frame UV color and the LIR/LUV ratio in the relationship between the Infrared excess (IRX) and slope of the UV spectrum (β; Meurer et al. 1999) potentially indicative of different dust compositions. Invoking hotter dust temperatures (hotter than is often assumed for lower-redshift sources) may bring observation back in line with expectation (e.g., Bouwens et al. 2016; Faisst et al. 2017).
The second set of claims posit that galaxies' dust SEDs evolve toward colder temperatures at higher z (e.g., Chapman et al. 2002; Pope et al. 2006; Symeonidis et al. 2009; Hwang et al. 2010; Kirkpatrick et al. 2012; Symeonidis et al. 2013; Magnelli et al. 2014; Kirkpatrick et al. 2017). They find this to be caused by higher dust masses (e.g., Kirkpatrick et al. 2017), higher dust opacities or emissivities, or more extended spatial distributions of dust at higher redshift (e.g., Symeonidis et al. 2009; Elbaz et al. 2011; Rujopakarn et al. 2013). While luminous and ultraluminous IR galaxies (LIRGs and ULIRGs) in the local universe are compact (<1 kpc; e.g., Condon et al. 1991), higher-redshift galaxies with similar IR luminosities have extended regions of dust emission (∼few kiloparsecs; e.g., Elbaz et al. 2011; Rujopakarn et al. 2011, 2013). The conjecture of these works is that the extended distribution of dust in high-z IR-luminous galaxies leads to overall cooler dust temperatures than are seen in local (U)LIRGs.
The third set of claims suggest that galaxy SEDs do not, in fact, evolve substantially with redshift but that any perception in evolution is due to telescope selection effects. Such selection effects have been known in the literature for some time; for example, 850 μm detection may preferentially select the coldest IR-luminous galaxies at any given redshift given the steep temperature dependence of (e.g., Eales et al. 2000; Chapman et al. 2004b). In analyzing several different samples from 0 < z < 5, Casey et al. (2018b) found no evidence for evolution in the distribution of galaxies in LIR–λpeak space, though this is only argued using aggregated literature fits that do not account for sample incompleteness or biases. Similarly, Dudzevičiūtė et al. (2020) found no evolution in dust temperature at fixed IR luminosity for massive dusty galaxies between 1.5 < z < 4, though their sample is limited to galaxies selected at 850 μm, and selecting samples at 850 μm will preferentially select for cold sources (e.g., Eales et al. 2000; Chapman et al. 2004a; Casey et al. 2009). Recent works by Simpson et al. (2019) and Reuter et al. (2020) also found no conclusive evidence of evolution.
In this paper we aim to quantitatively address possible evolution in the LIR–λpeak relationship from z ∼ 0 to z ∼ 2 and to test the hypothesis that LIR–λpeak does not evolve over this redshift range. Our goal is to do so using an approach that investigates sample and observing band biases in dust SED fitting in a uniform manner. To do that, we fit dust SEDs for ∼4700 galaxies from three different samples from the literature to explore telescope selection biases that may influence measurement of galaxies' dust SEDs, and then we place our best-fit SEDs in context against different claims in the literature. This paper is organized as follows. In Section 2 we describe each of the samples we curated for our analysis. In Section 3 we describe the SED modeling technique we employ. In Section 4 we describe the LIR–λpeak relation measured first for a z ∼ 0 sample and then go on to test its redshift evolution using subsequent higher-redshift samples. Section 5 presents a discussion of our results in context of literature studies, and we make recommendations for standardizing IR/millimeter SED fitting procedures for fair comparison across works. Finally, we present the MCIRSED tool we built to carry out our analysis in the Appendix. Throughout this work we adopt the Planck 2018 flat cosmology with H0 = 67.4 km s−1 Mpc−1, and Ωm = 0.315 (Planck Collaboration et al. 2020) and a Kroupa stellar initial mass function (Kroupa 2001; Kroupa & Weidner 2003).
2. Data Selection
The aim of this study is to investigate possible redshift evolution in the average dust temperature of galaxies. To this end we make use of three different catalogs from the literature that collectively span the redshift range of 0 < z < 2 giving us a broad dynamic range of IR-luminous galaxies that allows us to assemble an unbiased sample with well-sampled dust SEDs across a large redshift range. Each sample has multiple photometric observations at wavelengths spanning roughly the nominal range of thermal dust emission, 8–1000 μm with between six and 11 photometric constraints (with a median of eight constraints). We require that our samples have observations between rest frame 40 < λpeak < 200 μm in order to constrain λpeak. This corresponds to a temperature range of 10–80 K assuming the case of an optically thin blackbody. We also require that each galaxy have a spectroscopic or high-quality photometric redshift derived from optical and infrared (OIR) photometry (Δz/(1 + z) ≲ 0.1). It is crucial that redshift constraints be independent of the IR/millimeter photometry, otherwise our results may shield an inherent bias.
Our analysis focuses on three key samples. The first, called the Infrared Astronomical Satellite (IRAS) sample, is the low-redshift sample that forms the anchor understanding of dust SEDs in the local universe. The next sample is H-ATLAS, which extends to z ∼ 0.4, and finally the COSMOS sample, which extends to z ∼ 2. The distribution of each of these samples in LIR–z space is shown in Figure 1, and Table 1 summarizes their defining attributes, further described below.
Table 1. Listing, for Each Sample, of the Wavelengths of Observation, Detection Limits, Sample Size, Survey Area, Redshift Range, Median Redshift, and Parent Sample References
Sample Name | IRAS | H-ATLAS | COSMOS |
---|---|---|---|
Wavelengths (μm) | 12 (WISE), 12 (IRAS), 22, 25, 60, 100 (IRAS), 100 (Herschel), 160, 250, 350, 500 | 22, 100, 160, 250, 350, 500 | 24, 70, 100, 160, 250, 350, 500, 850, 1100, 1200 |
Selection of Sample | S/N > 5 (>179 mJy) at 60 μm and S/N > 4 (>29 mJy) at 250 μm | S/N > 3 (>132 mJy) at 100 μm and S/N > 4 (>29 mJy) at 250 μm | S/N > 3 in at least two bands: 100 μm (>5.34 mJy), 160 μm (>11.6 mJy), 250 μm (>17.4 mJy), 350 μm (>18.9 mJy), or 500 μm (>20.4 mJy) |
Number of Detections (Upper Limits) in Each Band a | 12 μm (WISE): 502 (9, 2%), 12 μm (IRAS): 16 (495, 97%), 22 μm: 378 (133, 26%), 25 μm: 65 (446, 87%), 60 μm: 511 (0, 0%), 100 μm (IRAS): 364 (147, 29%), 100 μm (Herschel): 461, (50, 9%), 160 μm: 463 (48, 9%), 250 μm: 511 (0, 0%), 350 μm: 499 (12, 2%), 500 μm: 401 (110, 22%) | 22 μm: 1656 (511, 24%), 100 μm: 2167 (0, 0%), 160 μm: 1459 (705, 33%), 250 μm: 2167 (0, 0%), 350 μm: 1801 (366, 17%), 500 μm: 563 (1604, 74%) | 24 μm: 1990 (0, 0%), 70 μm: 1588 (402, 20%), 100 μm: 1666 (324, 12%), 160 μm: 1713(277, 14%), 250 μm: 1971 (19, 1%), 350 μm: 1827 (163, 9%), 500 μm: 1406 (584, 29%), 850 μm: 185 (1805, 90%), 1100 μm: 18 (1972, 99%), 1200 μm: 5 (1985, 99.7%) |
Sample Size (Galaxies) | 511 | 2167 | 1990 |
Survey Solid Angle (deg2) | 660 | 162 | 2 |
Redshift | 0.025 < z < 0.19 | 0.05 < z < 0.4 | 0.15 < z < 2.0 |
Median Redshift | 0.050 ± 0.002 | 0.127 ± 0.002 | 0.701 ± 0.008 |
References | Wang et al. (2014), Valiante et al. (2016), Bourne et al. (2016), Smith et al. (2017), Maddox et al. (2018), Furlanetto et al. (2018) | Valiante et al. (2016), Bourne et al. (2016) | Lee et al. (2013), Simpson et al. (2019), Weaver et al. (2022) |
Note. Uncertainties on the median redshifts are the standard deviations of 1000 bootstrapped calculations of the median redshift.
a The number of galaxies with detections in each band with signal-to-noise ratio (S/N) ≥ 3. The number of galaxies with upper limits in each band (S/N < 3) are listed in parentheses as well as the percentage of the total that have upper limits.Download table as: ASCIITypeset image
We limit our analysis to z < 2 because galaxies at higher redshifts have significantly lower-quality photometric constraints on their dust SEDs. This is primarily due to the lack of sensitivity of Herschel SPIRE to LIR ≲ 1013 L⊙ SEDs at z > 2 (e.g., Casey et al. 2012a). In addition to low signal-to-noise ratio (S/N) measurements of z > 2 far-IR (FIR) data, z > 2 dusty star forming galaxy (DSFG) tend to have a lower proportion of reliable redshift constraints with a much higher occurrence rate of sources lacking OIR counterparts entirely (e.g., Wardlow et al. 2011). In contrast, ≳98% of z < 2 DSFGs are detected in 24 μm data and corresponding deep OIR imaging in the COSMOS field (e.g., Magdis et al. 2012; Weaver et al. 2022), which makes this analysis possible at z < 2.
2.1. IRAS Sample
We build our lowest-redshift sample, which we will refer to as the IRAS sample, by performing a spatial cross match between two separate catalogs. The first catalog contains IRAS (Neugebauer et al. 1984) sources from the Revised IRAS Faint Source Redshift Catalog of galaxies (RIFSCz; Wang et al. 2014). This sample is selected at 60 μm and contains observations at 12, 25, 60, and 100 μm covering galactic latitudes ∣b∣ > 20° in unconfused regions of the sky at 60 μm (where "unconfused" corresponds to fewer than eight sources per beam with 6' radius) with order of 60,000 sources. The second catalog is data releases 1 and 2 of the Herschel Astrophysical Terahertz Large Area Survey catalog, which covers ∼120,000 sources (H-ATLAS; Bourne et al. 2016; Valiante et al. 2016; Smith et al. 2017; Furlanetto et al. 2018; Maddox et al. 2018). This sample is selected at 250 μm and contains observations at 100, 160, 250, 350, and 500 μm from the SPIRE and PACS instruments covering 660 deg2.
To create the RIFSCz, Wang et al. (2014) selected galaxies detected with IRAS from the IRAS Faint Source Catalog (FSC; Moshir et al. 1993) that have S/N > 5 in the 60 μm band. The FSC contains ∼1.7 × 105 sources with only about 40% of sources detected at 60 μm. In order to obtain accurate positional estimates, Wang et al. (2014) used the likelihood ratio technique (e.g., Sutherland & Saunders 1992) to match FSC sources to 3.4 μm counterparts in the Wide-field Infrared Survey Explorer (WISE) All-Sky Data Release (Wright et al. 2010). This wavelength is used because it is the deepest band in WISE and it samples a galaxy's SED at a wavelength where it is likely to be detected from either warm dust or stellar emission mechanisms. The authors then cross match the IRAS sources with WISE astrometry (the latter is more precise) and then to several other wide-field surveys: the Sloan Digital Sky Survey Data Release 10 (SDSS DR10; Ahn et al. 2014) using a matching radius of 3'', the Galaxy Evolution Explorer All-Sky Survey Source Catalog (Seibert et al. 2012) using a matching radius of 5'', the Two Micron All Sky Survey (2MASS Huchra et al. 2012), which is already included with the WISE catalog, and the first public release of the Planck catalog of compact sources (Planck Collaboration 2013; Planck Collaboration et al.2014) using a matching radius of 3'. Each matching radius used by Wang et al. (2014) is chosen based on the spatial resolution of each match. Sources are retained in the RIFSCz catalog even if they are not matched to detections in these ancillary catalogs. See Wang et al. (2014) for more details. In the present work, focusing on IR SEDs, we use IRAS and WISE flux densities and redshifts for our analysis.
We further cross match RIFSCz sources with the H-ATLAS catalog (Bourne et al. 2016; Valiante et al. 2016; Smith et al. 2017; Furlanetto et al. 2018; Maddox et al. 2018) to fill out the SEDs at longer wavelengths. H-ATLAS presents sources detected at S/N > 4 at 250 μm (corresponding to a 4σ depth of 29 mJy) in an area of the sky spanning 660 deg2. The catalog contains two data releases: DR1, covering the North Galactic Pole (NGP; 500 deg2), cross matched with SDSS DR10 (Ahn et al. 2014) for redshifts and optical counterparts and positions, and DR2, covering the South Galactic Pole (SGP; 162 deg2).
Sources in the RISFCz catalog have astrometric positions based on the best available precision where reliable counterparts are identified in Wang et al. (2014): either from SDSS DR10, 2MASS, or WISE. From the Wang et al. (2014) astrometry, we further cross match RIFSCz sources with both DR1 and DR2 of the H-ATLAS catalog using search radii that correspond to the radius of the average point-spread function (PSF) of the best available ancillary catalog provided by Wang et al. (2014). H-ATLAS DR1 and the NGP field of DR2 are matched to RIFSCz sources with a search radius of 1'' because the RIFSCz catalog in this region is based on SDSS DR10 astrometry, precise to 1''. However, there is no SDSS DR10 coverage of the SGP field in H-ATLAS DR2, so sources in the SGP field are matched using a less restrictive search radius of 16 for counterparts in the 2MASS catalog whose astrometry is slightly less well constrained (154 sources are matched). In the absence of 2MASS counterpart matches, sources from the SGP are matched to the WISE astrometry given in the RIFSCz using a matching radius of 3'' (14 sources are matched this way). Note that contamination from false positives is negligible within this search radius given the low source density of both RIFSCz sources (∼1 deg−2) and possible 2MASS counterparts (∼50 deg−2). Cross matching sources from the RIFSCz to H-ATLAS data releases 1 and 2 results in 780 matches or 2% of RIFSCz sources, which is the expected proportion of the full RIFSCz catalog because H-ATLAS only covers 2% of the solid angle spanned by the RIFSCz.
We limit the IRAS sample to only include galaxies with spectroscopic redshifts (spec-zs). This requirement reduces the sample from 780 to 750 galaxies (96% of the original) spanning the redshift range z < 0.48 with median . This reduction does not show a bias in redshift within the sample. Requiring spec-zs reduces the sample size by only 4% and thus is a reasonable choice because uncertainties in adopted redshift will translate into uncertainties on our measured rest-frame peak wavelengths and IR luminosities.
Because the aim of this work is to look for redshift evolution in the correlation between the peak wavelength and the IR luminosities of galaxies, we want a clean snapshot of the LIR–λpeak correlation without influence from any potential redshift evolution. We therefore further restrict the sample to only include galaxies in a redshift range that does not evolve strongly with cosmological time and yet is beyond the local volume. We limit our sample to galaxies between 0.025 <z < 0.19. The lower-redshift bound is specifically chosen to remove spatially resolved sources. It corresponds to the redshift where a galaxy with a radius of 10 kpc would be resolved across two 250 μm Herschel beams (18'' × 2 = 36''). This removes some of the uncertainty associated with the more extreme aperture corrections required to derive accurate photometric constraints in the local volume. The upper-redshift bound is chosen based on the time frame over which the star formation rate of galaxies on the star formation "main sequence" evolves by less than a factor of two from our lower-redshift bound (Speagle et al. 2014). In other words, the galaxies' average SFRs (at fixed stellar mass) do not evolve by more than two times over this interval. This factor of two is chosen because it corresponds to the intrinsic width of the main sequence itself (∼0.3 dex). Thus, in this snapshot in time, the main sequence does not evolve by more than its intrinsic width. This additional redshift restriction leaves 511 out of 750 galaxies remaining in the sample (68%). We also tested setting the lower-redshift bound to the redshift where a galaxy with radius 10 kpc fits within a single beam (at z = 0.01) and found this does not change the overall results of this paper but does increase the uncertainty in our measurement of the LIR–λpeak relation, as it limits the IRAS sample to only 257 out of the 780 original galaxies (33%). We therefore continue with the less restrictive 0.025 < z < 0.19 selection to maximize the dynamic range of luminosities in our sample. We find that this sample is >90% complete above >340 mJy and >81% complete above >179 mJy at 60 μm. Figure 1 shows the distribution of redshifts versus the IR luminosities for the IRAS sample along with the other samples that we will describe below in Sections 2.2 and 2.3.
Observational constraints included in the IRAS sample are WISE 12 μm, IRAS 12 μm, WISE 22 μm, IRAS 25 μm, IRAS 60 μm, IRAS 100 μm, Herschel 100 μm, Herschel 160 μm, Herschel 250 μm, Herschel 350 μm, and Herschel 500 μm, with inclusion in the sample predicated only on a detection at 60 μm. These wavelengths allow us to probe approximately the full range of dust emission. Because the 12 μm band may capture PAH emission at the redshifts covered by this sample, we test that the inclusion of two 12 μm bands—from both IRAS and WISE–does not bias our measurement of IR luminosities and peak wavelengths. We find that the median IR luminosities and peak wavelengths of galaxies in this sample differ by less than a percent when excluding either set of observations at 12 μm. This difference is smaller than the average uncertainty in these parameters, so we conclude that the inclusion of both 12 μm bands does not introduce additional bias.
To summarize, our selection function for this sample is galaxies that are (1) detected at S/N > 5σ at 60 μm in the IRAS FSC, (2) are present in the H-ATLAS catalogs (S/N > 4σ at 250 μm), (3) have a spec-z in either RIFSCz or H-ATLAS, and (4) are in the redshift range 0.025 < z < 0.19, taken to represent a snapshot in time of the local universe. See Table 1 for a list of the wavelengths included and selection limits for this sample as well as the other samples described below. Figure 2 illustrates the detection limits for each sample in LIR–λpeak space at their median redshifts with contours showing the distribution of sources. The goal of these selections is to create samples that are minimally biased with respect to SED dust temperature at a given IR luminosity. We will discuss this further in Section 5.
Download figure:
Standard image High-resolution imageFinally, in order to test that the IRAS sample selection and observed filter set is not biased with respect to dust temperature, we generate mock SEDs covering a grid in LIR–λpeak space at peak wavelengths and IR luminosities that are broader than those we find in the sample, namely peak wavelengths between 35 and 200 μm, which corresponds to dust temperatures between 103 and 12.5 K, respectively. We find no offset between the inputs to these noise-perturbed mock sources and the measured values for LIR and λpeak at the signals to noise representative of the IRAS sample with the wavelength sampling of the IRAS sample. In other words, over a range of mock SEDs that is more diverse than the actual observed IRAS sample, we find no bias due to the wavelength coverage or S/N of the IRAS sample. If there were galaxies that populated parameter space in this grid we would be able to accurately measure their luminosities and peak wavelengths. We return to a discussion of this test, with respect to any bias in this combination of filters, in Section 4.2 after introducing the mock galaxy samples in more detail.
2.2. H-ATLAS Sample
Our second sample, which we will refer to as the H-ATLAS sample, expands the redshift horizon of this study beyond z ∼ 0.2, probing redshift evolution out to z ∼ 0.4. The sample is defined by galaxies detected at 250 μm with S/N > 4 over an area of 161.6 deg2 from data release 1 of H-ATLAS (Bourne et al. 2016; Valiante et al. 2016). For this sample, we make exclusive use of H-ATLAS DR1 sources because Bourne et al. (2016) performed a careful cross matching with WISE photometry (Wright et al. 2010), giving us important constraints on the sample's short wavelength dust SED. There are of the order of 70,000 galaxies in H-ATLAS DR1. We further restrict our analysis to galaxies that are detected at 100 μm with S/N > 3, which significantly helps constrain the galaxies' λpeak and reduces the sample to 3407 galaxies (5% of all H-ATLAS DR1 galaxies). The sky coverage with PACS and SPIRE in the H-ATLAS DR1 is roughly the same, but the PACS instrument is much less sensitive than SPIRE. This accounts for the relative dearth of galaxies with S/N > 3 at 100 μm. We will discuss the impact of this 100 μm detection cut further in Section 4.2, but to summarize here, it effectively reduces the dynamic range in IR luminosity to the same range used for the IRAS sample while also dramatically improving the SED fit quality by adding a precise constraint near the peak. In other words, the requirement of a 100 μm detection removes low-LIR sources below the IR luminosity range studied in this manuscript.
Although H-ATLAS DR1 includes WISE observations at 12 and 22 μm, we use only the 22 μm band in our SED fits because PAH emission is expected to contribute to the 12 μm band, depending on the redshift, and our modeling does not account for PAH emission. In the IRAS sample, we included WISE 12 μm, IRAS 12 μm, WISE 22 μm, and IRAS 25 μm because we have as many as five to six constraints at wavelengths shorter than the peak wavelength of emission, so possible PAH contamination was downweighted by the other observations. On the short wavelength end, the H-ATLAS sample only has observations at 12 μm, 22 μm, and 100 μm, so PAH contamination in the 12 μm band is more likely to skew our analysis. While we do not require a detection with WISE, we do limit our sample to galaxies observed with WISE. Excluding those without WISE coverage removes 723 galaxies, reducing our sample to 2679 galaxies (by 22%). This sample covers 0 < z ≲ 0.8, but we limit our analysis to galaxies between 0.05 < z < 0.4, due to a significant drop in H-ATLAS survey sensitivity beyond z ∼ 0.4; i.e., H-ATLAS is only sensitive to ULIRGs (>1012 L⊙) beyond z ∼ 0.4, but their sky density at those redshifts is significantly lower, making average dust SEDs difficult to constrain. As we did for the IRAS sample, we use a lower-redshift bound to avoid including spatially resolved sources. For this sample, we choose a lower limit of z = 0.05 because this is the redshift where a galaxy with radius 10 kpc is unresolved by Herschel at 250 μm (i.e., within one beam). This lower limit reduces the number of galaxies to 2301, and the upper limit reduces the number of galaxies to 2167 (3% of all H-ATLAS DR1 galaxies).
Valiante et al. (2016) estimated the completeness of DR1 to be greater than 95% for galaxies with flux densities above 42.15 mJy at 250 μm. This corresponds to 94% of galaxies in our sample. To address the possible inclusion of false positives, or galaxies whose FIR emission is incorrectly matched to an optical/near-infrared source within the redshift range of interest at z < 0.4, we evaluate the likelihood of random, nonphysical associations in existing optical/near-infrared imaging in the H-ATLAS survey. We find the likelihood of a random match to an OIR source at the depth of SDSS DR10 data (22nd magnitude) at z < 0.4 to be 0.7% within a 3'' matching radius, which would correspond to 15 sources. We also investigate whether it is likely that there are sources present in H-ATLAS whose optical counterparts are not detectable in the available optical imaging from SDSS DR10. 2 The infrared excess, the ratio of the IR to the UV luminosities (LIR/LUV ≡ IRX) of sources at z = 0.4 that have U-band flux densities at the detectability limit of SDSS DR10 (AB magnitude 22) and Herschel 250 μm fluxes at the detectability limit of H-ATLAS, corresponds to IRX ∼ 4 ×105, which is much higher than a more typical value of IRX for DSFGs in the literature ∼103–104 (e.g., Howell et al. 2010; Casey et al. 2014). Adopting an IRX value of 104 implies an AB magnitude limit of 18, implying all OIR counterparts should be detected in SDSS DR10.
The included wavelengths of observation for the H-ATLAS sample are then 22, 100, 160, 250, 350, and 500 μm. See Figure 1 for the redshift distribution versus IR luminosity of the H-ATLAS sample. Figure 2 shows the detection limits at the median redshift of the H-ATLAS sample with contours showing the distribution of sources in LIR–λpeak space. To summarize, our selection function for this sample is galaxies that have (1) S/N > 4 at 250 μm and S/N > 3 at 100 μm in H-ATLAS DR1, (2) are observed with WISE (though no detection is necessary), and (3) lie between 0.05 < z < 0.4.
2.3. COSMOS Sample
Our third sample, which we will refer to as the COSMOS sample, extends out to z = 2. It contains Herschel-selected galaxies from the COSMOS field (Scoville et al. 2007) originally presented in Lee et al. (2013; hereafter L13). L13 measures Spitzer MIPS flux densities from Sanders et al. (2007), Herschel SPIRE and PACS flux densities from Oliver et al. (2012) and Lutz et al. (2011), 1.1 mm AzTEC (Scott et al.2008; Aretxaga et al. 2011), and 1.2 mm MAMBO (Bertoldi et al. 2007) flux densities. They use positional priors from 24 μm (Le Floc'h et al. 2009) and 1.4 GHz (Schinnerer et al. 2007) following the cross-identification (XID) linear inversion technique (Roseboom et al. 2010, 2012). The authors then match the 24 μm positions to Ks -band counterparts from the COSMOS-WIRCam near-infrared imaging survey (McCracken et al. 2010) using a 2'' search radius. Finally, these Ks -band positions are matched with a 1'' search radius to the photometric redshift catalog of Ilbert et al. (2009) to derive photometric redshift constraints. This results in a total of 39,333 sources. Most of these are not significantly detected in the IR/millimeter and so are excluded from the analysis in this paper upon applying further selection cuts.
In the present paper, we update the L13 sample and methodology using more recent COSMOS data releases. We start with the 4220 galaxies in the L13 catalog with S/N > 3 in at least two PACS or SPIRE bands (11% of all L13 galaxies). L13 uses only galaxies with S/N > 5 in at least two PACS or SPIRE bands (1810 galaxies, using confusion limits as a floor on the errors on flux density). We explored both S/N cuts for our final sample and found that pushing down to S/N > 3 added significant dynamical range to the luminosity limit of our analysis and does not detract from the quality of our SED constraints. We then cross match L13 positions for these 4220 galaxies with Atacama Large Millimeter/submillimeter Array (ALMA) sources in the A3COSMOS catalog (Liu et al. 2019) given the significant uncertainty in the initial 24 μm/1.4 GHz association with Herschel flux densities. Magdis et al. (2011) showed that 98% of Herschel sources at z < 2 will be detected in deep Spitzer 24 μm imaging. Similarly, ALMA detections centered on Herschel-selected sources are physically associated due to the rarity of both populations on the sky. Therefore the likelihood that an ALMA source located within the MIPS 24 μm PSF radius is not associated with the galaxy responsible for 24 μm emission at z < 2 is low (<1%). A3COSMOS contains all available ALMA archival continuum pointings in the COSMOS field with detections of 756 unique galaxies across ∼280 arcmin2. We use a search radius of 3'', the beam radius of the Spitzer 24 μm band, to match 24 μm positions to ALMA-detected counterparts. ALMA detections were found for 147 sources in L13 (4% of the original 4220 sources). Of these, we find that 20 sources (14% of the 147 matches) were misidentified in the L13 sample based on the adopted positional prior. In other words, the original OIR counterpart identified within the 24 μm beam by L13 was found to be spatially offset from an ALMA source, also within the 24 μm beam, by larger than 20 kpc for 14% of the subsample with ALMA measurements available. These mismatches did not affect the photometry of these sources, as the XID linear inversion technique simply measures the fluxes in all bands at the positions of 24 μm Spitzer sources. A new matched position within the beam of 24 μm emission only results in a new matched OIR counterpart and hence a new redshift.
After adopting the new positions from the A3COSMOS subset, we cross match all source positions (including those without ALMA data) with the COSMOS2020 photometric redshift catalogs (Weaver et al. 2022). Their photometric redshifts are updated to make use of the near-IR and optical data from the UltraVISTA and Subaru HSC surveys' deeper imaging in the field. The COSMOS2020 photometric redshift (photo-z) catalog consists of two separate photometric catalogs generated using different source extraction techniques as well as two photo-z fitting techniques applied to those two catalogs, resulting in four unique photo-z catalogs. See Weaver et al. (2022) for more details. Following the procedure adopted by L13 and Le Floc'h et al. (2009), we perform our spatial cross match from long-wavelength astrometry (ALMA if available, otherwise Spitzer 24 μm) separately on all COSMOS2020 catalogs using a 2'' search radius, and adopt the nearest neighbor match. These works found that by adopting a 2'' search radius, they were able to reduce the fraction of multiple matches while still taking into account most of the positional uncertainty associated with the 3'' width of the MIPS 24 μm PSF. For sources with ALMA positions, we adopt a search radius of 1'', as this is the size of the average ALMA beam in A3COSMOS. Thus, from COSMOS2020, we have between one and four photometric redshift estimates for each galaxy. There are 4011 galaxies matched out of the original 4220 sources with S/N > 3 in at least two Herschel bands (95%). Next we remove galaxies that are marked as masked in both source catalogs (for proximity to a bright star that obfuscates the OIR source photometry), leaving only 2295 galaxies of the original 4011 with matches (57%). Rather than fitting these masked galaxies with the original L13 photo-zs, we choose to only retain the subset of sources matched to COSMOS2020 to retain consistency in how redshifts are measured within the sample. We do not find that this cut hinders our final analysis in any way, as the remaining sample is still sufficiently large.
Where spectroscopic redshifts are available in the COSMOS2020 catalog, we use them instead of photometric redshifts. Of the 2295 sources, 285 have spec-zs (12%). For the remaining galaxies, we adopt a single photo-z. For those with two photo-z estimates from the COSMOS2020 catalogs, we use the lower z because galaxies in this sample are statistically more likely to be at lower z. For galaxies with three photo-z estimates, we adopt the median redshift. And finally, for galaxies with four photo-z estimates, we adopt the redshift that is the closest to the median. We do this rather than adopting the true median so that we may retain the redshift uncertainties included with the COSMOS2020 catalogs. There are 2005 sources with four photo-z estimates, 83 with three photo-z estimates, 206 with two photo-z estimates, and one source with one photo-z estimate. The median standard deviation of photo-z estimates is 0.02 for sources with four estimates. Note that only a minority of sources (423 of 2295, or 18%) have standard deviations in their two to four photometric redshift estimates greater than 0.1. Additionally, the majority of our galaxies have very similar photometric redshift estimates to those from L13, with σΔz/(1+z) = 0.009. See Figure 1 for the distribution of COSMOS sample redshifts and IR luminosities relative to the other samples.
Galaxies in the COSMOS sample span 0.02 < z < 4.8. We limit our analysis to 0.15 < z < 2 due to the poor sensitivity of Herschel for IR-luminous sources beyond z ∼ 2 and potential bias in the dust temperature measured from those SEDs beyond that epoch. Unlike the H-ATLAS sample where a lower-redshift bound of 0.05 is chosen to exclude galaxies resolved with Herschel, for the COSMOS sample, we increase our lower-redshift bound to 0.15 because the COSMOS field is much smaller than that of H-ATLAS, and a small area provides a low number of sources per given bin of Δz below z < 0.15. Whereas there are 72 sources between 0.05 < z < 0.15 in the COSMOS sample, there are 1314 sources in the H-ATLAS sample over this same range. Low number statistics between 0.05 < z < 0.15 are likely to result in a bias to our analysis, particularly given the lower IR luminosities of that subsample, so we require >100 galaxies per Δz = 0.1 bin on the low-redshift end of the COSMOS sample. We achieve this by adopting a lower-redshift bound of 0.15. This reduction takes the COSMOS sample to 1990 sources.
The last step in our sample selection procedure is to cross match the COSMOS sample with sources from the S2COSMOS catalog (Simpson et al. 2019) using a search radius of 65, the beam radius of SCUBA-2 at 850 μm, in order to fold in their 850 μm data to our SED analysis. This results in 124 detections with S/N > 3, and nine with S/N > 5. For galaxies in the COSMOS sample that do not have a matched S2COSMOS source, we adopt the flux in the S2COSMOS 850 μm map at the location of our source. For the uncertainty on this flux density, we adopt the rms from the S2COSMOS 850 μm rms map. The wavelengths of observation for our COSMOS sample are then 24, 70, 100, 160, 250, 350, 500, 850, 1100, and 1200 μm. See Figure 1 for the redshift distribution of sources in the COSMOS sample versus IR luminosity, and see Figure 2 for the detection limits at the median redshift of the COSMOS sample with contours showing the distribution of sources in lirlp space. We find that this sample is >90% complete at flux densities >10 mJy at 250 μm. To summarize, our selection function for this sample is galaxies from L13 that have (1) S/N > 3 in at least two of five Herschel bands (100, 160, 250, 350, and 500 μm), and (2) have redshifts between 0.15 < z < 2.0.
3. Modeling Galaxies' IR SEDs
To model the SEDs of galaxies between 8 and 1000 μm, we use a piecewise function consisting of a mid-infrared (MIR) power law and an FIR modified blackbody. The model takes the form:
where λ is the rest-frame wavelength, α is the MIR power-law slope, Npl and Nbb are normalization constants, λ0 is the wavelength where the dust opacity equals unity, β is the dust emissivity index, T is the luminosity-weighted characteristic dust temperature, h is the Planck constant, k is the Boltzmann constant, and c is the speed of light. The two piecewise functions connect seamlessly at the wavelength where the slope of the modified blackbody is equal to the slope of the power-law function.
To fit this function to the data presented herein, we built a package called Monte Carlo InfraRed SED or MCIRSED. It is described in further detail in the Appendix. To summarize, we use Markov Chain Monte Carlo (MCMC) to sample the posteriors of our fit parameters and provide confidence intervals. The fit parameters that may be fixed or free are the power-law slope, α, the dust emissivity index, β, and the wavelength where the opacity equals unity, λ0. The peak wavelength and IR luminosity are always free parameters. Figure 3 shows example fits to a galaxy from each of our samples with their associated two-dimensional joint probability distributions. The only parameter that is fixed for all galaxies in our analysis is λ0 = 200 μm, consistent with measurements or assumptions made for similarly IR-luminous galaxies in the literature (e.g., Blain et al. 2003; Draine 2006; Conley et al. 2011; Rangwala et al. 2011; Greve et al. 2012; Conroy 2013; Spilker et al. 2016; Simpson et al. 2017; Zavala et al. 2018; Casey et al. 2019).
Download figure:
Standard image High-resolution imageFor our fiducial IRAS sample, we use flat bounded priors on α, β, and Td. The bounds are chosen to restrict sampling to parameter space that corresponds to conditions that are physical though not restrictive. For the H-ATLAS and COSMOS samples, we use normally distributed priors based on the best fits to the IRAS sample. Details on the choice of parameter priors are given in Section Appendix. When fitting galaxies with upper limits in any of the wavelength bands with unspecified measurement uncertainty, we adopt a flux density of 0.0 with an uncertainty equal to the implied 1σ upper limit of the given data set. However, if a band has a flux density and uncertainty reported in the parent catalog, we fit with those reported values, even if the S/N of the observation is low or negative. In practice, measurements with negative S/N (originating from negative flux density and positive uncertainty) are limited to S/Ns close to zero, allowing fits to remain exclusively at positive flux densities and still within the typical uncertainty on the measurement. The best-fit parameters reported throughout this work are the median values of the posterior distributions, and the uncertainties are the 16th and 84th percentiles (±1σ for Gaussian distributions).
4. Measuring the LIR–λpeak Correlation
Our investigation into whether there is redshift evolution in the bulk SED characteristics of dust in galaxies between 0 < z < 2 is anchored to our analysis of the empirically established correlation between LIR and λpeak. We choose LIR as the fundamental quantity to which we compare dust temperatures, via λpeak, for two reasons: (1) IR luminosity is the most fundamental quantity we can derive directly from a galaxy's dust SED other than λpeak, making it directly measurable for all galaxies in our samples, (2) it has been shown to be more closely correlated with galaxy-integrated dust temperature (Burnham et al. 2021) than several alternate quantities like stellar mass, sSFR, or distance from the "main sequence" (see Chapman et al. 2003; Chanial et al. 2007; Symeonidis et al. 2013; Magnelli et al. 2014; Lutz et al. 2016). Though the star formation surface density, ΣIR, seems to be more fundamentally correlated with dust temperature than LIR (as one might suspect given its dependence on dust geometry; e.g., Lutz et al. 2016; Burnham et al. 2021), the lack of high-resolution infrared imaging, and thus measurement of dust sizes, for the vast majority of our galaxies prevents direct analysis of possible evolution in ΣIR versus λpeak. Thus, we anchor our work to the LIR versus λpeak plane, as the next best alternate for unresolved sources, for the rest of this paper.
The correlation between IR luminosity and dust temperature (where Td ∝1/λpeak) has been well documented in the literature. Early studies of IRAS galaxies in the local universe demonstrate that IR color, a parameterization of dust temperature, correlates with IR luminosity (e.g., Andreani & Franceschini 1996; Dunne et al. 2000; Dale et al. 2001; Chapman et al. 2003). The launch of Herschel revealed that galaxies at higher redshifts also obeyed a correlation between dust temperature and luminosity (e.g., Hwang et al. 2010; Casey et al. 2012a, 2012b; Magnelli et al. 2012; Symeonidis et al. 2013; Casey et al. 2018b).
The specific hypothesis we are testing is that the correlation between IR luminosity and peak wavelength does not evolve with redshift. This is suggested to be the case in some recent papers (Casey et al. 2018b; Reuter et al. 2020), though Casey et al. (2018b) pointed out that their analysis has limited power to constrain the relation at z > 2 because of how few galaxies are in their samples at these redshifts. Similarly, the sample presented by Reuter et al. (2020) is limited to ∼100 sources over a range of redshifts between 1.9 < z < 6.9. Establishing whether there is redshift evolution in the LIR–λpeak correlation, especially using a uniform approach to SED fitting is important because it might imply a corresponding evolution in the average properties of dust in the ISM of galaxies across these redshifts.
To model the LIR–λpeak correlation, we begin with the equation from Casey et al. (2018b) that relates the peak wavelength to the LIR of galaxies:
Here Lt is fixed to 1012 L⊙, λt is the peak wavelength of galaxies at Lt, and η is the power-law index. Note that Lt is simply a normalization constant and is fixed to 1012 L⊙ as is done in Casey et al. (2018b); its adopted value does not impact the measured results. Because we find the data in each of our samples is normally distributed in log(LIR)–log(λpeak) space, we also model the distribution of galaxies about this best-fit line as a Gaussian using the general prescription of Kelly (2007). Our model then becomes:
where is a Gaussian with mean μ ≡ 0, and standard deviation , which we will refer to as σλ for convenience. Also for simplicity, we will hereafter refer to the set of all fit parameters as Θ = (λt , η, σλ ).
Each sample is fit using MCMC with noninformative priors on all parameters to derive independent measurements of Θ. The rest of this section presents measurements for several sets of galaxies, starting with the IRAS sample, which we use as a basis, indicative of the relationship in the local universe at z < 0.2. We then proceed to simulate mock galaxies, adopting the derived relation from the IRAS sample. The mock galaxies are used to infer the influence of telescope selection biases on IR-luminous galaxy samples, which become more prominent at higher redshifts. Lastly, we present fits to both the H-ATLAS and COSMOS samples and compare those fits to the expectations from the mock galaxy samples passed through the same observational selection filters.
4.1. Fitting the z < 0.2 IRAS Sample as an Anchor
The IRAS sample serves as our low-redshift anchor to which we compare the H-ATLAS and COSMOS samples that extend farther out to high redshift. The left panel of Figure 4 shows the LIR–λpeak correlation fit for the IRAS sample. The right panel of Figure 4 shows a histogram of the log of the peak wavelength measurements minus the log of the best-fit model peak wavelengths, illustrating the scatter about the LIR–λpeak relation. The distribution is well fit by a Gaussian with a tail of galaxies at shorter peak wavelengths, or hotter dust temperatures. AGNs are a possible extra source of heating for dust in galaxies. They are capable of heating torus dust to temperatures from 50 to 100 K to as high as the dust sublimation temperature, ∼1000–1500 K (e.g., Donley et al. 2012). The galaxy-integrated dust temperature may or may not be shifted to warmer temperatures because of AGNs, depending on the emergent AGN luminosity. To remove bias in our fits to the LIR–λpeak correlation due to the presence of the hotter tail of sources in the IRAS sample, we iteratively sigma-clip sources at >3σ, consisting of 27 out of 511 sources (5%). We find that approximately 26% of these outliers (seven out of 27) contain known AGNs by cross referencing with the first bright quasi-stellar object survey (Gregg et al. 1996), the 13th edition of the catalog of quasars and active nuclei (Véron-Cetty & Véron 2010), and the all sky catalog of WISE-selected AGNs from Secrest et al. (2015). Note that the fraction of cold ("inliers") similarly identified as AGNs in the same catalogs is significantly lower (7%, or 35 of 484) than 26%. For the other samples, we do not sigma-clip, as we do not find a substantial population of outliers like those in the IRAS sample. The reason they are present in the IRAS sample is likely due to the 60 μm selection. Figure 2 shows how a 60 μm selection is more sensitive to low-luminosity warm outliers compared to the selection functions that drive the H-ATLAS and COSMOS samples. Though AGNs are present in our samples and have been shown to account for about 10%, on average, of the total IR luminosities of their host galaxies (e.g., Iwasawa et al. 2011), they do not seem to affect the global relationship measured here. Note that if we were not to clip these outliers from our sample, the derivation of the ΘIRAS parameter set would still be consistent with our findings within measurement uncertainty; however, we find that this clipping provides the most appropriate fit by excluding sources known not to follow the bulk relationship drawn from a normal distribution in about a given IR luminosity.
Download figure:
Standard image High-resolution imageThe best-fit parameters for the IRAS sample, ΘIRAS, are given in Table 2. In the following subsections, we will use these values as our basis for comparison with our other samples to investigate redshift evolution.
Table 2. Best-fit Parameters of the LIR–λpeak Correlation for Each Redshift Bin across All Samples
Sample | Redshift Range | λt | η | σλ | Number of Galaxies |
---|---|---|---|---|---|
IRAS | 0.025 < z < 0.19 | 92 ± 2 | −0.09 ± 0.01 | 0.046 ± 0.003 | 511 |
H-ATLAS | 0.05 < z < 0.1 | 96 ± 3 | −0.08 ± 0.07 | 0.031 ± 0.003 | 729 |
0.1 < z < 0.2 | 95 ± 2 | −0.079 ± 0.010 | 0.030 ± 0.002 | 926 | |
0.2 < z < 0.3 | 92 ± 4 | −0.11 ± 0.04 | 0.032 ± 0.007 | 328 | |
0.3 < z < 0.4 | 96 ± 2 | −0.05 ± 0.03 | 0.029 ± 0.007 | 184 | |
COSMOS | 0.15 < z < 0.5 | 97±3 | −0.074 ± 0.009 | 0.052 ± 0.006 | 650 |
0.5 < z < 1.0 | 96 ± 1 | −0.09 ± 0.01 | 0.059 ± 0.004 | 743 | |
1.0 < z < 1.5 | 101 ± 2 | −0.15 ± 0.03 | 0.056 ± 0.005 | 359 | |
1.5 < z < 2.0 | 96 ± 5 | −0.11 ± 0.04 | 0.067 ± 0.006 | 238 |
Note. Each bin is consistent with the case of no redshift evolution since the redshift range of the IRAS sample, even if the best-fit parameters do not agree within errors with the fit to the IRAS sample. This is because we find overlap between mock, observable galaxies representing each of these redshift bins. See the posterior distributions of observed and mock samples in Figure 6. The detection limits of each sample are responsible for any apparent evolution in the correlation between IR luminosity and peak wavelength.
Download table as: ASCIITypeset image
4.2. Generating Mock Galaxies
Beyond the nearby universe, the detection limits of FIR/submillimeter facilities can, depending on the wavelength of observation, be biased with respect to the galaxies' dust temperatures, rendering either very cold or very warm SEDs undetectable. This is already demonstrated in our analysis by the presence of likely AGN-heated hot sources in the IRAS sample (driven by the 60 μm selection) not present in the H-ATLAS and COSMOS samples. If one tries to fit the LIR–λpeak correlation for subsamples of galaxies selected at different wavelengths without first accounting for these relative selection biases, the measured fit parameters Θ may themselves be biased. Such a bias is a function of sample selection wavelength, detection limit, and redshift. Figure 2 shows the detection limits at the median redshift of each sample plotted with contours of detected galaxies across all redshifts in each sample. For galaxies to be detected in a given sample, they need to have higher IR luminosities than indicated by the thick colored lines, which denote the flux density selection process outlined in Table 1. For example, to be detected in the COSMOS sample, galaxies must lie to the right of at least two curves in the rightmost panel of Figure 2: each curve represents the detection limit in each of the five bands used to define the catalog. For the H-ATLAS sample, the inclusion of the 100 μm detection requirement acts like a cut in IR luminosity; it is insensitive to dust temperature. We find this additional selection greatly improves our constraints on the measurement of peak wavelengths of galaxies in this sample.
To determine the degree to which measuring Θ at any given redshift is biased by selection functions, we generate mock galaxies to represent each data set in this paper. These mock galaxies are modeled such that they follow the local IRAS sample derived LIR–λpeak correlation. In other words, we assert in the mock samples that the LIR–λpeak correlation does not evolve with redshift. These mock galaxies are generated using the Θ parameter set fit to the IRAS sample, and are therefore used to test the null hypothesis, i.e., that there is no intrinsic redshift evolution in Θ parameters.
In order to generate representative galaxies for each sample, we use the infrared luminosity function from Zavala et al. (2021) based on the Casey et al. (2018b) model to calculate the expected number of galaxies of each IR luminosity that exist in the given survey solid angle. Note that these IR luminosity functions are consistent with many other IR luminosity functions in the literature at z ≲ 2 (e.g., Magnelli et al. 2009; Casey et al. 2012a, 2012b; Gruppioni et al. 2013; Magnelli et al. 2013). We randomly assign IR SEDs to the mock galaxies by sampling a normal distribution in log(λpeak) with width σλ around the best-fit LIR–λpeak correlation and with α and β values drawn from the posterior distributions of the IRAS sample. Photometry is then generated for the mock galaxies at the wavelengths where observations exist for each sample with appropriate instrument and confusion noise added. We then remove mock galaxies that fall below the detection limits of the survey as defined by the thresholds in Table 1 in each redshift bin. Thus, we generate mock versions of each sample in both survey volume and selection biases as they would exist if the null hypothesis were true. Then, if the measured Θ from the mock samples are statistically consistent with the Θ measured from the observed samples in a given redshift bin, then that redshift bin is consistent with the null hypothesis, i.e., that there is no redshift evolution in the LIR–λpeak relationship since z ∼ 0.
We test for selection bias in the IRAS sample in this same manner, though by construction we might expect agreement, given that mock galaxies are drawn from the derived parameter set Θ fit to the IRAS sample. This would not be the case if there was a strong selection bias, however. We find that the fits to the mock IRAS sample are consistent within errors with the measurements from the observed sample. This suggests that the detection limits for the IRAS sample are not presenting a strong bias for our measurement of Θ in the nearby universe. Additionally, as is detailed in Section 2.1, we find no offset between the inputs and best-fit values of mock galaxies that we generate in a grid covering a larger range of LIR–λpeak space than exists in the IRAS data. Together, these two tests suggest that there is not a strong bias for our measurement of Θ in the nearby universe.
4.3. Is There Evidence for Redshift Evolution in LIR–λpeak?
With mock samples that match our observed H-ATLAS and COSMOS samples in hand, we investigate whether or not there is evidence of redshift evolution in the LIR–λpeak relation. To do this, we divide both mock and observed samples into discrete redshift bins for which we measure Θ. We choose bin widths that provide an adequately large dynamic range in the subsamples' LIR (≳1 dex) and at least a few hundred galaxies per bin, both of which are needed to accurately constrain the parameters of Θ after accounting for the added uncertainty due to selection biases. For the H-ATLAS sample, we use redshift bins of width Δz = 0.1 between 0.05 < z < 0.4 with the lowest redshift bin truncated to 0.05 < z < 0.1. For the COSMOS sample, we use bins with width Δz = 0.5 ranging between 0.15 < z < 2, again with a truncated lowest redshift bin with Δz = 0.35.
Figure 5 shows the H-ATLAS and COSMOS samples in the LIR–λpeak plane in purple alongside mock galaxies made to mimic them in orange. We find that for the most part, the mock galaxies qualitatively resemble the distribution of real galaxies in both the H-ATLAS and COSMOS samples. The black dashed line shows the best fit to the IRAS sample, and the orange and purple lines show 100 random samples from the MCMC fits to real and mock galaxies, respectively. Because the mock galaxies are drawn from a distribution mirroring the nearby IRAS sample, we test for redshift evolution by comparing the best-fit parameter sets of the real samples to the mock galaxy samples. If the two fits are statistically consistent within a 95% confidence interval (i.e., 2σ), that would constitute direct observational evidence that the LIR–λpeak relation does not evolve.
Download figure:
Standard image High-resolution imageFigure 6 compares the best-fit parameter set Θ = (λt, η, σλ ) from Equation (3) between real and mock galaxies in the H-ATLAS and COSMOS samples. Because we find that variations in Θ associated with generating a volume limited set of mock galaxies are larger than the uncertainties derived when fitting any one set of mock galaxies (∼2× higher than uncertainties), we simulate each redshift subsample for H-ATLAS and COSMOS 100 times using input Θ that are drawn from the posterior distribution of the IRAS sample, ΘIRAS. We then measure Θ in each trial to generate a distribution of measured Θ for each subsample. Similarly, we also fit 100 bootstrap iterations of the real data for each subsample to generate realistic joint probability distributions of Θ.
Download figure:
Standard image High-resolution imageTo statistically assess the agreement or disagreement of our measured parameter set Θ for a given sample and redshift bin, we compute the ratio of slopes and intercepts between mock and real samples, ηmock/ηreal and λt,mock/λt,real. If mock and real galaxies are statistically consistent, these ratios will be approximately equal to one. We find that indeed, ηmock/ηreal and λt,mock/λt,real are consistent with 1 within the 95% confidence interval (2σ) for all subsamples and redshifts. In other words, we rule out the possibility of an evolving LIR–λpeak relationship at a confidence of >99.9997% (or 4.6σ significance), which we calculate by integrating the two-dimensional probability density distribution functions in derived parameters η and λt at values that would be >3σ offset from those measured in ΘIRAS. Thus we conclude for all subsamples that the data are consistent with no redshift evolution in the correlation between the IR luminosities of galaxies and the peak wavelength of their dust emission between 0 < z < 2.
The rightmost panels of Figure 6 show the width of the LIR–λpeak distribution in each redshift bin for all samples. The best fit to the IRAS is shown as a green dashed line, the scatter of observed galaxies about the LIR–λpeak correlation, σλ , is shown in purple, and the scatter of the distributions of mock galaxies is shown in orange. The measured σλ of the real galaxies are consistent with the measurements both from the IRAS sample and the mock samples for each redshift bin.
5. Discussion
We have shown that the correlation between the peak wavelengths of dust emission and the IR luminosities of galaxies, i.e., the LIR–λpeak relation, does not evolve between 0 < z < 2. We have done so by demonstrating that galaxy samples out to z ∼ 2 are fully consistent with the local relation when their selection wavelengths and depths are properly accounted for, in both sample selection and SED fitting. Similarly, we find that the scatter about the LIR–λpeak relation, σλ , does not evolve out to z ∼ 2 within measurement uncertainty.
5.1. Physical Interpretation of the LIR–λpeak Relationship
We interpret our results to mean that IR-luminous galaxies at high-redshifts are not dramatically different in size or geometry than those in the local universe, assuming the dust opacity does not significantly vary between sources or does not evolve with redshift. The z ∼ 0 relation we measure in LIR–λpeak from the IRAS sample exhibits only a subtle IR–luminosity dependency on λpeak as evidenced by measured slope of η = − 0.09 ± 0.01. For context, over four decades of luminosity, from 109 L⊙ to 1013 L⊙, the mean λpeak measured is only expected to change by a factor of 2.3 ± 0.3. The intrinsic scatter about the relation, measured via the parameter σλ , is of the order of the same factor (×1.1). The subtlety of the relationship and large intrinsic scatter can be interpreted with some understanding of galaxies' dust-emitting sizes and source geometries.
The size and geometry of a dust-emitting region relate directly to the emergent IR luminosity and peak wavelength as would be expected from the Stefan-Boltzmann Law (e.g., Hodge et al. 2016; Simpson et al. 2017), where luminosity is proportional to dust temperature to the fourth power and to surface area (for an optically thick blackbody). A nice discussion of the applicability of the Stefan-Boltzmann Law to galaxies' dust emission is offered in Burnham et al. (2021), who show that IR luminosity surface density is the most fundamental tracer of a galaxy's integrated dust temperature, or λpeak (see also Chanial et al. 2007; Lutz et al. 2016). Using the equation relating the best-fit surface density of IR luminosity to the peak wavelength of the dust SED from Burnham et al. (2021), , we find that for a galaxy with LIR =1012 L⊙, the difference between galaxies at peak wavelengths ±1σt corresponds to a difference of only 0.2 dex (<2×) in the effective radius of the emitting dust. A wide array of dust geometries in individual galaxies, from compact and clumpy to extended and diffuse, then accounts for the intrinsic scatter in the LIR–λpeak relation at fixed LIR. This scatter translates to ±5 K temperature differences at λpeak = 100 μm, and an effective half-light radius difference of 0.4 kpc for galaxies of typical radii of 0.8 kpc, assuming the case of an optically thick blackbody. Because galaxy sizes (as measured by dust continuum; e.g., Hodge et al.2016; Simpson et al. 2017) only vary by a factor of a few and not orders of magnitude, the LIR–λpeak relation itself has only shallow dependence on LIR. In other words, in our best-fit relation for the IRAS sample, η = − 0.09 ± 0.01 implies a size–luminosity relationship such that (consistent with , measured by Fujimoto et al. 2017).
Though this work does not directly measure the size of the dust-emitting region in our samples, nor does it quantify the dust geometry through morphological analysis, our finding of consistent nonevolving SEDs in the LIR–λpeak plane argues for unchanging dust sizes from z ∼ 2 to z ∼ 0. Similarly, it argues that the breadth of sizes in those galaxies also does not evolve. This naturally follows when considering the IR-luminous population (>1011 L⊙) is, on a whole, representative of fairly massive galaxies that have likely established most of their mass reservoirs—whether in stars or gas supply that will be converted to stars—and will not continue to grow substantially in size, whether they are at z ∼ 2 or z ∼ 0.
5.2. Comparison with Literature Dust Temperatures: Evolution or No Evolution?
Our finding that the LIR–λpeak relationship does not evolve from 0 < z < 2 is perhaps seen to be in conflict with other works in the literature that claim galaxies' temperatures evolve with redshift. One set of claims argues that higher-redshift galaxies have hotter dust temperatures compared with galaxies at z ∼ 0. This is based on observations of galaxies between 1 ≲ z ≲ 5 (e.g., Magdis et al. 2012; Magnelli et al. 2014; Béthermin et al. 2015; Faisst et al. 2017; Schreiber et al. 2018; Faisst et al. 2020), and theoretical modeling, primarily at z ≳ 5 (e.g., De Rossi et al. 2018; Arata et al. 2019; Liang et al. 2019; Ma et al. 2019; Sommovigo et al. 2020). These higher temperatures are attributed to higher sSFRs, higher star formation surface densities, or lower dust abundances relative to z ∼ 0 samples.
Another set of claims argues that dust temperatures are colder at high redshift than at z ∼ 0 (e.g., Chapman et al. 2002; Pope et al. 2006; Symeonidis et al. 2009; Hwang et al. 2010; Kirkpatrick et al.2012; Symeonidis et al. 2013; Magnelli et al. 2014;Kirkpatrick et al. 2017). This is attributed to galaxies having higher dust masses (e.g., Kirkpatrick et al. 2017), higher dust opacities or emissivities, or more extended spatial distributions of dust at higher redshift (e.g., Symeonidis et al. 2009; Elbaz et al. 2011; Rujopakarn et al. 2013) as compared with z ∼ 0.
A third set of claims argues that the dust temperatures of galaxies do not evolve with redshift. These works find no evidence of redshift evolution in LIR–λpeak space for their samples (e.g., Casey et al. 2018b; Dudzevičiūtė et al. 2020; Reuter et al. 2020), which we also find. How is it possible that there are such seemingly contradictory conclusions in the literature?
Our results are, indeed, consistent with more than just the third set of these claims. Observational works such as those by Magnelli et al. (2014), Béthermin et al. (2015), and Schreiber et al. (2018) found that dust temperatures increase between low and high redshift by studying samples of galaxies at fixed stellar mass on the main sequence. The star formation rates of galaxies on the main sequence increase as redshift increases (e.g., Speagle et al. 2014), which naturally results in an increase in IR luminosity. Figure 7 highlights how a nonevolving LIR–λpeak relation then translates to a perceived evolution in dust temperatures when drawing from main-sequence galaxies at fixed stellar mass. At high redshift, the SFR (thus LIR) at fixed stellar mass is higher (e.g., Speagle et al. 2014), and galaxies with higher IR luminosities have hotter SEDs (e.g., Casey et al. 2018b; Burnham et al. 2021). The tracks of fixed stellar mass in this figure are derived from our LIR–λpeak relationship, where LIR is converted to a star formation rate (Kennicutt & Evans 2012) then to stellar mass (using the Speagle et al. 2014 relation). We then convert λpeak to dust temperature using a simple optically thin modified blackbody, as in Bouwens et al. (2020). We see remarkable agreement between our projected LIR–λpeak curves at fixed stellar masses and measurements from literature works claiming measurement of hotter SEDs at higher redshifts (e.g., Magnelli et al. 2014; Béthermin et al. 2015; Schreiber et al. 2018). While the conversion between λpeak and dust temperature is reliant on an understanding of the underlying opacity of dust, there is roughly a linear scaling between the dust temperatures estimated using the optically thin assumption and other opacity models. So while the absolute dust temperatures may not be known, the trend toward "hotter" SEDs as perceived in Figure 7 should be relatively robust. An illustration of this can be seen with the Béthermin et al. (2015) sample. Their calculated dust temperatures (black circles) do not agree with those that we calculate for their sample (having refit them using MCIRSED; open circles), though both show monotonic increase in temperature with redshift. This offset is due to differences in the model assumed to translate λpeak to Td between our two works. We attribute the increase in temperature with redshift solely to the increasing SFR of galaxies on the main sequence, in line with expectations, given a nonevolving LIR–λpeak relation.
Download figure:
Standard image High-resolution imageIn contrast to works claiming that higher-redshift galaxies have hotter temperatures, there are also a number of literature works that show higher-redshift galaxies may have cooler SEDs at fixed LIR (e.g., Chapman et al. 2002; Pope et al. 2006; Symeonidis et al. 2009; Hwang et al. 2010; Kirkpatrick et al. 2012; Symeonidis et al. 2013; Magnelli et al. 2014; Kirkpatrick et al. 2017). Figure 8 shows the LIR–λpeak correlation as reported from a handful of different works in the literature, color coded roughly by redshift. Blue points represent samples at low redshift, red points represent samples of high-redshift galaxies, and cream points are representative of samples at intermediate redshifts. We have converted dust temperatures to λpeak using the fixed values of β and λ0 from each sample, where provided by the authors; otherwise, we use Wien's Law to convert, as was done in the case of Chapman et al. (2005), Magnelli et al. (2012), and Hwang et al. (2010). We plot these works because we are interested in the general trend with redshift, not the absolute scaling of dust temperature. From what is presented here from the literature, there is a clear offset between the lowest-redshift samples and the high-redshift samples, with some of the intermediate-redshift samples aligning with the low-redshift relation and some with the high-redshift relation.
Download figure:
Standard image High-resolution imageIn this paper we find no such evolution. We attribute the difference in conclusions to the manner of constraining these galaxies' SEDs to measure λpeak. Some of these works, in an effort to compare to the most complete local samples of galaxies, used the IRAS RBGS without requiring submillimeter photometry just redward of the peak (e.g., Casey et al. 2014). Indeed, such observations did not exist for large numbers of galaxies until the launch of Herschel. Figure 9 shows that when fitting SEDs to IRAS data alone from local samples, the measurement of λpeak is biased toward warmer dust temperatures than when fitting to data covering the full range of the dust SED. Without data in the 160–500 μm range, intrinsically colder SEDs (with λpeak ≳ 100 μm) become untethered, with very poor λpeak constraints. On average, we find that IRAS photometry alone will systematically underestimate λpeak for such systems, i.e., skewing fits toward hotter dust SEDs than they may intrinsically have. Furthermore, we find a systematic offset between IRAS 100 μm measurements and Herschel PACS observations at 100 μm. The median ratio of flux at 100 μm from PACS to that from IRAS for our IRAS sample is 1.13 ± 0.01), shown in Figure 9. This systematic bias for the sample's 100 μm flux densities results in warmer fitted dust temperatures when SEDs are fit to IRAS data exclusively rather than the full suite of IRAS, WISE, and Herschel described in Section 2.1. With the introduction of Herschel's photometric constraints on the Rayleigh–Jeans tail, such biases are eliminated. The stringent requirement we place on our local universe IRAS sample (to have Herschel-SPIRE constraints from H-ATLAS) translates to colder net SEDs in the LIR–λpeak relation than had been calculated previously. We have effectively removed the bias whereby colder systems had been fit to SEDs with warmer temperatures for lack of long-wavelength data.
Download figure:
Standard image High-resolution imageOne last comparison we offer is a comparable fit to the Arp 220 system, one of the best-known ULIRGs in the nearby universe. Literature works often describe Arp 220 as potentially having a hotter dust SED than the average ULIRG at high redshift (λpeak,Arp220 ∼ 60 μm versus λpeak,SMG ∼ 70–80 μm; e.g., Dudzevičiūtė et al. 2020). However, when we refit Arp 220's well-sampled IR/millimeter SED (e.g., Eales et al. 1989; Rigopoulou et al. 1996; Sargsyan et al. 2011; Brown et al. 2014) using MCIRSED, we find that, in fact, it is well within the expected scatter for galaxies of its luminosity at any redshift 0 < z < 2 (see Figure 8).
5.3. Evolution of Temperatures at z > 2?
We do not analyze galaxies beyond z ∼ 2 because Herschel-SPIRE observations beyond that redshift are limited to very bright, rare sources (LIR ≳ 1013 L⊙). Extending the analysis to higher redshift would require sufficient dynamic range in LIR with full SED sampling. Sources in our samples at z > 2 do not provide enough dynamic range per redshift bin to perform this analysis, and other literature samples either lack statistics or an unbiased sampling of SEDs. We have demonstrated the importance of using observations that bracket the peak wavelength of the dust SED when trying to accurately measure λpeak. Unfortunately, other samples do not bracket the peak wavelength of dust emission until z ∼ 7, when the observed-frame peak reaches ∼850 μm, which is measurable from the ground to great depth thanks to ALMA. Though there exist a number of works that fit dust temperatures to galaxies beyond z > 2, including SCUBA-2 and South Pole Telescope samples (e.g., Dudzevičiūtė et al. 2020; Reuter et al. 2020), these samples do not have enough galaxies per redshift bin to allow for the analysis presented in the present work to be performed. Investigating whether there is evolution at z > 2 will require millijansky sensitivity in the 250–500 μm wavelength range, which does not currently exist (1 mJy at, e.g., 500 μm corresponds to a galaxy of LIR = 8.4 × 1010 L⊙ at z = 4). Earth's atmosphere is opaque at those wavelengths whenever the atmosphere is not extremely dry. This constraint limits surveys to small areas and, hence, small samples of galaxies. One such survey, reaching a ∼1 mJy depth at 450 μm, is STUDIES (Wang et al. 2017), which covers a 151' region in the COSMOS field and detects too few sources to perform an analysis such as the one presented in this paper. Future space-based FIR missions like the proposed Origins Space Telescope or the 2020 decadal recommendation of an FIR probe could provide these constraints for the z > 2 universe. In the absence of direct constraints on galaxy SEDs down to ∼1 mJy, stacking analyses can provide helpful insight, but the different stacking techniques used in the literature in highly confused FIR maps can result in a broad range of predictions that are not especially constraining (e.g., Viero et al. 2013; Béthermin et al. 2015).
5.4. Recommendations for Fitting Dust SEDs
The analysis presented in this paper has shown that disparate techniques in IR/millimeter SED fitting and galaxy sample selection can manifest in very different interpretations of galaxy dust SEDs. Our finding of broad consistency between SEDs from the local universe out to z ∼ 2 was shown only by accounting for both sample selection wavelengths and by approaching SED fitting using a uniform technique. Our SED fitting technique has been tuned to be broadly applicable to most dusty galaxies in the literature that may lack extensive photometric constraints in the IR/millimeter.
Based on work in this paper, we have assembled a list of "best practices" in implementing IR/millimeter SED fitting for the community, which will facilitate easier cross-reference comparisons and eventual measurement of broad physical trends in galaxy populations out to higher redshifts. These recommendations are:
- 1.In most cases, galaxies have a dearth of IR/millimeter data (three to five photometric constraints) and as a result, should not be fit to models with more free parameters than data. Though this is standard scientific practice, it is sometimes violated given the intrinsically complex nature of SED models and the relative dearth of data, especially in this wavelength regime. Most galaxy IR/millimeter SEDs are superpositions of modified blackbodies that may be approximated as a dominant cold blackbody joined with an MIR power law with no more than five free parameters.
- 2.The number of free parameters in IR/millimeter SED fitting should be adjusted according to existing data constraints. Gaussian priors for parameters should be used where data have low signal-to-noise (i.e., S/N < 5), and parameters should be fixed in the absence of data. For example, the MIR power-law slope (when using MCIRSED or other piecewise modified blackbody with an infrared power law) should be set to 2.3 ± 0.5 if fewer than two photometric constraints exist at rest-frame wavelengths λ ≲ 80 μm. Similarly if fixing λ0 = 200 μm (as we have done here), β should be set to 1.96 ± 0.43 if there are fewer than two photometric constraints longward of rest frame ≳250 μm. These are the median parameter set values for our fitted IRAS sample.
- 3.λ0 should be fixed if there are no direct constraints on the dust column density via spatially resolved observations used to constrain dust mass surface density.
- 4.Dust temperatures cannot be directly constrained for the vast majority of galaxies that have poorly sampled SEDs and no spatially resolved dust continuum observations. In lieu of dust temperatures, SED fitting should focus on measurement of λpeak, the rest-frame peak wavelength of the SED in Sν . Measurements of λpeak, which serves as a proxy for dust temperature, are directly comparable between works, even when different SED modeling techniques are employed (assuming the dust opacities are roughly comparable). The same is not true of dust temperatures.
We have made the MCIRSED code publicly available 3 to help facilitate the use of these recommendations. While this list should be regarded as a useful guide, it requires custom attention, given the above recommendations. Similarly, galaxies with especially thorough IR/millimeter coverage (e.g., like some local (U)LIRGs, and high-redshift lensed objects and quasars) may not be fit well by our modeling technique, in which case more detailed radiative transfer models may be appropriate. In all instances, we recommend that community members first reflect on their goals in deriving an IR/millimeter SED for a galaxy or sample of galaxies before approaching the task of SED fitting, and use the appropriate tool for the type of measurement that needs to be made.
6. Summary
We have fit the dust SEDs of ∼4700 galaxies from different studies in the literature that span redshifts 0 < z < 2 to investigate possible redshift evolution in galaxies' dust temperatures. This analysis is carried out within the context of the LIR–λpeak plane, or between galaxies' IR luminosities and their rest-frame peak wavelengths of their dust SEDs, the latter of which is model independent. 4 We find no evolution in the LIR–λpeak relation between 0 < z < 2, which implies that, at fixed LIR, galaxies at z ∼ 0 are relatively similar to those at z ∼ 2. Specifically, we rule out the possibility that the slope and intercepts of the LIR–λpeak relation evolve at 0 < z < 2 with 99.9997% confidence (corresponding to 4.6σ). We find that the scatter about the LIR–λpeak relation is also unchanged from z ∼ 2 to z ∼ 0. Such scatter likely traces the diversity of dust sizes and dust geometries in IR-luminous galaxies.
We parameterize the LIR–λpeak correlation as a Gaussian distributed about a line such that where λt = 92 ± 2 μm, η = − 0.09 ± 0.01, and N is a Gaussian distribution with μ ≡ 0 and σλ = 0.046 ± 0.003. This result is consistent with other works in the literature that have found that the dust temperature of galaxies does not evolve with redshift over the range 0 < z < 4 (e.g., Casey et al. 2018b; Simpson et al. 2019; Dudzevičiūtė et al. 2020; Reuter et al. 2020). This result is also consistent with seemingly conflicting claims in the literature, which argue that dust temperature evolves with redshift and that galaxies are hotter at high redshift (e.g., Béthermin et al. 2015; Schreiber et al. 2018). This conflict is resolved when we consider that the LIR–λpeak relation does not evolve but the SFR–M⋆ relation does, and the measured increasing temperatures are a function of samples with fixed stellar masses. Our result also conflicts with claims in the literature that argue that dust SEDs are colder at higher redshift than in the local universe (e.g., Pope et al. 2006; Symeonidis et al. 2009, 2013). The data was often limited in these samples to IRAS observations only in the low-redshift reference sample with which the high-redshift samples are compared, and we find that fitting SEDs to IRAS data only can bias to warmer dust temperatures. In the present work, we fit high-quality data that spans the full range of the dust SED and find no such offset in temperature between local galaxies and those at higher redshifts.
Accounting for selection effects is crucial in the analysis of galaxies' IR/millimeter SEDs. We model selection bias by generating mock galaxies that are distributed in LIR–λpeak space as they would be with the case of no evolution in the LIR–λpeak correlation between 0 < z < 2. We then test for statistical deviation of the true data from the modeled mock samples and find that all of the samples we have drawn from out to z ∼ 2 are consistent with the hypothesis that LIR–λpeak does not evolve.
At present, the analysis performed in this paper cannot be extended to redshifts above z ∼ 2 because sensitive, high signal-to-noise observations are not available for large samples of galaxies in redshift bins, which would allow for the study of redshift evolution.
We recommend that the number of free parameters used in dust SED fitting be adjusted according to the number of existing data constraints. Accurate measurement of the peak wavelength of dust emission depends on having high-quality constraints on both sides of the SED's true peak wavelength.
We thank Brendan Bowler, Lina Fernanda González Martínez, Milos Milosavljevic, Justin Spilker, Kres̆imir Tisanić, and Jorge Zavala for help and useful discussions.P.M.D. and C.M.C acknowledge financial support by NSF grants AST-1714528 and AST-1814034, and the University of Texas at Austin College of Natural Sciences. C.M.C also acknowledges support from the Research Corporation for Science Advancement from a 2019 Cottrell Scholar Award sponsored by IF/THEN, an initiative of Lyda Hill Philanthropies. This research has made use of the NASA/IPAC Infrared Science Archive, which is funded by the National Aeronautics and Space Administration and operated by the California Institute of Technology. This research made use of Astropy, a community-developed core Python package for Astronomy (Astropy Collaboration et al. 2013), and the python packages Matplotlib (Hunter 2007), Numpy (van der Walt et al. 2011), and Pandas (McKinney 2010). This research also made use of the NASA/IPAC Extragalactic Database (NED), which is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration.
Appendix: Bayesian Modeling with PyMC3
The model we employ to fit the flux densities of galaxies between 8 and 1000 μm is:
where λ is the rest-frame wavelength, α is the mid-infrared (MIR) power-law slope, Npl and Nbb are normalization constants, λ0 is the wavelength where the dust opacity equals unity, β is the dust emissivity index, T is the luminosity-weighted characteristic dust temperature, h is the Planck constant, k is the Boltzmann constant, and c is the speed of light. Npl and Nbb are tied to each other and to LIR. At short wavelengths, the SED is dominated by a power law with slope α. Physically, this shorter-wavelength regime represents the warmer dust emission originating near star-forming regions or AGNs whose dust mass is distributed with a power-law distribution of temperatures (e.g., Kovács et al. 2010). At long wavelengths, the SED is dominated by a modified Planck function with a self-absorption factor, , with optical depth given by . Physically, this represents the cold dust reservoir, which contains most of the mass of dust in galaxies' ISM. The transition between the two piecewise regimes occurs at the wavelength where the derivative of the modified blackbody in log–log space equals α. This piecewise form follows the methodology of, e.g., Blain et al. (2003), Younger et al. (2009), and Roseboom et al. (2013). The model has the advantage of accounting for potential MIR excesses while also minimizing the number of free parameters, which is very important given the dearth of observations at wavelengths between 8 and 1000 μm. While there are galaxies for which such a model may be incomplete—for example, dusty, luminous quasars—the vast majority of galaxies with nondominant AGN can be modeled using this simple technique.
Previous work by Casey (2012) has shown that SED fits with a modified blackbody plus MIR power law produce statistically better fits than fits with IR templates, despite having fewer free parameters. If one's goal is to fit the IR luminosity and peak wavelength, as is our goal in the present work, it is better to use the model we adopt here than to fit templates. Such a model has been used widely in the literature (e.g., Blain et al. 2002, 2003; Younger et al. 2009; Casey 2012) because it produces good fits to data and has fewer free parameters than methods like template fitting (e.g., Chary & Elbaz 2001; Dale & Helou 2002; Siebenmorgen & Krügel 2007) and models with power-law distributions of dust temperatures (e.g., Kovács et al. 2010).
The fit parameters that may be fixed when fitting are the power-law slope, α, the dust emissivity index, β, and the wavelength where the opacity equals unity, λ0. The peak wavelength and IR luminosity are always free parameters. Consistent with previous works (e.g., Sajina et al. 2006; Hayward et al. 2012; Spilker et al. 2016), we find that the parameters β, Td, and λ0 have significant covariance with one another under the current limitations of long-wavelength data sets. Note, however, the covariance does not extend to λpeak, which is measured after the fact from the resulting trial fit.
A.1. Choice of Parameter Priors
We choose to fix λ0 = 200 μm for all galaxies analyzed in this paper for a few reasons. First, our observations—like the vast majority of all galaxies—do not have sufficient wavelength coverage near the peak of dust emission to constrain λ0 as a free parameter. Second, we do not have resolved dust continuum measurements for these galaxies. Third, fixing λ0 = 200 μm allows our peak wavelength constraints to be more easily compared with other works in the literature that do the same. Last, our fitting function is, at face value, phenomenological rather than physical. In reality, each galaxy has a distribution of dust temperatures rather than a single "cold" temperature representative of the bulk luminosity of the ISM. A galaxy with a distribution of luminosity-weighted dust temperatures may emit radiation in such a way that mimics opacity effects from the self-absorption factor, (1 − e−τ ) or a shallower measured dust emissivity index, β. Typical values of λ0 measured for similarly IR-luminous galaxies or adopted in the literature range from ≈100–200 μm (e.g., Blain et al. 2003; Draine 2006; Conley et al. 2011; Rangwala et al. 2011; Greve et al. 2012; Conroy 2013; Spilker et al. 2016; Simpson et al. 2017; Zavala et al. 2018; Casey et al. 2019).
MCIRSED employs Markov Chain Monte Carlo (MCMC) to construct posterior distributions for our fit parameters and derive confidence intervals. We use the Python package, PyMC3 (Salvatier et al. 2016), with its Hamiltonian Monte Carlo fitter (HMC) and the No-U-Turn Sampler (NUTS; Hoffman & Gelman 2011) to efficiently explore parameter space without requiring the user to tune the HMC parameters. The NUTS algorithm takes advantage of the gradients of the Bayesian likelihoods in order to converge more quickly than other sampling methods for models with high dimensionality. One of the key strengths of using the PyMC3 package with NUTS is that no prior knowledge or testing of step size or number of steps per sample is required for a given galaxy. We refer the reader to Hoffman & Gelman (2011) for the NUTS algorithm and more detail on the tuning phase with the NUTS sampler.
PyMC3 requires the user to specify the model to fit to the data, the prior distributions for each parameter associated with the model, the number of samples to run, and the number of tuning steps to use. Tuning is a phase of fitting done prior to sampling in which the step size and number of steps per sample are automatically varied to minimize the sampling time while trying to reach the target acceptance rate. We choose 3000 tuning steps for all of our fits and exclude those sampling steps from our final analysis. We find that adding more tuning steps does not change the fits for our galaxies. We sample the posterior distributions of each galaxy 5000 times because with this number we find our sampling always converges with >1000 effective samples.
For the IRAS sample, we use flat bounded priors on α, β, and Td. The bounds are chosen to restrict sampling to parameter space that corresponds to conditions that are physical though not restrictive. The dust temperature may not be lower than the temperature of the cosmic microwave background (CMB) at the redshift of the galaxy being fitted and not hotter than 1200 K, a temperature that corresponds to some of the hottest quasars' host galaxies' ISMs (e.g., Glikman et al. 2006).
The MIR power-law slope, α, is constrained to be between 0 < α < 15. Higher α values asymptotically approach the case of a pure Planck function with α = 15 being nearly indistinguishable. See the left panel of Figure 10 for a plot of example SEDs with a range of α values from 0.5 to 15. The α = 15 curve and the Planck function are indistinguishable.
Download figure:
Standard image High-resolution imageThe dust emissivity index, β, is constrained to be between 0.5 < β < 5.5. Some of the highest observed dust emissivity values are β = 3.7 (Kuan et al. 1996; observed in the giant molecular cloud, Sagittarius B2), so we choose a limit beyond that of 5.5. MCIRSED allows the user to set the prior boundaries if one wishes to use higher or lower bounds. For the galaxies analyzed in the present work, we find none of the bounds affect our sampling of the posteriors, as would be evidenced by truncated posterior distributions. See the right panel of Figure 10 for a plot of example SEDs with 0.5 < β < 5.5.
For the H-ATLAS and COSMOS samples, which do not have as high-quality constraints as the IRAS sample, we use the measured average IRAS sample parameters as a basis, adopting normal priors on α and β with medians and standard deviations from the posterior distributions from the IRAS sample. The adopted values are μα = 2.3, σα = 0.5, μβ = 1.96, and σβ = 0.43. The median emissivity, β = 1.96 is fully consistent with several recent publications that found similarly steep values for the emissivity spectral index in various high-redshift galaxies with high-quality FIR constraints (e.g., Kato et al. 2018; Casey et al. 2021). We choose to adopt these priors because the minority of galaxies that produce unphysical fits when run with flat priors have more physical fits using normal priors. For instance, we find that 7.5% of COSMOS galaxies have β that are higher than the highest measured value in the literature of 3.7 (Kuan et al. 1996) when fitted with flat bounded priors. However, the adoption of normal priors results in much lower β values (median 2.03). The reason for the high β values is that the S/Ns of observations that constrain β are low. In the absence of firm constraints, the Gaussian priors return more reasonable values of β, rather than something closer to an average of all allowed values between 0.05 and 5.5. For sources with high-S/N observations on the RJ tail, the best-fit values of β are unaffected by the choice of a flat bounded prior or that of a Gaussian prior taken from the posterior of the IRAS sample fits. Fitting galaxies with Gaussian priors in the case of low-S/N observations on the Rayleigh–Jeans tail, for instance, is less restrictive than fixing β.
A.2. Correction for Cosmic Microwave Background Emission
Though it is not necessary to account for CMB heating of dust SEDs in this paper (as all sources are at z < 2), we have build MCIRSED to be used for galaxies at any redshift, including those at z > 5, where CMB heating has a nonnegligible effect on the SED. MCIRSED allows users to choose whether to correct for the effects of observing a galaxy against a nonnegligible CMB temperature. By default, the CMB correction is off, but one may toggle it on when calling the MCMC fitting function. Work by da Cunha et al. (2013) demonstrates that if the CMB at the redshift of the observed galaxy is nonnegligible compared with the dust heating due to the interstellar radiation field, corrections need to be made (1) to the observed dust temperature in order to isolate the dust heating due only to the interstellar radiation field, and (2) to the luminosity of the galaxy observed against the nonnegligible background continuum. When this setting is enabled in MCIRSED, in addition to the parameters mentioned above, the parameter (the temperature the dust would have if bathed in the z = 0 CMB) and (the flux density intrinsic to the source without contribution from the CMB) will also be sampled. These quantities are defined as:
where is the observed flux density, Bν is the Planck function, TCMB(z) is the temperature of the CMB at the redshift of the galaxy, and T is the dust temperature measured from the best fit to the data, as above.
where is the temperature of the CMB at z = 0, and β is the dust emissidvity index. See da Cunha et al. (2013) for more detail.
Footnotes
- 1
Publicly available at github.com/pdrew32/mcirsed.
- 2
Both the IRAS and COSMOS samples have deeper relative OIR imaging (IRAS is also matched to SDSS DR10, but sources are at lower redshift than the H-ATLAS sample, and the COSMOS OIR imaging is much deeper than that of SDSS DR10 with average broadband depths ∼26–27 AB), so we can conclude that all galaxies in each sample have detectable OIR counterparts in their corresponding optical imaging surveys.
- 3
Publicly available at github.com/pdrew32/mcirsed.
- 4
While λpeak is model independent, the derivation of dust temperature from a given λpeak is model dependent and relies on some knowledge of a given galaxy's dust opacity.