The following article is Open access

Constraints From Dwarf Galaxies on Black Hole Seeding and Growth Models With Current and Future Surveys

, , , and

Published 2023 March 29 © 2023. The Author(s). Published by the American Astronomical Society.
, , Citation Urmila Chadayammuri et al 2023 ApJ 946 51 DOI 10.3847/1538-4357/acbea6

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/946/1/51

Abstract

Dwarf galaxies are promising test beds for constraining models of supermassive and intermediate-mass black holes (MBHs) via their BH occupation fraction (BHOF). Disentangling seeding from the confounding effects of mass assembly over a Hubble time is a challenging problem that we tackle in this study with a suite of semianalytical models (SAMs). We show how the measured BHOF depends on the lowest BH mass or active galactic nucleus (AGN) luminosity achieved by a survey. To tell seeding models apart, we need to detect or model all AGNs brighter than 1037 erg s−1 in galaxies of M* ∼ 108−10 M. Shallower surveys, like eRASS, cannot distinguish between seed models even with the compensation of a much larger survey volume. We show that the AMUSE survey, with its inference of the MBH population underlying the observed AGNs, strongly favors heavy seed models, growing with either a power-law Eddington ratio distribution function or one in which BH accretion is tied to the star formation rate (i.e., the AGN-main sequence, AGN-MS, model). These two growth channels can then be distinguished by the AGN luminosity function at >1040 erg s−1, with the AGN-MS model requiring more accretion than observed at z ∼ 0. Thus, current X-ray observations favor heavy seeds whose Eddington ratios follow a power-law distribution. The different models also predict different radio scaling relations, which we quantify using the fundamental plane of BH activity. We close with recommendations for the design of upcoming multiwavelength campaigns that can optimally detect MBHs in dwarf galaxies.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Massive black holes (MBHs) are expected to reside in the core of every galaxy with M* ≳ 1010 M, but the BH occupation fraction (BHOF) in galaxies at lower stellar masses are poorly constrained. Unlike stellar-mass BHs, the origin of MBHs remains an open question, in particular, if there exist multiple pathways for their formation. Key candidate formation channels include remnants of Population III stars (Bromm & Loeb 2003), merged remnants in nuclear star clusters (Gürkan et al. 2004; Portegies Zwart et al. 2004; Baumgardt 2017; Natarajan 2021), the direct collapse of pregalactic gas disks in which fragmentation and star formation is suppressed (e.g., direct-collapse black holes, DCBHs; Begelman et al. 2006; Lodato & Natarajan 2006; Mayer et al. 2010; Tanaka & Li 2014; Inayoshi et al. 2015), or dark stars, powered by dark matter annihilation rather than nuclear fusion (Spolyar et al. 2008; Freese et al. 2010; see also the recent review by Volonteri et al. 2021). Several seeding models are capable of reproducing observations in higher-mass galaxies under different assumptions for BH growth (Milosavljević et al. 2009; Valiante et al. 2016; Ricarte & Natarajan 2018a), because observations largely probe later times in the universe relative to the extremely early seed formation epochs. The models are all calibrated to match z = 0 observations and reproduce observations of high BH/galaxy masses as a function of time. However, to distinguish truly between the models, we either require observations at z ≳ 7 when seeds are expected to assemble (Treister et al. 2013; Ricarte & Natarajan 2018b; Lusso et al. 2022) and/or lower mass galaxies at recent epochs (see Reines 2022 for a review).

Despite being modulated by complex growth mechanisms over cosmic time, some signatures of the initial MBH seeding mechanisms have been demonstrated to survive in late-time observables, primarily the low-redshift occupation fraction of MBH in dwarf galaxies (Ricarte & Natarajan 2018a; Volonteri et al. 2021). While most, if not all, massive galaxies seem to host central MBHs with masses that are empirically seen to be correlated with host galaxy properties (Kormendy & Ho 2013; McConnell & Ma 2013; Bogdán & Goulding 2015), many low-mass galaxies may not host a central MBH at all; and if they do, they may possibly follow different scaling relations (Fontanot et al. 2015) or none at all (Baldassare et al. 2020). Multiwavelength observations have been finding ever fainter MBHs in dwarf galaxies in the radio (Reines et al. 2020), via stellar dynamics (Baldassare et al. 2015; Nguyen et al. 2018), and from optical line ratios/variability (Reines et al. 2013; Moran et al. 2014; Baldassare et al. 2018; Mezcua & Dominguez Sanchez 2020). Any census of these low-mass MBHs is bound to be incomplete since some are too faint—perhaps even completely inactive—and therefore evade detection by these methods that rely on accretion activity. To infer the full occupation fraction of MBH, regardless of activity, one must incorporate detections as well as upper limits across multiple wavelengths and rely on some, preferably flexible, models. Such work has been attempted successfully in the X-ray (Miller et al. 2015; Gallo & Sesana 2019) and optical (Kelly & Shen 2013).

But how to test MBH seed and growth models with the observed occupation fractions? The BHs that exist at z = 0, by and large, do not reflect the masses of the initial seeds, as these are the result of growth averaged over cosmic time (Mezcua 2019). We require models that trace the MBH population detected locally back to their high redshift seeds via growth histories. Semianalytical models (SAMs) have proved to be a computationally efficient way to do so, as they involve populating and assembling halos over cosmic time, using well-defined physical models for BH growth. In one such set of models, halos are populated with MBHs adopting various scaling relations, which permit tracking their growth over cosmic time in secular as well as merger-driven accretion modes (Ricarte & Natarajan 2018b, 2018a; Ricarte et al. 2019). SAMs enable fairly comprehensive probing of the parameter space of growth trajectories coupled with the different seeding mechanisms, quantifying how their combinations produce very different populations at z = 0. In this paper, we probe these differences in detail and examine which observables offer powerful discrimination, specifically on the issue of initial seeding.

The paper is structured as follows. In Section 2, we present an overview of the SAM and the scaling relations used to create the mock catalogs. Section 3 compares the BHOF using different cuts on the BH mass and luminosity, and is compared to X-ray observations. Section 4 makes recommendations for upcoming multiwavelength surveys, discussing the capabilities and limitations of each and the synergies between them. We close with conclusions and future prospects in Section 5.

2. Tracking BH Growth over Cosmic Time

2.1. SAMs

By construction, SAMs are set up to explore a range of seeding and growth modes for a population of BHs as a function of redshift. Time slices from these SAMs can then be compared to measured BHs. In this work, we focus on the SAM developed by Ricarte & Natarajan (2018b) and explore new predictions for the low-mass end of the local BH mass function. In particular, we include the properties of the host galaxies explicitly in the scheme and do so flexibly by exploring multiple scaling relations between them and MBH properties. It is this new feature that allows us to delve more deeply into further predictions in combination with the Bayesian inference framework in this work.

Intriguingly, as first noted and discussed in detail in Ricarte & Natarajan (2018a), signatures of seeding survive to late times via the observed occupation fraction, though there is dependence on the details of their mass assembly history. There are currently other SAMs from Somerville et al. (2008) and Bower et al. (2010) and updated versions thereof, which do not focus on seeding and include recipes for star and galaxy formation as well as MBH growth, and these models inevitably have many more tuneable parameters. Here, we stick with a pared-down, phenomenologically driven model that is primarily focused on seeding signatures from nearly local observations (z ∼ 0). The Ricarte & Natarajan (2018a) SAM considers two seed models: either a massive and rare "heavy" seed typically produced from DC following the seeding prescription of Lodato & Natarajan (2006) or, Volonteri et al. (2008), or abundant "light" seeds drawn from a power-law (PL) initial mass function between 30 and 100 solar masses motivated by cosmological simulations (Hirano et al. 2015). For each seed, the SAM explored in Ricarte & Natarajan (2018b) follows three different MBH growth prescriptions:

  • 1.  
    PL Model: in this model, major halo mergers trigger a burst of growth at the Eddington rate until MBHs reach the mass determined by the observed local Mσ* relation. When MBHs are not in the burst mode, their accretion rates are drawn from a PL tuned to help reproduce low-redshift luminosity functions (Ricarte & Natarajan 2018b).
  • 2.  
    Active Galactic Nucleus-Main Sequence (AGN-MS) Model: major halo mergers trigger burst mode growth at the Eddington rate until MBHs reach the Mσ* relation. When MBHs are growing secularly, their accretion rates are set to be 10−3 × the star formation rate of their host. This mode is motivated by the empirical relation determined by Mullaney et al. (2012). Since star formation is not actually tracked in this SAM, scaling relations are instead used to estimate the stellar masses of galaxies as a function of halo mass and redshift (Ricarte & Natarajan 2018b).
  • 3.  
    Broadline Quasars (BLQ) Model: major halo mergers (defined to be those with a halo mass ratio of at least 1:10) trigger burst mode growth. Additionally, at each time step the accretion rate is drawn from a distribution of Eddington rates observationally inferred for BLQs in SDSS (Kelly & Shen 2013). Specifically, we adopt a redshift-evolving log-normal distribution fit by Tucci & Volonteri (2017). 3  No additional steady accretion mode for mass growth is added since the AGN luminosity function is adequately predicted with this prescription (Ricarte et al. 2019). However, as discussed in this work, this assumption fails when considering deep surveys of dwarf galaxies that are not selected a priori to be AGN.

In this model, only 10% of the halo mergers lead to BH mergers. This is done to prevent the overgrowth of BHs in the most massive halos, whose BHs would otherwise grow mostly from BH–BH mergers at low redshift. We do not expect this assumption to have an impact on the dwarf galaxies considered in this work, which have much lower merger rates. In our study, we do not include satellite halos, since the SAM does not include a prescription for environmental processes such as ram-pressure/tidal stripping, which in turn are expected to affect the stellar mass as well as the growth of MBH. To increase our sample size, we compile the output from three snapshots at z = 0, z = 0.1, and z = 0.2. This yields a total of 6579 halos per seed × growth model, so the uncertainties from Poisson (small number) statistics on the predictions are small, even in the lower stellar mass bins. The scaling relations between the galaxy and MBH properties evolve very marginally in this redshift range, and therefore no further modification is applied to the z = 0.1 and 0.2 halos.

2.2. Creating Mock Catalogs from the SAM

SAMs provide the mass and mass accretion rates for MBHs at every snapshot. The simplest way to compute the luminosity is to convert the accretion rate into a bolometric luminosity with some efficiency epsilonf , and then use a bolometric correction factor to infer the luminosity in some specific energy/wavelength range. Like much of the literature, we assume a radiative efficiency of accretion epsilonf = 0.1. The SAM is known to contain too little scatter (Ricarte & Natarajan 2018a) since AGN are essentially always on. In practice, a duty cycle would increase the scatter in the observed instantaneous luminosity of the BHs. Meanwhile, we know that relatively complete surveys like AMUSE (Gallo et al. 2008; Miller et al. 2012), with a limiting luminosity of 2 × 1038 erg s−1 at 0.5–7.0 keV, find AGNs in ∼40% of galaxies with 108 < M*/ M < 1012. AMUSE is a remarkably uniform and complete survey of AGNs in nearby galaxies performed with the Chandra X-ray telescope, observing 200 galaxies—some in the field, some in the Virgo galaxy cluster—to a limiting flux of 1038.3 erg s−1, as shown in Figure 1 with a dotted line. While more recent catalogs exist, they are compiled from the Chandra archive and are thus necessarily nonuniform in exposure (e.g., Gallo & Sesana 2019). We measure the detected fraction of AGN in each SAM as:

Equation (1)

,

Equation (2)

where wi is the relative abundance of the host galaxy stellar mass in the SAM compared to the observations. The sum in the numerator is over Na galaxies with LX > 0 and in the denominator is over all Ng galaxies. The weighting corrects for the different stellar mass functions of the SAMs and the observed sample. Both the numerator and denominator include galaxies at 108 < M*/ M < 1012. The PL and AGN-MS growth models predict the existence of many AGNs below the AMUSE flux limit, shown as the dotted horizontal line in each panel.

Figure 1.

Figure 1. The LX M* scaling relation for each of the SAMs assuming epsilon = 10%. The duty cycle fd marks the fraction of MBHs with LX > 0, and σ is the rms error of the fit. The dotted line shows the flux limit of the AMUSE survey (Gallo et al. 2008; Miller et al. 2012). The PL model predicts many AGNs below this flux limit; the dashed lines, therefore, show an additional fit for only AGNs with ${L}_{X}\gt {L}_{\mathrm{lim}}$. The color of each point represents the stellar velocity dispersion of the galaxy, as shown in the color bar.

Standard image High-resolution image

If the SAMs are taken at face value, 75%–90% of the galaxies in the AGN-MS and PL models would have had AGN detections in an AMUSE-like survey. We, therefore, add scatter to the LX in these SAMs until the detected fraction, as defined above, is 40% for an AMUSE-like survey. These values are shown in Table 1, and the LX M* relations for the SAMs with the added scatter are shown in Figure 2.

Figure 2.

Figure 2. Same as Figure 1, but with enough scatter added to the LX to reproduce the observed AGN detection fraction of the AMUSE survey when each galaxy is weighted by the number density of observed galaxies of the same stellar mass. The additional scatter for each model is shown in Table 1. This process was not carried out on the BLQ model, since its intrinsic duty fraction is far below what is observed.

Standard image High-resolution image

Table 1. Scaling Relations for the PL and AGN-MS SAMs, Measured for 108 < M*/ M < 1010 and with σadd Adjusted So That ∼40% of the Galaxies Have AGNs with $\mathrm{log}({L}_{X}/\mathrm{erg}\ {{\rm{s}}}^{-1})\gt 38.3$, the Limiting Luminosity of the AMUSE Survey

Model σadd β α ${\sigma }_{\mathrm{intr}}$
PopIII—PL0.50.5034.01.13
PopIII—AGN-MS1.90.3634.80.28
DCBH.- PL0.60.637.31.28
DCBH—AGN-MS1.80.3534.70.29

Download table as:  ASCIITypeset image

Shen et al. (2020) measured the bolometric correction for the 0.5–2 keV and 2–10 keV X-ray bands and for the infrared B band, as a function of bolometric luminosity. This corresponds to 1/kband = Lbol/Lband = 10–30 in each of the X-ray bands and 4–7 in the B band, each with a scatter of about 50%. For each BH, we draw the bolometric correction from a normal distribution with the mean and scatter based on its bolometric luminosity.

3. Results

In this section, we compare the BHOF for the six SAM models (three accretion modes × two seeding models) as detailed above with published values in the literature, applying cuts to the SAM outputs to match the observational selection functions.

3.1. Occupation Fractions and Selection Criteria

We focus first on how observational selection can impact the derived BHOF and explore the optimal future survey strategies that will enable potential discrimination between seeding models. We assume that BH masses are available from the full range of possible observational probes and documented empirical correlations—either from dynamical modeling of galaxy cores or other techniques. AGNs, for instance, have been identified via their optical variability (Baldassare et al. 2018, 2020) or line ratio diagnostics (Cid Fernandes et al. 2010; Kewley et al. 2013; Polimera et al. 2022); masses are then measured via reverberation mapping (e.g., Peterson et al. 2004) or direct dynamical modeling (e.g., Walsh et al. 2013; Seth et al. 2014; Nguyen et al. 2017), and then used to construct scaling relations based on easily observed quantities like emission line strengths. Therefore, we proceed with the premise that we have in hand MBH measurements/upper limits for every single galaxy in our SAM survey sample. We first investigate the impact of the completeness of the mass measurements on our inference of the BHOF.

Figure 3 shows the occupation fraction for different values of BH mass completeness. If we assume in the best-case scenario that all BHs above 103 M can be detected, then the Population III seeds produce a higher occupation fraction than the DCBH models. This is particularly noticeable for galaxies with M* < 109 M. We note that this clearly reflects the efficiency and abundance of the initial seeds, as light seeds are more ubiquitous and are expected to form more efficiently compared to heavy seeds in the SAMs. If, however, we can only detect BHs more massive than 3 × 104 M, then light seeds growing through the AGN-MS channel predict focc = 1 for all stellar masses; for the other two growth channels, the seed models become indistinguishable for M* < 4 × 108 M. If we can only find MBHs above 106 M, information about seeding cannot be cleanly disentangled from accretion. These findings clearly show that the detectable present-day mass needs to be close to the initial assumed "heavy" seed mass from high redshifts, namely, 104−5 M, in order to be disentangled from the confounding effects of the accretion physics.

Figure 3.

Figure 3. The fraction of galaxies containing BHs as a function of stellar mass that lie above different mass cuts. The dotted lines show the SAM with Population III seeds, while the solid lines show heavier DCBH seeds. The colors indicate how the MBHs grow following the BLQ (blue), PL (green), and AGN-MS (orange) models, as described in Section 2.1.

Standard image High-resolution image

In summary, if the initial seeds have not grown much by any of the accretion modes or mergers, then their memory of seeding is retained in the BHOF as measured even at low redshift.

In practice, MBHs are usually detected when they are accreting and luminous. The scaling relations used to measure BH masses, regardless of wavelength deployed or method used, all suffer from large intrinsic scatter (Woo & Urry 2002) and are most reliable at X-ray wavelengths where obscuration is the lowest. Therefore, we stick to X-ray observations for comparison with the SAMs for the rest of this study. Figure 4 shows the fraction of galaxies with AGNs detected at 0.5–2.0 keV as a function of the survey detection limit. The BLQ prescription is ruled out because it predicts almost no AGNs in dwarf galaxies. A relatively shallow survey of 1040 erg s−1 would only be marginally able to distinguish between the AGN-MS and PL growth channels modes, with the former predicting 10%–20% more detections in dwarf galaxies; if all AGNs above 1038 erg s−1 are found, the detected fraction is identical for both of these growth channels. We need a survey that either directly detects, or can model the presence of, all AGNs with luminosities down to 1036 erg s−1 if the heavy and light seed models are to be clearly distinguished.

Figure 4.

Figure 4. The detected AGN fraction for different survey luminosity limits. Here, the instantaneous accretion rates from the SAMs are converted to the observed luminosities with the bolometric corrections from Shen et al. (2020) and a radiative efficiency epsilonf = 0.1. The BLQ prescription assumes that MBHs only accrete and radiate during a merger, and thus predict almost no activity at z = 0. Since we do observe AGNs at z = 0, we can rule this model out. For ${L}_{\mathrm{lim}}\leqslant {10}^{37}\ \mathrm{erg}\ {{\rm{s}}}^{-1}$, the Population III and DCBH seed models predict very different occupation fractions at M* ≲ 108 M. Applying a higher luminosity cut then differentiates between the PL and AGN-MS models, with the latter predicting a higher luminous fraction.

Standard image High-resolution image

Figures 3 and 4 reveal that most of the differences between the models are apparent at the lowest halo masses. First, this highlights the importance of using a parent sample of galaxies that has stellar masses complete down to 108 M, pushing down an order of magnitude compared to current observational limits. Figure 5 therefore shows the occupation fraction of galaxies with stellar masses in the range 108−1010 M as a function of the minimum detectable BH mass. If all the information we have is the BH and stellar masses, we can only tell different seed models apart if we find every MBH at least as massive as 104 M. If we can separately confirm that the growth channel is like the AGN-MS model, then we can distinguish seeds using all MBHs above 105 M.

Figure 5.

Figure 5. The total MBH occupation fraction for galaxies with 108 < M*/M < 1010, as a function of the smallest detectable MBH mass. If the growth channel is AGN-MS, seed models can be distinguished for M > 105 M; otherwise, all MBH above 104 M need to be detected for a given galaxy sample, assuming the sample is stellar mass complete.

Standard image High-resolution image

In practice, again, we almost always detect BHs when they are luminous. Figure 6 shows the fraction of galaxies with stellar masses of 108 < M*/ M < 1010 with detectable AGNs, as a function of survey-limiting luminosity. Heavy and light seeds can be distinguished if the survey has a limiting luminosity better than 1037 erg s−1.

Figure 6.

Figure 6. Similar to Figure 5, but for different limiting values for AGN luminosity at 0.5–2.0 keV detectable by an X-ray survey.

Standard image High-resolution image

Figure 7 shows the AGN bolometric luminosity function for the six models, with observations from Ueda et al. (2011) in black for comparison. The uncertainties incorporate 1000 draws from the uncertainty in the bolometric corrections; the BLQ model uncertainties are not shown because, due to its low fraction of active MBHs, has error bars down to 0 at all luminosities. The AGN-MS and PL models have diverging luminosity functions above 1042 erg s−1, with the former model requiring more growth at late times to match the Mσ relation. The PL model is thoroughly consistent with the observations, although the AGN-MS model cannot be entirely ruled out due to the large scatter required to bring it into an agreement with the observed active fraction in the AMUSE survey.

Figure 7.

Figure 7. The bolometric luminosity function of the z = 0 AGNs produced in the different SAMs. The three growth channels produce very distinct luminosity functions at z = 0.

Standard image High-resolution image

3.2. Coupling the SAMs to Bayesian Inference

As noted above, the BHOF depends on the minimum AGN luminosity that we can measure or model. Studies like Miller et al. (2015) compute the BHOF from observed AGN luminosities and upper limits, using a model where the LX M* relation contains log-normal scatter. Therefore, this BHOF accounts for AGNs even below the nominal luminosity limit of the survey, which is 1038 erg s−1. Since they did not publish the scaling relations inferred by their Bayesian model, we remeasure them here. To recap, the original study assumed that the BHOF had the functional form of a sigmoid function:

Equation (3)

The tuning parameter M*,0 dictates how rapidly the occupation fraction jumps from 0 to 1; in other words, a higher value for this parameter means that galaxies with lower stellar masses are less likely to host super-MBHs than their more-massive counterparts. Put another way, a higher M*,0 means a lower BHOF.

A Bayesian inference model then constrains M*,0 simultaneously with the normalization, slope, and scatter of the LX M* relation:

Equation (4)

This approach thus assumes (1) that the AGN duty cycle can be described in terms of the time spent below the detection limit if the LX M* scaling relation is a PL with log-normal scatter and (2) that we have no prior knowledge of this scaling relation. Again, Figure 2 shows the LX M* relation for each SAM, with points below the AMUSE detection threshold shown as translucent.

Right away, we can rule out the BLQ prescription as the primary growth channel, because this predicts that only 3%–4% of the galaxies at z < 0.2 contain AGNs, far below the 40% detection fraction in AMUSE. This prescription was motivated from observations of the brightest quasars, the high-luminosity end of the Sloan Digital Sky Survey (SDSS) quasar luminosity function. Since the abundance of bright quasars drops precipitously at low redshifts, the BLQ SAM is self consistent in predicting low active fractions at low z. The fact that this differs with observations of low-mass galaxies shows that this growth channel, which is bursty and merger driven, cannot be important in dwarf galaxies. Next, we aim to distinguish between the AGN-MS and PL growth channels.

Both the PL and AGN-MS models predict very high (75%–99%) occupation fractions. From Figure 4, we know that we need to apply a luminosity cut to the SAMs before comparing them to observations. For the Miller et al. (2015) study, this is not the same as the limiting luminosity of the observations. Rather, the BHOF in that study includes all the AGNs whose luminosities fall within log-normal scatter from the mean value of the PL Lx M* relation. Therefore, we start by measuring this σ.

The probability of the parameter set [α, β, σ, M*,0] for detection is defined as:

Equation (5)

where N(xμ, σ) is the normal probability distribution function centered at μ with standard deviation σ. For the nth nondetection, the probability that there is a BH is given by the latent variable In , which in Miller et al. (2015) depended only on the scaling relation and occupation fraction:

Equation (6)

where C is the cumulative probability of the normal distribution. This says that a nondetection means either that there is no BH, or there is a BH and it is on but the luminosity is below the detection limit ${L}_{\mathrm{lim}}$. The latent variable In is then set to either 1 or 0 by drawing from a binomial distribution with ptrue = pBH. For further details of the inference pipeline, we refer the reader to Section 2 of Miller et al. (2015).

As a test, we run the AMUSE sample through a Markov Chain Monte Carlo (MCMC) pipeline four times, each time using a different scaling relation between LX M* in Table 1, as inferred from Figure 1. The resulting posterior on M*,0, the stellar mass at which the occupation fraction is 50%, is shown in Figure 8. Surprisingly, these posteriors are almost indistinguishable from each other.

Figure 8.

Figure 8. Posterior distributions for the tuning parameter M*,0 of the BHOF. The black line allows the MCMC sampler to fit the parameters of the scaling relation in addition to M*,0, whereas the remaining lines use the scaling relations from Table 1. The PL growth models have larger intrinsic scatter and are therefore slightly less constraining.

Standard image High-resolution image

The best-fit value of σ for AMUSE is 2; as seen in Table 1, this is very comparable to the total scatter required in each SAM to match the AMUSE detection fraction. Thus, the BHOF inferred assuming an LX M* relation with such a large scatter would capture all the AGNs in the SAMs with both the PL and AGN-MS growth channels, and no luminosity cut needs to be applied to the SAM MBHs.

Finally, Figure 9 compares the AMUSE constraints applied to each of the four SAMs. The observations are clearly consistent with the heavy seed models growing via either the PL or AGN-MS channels. The light seed models, even with large scatter in their LX M* relations, cannot be reconciled with an X-ray observed sample like AMUSE.

Figure 9.

Figure 9. The AMUSE best fit and 1σ constraints on the BHOF, shown as the black solid line and gray shaded region, respectively. The SAM occupation fractions now use no luminosity cut, since the inference pipeline is designed to infer the presence of BHs below the detection threshold. The observations clearly prefer heavy seed models.

Standard image High-resolution image

3.3. Prospects for Future Surveys

A key result of our study is that distinguishing between MBH seeding and growth models relies crucially on detecting the least-massive MBH (or least-luminous AGN) in galaxy samples that are stellar mass complete down to at least 108 M. We show here how such a survey could be constructed by following up on existing surveys.

The RESOLVE survey (Eckert et al. 2015) was designed to be complete in baryonic mass down to 109.3 M, and thus to even lower stellar masses, within z < 0.025 (d < 110 Mpc). Figure 10 shows the spatial (left) and stellar mass (right) distributions of the RESOLVE galaxies, with the latter demonstrating stellar mass completeness above 5 × 108 M. The orange histogram shows only galaxies above decl. = 30°, where emission from the Milky Way is negligible, and even faint extragalactic AGNs could be detected. In principle, a survey could include galaxies at ∣decl.∣ < 30° as long as they are in the direction opposite to the Galactic Center; we are being conservative in our recommendation here to account for telescope scheduling difficulties, which may make it difficult to point at extragalactic fields at all times. The RESOLVE survey contains >10,000 such galaxies.

Figure 10.

Figure 10. Map (left) and stellar mass function (right) of the RESOLVE survey (Eckert et al. 2015). This survey is stellar mass complete above 5 × 108 M out to z = 0.025, or a distance of ∼110 Mpc. The area above the dotted line in the map indicates the region where emission from the Milky Way is negligible, and extragalactic surveys can probe fainter sources. The stellar mass function of the RESOLVE galaxies in this field is shown in orange on the right.

Standard image High-resolution image

Using this subset of galaxies as a parent sample, we create three different mock catalogs. Each assumes a different value of M*,0 and thus a different BHOF. Where an MBH exists, we assign an X-ray luminosity drawn from the AMUSE observed LX M* relation, including the scatter. We pass it through the Bayesian inference pipeline described above, assuming survey luminosity limits of 1036 erg s−1 and 1038 erg s−1. Figure 11 shows the posterior distributions recovered for each of the mocks. Higher occupation fractions are harder to rule out than lower ones, due to the large scatter inherent in the LX M* relation. While broad, the mode of each distribution matches the input value for ${L}_{\mathrm{lim}}={10}^{36}\ \mathrm{erg}\ {{\rm{s}}}^{-1}$. For shallower surveys, if the intrinsic BHOF is high, the survey would return a flat posterior on M*,0 even for this high level of completeness. On the one hand, this suggests that the uncertainties obtained from the AMUSE survey may be biased low. On the other hand, it confirms that higher values of M*,0 would have been ruled out with high confidence.

Figure 11.

Figure 11. The recovered posterior on $\mathrm{log}({M}_{* ,0})$ for mock catalogs of 10,000 galaxies at z < 0.025 observed with an X-ray survey with a luminosity completeness above 1036 erg s−1. The faintest AGNs created by the scaling relation are above this threshold, at 4 × 1036 erg s−1. The MCMC sampler assumes that the LX M* relation and its scatter are well measured at higher stellar masses and do not change for dwarf galaxies. The posteriors are broadened by the scatter in the scaling relation and by the probabilistic nature of populating galaxies with BHs if focc(M*,0) < 1. Nevertheless, the mode of the probability density function tracks the true M*,0 used to generate each mock.

Standard image High-resolution image

On the other hand, we have checked that the eFEDS survey, despite its uniform coverage overlapping with ∼500 times more galaxies than AMUSE and detection of several new AGN candidates in dwarf galaxies (Latimer et al. 2021a), yields flat posteriors on M*,0, and that this situation is not expected to improve even with the deepest, polar regions of eRASS8. The key takeaway from this analysis is that survey depth, and the resulting completeness in stellar mass and MBH mass/luminosity, are much more powerful than survey volume when measuring the underlying BHOF.

4. Discussion

4.1. Confusion with Stellar BH Systems

A key challenge at low MBH masses, and therefore luminosities, is confusion with other sources such as ultra- or hyperluminous X-ray sources (ULXs/HLXs) (Swartz et al. 2004, 2011), which are likely the high Eddington ratio end of the accreting stellar BH population (Stobbart et al. 2006; Sutton et al. 2013), although some studies argue that the brightest of these may indeed be intermediate-mass BHs (IMBHs; Miller et al. 2003; Bellovary et al. 2010; Sutton et al. 2012). ULXs may be differentiated from IMBHs by a lack of radio emission or a position offset from the galaxy center (Thygesen et al. 2023), although Bellovary et al. (2021) showed that in the shallow potentials of dwarf galaxies, MBHs could live off center. Multiband X-ray observations can enable spectral studies to differentiate further between MBHs and high-accreting stellar BHs, and multiepoch observations may find different variability scales. Further contamination at the faintest ends may arise from populations of X-ray binaries (XRBs), although this can potentially be modeled via low-scatter scaling relations with the stellar mass and star formation rate (Lehmer et al. 2010, 2019).

4.2. What Would a Low-redshift Multiwavelength Campaign Need to Look Like?

Whether the BH selection is by mass (Figure 3) or luminosity (Figure 4), seeding models cannot be distinguished unless we find every MBH ≳104 M. This finding was also independently arrived at in recent work by Burke et al. (2022) using future variability surveys. These smallest MBHs have been recently discovered using optical variability (Baldassare et al. 2018) and line diagnostics (Baldassare et al. 2015; Mezcua & Dominguez Sanchez 2020; Polimera et al. 2022). Emission line widths, such as Hα, can yield BH masses, although these can easily be confused with ongoing star formation.

MBHs are, however, notorious for the wide variety and variability of their spectral energy distributions (SEDs). This means that we do not yet have a complete understanding of how the emission from the reprocessed material feeding a BH is distributed at different wavelengths. Below, we summarize some of the opportunities and challenges at various wavelengths for future low-redshift surveys.

4.2.1. Optical and IR

Optical emission from AGN is highly susceptible to obscuration by dust, either in the torus of the AGN itself or elsewhere in the host galaxy (e.g., Barger et al. 2005). Dust-absorbed optical photons are then reemitted in the infrared (IR); however, infrared surveys thus far have lacked the sensitivity and/or spatial resolution to capture this population well enough (Latimer et al. 2021b). Marleau et al. (2017) found 62 AGN candidates in dwarf galaxies (106 < M*/ M < 109) in the Wide-field Infrared Survey Explorer (WISE) all-sky survey, but BH masses require an assumption about Eddington ratios and therefore span three orders of magnitude. The current situation stands to be transformed by low-redshift data from the James Webb Space Telescope (JWST), particularly if there are appropriately designed surveys with well-understood selection functions.

4.2.2. Spectroscopic Line Ratios

A common way to find AGNs is using spectroscopic line ratios, but these selection criteria were developed for more-massive BHs and can fail for dwarf AGNs. Polimera et al. (2022) found a population of AGNs in dwarf galaxies using new line ratio diagnostics, exceeding previous detection rates in the galaxy sample by 3%–16%. Mezcua & Dominguez Sanchez (2020) found that for 23 of 34 AGNs in dwarf galaxies with resolved IFU spectroscopy, the AGN signal gets washed out if integrated over the entire galaxy. Recently, Cann et al. (2018) has shown that more IMBHs should show up via coronal emission lines in JWST observations of nearby dwarfs.

4.2.3. Optical/IR Variability Selection

Using a similar, phenomenologically motivated model, Burke et al. (2022) recently presented forecasts for the detection prospects of actively accreting BHs hosted in dwarf galaxies using their optical variability. Focusing on the intermediate BH mass regime, they use observational constraints on optical variability as a function of BH mass to generate mock light curves. In a similar vein to the work presented here, adopting several different models for the BHOF, including one for off-nuclear IMBHs, they quantify differences in the predicted properties of BH mass and luminosity functions in local dwarf galaxies. Modeling the fraction of variable accreting BHs as a function of host galaxy stellar mass, including selection effects, they report that the BHOFs for "heavy" and "light" initial seeding scenarios can be discriminated with variability alone at the 23σ level for galaxy host stellar masses below ∼108 M with data from the upcoming planned synoptic surveys with the Vera C. Rubin Observatory. Current variability-centered searches for low-mass BHs are starting to yield dividends, for instance, Ward et al. (2022) find ∼200 dwarf AGNs using optical variability, and show that 81% of them would be missed by spectroscopic searches.

4.2.4. Radio Wavelengths

AGN have also been detected in nearby dwarf galaxies through their radio emission, which stand out due to the very steep spectral index (Davis et al. 2022; Sargent et al. 2022; Yang et al. 2022). Liodakis (2022) models the expected emission from jets in IMBHs in dwarf galaxies and predicts that ∼40% of them will be detectable with ngVLA, SKA, and the Rubin observatory.

Using the fundamental plane of BH activity (Merloni et al. 2003), we can use BH masses and X-ray luminosities to predict the expected radio emission for the different SAMs. The most recent measurement of this relation by Gultekin et al. (2019):

Equation (7)

where the LR is the radio continuum luminosity at 5 GHz and LX is the X-ray luminosity integrated over 2–10 keV. The best fit to their observations was μ0 = 0.55 ± 0.22, ξμ R = 1.09 ± 0.10, and ξμ L = − 0.59 ± 0.15, with an intrinsic scatter of −0.04 in log(M). Figure 12 shows the predictions derived using this observed fit for the expected LR for each of the SAMs. The heavy seed models predict slightly higher radio emission for dwarf AGNs, but almost no sources in galaxies with M* < 107 M. Additionally, in the heavy seed scenario, the PL growth channel shows larger scatter in radio luminosities for the faint AGNs in dwarf galaxies. This allows an additional way to distinguish between growth channels, other than the high-end luminosity function mentioned above, without having to assume that the same growth channels dominate for MBHs of all masses/luminosities.

Figure 12.

Figure 12. Predicted radio luminosity at 5 GHz from the three different growth channels, assuming the fundamental plane of BH activity from Gultekin et al. (2019). As in the X-rays, the AGN-MS model predicts higher radio luminosities on average and with lower scatter than the PL model, whereas the BLQ prescription predicts most super-MBH to be inactive at z = 0. The scalings are similar for the two seed models, but heavier seeds produce almost no sources for stellar masses below 107 M. The dotted line shows a flux limit of 1 mJy, corresponding to the FIRST survey (Becker et al. 1995).

Standard image High-resolution image

Combining optical, infrared, and X-ray observations would allow a self-consistent determination of the bolometric luminosity, removing the need for the uncertain bolometric correction factors that are currently adopted (Thornton et al. 2008; Koliopanos et al. 2017). Such a follow-up campaign to measure the masses of all the BHs detected via the range of methods above would greatly improve measurements of the BHOF.

4.3. The Possibility of Multiple Seeds

In this idealized study, we have examined in detail the effects of seed and growth channels in isolation. It is likely that BHs may grow form via multiple channels. Spinoso et al. (2022) used a SAM to populate halos from the Millennium-II Simulation with both heavy and light seeds, and evolved their growth to z = 0 using a two-phase prescription, corresponding to the observed quasar and radio modes of AGNs. They find that with such a multiple-channel seeding model, the slope of the BH–stellar mass relation is flatter for dwarf galaxies than their more-massive counterparts. We see this in Figure 1 in the models that grow with a PL Eddington ratio distribution function (ERDF), irrespective of the seeding model, although the difference is starker for heavy seeds. Interestingly, this heavy seed PL ERDF model is also the one that is most consistent with the AMUSE observations. This is a promising sign for future X-ray surveys, suggesting that we may find more dwarf AGNs by simply pushing to slightly lower luminosities.

5. Conclusions

We have assessed the feasibility of distinguishing between models of MBH seeding and growth in the presence of observational limitations. Using a suite of SAMs, we applied cuts on the BH mass, BH X-ray luminosity, and host galaxy stellar mass to mimic observational selection functions. We find that:

  • 1.  
    If we can find every MBH above 103 M, light seeds predict a ∼100% occupation fraction down to stellar masses of 108 M, whereas for heavy seeds, this occupation fraction falls to 50% at 108 M. The total occupation fraction of MBHs at 108 < M*/M < 1010 is ∼100% and ∼70% for the light and heavy seeds, respectively.
  • 2.  
    If galaxies grow via the AGN-MS channel, the seed models can be distinguished if we can find every MBH above 3 × 104 M and are stellar mass complete down to 108 M. With a PL ERDF, the occupation fraction of >3 × 104 M MBHs is 20% at 108−109 M for the heavy seed model, versus ∼0% for the light seed model. The two growth channels can then be told apart by looking only at BHs >106 M, which are much more abundant with the AGN-MS growth channel than the PL.
  • 3.  
    We use bolometric corrections from Shen et al. (2020) to determine the X-ray luminosities of simulated AGNs. An X-ray survey of limiting luminosity 1036 erg s−1 in the 0.5–2.0 keV band can distinguish clearly between heavy and light seed models. If this limit is 1038 erg s−1, both seed models with either the AGN-MS or PL growth channels are indistinguishable; at higher luminosity limits, most models produce focc ∼ 0 at all stellar masses.
  • 4.  
    The BLQ prescription, which assumes purely merger-driven growth and reproduced the BLQ luminosity function at high redshifts, predicts almost no luminous AGNs at z = 0, inconsistent with observations. We therefore conclude that it is not a viable mechanism for the growth of dwarf AGNs.
  • 5.  
    The constraining power of a flux-limited survey can be enhanced by physical modeling of the underlying MBH population. We use mock catalogs to confirm that Bayesian pipelines such as that of Miller et al. (2015) are able to recover the true intrinsic occupation fraction, using a combination of AGN detections and upper limits. The inferred BHOF is consistent with heavy seeds growing via either the PL or AGN-MS channels. The two growth channels produce different luminosity functions at z = 0, however, with the PL model being more consistent with the observations of Ueda et al. (2011).
  • 6.  
    Using the fundamental plane of BH activity, we predict that heavy seeds will produce more luminous dwarf AGNs, but almost no sources in galaxies with M* < 107 M. While for light seeds the two growth channels produce identical radio signals, for heavy seeds, the PL model has a larger scatter in the LR M* relation. This will be testable with upcoming radio surveys like ASKAP and MeerKAT.
  • 7.  
    We emphasize that future optical/IR, variability-selected, and radio surveys of AGNs in dwarf galaxies must emphasize the stellar mass completeness of the galaxy sample, since AGN nondetections are likely to be correlated with galaxy detectability ways that are nontrivial to model.

In summary, we show concretely that it is possible to test models of both the seeding and growth of super-MBHs robustly using low-redshift surveys. The surveys need to be carefully designed, achieving high completeness at low stellar and BH masses and luminosities. Multiwavelength campaigns will help break degeneracies. In most cases, pushing current surveys just one order of magnitude deeper will help distinguish between radically different early seeding models.

We thank Paul Tiede for helpful discussions of the statistical technique, and Ned Taylor for discussions on optical surveys. U.C. and Á.B. acknowledge support from the Smithsonian Institution through NASA contract NAS8-03060. U.C. was also supported by NASA grant GO0-21070X. A.R. and P.N. gratefully acknowledge support from the Black Hole Initiative at Harvard University, which is funded by grants from the John Templeton Foundation and the Gordon and Betty Moore Foundation.

Appendix

A.1. Bolometric Corrections

The most uncertain part of our model is the connection between the MBH mass and its X-ray luminosity. Observationally, we know that the MBH population is diverse—there are Type I (Richards et al. 2006) and Type II AGN, Seyfert galaxies, FR I and FR II jets, blazars (Giommi et al. 2009; Bottcher et al. 2017), BLQs, narrowline quasars (NLQs), low-luminosity AGNs (Almeida et al. 2021), and a variety of radio sources. A large part of this diversity has to do with dust obscuration, which affects X-ray wavelengths far less than optical/IR. However, there is expected to be intrinsic variability in the SEDs at different BH accretion rates, depending on what process is driving the luminosity on BH event horizon scales.

We adopted the bolometric correction to the 0.5–2.0 keV band from Shen et al. (2020), which is a function of the bolometric luminosity; they found a similar scaling for the 2–10 keV band. Their measurements agree very well with Duras et al. (2020); for the earlier Lusso et al. (2012) study, the corrections agree for Lbol ≳ 1043 erg s−1, but diverge significantly for fainter sources. Using the results of Lusso et al. (2012) would thus yield much more pessimistic predictions for the X-ray surveys; we note, however, that these scaling relations were computed for AGNs with Lbol ≳ 1043 erg s−1, far above the dwarf-AGN regime. Vasudevan & Fabian (2007) found that the bolometric correction to the 2–10 keV band is better correlated to the Eddington ratio. Woo & Urry (2002) went so far as to argue that there was no correlation between BH luminosity and mass other than due to circular reasoning in inferring the former from the latter. A relationship between the two is crucial for inferring BH demographics unless we rely on dynamical measurements of the BH mass, which are only possible for the nearest galaxies, or reverberation mapping, which is possible only for Type I AGN with an unobscured BL region.

A.2. Cosmic Downsizing

We do not know whether the Eddington distribution—the ratio of a BH's luminosity to its Eddington luminosity—is the same as for more-massive galaxies, or if there is a "downsizing" effect where less-massive galaxies host, on average, more-active BHs. There is then a degeneracy between the M*LBH,X scaling relation and the occupation fraction. The most robust constraints must jointly model these two effects, using an unbiased sample of host galaxies (Gallo et al. 2010).

Footnotes

  • 3  

    Here, the probability distribution of Eddington ratios λ is given by $P{(\lambda )=\exp {(-(\mathrm{ln}\lambda -\mathrm{ln}{\lambda }_{c}(z))}^{2}/(2\sigma (z)}^{2})/(2\pi \sigma (z)\lambda )$, where $\mathrm{log}{\lambda }_{c}=\max (\{-1.9+0.45z,\mathrm{log}0.03\})$ and $\sigma (z)=\max (\{1.03-0.15z,0.6\})$.

Please wait… references are loading.
10.3847/1538-4357/acbea6