A NEW TEST OF THE STATISTICAL NATURE OF THE BRIGHTEST CLUSTER GALAXIES

, , and

Published 2010 May 12 © 2010. The American Astronomical Society. All rights reserved.
, , Citation Yen-Ting Lin et al 2010 ApJ 715 1486 DOI 10.1088/0004-637X/715/2/1486

0004-637X/715/2/1486

ABSTRACT

A novel statistic is proposed to examine the hypothesis that all cluster galaxies are drawn from the same luminosity distribution (LD). In such a "statistical model" of galaxy LD, the brightest cluster galaxies (BCGs) are simply the statistical extreme of the galaxy population. Using a large sample of nearby clusters, we show that BCGs in high luminosity clusters (e.g., Ltot ≳ 4 × 1011h−270L) are unlikely (probability ⩽3 × 10−4) to be drawn from the LD defined by all red cluster galaxies more luminous than Mr = −20. On the other hand, BCGs in less luminous clusters are consistent with being the statistical extreme. Applying our method to the second brightest galaxies, we show that they are consistent with being the statistical extreme, which implies that the BCGs are also distinct from non-BCG luminous, red, cluster galaxies. We point out some issues with the interpretation of the classical tests proposed by Tremaine & Richstone (TR) that are designed to examine the statistical nature of BCGs, investigate the robustness of both our statistical test and those of TR against difficulties in photometry of galaxies of large angular size, and discuss the implication of our findings on surveys that use the luminous red galaxies to measure the baryon acoustic oscillation features in the galaxy power spectrum.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Over the past decade, the features in the cosmic microwave background angular power spectrum and galaxy clustering power spectrum due to the baryon acoustic oscillations (BAO) have emerged to be a premier standard ruler, and strong cosmological constraints have been derived using this technique (e.g., Spergel et al. 2003; Eisenstein et al. 2005; Cole et al. 2005; Padmanabhan et al. 2007). In the galaxy BAO measurements, the tracer population of the large-scale matter distribution often employed is the so-called luminous red galaxies (LRGs). These are massive elliptical galaxies characterized by an old and passively evolving stellar population (Eisenstein et al. 2001) and tend to be the dominant, central galaxies in group or cluster scale dark matter halos (e.g., Zheng et al. 2009; throughout this paper we do not refer to the most luminous galaxy in clusters and groups differently, but simply use the term brightest cluster galaxies (BCGs)).

Recognizing the constraining power of BAO measurements, an impressive array of ongoing and planned cosmological experiments have adopted this method as the main survey component (e.g., BOSS,6 WiggleZ,7 HETDEX,8 ATLAS,9 ADEPT, WFMOS). An important issue faced by these surveys is the control of systematics, including, e.g., the corrections for the galaxy bias and the redshift distortion. In practical terms, it is necessary to know the statistical properties of the BAO tracer population to exquisite detail (such as the way they "populate" their host halos and their luminosity function). As the LRGs are used as the tracer in most of the aforementioned experiments, it is critical to test for the homogeneity of the LRG population. In this paper, we address two key questions: are BCGs different from other cluster galaxies? Are BCGs distinct from non-BCG, luminous, red galaxies in clusters and groups?

Because of their brightness and uniformity in luminosity, the BCGs have long been regarded as an ideal standard candle (e.g., Humason et al. 1956; Scott 1957). Through the systematic investigations separately led by Sandage and Gunn (e.g., Sandage 1972, 1973; Sandage & Hardy 1973; Gunn & Oke 1975; Kristian et al. 1978; Hoessel et al. 1980; Schneider et al. 1983), however, it was realized that various corrections (such as dependences of the BCG luminosity on the cluster richness class (Abell 1958) and Bautz–Morgan type (Bautz & Morgan 1970), and the effect of galactic cannibalism (Ostriker & Tremaine 1975)) have to be applied before any cosmological implication from the BCG Hubble diagram can be extracted.

The small dispersion in BCG luminosity (e.g., ∼0.3 mag as determined by Sandage 1972) also stimulated the discussion on their origin (Scott 1957; Peebles 1968; Geller & Peebles 1976; Tremaine & Richstone 1977; Hausman & Ostriker 1978; Geller & Postman 1983; Merritt 1985; Bhavsar & Barrow 1985; Lin & Mohr 2004; Loh & Strauss 2006; von der Linden et al. 2007; Bernardi et al. 2007; Skibba et al. 2007; Vale & Ostriker 2008; and Liu et al. 2008 to name a few): whether they are simply "the brightest of the bright" or a different population from other cluster galaxies all together. If the former hypothesis were true, the BCG luminosity distribution (LD) simply results from sampling of a universal LD of all cluster galaxies, and its small dispersion in magnitude reflects the steepness of the bright end of the universal LD. To test for such a "statistical nature" of the BCGs, Tremaine & Richstone (1977, hereafter TR) devised a couple of statistics based on the mean and dispersion of the magnitude difference between the first- and second-ranked galaxies ($\overline{\Delta }$ and σΔ, respectively), and the dispersion of the BCG magnitude (σ1). The basic idea is that, if the BCGs are simply the statistical extreme of the cluster galaxy population, both σ1 and σΔ would be greater or comparable to $\overline{\Delta }$. More specifically, under the assumptions that (1) the numbers of galaxies in non-overlapping magnitude intervals are independent random variables, and (2) the LD drops to zero rapidly at the extreme bright end,10 the following conditions need to be satisfied: $T_1\equiv \sigma _1/\overline{\Delta }\ge 1$ and $T_2 \equiv \sigma _{\Delta }/\overline{\Delta }\gtrsim 0.82$ (see also Loh & Strauss 2006).

Using LRGs to identify dense environments such as groups and clusters, Loh & Strauss (2006) found a large magnitude gap ($\overline{\Delta }\sim 0.8$ mag) between the first- and second-ranked galaxies in ∼12,000 LRG-selected groups and clusters (for which the majority of the LRGs are the BCGs). They concluded that the LRGs are inconsistent with being the statistical extreme of the universal galaxy LD, based on the tests proposed by TR.

One fundamental issue faced by studies of very luminous galaxies, such as BCGs, concerns their luminosity. On the practical side, for nearby BCGs, measuring their total light is nontrivial. The majority of BCGs are giant elliptical or cD galaxies, whose surface brightness profile is typically flatter than that of normal elliptical galaxies, and can extend to tens or hundreds of kpc (Tonry 1987). To measure the "total" magnitude requires careful subtraction of the sky background. On the physical side, the cD envelope may well extend into the intracluster space, and it is sometimes difficult to separate the luminosity of the BCGs from that of the intracluster stars (e.g., Gonzalez et al. 2005). In this study, we will use data from the Sloan Digital Sky Survey (SDSS; York et al. 2000), which is not deep enough to detect contributions from the intracluster stars. However, it is known that the current SDSS pipeline has difficulty handling photometry of galaxies with large angular extents, and therefore may seriously underestimate the luminosity of BCGs (e.g., Lauer et al. 2007; von der Linden et al. 2007; Abazajian et al. 2009). In such cases, Δ will be biased low. Although corrections to BCG photometry (if at all possible) may decrease the value of T1, it may have the opposite effect on T2, making the interpretation of the TR tests more difficult (see Section 4.3).

Here we propose a new way to examine the statistical nature of the BCGs that is less dependent on a robust measurement of the BCG luminosity and Δ, and apply it to a large sample of nearby clusters. Our method is both conceptually and operationally very simple: with a sample of N clusters, whose total galaxy luminosities are Ltot,i, where i = 1, 2, ..., N, one can combine all member galaxies to form a composite cluster. By drawing galaxies in random11 from the composite cluster to create N mock clusters whose total luminosities are matched to the observed ones (see Figure 1), the most luminous mock galaxies constitute the "statistical" BCG sample. One can then examine the distribution of the BCG luminosities (Lbcg) for the observed and mock clusters. In particular, we look for deviations from the statistical expectation from the correlation between Lbcg and Ltot. As cluster luminosity is correlated with its mass (e.g., Lin et al. 2004), a comparison of the LbcgLtot correlation between the observed and mock clusters may provide some insights into possible cluster mass dependence (if any) in the BCG formation mechanism(s).

Figure 1.

Figure 1. Illustration of our method for obtaining the LD of all cluster galaxies and for creating mock clusters. By combining member galaxies from all N clusters, we get a "galaxy pool" that by definition should be representative of the LD. By drawing galaxies in random from the galaxy pool we then create N clusters whose total luminosities are matched as close as possible to the real ones (see Section 3.1 for details). Our galaxy selection is limited to Mr = −20.

Standard image High-resolution image

The great virtue of the TR tests is that they only rely on a minimal set of assumptions concerning the nature of the LD and is immune to any cluster-to-cluster variation. Our proposed test, on the other hand, although relying on the universality of the LD, is independent of the shape of the LD. In this sense the two approaches are highly complementary.

In Section 2, we describe our cluster sample and the galaxy data. We show the results of our tests in Section 3 and present a comparison with the TR tests in Section 3.3. We apply our test to the second BCGs (hereafter G2s) in Section 3.4. Extensive tests have been carried out in order to show the robustness of our results (Section 4). We conclude by discussing the implications of our finding on the BCG formation scenarios and LRG-based BAO surveys in Section 5.

Throughout this paper, we adopt a flat ΛCDM cosmological model where ΩM = 1 − ΩΛ = 0.3 and H0 = 70h70 km s−1Mpc−1.

2. CLUSTER SAMPLE AND GALAXY DATA

Our clusters are drawn from an updated version of the C4 cluster catalog (Miller et al. 2005), which uses data from the fifth data release of the SDSS (Adelman-McCarthy et al. 2007). Of the 2037 clusters in the sample, we restrict ourselves to 494 that lie within the redshift range z = 0.030–0.077, with velocity dispersion σ>200 km s−1, and contain at least two luminous galaxies (see below). While the lower limit in redshift helps to alleviate problems in photometric measurements of bright galaxies with large angular extents in SDSS, the upper limit is chosen to ensure that nearly all galaxies with Mr ⩽ −20 have redshifts measured by SDSS (i.e., extinction-corrected petrosian magnitude rp ⩽ 17.77, among other selection criteria; Strauss et al. 2002). Note that the characteristic magnitude for cluster red galaxies is M*,r = −21.70 (see Table 4 below), based on our cluster sample. In addition, as we will combine galaxies from all clusters to form a composite cluster, the small redshift range chosen prevents any redshift evolution within our cluster galaxy sample.

Our first task is to assign cluster membership to galaxies in the SDSS main sample. For every cluster, we include only red galaxies with redshift |zzcl| ⩽ 3σ/c, with r-band absolute petrosian magnitude Mr ⩽ −20, and are projected within 0.8h−170 Mpc of the cluster center (defined as the peak of bright galaxy density distribution; Miller et al. 2005). Here zcl and c are the cluster redshift and speed of light, respectively. A galaxy is considered red if the model color ur ⩾ 2.2 (e.g., Strateva et al. 2001). To calculate the absolute magnitudes, we use the k-correction based on the New York University value-added galaxy catalog (Blanton et al. 2005). In addition, a correction is made to convert the petrosian magnitudes into "total" magnitudes based on the surface brightness profile of the galaxies, following Graham et al. (2005).

On average, about 94% of galaxies in the SDSS main sample have fibers assigned to them (Strauss et al. 2002). This fraction becomes smaller in crowded fields, such as the clusters (see Miller et al. 2005 and below), due to the size of the fiber plugs. It is therefore critical to correct for such an incompleteness when trying to include all cluster galaxies using the SDSS main sample. For every cluster we first calculate the mean gr and ri colors $\bar{c}_{gr}$ and $\bar{c}_{ri}$ of the spectroscopically confirmed member galaxies, then assume the galaxies that (1) satisfy the above selection criteria (ur ⩾ 2.2, rp ⩽ 17.7, projected distance ⩽0.8h−170 Mpc), (2) were targeted for spectroscopy but were not assigned a fiber, and (3) have $|g-r-\bar{c}_{gr}|\le 0.15$ and $|r-i-\bar{c}_{ri}|\le 0.15$ as probable cluster members (hereafter referred to as the photometric members). We treat these galaxies exactly the same way as those with spectroscopic redshift measurements and derive the absolute magnitudes by assuming they are at the cluster redshift.

Combining the lists of the spectroscopic and photometric members, we record the absolute magnitudes of the galaxies for each cluster. The 5980 galaxies with Mr ⩽ −20 from all 494 clusters form our composite cluster, or a "galaxy pool" that by definition has the statistical galaxy LD (see Section 1). Among these galaxies, 871 are photometric members (including 97 BCGs).

In our analysis we will construct two cluster subsamples according to the total r-band luminosity Ltot from the member galaxies: a high-luminosity sample, consisting of 124 systems with Ltot > Ldiv ≡ 3.7 × 1011h−270L, and a low-luminosity sample with 370 systems whose LtotLdiv. The cluster luminosity Ltot is the sum of luminosities from all red member galaxies (including the photometric members) more luminous than Mr = −20, projected within a radius of 0.8h−170 Mpc. As the SDSS main sample has complete spectroscopic coverage (modulo fiber collision) for Mr ⩽ −20 for all our clusters, no extrapolation in the luminosity function is needed to obtain Ltot. The division luminosity Ldiv is chosen so that each subsample contains roughly equal number of galaxies.

In Figure 2, we show the distribution of number of galaxies for the two subsamples. The histograms are normalized to the total number of clusters in the respective samples (red/solid: high-luminosity; blue/dashed: low-luminosity). We see that >92% of low-luminosity systems have at least five red galaxies above our chosen magnitude limit. As we will show below, our results are not dependent on the faintest or most luminous systems.

Figure 2.

Figure 2. Red/solid and blue/dashed histograms show the normalized distribution of number of galaxies for the high- and low-luminosity subsamples, respectively. More than 92% of systems in the low-luminosity sample contain at least five red galaxies (more luminous than Mr = −20, located within 0.8h−170 Mpc).

Standard image High-resolution image

Figure 3 presents the normalized redshift distribution dN/dz of the two subsamples (red/dotted: high-luminosity; blue/dashed: low-luminosity). We find that the two samples have similar dN/dz: the median, mean, and 1σ scatter of the high-luminosity sample are 0.0662, 0.0624, and 0.0126, respectively. The same quantities for the low-luminosity sample are 0.0616, 0.0593, and 0.0126. The green/solid curve in the figure is the differential comoving volume element dV/dz, scaled to the height of the distributions. The similarity between the redshift distributions and dV/dz suggests that our samples are roughly volume limited and should be representative of the nearby groups and clusters.

Figure 3.

Figure 3. Red/dotted and blue/dashed histograms show the normalized redshift distribution for the high- and low-luminosity subsamples, respectively. The two samples have similar distributions. For comparison, the green/solid curve shows the (arbitrarily) scaled differential comoving volume element dV/dz. As the redshift distributions roughly follow dV/dz, our samples are close to volume-limited.

Standard image High-resolution image

3. THE LbcgLtot CORRELATION AND THE STATISTICAL NATURE OF BCGs

Correlations between the luminosity of the BCGs and the properties of host clusters such as mass or total luminosity have been noted by previous studies (e.g., Lin & Mohr 2004; Yang et al. 2005; Hansen et al. 2009). Here we utilize the BCG luminosity–cluster luminosity correlation (LbcgLtot) to show that BCGs are indeed special—at least in luminous/massive clusters.

In Figure 4 (top panel), we show the observed LbcgLtot correlation for the C4 clusters. The magenta squares in the top panel are the mean of BCG luminosity $\overline{L_{{\rm bcg,obs}}}$, as a function of Ltot. The meaning of the cyan crosses will be described below.

Figure 4.

Figure 4. LbcgLtot correlation for the real clusters (top panel) and one of the Monte Carlo realizations of mock cluster ensemble (middle panel). The squares in the top and middle panel are the mean of the BCG luminosity for the real and the mock clusters ($\overline{L_{{\rm bcg,obs}}}$ and $\overline{L_{{\rm bcg,sim}}}$), respectively. Also shown in both panels as crosses are the mean BCG luminosity averaged over 200 Monte Carlo realizations, $\left< \overline{L_{{\rm bcg,sim}}} \right>$. In the bottom panel, we show the distribution of $d_{{\rm obs}}= \log \overline{L_{{\rm bcg,obs}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$ (blue solid points) and $d_{{\rm sim}}= \log \overline{L_{{\rm bcg,sim}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$ (red open points). The real BCGs are systematically more luminous than the statistical ones in luminous clusters. The errorbars of dobs are derived from 200 bootstrap resampling of the observed LbcgLtot correlation.

Standard image High-resolution image

3.1. The Mock Clusters and BCGs

To check for the statistical nature of BCGs, we proceed as follows. Based on the observed LD, we generate many realizations of the LbcgLtot correlation, from which we derive the mean LbcgLtot relation that is expected if the BCGs are drawn from the same LD as other cluster galaxies. We can then compare the observed mean LbcgLtot relation and determine if it is consistent with the mean from the mock data.

We show in the middle panel of Figure 4 one such realization. Basically, for each cluster in our sample, we create a corresponding mock cluster by randomly drawing galaxies from the galaxy pool, until the total luminosity of the mock cluster matches that of the observed one (see also Figure 1). Specifically, suppose the first N galaxies give a total luminosity of LN < Ltot, and the next mock galaxy brings the total luminosity to LN + 1 > Ltot. We keep the (N + 1)th galaxy if LN + 1Ltot < |LNLtot|, otherwise we only use the first N galaxies. Because of this procedure, although the total luminosity for massive (luminous) clusters can be matched fairly well, for low luminosity clusters the difference in Ltot between the mock and real clusters can be large.

The black squares in the middle panels of Figure 4 show the mean in BCG luminosity, $\overline{L_{{\rm bcg,sim}}}$ in this particular ensemble of simulated clusters. The cyan crosses in both the top and middle panels are the mean of $\overline{L_{{\rm bcg,sim}}}$ from 200 Monte Carlo realizations of the mock cluster ensemble, which we denote as $\left< \overline{L_{{\rm bcg,sim}}}\right>$. It is found that for luminous clusters (e.g., Ltot > 3.7 × 1011h−270L), the observed $\overline{L_{{\rm bcg,obs}}}$ is higher than $\left< \overline{L_{{\rm bcg,sim}}}\right>$ from mock clusters (e.g., compare the magenta squares with the cyan crosses in the top panel). For low-luminosity clusters, the observed and mock values are comparable.

We list in Table 1 the definition of various terms employed in this and the following subsections.

Table 1. Definition of Terms

Notation Meaning
$\overline{L_{{\rm bcg,obs}}}$ mean BCG luminosity of the real clusters (in 15 Ltot bins)
$\overline{L_{{\rm bcg,sim}}}$ mean BCG luminosity of one Monte Carlo ensemble (in 15 Ltot bins)
$\left< \overline{L_{{\rm bcg,sim}}}\right>$ average of $\overline{L_{{\rm bcg,sim}}}$ over 200 Monte Carlo ensembles; this is the statistical expectation
dobs $\equiv \log \overline{L_{{\rm bcg,obs}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$
dsim $\equiv \log \overline{L_{{\rm bcg,sim}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$

Download table as:  ASCIITypeset image

3.2. Statistical Significance

By comparing the observed and mock LbcgLtot correlations we infer that BCGs in real clusters have systematically higher luminosities than the mock BCGs. One might easily imagine a systematic measurement error that systematically gives too large a luminosity for bright galaxies. But that would not produce the effect we find. One would need a systematic error that increased the brightness of BCGs but had a less dramatic effect on second brightest galaxies in other (more luminous) clusters of the same intrinsic luminosity. Here we quantify the significance of the difference between the two populations.

Using 200 realizations of the mock cluster ensemble (each containing 494 clusters), we calculate $\left< \overline{L_{{\rm bcg,sim}}}\right>$, as well as the difference between logarithms of the mean BCG luminosity $\overline{L_{{\rm bcg,sim}}}$ from individual realizations and $\left< \overline{L_{{\rm bcg,sim}}}\right>$, $d_{{\rm sim}}= \log \overline{L_{{\rm bcg,sim}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$. By dividing the cluster sample into 15 bins in Ltot, each containing roughly equal number of clusters, for each mock cluster ensemble we have 15 evaluations of the statistic dsim. One such example is shown in the lower panel of Figure 4 (open red symbols). We see that dsim roughly scatters about zero. We expect that for mock clusters, the distribution of dsim should follow a Gaussian, which is shown as the blue histogram in the top panel of Figure 5. The green curve is a Gaussian fit to the histogram.

Figure 5.

Figure 5. Distribution of the statistic d, defined as the difference between logarithms of the mean BCG luminosity $\overline{L_{{\rm bcg}}}$ and the mean of $\overline{L_{{\rm bcg,sim}}}$ from 200 Monte Carlo realizations of the mock cluster ensemble. The distribution of the observed dobs is shown as the blue histograms, while that for the mock clusters is shown as the red ones. The top, middle, and bottom panels show the results for the whole cluster sample (494 systems), for high luminosity clusters (Ltot > Ldiv = 3.7 × 1011h−270L; 124 systems), and for low luminosity clusters (LtotLdiv; 370 systems), respectively. In each panel the green dotted curve is a Gaussian fit to the red histogram. For the high luminosity clusters the difference between the real and mock data is significant at the 0.028% level (see Table 2).

Standard image High-resolution image

We can similarly calculate $d_{{\rm obs}} = \log \overline{L_{{\rm bcg,obs}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$ for the real clusters. We show dobs as a function of Ltot in the lower panel of Figure 4 as the solid blue points; it is apparent that dobs correlates positively with cluster luminosity. The errorbars of dobs are derived from 200 bootstrap resampling of the observed LbcgLtot correlation. The distribution of dobs is further shown as the red dashed histogram in the top panel of Figure 5 and is clearly different from a Gaussian. A Kolmogorov–Smirnov (K-S) test shows that the probability of dobs to be drawn from the same distribution of dsim is $\mathcal {P}=0.79\%$.

We examine in more detail the distribution of d in high and low luminosity clusters in the other two panels of Figure 5. The distribution of d for the high (low)-luminosity subsample is shown in the middle (bottom) panel. As we note above, in high luminosity clusters the real BCGs are on average more luminous than the mock ones, resulting positive dobs, and there is only a 0.028% probability that they are drawn from the same LD, based on a K-S test. On the other hand, $\overline{L_{{\rm bcg,obs}}}$ in low luminosity clusters scatter about $\left< \overline{L_{{\rm bcg,sim}}}\right>$, and dobs and dsim have a much higher probability ($\mathcal {P}=54.5\%$) to be drawn from the same distribution. These results are recorded in Table 2 (the column under "galaxy pool;" for the meaning of the other columns, see Section 4.2).

Table 2. Probability of BCGs being the Statistical Extreme

    Method for Creating Mock Clusters
Sample Ncla Galaxy Pool Gaussian+Schechterb Single Schechterb
All 494 0.8% 0.5% 0.6%
High 124 0.03% 0.01% 0.02%
Low 370 54.5% 39.0% 44.5%

Notes. aNumber of clusters; the division of high and low luminosity subsamples is 3.7 × 1011h−170L (see Section 2). bSee Section 4.2 for more details.

Download table as:  ASCIITypeset image

3.3. Tremaine–Richstone Tests

Next we compare our results with those from the TR tests. Based on the whole cluster sample, we find that σ1 = 0.58, $\overline{\Delta }=0.62$, and σΔ = 0.49, resulting in $T_1=\sigma _1/\overline{\Delta } = 0.93$ and $T_2=\sigma _\Delta /\overline{\Delta }=0.78$. Recall that if the BCGs are "statistical," we would have obtained T1T1,lim = 1 and T2T2,lim = 0.82 (Section 1; but see below). Note that these lower limits are derived under rather general assumptions about the form of the LD (TR). For example, assuming the underlying LD follows the Schechter (1976) form, the limits are T1,lim,sch ≈ 1.16 and T2,lim,sch ≈ 0.88.

Dividing the clusters into high and low luminosity subsamples (using the same division luminosity $L_{{}\rm div}$ as in Section 3.2), we find $(\sigma _1, \overline{\Delta }, \sigma _{\Delta }, T_1, T_2)=(0.39,0.55,0.43,0.70,0.79)$ and (0.54, 0.65, 0.50, 0.84, 0.77), respectively. That T1 for the whole sample is larger than T1 for both high and low luminosity subsamples is due to the weak, positive correlation between Lbcg and Ltot.

Both our test and the TR tests indicate that, using the whole cluster sample, the BCGs are not drawn from the same LD as other cluster galaxies. Separating the low luminosity clusters from the luminous ones, our test suggests that this conclusion is mainly driven by the BCGs in the luminous clusters. The T1 statistic implies a larger deviation from the statistical expectation for BCGs in high luminosity clusters (T1,limT1 = 0.30), compared to their counterparts in less luminous clusters (T1,limT1 = 0.16).

It is important to quantify how significant are the deviations of the Ti (i = 1, 2) values we observe from the statistical limits. Without assuming any particular form of the LD, we evaluate the significance empirically, using the "galaxy pool" from all the member galaxies. More specifically, we construct ensembles of mock clusters following the method described in Section 3.1; we can then derive the mean and dispersion of the two statistics from the distributions of T1 and T2 based on the mock clusters (see Figure 6). For all clusters in our sample, when the BCGs are generated statistically, the mean and standard deviation of the two statistics are $\overline{T_1}=1.01$, $\sigma _{T_1} = 0.05$, and $\overline{T_2}=0.90$, $\sigma _{T_2} = 0.03$. We find that about 3.5% of the Monte Carlo realizations have T1 as low as the observed value (0.93), but only 0.007% have T2 as low as 0.78, the observed value. We denote these fractions as p1p1(⩽T1,obs) and p2p2(⩽T2,obs). For high luminosity clusters, ($\overline{T_1}, \sigma _{T_1}, p_1, \overline{T_2}, \sigma _{T_2}, p_2)=(0.96,0.08,0.014\%,0.84,0.06,14\%)$. For low luminosity ones, the values are (0.90, 0.05, 8.8%, 0.88, 0.04, 0.07%).

Figure 6.

Figure 6. Distribution of the T1 and T2 values for the real clusters (based on 5000 bootstrap resampling) and mock clusters (derived from 105 ensembles). The left panels show the distribution for T1, while the right panels are for T2. From top to bottom, we show the results for all the clusters, high and low luminosity subsamples, respectively. In each panel, the solid histogram is the distribution of the mock clusters (with the short solid arrow indicating the mean value), and the dotted histogram is that for the real clusters (with the mean indicated by the short dotted arrow). The overlapping area under the solid and dotted histograms represents an estimate of the likelihood the observed value is consistent with the statistical expectations (see Table 3). Looking at the middle left panel (for T1 values of high luminosity clusters), one sees that the dotted histogram is at the extreme of the solid red histogram and would conclude that BCGs are unlikely to be statistical (based on the T1 statistic). However, in the middle right panel (for T2), the dotted histogram is well within the dotted blue histogram, suggesting the BCGs are statistical. The opposite situation happens when one looks at the bottom panels (for low luminosity clusters).

Standard image High-resolution image

Additionally, we can estimate the distribution of the observed Ti via bootstrap resampling. The overlapping area under the "observed" and the "statistical" distributions (the latter from the mock cluster ensembles, as mentioned above) then gives an estimate of the likelihood that the observed Ti is consistent with the statistical expectation. We denote the likelihood as qiqi(Ti,obsTi,stat), i = 1, 2. For the whole cluster sample, we find (q1, q2) = (8.8%, 1.4%), For high and low luminosity clusters, the results are (0.35%, 23.4%) and (14.9%, 0.64%), respectively.

For clarity, these results are summarized in Table 3. We present the results for the whole sample, as well as the high and low luminosity subsamples. For each (sub)sample, the numbers under "obs." are the observed values, and the triplet of numbers under "stat." denotes the mean and standard deviation of the TR statistics based on 105 ensembles of mock data sets, and pi (i = 1, 2). Looking at p1 for the high and low luminosity clusters, one would conclude that BCGs in the faint clusters are much more likely to be statistical. One would draw the opposite conclusion if considering the T2 statistic, however. It is therefore not clear if the TR tests give a consistent picture (e.g., dependence on Ltot) for the degree of deviation of BCGs from the galaxy population in clusters.

Table 3. Tremaine–Richstone Statistics

  All Clusters Luminous Clusters Faint Clusters
  Obs.a Stat.b (p, q)c Obs.a Stat.b (p, q)c Obs.a Stat.b (p, q)c
T1 (0.93,0.01) (1.01, 0.05) (3.5, 8.8) (0.70,0.01) (0.96, 0.08) (0.01, 0.35) (0.84,0.01) (0.90, 0.05) (8.8, 14.9)
T2 (0.78,0.01) (0.90, 0.03) (0.007, 0.145) (0.79,0.01) (0.85, 0.06) (14, 23) (0.77,0.01) (0.88, 0.04) (0.07, 0.64)

Notes. aObserved value and the standard deviation based on 5000 bootstrap resampling. bStatistical expectation and the standard deviation based on 105 Monte Carlo realizations. cp is the proportion of the Monte Carlo realizations that give TR statistics as low as the observed value. q is the area of the overlapping region under the observed and statistical distribution of the TR values (see Figure 6). Both p and q are in %.

Download table as:  ASCIITypeset image

Using a large group sample from SDSS, Yang et al. (2008) examined the gap statistics. Although they did not explicitly calculate the values of Ti, using the results presented in their Figure 7, we can roughly estimate that for clusters with virial mass log M ≈ 14.8, (T1, T2) ∼ (1.4, 0.7), and for clusters with log M ≈ 14.0, (T1, T2) ∼ (0.6, 0.8). We see that the Ti values have opposite trends with cluster mass: based on T1, the BCGs in low mass systems are special; using T2 one would be lead to the opposite conclusion.

In their study of LRG-selected groups and clusters, Loh & Strauss (2006) found that $(\sigma _1, \overline{\Delta }, \sigma _{\Delta }, T_1, T_2)=(0.30,0.87,0.52,0.35,0.59)$ at z ≈ 0.12. They also examined the richness dependence of the TR statistics. For the richest systems (75%–100% quartile in the richness distribution) they found (T1, T2) = (0.75, 0.71); for those with richness in the 25%–50% quartile (with about 2–3 galaxies on the red sequence), (T1, T2) = (0.27, 0.34). The results of Loh & Strauss (2006) suggest a dependence of the TR statistics on richness that is opposite to our findings (if their richness estimator correlates well with the total luminosity).

The main cause for the differences seems to be in the magnitude gap, Δ. Although for the richest systems, Loh & Strauss (2006) obtained $\overline{\Delta }\sim 0.55$, a value comparable to ours, their finding of $\overline{\Delta }\sim 1$ for the much poorer systems (25%–50% quartile in richness) is much higher than our value. Because Loh & Strauss (2006) looked for galaxy concentrations around LRGs using (primarily) photometric data, we suspect many of their low-richness systems may not be real, but simply chance projections of galaxies with similarly red colors (see purity of the maxBCG algorithm; see Koester et al. 2007). On the other hand, we note that Yang et al. (2008) also found that $\overline{\Delta }\gtrsim 1$ for their low mass systems (log M ∼ 13).

These results seem to suggest that, when considered together, the two TR statistics may not unambiguously determine the statistical nature of BCGs with respect to cluster mass (or its proxy, such as total galaxy number or luminosity).

The disagreements among the different studies may be attributed to several factors (such as the way BCGs are selected, the way background contamination is treated, or even the group/cluster selection). A possible resolution would be to compare the selection of BCG and G2 on a cluster-by-cluster basis, for the clusters that are present in multiple catalogs.

3.4. Are G2s Special?

We have applied a new method to confirm that the BCGs are special. This approach can be used to check if G2s are statistical, that is, are their LD consistent with that of the overall cluster galaxy population. The results can then be used to answer the second question we set out to address: are BCGs different from other luminous, red, cluster galaxies?

We proceed in a similar fashion as in the previous sections. The main difference is that we are now comparing the observed Lg2Ltot correlation with the corresponding mean relation from mock clusters. Requiring that mock clusters must have at least two members causes a small modification to the way mock clusters are constructed, which mainly affects the low luminosity clusters. Therefore, we will set a lower luminosity limit at Llim = 1.6 × 1011h−270L and define the low luminosity subsample (234 systems) with LlimLtotLdiv, where Ldiv = 3.7 × 1011h−270L. The 124 cluster high-luminosity subsample remains the same as before.

In Figure 7, we show the distribution of the quantity d for G2s, which is analogous to Figure 5 for the BCGs. The bottom, middle, and top panels are the results for the low-luminosity subsample, high-luminosity subsample, and the combined sample (358 systems), respectively. In all samples considered, we find that G2s are consistent with the statistical expectation. K-S tests suggest that $\mathcal {P}=26\%$, 55%, and 24% for all, high-, and low-luminosity clusters, respectively. Our results are not sensitive to the exact value of Llim.

Figure 7.

Figure 7. Distribution of the statistic d for G2. The distribution of the observed dobs is shown as the blue/dashed histograms, while that for the mock clusters is shown as the red/solid ones. The bottom, middle, and top panels show the results for a low luminosity subsample (1.6 × 1011h−270LLtotLdiv; 234 systems), the high luminosity subsample (Ltot > Ldiv = 3.7 × 1011h−270L; 124 systems), and the combined sample (358 systems). In all samples considered, we find that G2s are consistent with the statistical expectation.

Standard image High-resolution image

This result immediately suggests that BCGs are distinct from G2s, and very likely, other luminous member galaxies.

This simple exercise also demonstrates one advantage of our method over the TR tests: while the latter by definition could not determine the statistical nature of G2s, our method can in principle be applied to even the third- or lower-ranked member galaxies.

4. SYSTEMATICS

In this section, we examine the robustness of our findings on various aspects in the analysis, including the selection of cluster red galaxies in SDSS, the way mock cluster and galaxy samples are constructed, and the issue of photometry of galaxies with large angular extent.

4.1. Sensitivity on Galaxy and Cluster Sample Selection

We first check the sensitivity of our results on the galaxy selection criteria, by repeating our test using a much more stringent set of conditions to assign cluster memberships for red galaxies. In addition to the basic requirements (ur ⩾ 2.2, rp ⩽ 17.7, projected within 0.8h−170 Mpc), for the spectroscopic members, we only include galaxies whose (1) redshifts satisfy c|zzcl| ⩽ v, where v ≡ min(2σ, 1500 km s-1); (2) gr and ri colors fall in the range $\bar{c}-1.5\sigma _{c}$ to $\bar{c}+2.5\sigma _{c}$, where $\bar{c}$ and σc are the mean and dispersion of the gr and ri colors from the galaxies at redshifts broadly consistent with the cluster; and (3) "concentration parameter" cmr90/r50 ⩾ 2.6, where r90 and r50 are radii that enclose 90% and 50% of the Petrosian flux, respectively; this last condition aims to select galaxies with early-type morphology (Strateva et al. 2001). As for the photometric members, along with the filters in the r-band flux, ur color, spatial distribution, and morphology, we require that their gr and ri colors to lie between $\bar{c}-\sigma _{c}$ and $\bar{c}+1.5\sigma _{c}$, where $\bar{c}$ and σc are now derived from the spectroscopic members.

These criteria reduce our cluster and galaxy sample; only 344 clusters containing 2819 galaxies are included. We find that the probability $\mathcal {P}$ that dobs and dsim are drawn from the same distribution is 1.5%, 0.15%, and 67% for the whole, high-luminosity, and low-luminosity (sub)samples. These values confirm the results found in Section 3.2.

We note in Section 2 that 20% (≈97/494) of the BCGs are photometrically identified. Some of these may be foreground/background galaxies. To evaluate the effect of possible contamination due to these photometric BCGs, we repeat our analysis using the 397 clusters whose BCGs are spectroscopically confirmed members (with the membership assignment criteria of Section 2). Based on the galaxy pool constructed from the 3661 galaxies in these clusters, we find that $\mathcal {P}=1.7\%$, 0.28%, and 27%, for the whole, high-luminosity, and low-luminosity (sub)samples.

Although the absolute values of $\mathcal {P}$ changes somewhat with respect to our nominal values recorded in Table 2, the trend remains clear that the BCGs in high luminosity clusters are much less likely to be drawn from the same LD as other cluster members when compared to their counterparts in lower luminosity clusters.

Let us comment next on one effect our cluster selection may have on the results. The requirement that the clusters need to host at least two galaxies with Mr ⩽ −20 potentially excludes systems dominated by a single ∼M* galaxy, such as the "fossil groups" (which are defined to have Δ ⩾ 2 and extended X-ray emissions). These systems are believed to evolve in isolation, with the last major merger with other galactic systems being long enough in the past that a dominant central galaxy can result from the dynamic friction (Ponman et al. 1994). The BCGs in these groups would then deviate significantly from the statistical extreme, which makes them distinct from the other BCGs in low luminosity groups included in our sample. One might be concerned that the exclusion of fossil groups from our cluster sample may have contributed to our conclusion that, for the lower luminosity systems, there is no statistically significant deviation of BCGs from the statistical distribution of all cluster galaxies. However, given the rarity of the fossils (number density ∼2 × 10−6h370 Mpc−3; e.g., La Barbera et al. 2009; Voevodkin et al. 2010), we expect there to be at most ∼20 such systems in the volume we sample (z = 0.03–0.077, ≈5700 deg2), and therefore our conclusion should be robust.

4.2. Construction of Mock Clusters

We first investigate whether the way galaxy pool is constructed affects our results. Instead of pouring galaxies from all clusters into one pool, we can create two pools separately for galaxies in high and low luminosity clusters. High and low luminosity mock clusters are then created from the respective pools. We find that $\mathcal {P}=0.98\%$ and 92% for the high and low luminosity clusters, respectively. The trend is clear that BCGs in low luminosity clusters are far more likely to be the statistical extreme than their cousins in high luminosity systems.

Second, we note that instead of utilizing the galaxy pool, we can construct mock clusters using fits to the observed LD of cluster galaxies. The LDs for the whole cluster sample, and for the high- and low-luminosity subsamples, are measured by the method developed in Lin & Mohr (2004), to which we refer the reader for more details. The results are shown in Figure 8. From top to bottom are the LDs for the high luminosity clusters, the whole sample, and the low luminosity clusters.

Figure 8.

Figure 8. Composite LDs in the r band for the 494 C4 clusters at z = 0.030–0.077. The red, green, and blue symbols represent the LD for high luminosity clusters, the whole cluster sample, and low luminosity clusters, respectively. The curves are Gaussian+Schechter function fits to the LD (see Equation (1)); the best-fit parameters are given in Table 4.

Standard image High-resolution image

When BCGs are included, the Schechter function is not a good description of the LDs. Instead, the bright end may be more appropriately described by a log-normal distribution. Therefore, in fitting the LDs we consider both a bright end-truncated Schechter function plus a Gaussian,

Equation (1)

where $\mathcal {M}=10^{-0.4 (M-M_*)}$ and a single Schechter function. The best-fit parameters are given in Table 4.

Table 4. LD Parameters in r-band

Sample Single Schechter Gaussian+Schechtera
  α ϕ* M* α ϕ* M* Mch $\tilde{\phi }_{*}$ $\tilde{M}_{*}$ σ2M
All −1.01 8.53 −21.92 −0.93 10.25 −21.70 −22.40 1.38 −22.25 0.32
High −1.17 12.62 −22.13 −0.99 18.30 −21.73 −22.60 1.90 −22.57 0.25
Low −0.93 6.34 −21.78 −0.73 8.92 −21.28 −22.05 1.35 −21.98 0.33

Note. aSee Equation (1) for the definition of the parameters.

Download table as:  ASCIITypeset image

Using the Gaussian+Schechter function fit to the whole sample as the LD, we find that for the whole, high-, and low-luminosity samples, $\mathcal {P}=0.47\%$, 0.013%, and 39%, respectively. If using the Schechter function fit to the LD of the whole sample, $\mathcal {P}=0.57\%$, 0.015%, and 44% for the three (sub)samples, respectively. These results are recorded in the third and fourth columns of Table 2.

We thus conclude that our results are robust irrespective of the way the mock clusters are constructed. Another implication from this exercise is that our method does not rely on largely complete spectroscopic data set like SDSS; as long as (1) one can determine the LD accurately (e.g., via a statistical background subtraction method; Lin et al. 2004), and (2) the cluster membership of BCGs can be reliably assigned, our test can be applied.

4.3. Sensitivity on Photometry of BCGs

We remark in Section 1 concerning the difficulty in the measurement of the total light of the BCGs. Here we examine the effect of underestimation of BCG luminosity (and more generally, luminosity of galaxies with large angular extents) on both our test and the TR tests. We address this issue by using two methods to obtain the best estimates of the true galaxy magnitudes for our galaxy sample. The first one is a re-measurement of the galaxy photometry from the reduced SDSS images for every galaxy in our sample. The second one employs a statistical correction to the official SDSS photometry.

Simply put, the reason that the SDSS photometric pipeline has difficulties dealing with the photometry of large galaxies is because it is defaulted to use the "local" sky flux measured within regions of 256 × 256 pixels (1farcm7 × 1farcm7) in the CCD frames. In the case that a large fraction of a given region is occupied by galaxies, the sky background level will be biased high, and in turn the galaxy magnitude will be lower than the true value. A remedy is to use the "global" sky level measured in frames of 2048 × 1498 pixels (13farcm5 × 9farcm8). Following this line of reasoning, von der Linden et al. (2007) devised a way to recover the true magnitude, given the crowdedness of a field, and the relative values of the local and global sky background.

As our first approach, we adopt the methodology of von der Linden et al. (2007) to apply corrections to the magnitudes of the galaxies and change the cluster luminosity accordingly, and then use the modified galaxy catalogs (and the galaxy pool) to conduct our test and the TR tests. It is found that $\mathcal {P}=1.2\%$, 0.012%, and 49%, for the whole, high luminosity, and low luminosity cluster (sub)samples, respectively. As for the TR tests, for the whole sample, (T1, p1, T2, p2) = (0.99, 11.1%, 0.77, 2 × 10−4); for the high and low luminosity subsamples, (T1, p1, T2, p2) = (0.68, 5 × 10−5, 0.77, 2.4%) and (0.84, 7.1%, 0.77, 3 × 10−4), respectively.

Using an independent photometric reduction software, Hyde & Bernardi (2009) quantified the degree of underestimation of the magnitudes of large galaxies by the official SDSS pipeline. Their tests suggest that, for early-type galaxies with effective radius ≈10'', the mean magnitude deficit is about 0.25 mag, with a 68% scatter about 0.04 mag. They provided a fitting function that gives the mean deficit $\overline{\Delta m}$ as a function of the angular size θ, which we use as the basis of our second method for recovering the true galaxy magnitudes. For every galaxy in our sample, we assume its magnitude was underestimated by the SDSS pipeline by $\overline{\Delta m}(\theta)+\delta m$, where δm is a Gaussian random variate with dispersion of 0.04 mag, and θ is determined from the de Vaucouleur fits to the galaxy surface brightness profile (see Hyde & Bernardi 2009 for details).

For the galaxy and cluster catalogs thus modified, we find that (T1, T2) = (0.99, 0.77) for the whole sample, and the probability to obtain T1 and T2 from the galaxy pool as low as the "observed" values is 11.3% and 8 × 10−5, respectively. On the other hand, the probability that dobs and dsim are drawn from the same distribution is $\mathcal {P}=1.2\%$. Breaking the sample into high- and low-luminosity subsamples, the results are (T1, p1, T2, p2) = (0.68, 5 × 10−5, 0.78, 2.2%) for the luminous clusters and (0.84, 7.5%, 0.77, 2 × 10−4) for the faint clusters. From our test we find that BCGs in high and low luminosity clusters have probabilities of $\mathcal {P}=0.012\%$ and $\mathcal {P}=49\%$ to be statistical.

These results suggest that both our test and the TR tests are insensitive to the uncertainties in galaxy photometry due to sky subtraction.

It is also important to check if the low luminosity clusters are systematically at lower redshifts than the luminous ones. If this were true, the issue with sky subtraction of the SDSS pipeline may affect the photometry of BCGs in low luminosity clusters (hereafter LLCBCG) more strongly than that of BCGs in high luminosity clusters (hereafter HLCBCG), causing the photometry of LLCBCG to be underestimated more than the case for HLCBCG. This would in turn affect our finding that LLCBCG are consistent with being drawn from the same LD as other cluster galaxies (Section 3.2). We have shown in Section 2 that the high and low luminosity clusters have very similar redshift distribution, and therefore we do not think the photometry of LLCBCG and HLCBCG is treated differently.

5. DISCUSSIONS AND SUMMARY

5.1. Implications on BCG Formation

The main result of this study is that BCGs as a whole have an LD that is distinct from that of the majority of red cluster galaxies (those with Mr ⩽ −20). This conclusion is primarily due to the high luminosities of HLCBCG, which has only 0.03% of probability to be drawn from the LD of all galaxies. On the contrary, LLCBCG are more likely to be simply the statistical extreme of the LD of all galaxies.

With the help of mock clusters, and the elements that come into the calculation of TR tests, one gains some insight into the BCG formation process. For high luminosity clusters, we find that $(\sigma _1,\sigma _\Delta,\overline{\Delta })=(0.39,0.43,0.55)$ from the real data, while the corresponding values from the mock data set are (0.42, 0.37, 0.44). Therefore, although σ1 in real and mock clusters are comparable, in real clusters $\overline{\Delta }$ is higher that that in mock ones (suggesting that real BCGs are on average 0.11 mag brighter than the statistical extreme). Physics of BCG formation drives the magnitude gap to be larger than the statistical expectation.

To explain the origin of HLCBCG, one therefore needs to invoke galactic mergers (e.g., Ostriker & Tremaine 1975; Hausman & Ostriker 1978; Lin & Mohr 2004; Cooray & Milosavljević 2005; Vale & Ostriker 2008), which is naturally expected within the hierarchical structure formation paradigm (e.g., Dubinski 1998; De Lucia & Blaizot 2007). However, the details of the mergers (e.g., major mergers between BCG and very luminous galaxies, minor mergers between BCG and M*-type galaxies, or the "galactic cannibalism"12) remain to be understood. For example, while Lin & Mohr (2004) suggested that major mergers are a viable route for forming HLCBCG, Vale & Ostriker (2008) were more in favor of minor mergers.

In principle, the extent of late mergers can be constrained by the LbcgLtot correlation, or statistics related to the magnitude gap between BCG and second-ranked galaxy (e.g., Milosavljević et al. 2006; Yang et al. 2008). We have developed a simple merger model for cluster galaxies and will present constraints on the importance of mergers based on the observations obtained in this paper in a future publication (Y.-T. Lin & J. P. Ostriker 2010, in preparation).

After confirming that BCGs are indeed different from the bulk of cluster galaxy population, it is natural to ask if BCGs are different from other luminous/massive cluster galaxies (e.g., MM* − 1). It has long been known that BCGs follow different scaling relations from other early-type galaxies (e.g., Oegerle & Hoessel 1991; Lauer et al. 2007; Desroches et al. 2007; Bernardi et al. 2007), in the sense that the BCGs are larger, less dense, and have lower velocity dispersion, compared to other early type galaxies of the same luminosity. However, at present it is not clear if the structure of the BCGs (in terms of Sersic fits) is indeed different from non-BCG early-type galaxies of comparable color and stellar mass (see Guo et al. 2009).

We address this question somewhat indirectly; in Section 3.4, it is shown that the LD of G2s in high luminosity clusters are consistent with that of the whole cluster galaxy population. (Our method could not be robustly applied to examine the statistical nature of G2s in low luminosity clusters, unfortunately.) Although we do not check the third-ranked brightest members, we believe a similar result will emerge for them. Effectively, we suggest that among the most luminous, red, cluster members, only BCGs in high luminosity clusters show significant deviations from the statistical extreme, and therefore these galaxies are a distinct population.

Our results suggest both BCGs in low luminosity/mass clusters and G2s in luminous/massive clusters are the statistical extreme of the galaxy LD. As the latter are very likely BCGs (in clusters or groups that merge with the current host clusters) themselves at earlier stages of their lives, this finding seems to give a self-consistent picture of the luminous galaxy evolution within the hierarchical cluster evolution scenario.

5.2. Cosmological Implications

The results and reasoning presented in Sections 3.4 and 5.1 suggest that LRGs that are BCGs in high luminosity (massive) clusters are different from other LRGs (e.g., those that are LLCBCG and G2s or lower-ranked galaxies in high luminosity clusters), and call for very careful modeling and selection of the LRGs.

Suppose the finding that HLCBCG and LLCBCG are two distinct populations continues to hold toward higher redshifts. In a flux-limited survey, the LRGs at higher redshifts would be intrinsically more luminous than those at lower-z. Therefore, a larger fraction of LRGs at higher-z would be composed of HLCBCG, when compared to LRGs at lower-z. When accounting for the Malmquist bias present in the LRG sample, one cannot simply assume that BCGs follow the same LD as LRGs as a whole, otherwise the inferred luminosity would be lower than the true value. This potential systematic effect would be larger toward higher-z.

It is thus important to take into account the difference in the LD of HLCBCG and LLCBCG when creating mock LRG catalogs for BAO surveys. Unfortunately, the BCGs in this study are at lower redshifts (z < 0.1) even compared to SDSS and SDSS-II LRG studies (at z ∼ 0.3; e.g., Eisenstein et al. 2005), and therefore the LDs we measure (see the Gaussian fits in Table 4) are not readily applicable. Accordingly our study strongly motivates for a systematic investigation on the LbcgLtot correlation at z > 0.3, either through direct observations (similar to the methodology used in this paper, utilizing spectroscopic data from, e.g., BOSS or GAMA surveys), through the halo occupation distribution analysis (e.g., Zheng et al. 2009), or the non-parametric method of Vale & Ostriker (2006).

5.3. Summary

The question of whether the LD of BCGs is drawn from the same LD as other cluster galaxies has been extensively discussed over the last four decades. Here we propose a simple new test to examine the statistical nature of the BCGs and to supplement the classical tests proposed by Tremaine & Richstone (1977).

Our basic idea is to shuffle the cluster galaxies and see how likely the observed LD of BCGs can be reproduced. The procedure is shown in Figure 1. Using a sample of 494 clusters from the C4 catalog at z = 0.03–0.077, we combine all member galaxies with Mr ⩽ −20 to form a big pool of galaxies (5980 in total). We then randomly pick up galaxies from the pool to create 494 mock clusters, whose total luminosities Ltot are matched to that of the real clusters as close as possible. This way we can compare the correlation between the BCG luminosity Lbcg and Ltot of real and mock clusters (see Figure 4). Of course, there are more than one way to create an ensemble of mock clusters, so we repeat this Monte Carlo process to generate 200 mock ensembles.

The averaged BCG luminosity from all 200 realizations of mock data sets, $\left< \overline{L_{{\rm bcg,sim}}}\right>$, gives the expected value when BCGs are statistical, as a function of cluster luminosity. The differences between the logarithms of mean mock BCG luminosity ($\overline{L_{{\rm bcg,sim}}}$) from each ensemble and $\left< \overline{L_{{\rm bcg,sim}}}\right>$ represent the degree of scatter expected if BCGs are stochastically selected from a global LD. In Figure 5, we compare the distribution of $d_{{\rm sim}}= \log \overline{L_{{\rm bcg,sim}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$ with that of $d_{{\rm obs}}= \log \overline{L_{{\rm bcg,obs}}} - \log \left< \overline{L_{{\rm bcg,sim}}}\right>$ and find that the real BCGs are more luminous than the statistical expectation; there is 0.8% of probability for dobs to be drawn from the same distribution of dsim (see Table 2). Separating the clusters into high- and low-luminosity subsamples (with a division luminosity of Ldiv = 3.7 × 1011h−270L), we conclude that the difference is mainly coming from BCGs in high luminosity clusters (only 0.03% chance to be statistical). The luminosities of BCGs in low luminosity clusters are roughly consistent with the global LD.

We extend the analysis to discuss the statistical nature of the second brightest galaxies (G2s) and find strong evidence that (in high luminosity clusters), G2s have a LD similar to that of the whole cluster galaxy population.

We also apply the tests proposed by TR to our cluster sample and record the results in Table 3. The two statistics (T1 and T2) both suggest a small probability for BCGs to be statistical, consistent with our findings. However, examining the TR statistics for the high and low luminosity clusters reveals confusing trends. According to the T1 statistic, BCGs in the high luminosity clusters are much less likely to be statistical than their counterparts in the low luminosity clusters. However, the T2 statistic suggests the opposite conclusion.

Our results confirm previous findings that BCGs in high luminosity clusters are a distinct population from other cluster galaxies and tentatively supports the physical mechanism of cannibalism (Ostriker & Tremaine 1975), which is essentially dynamical friction. We also suggest that BCGs are distinct from other non-BCG LRGs. As the majority of LRGs are the brightest member in groups and clusters, such an effect should be taken into account in selecting a homogeneous sample of LRGs for ongoing and future BAO surveys.

We thank the anonymous referee for helpful comments and suggestions that improved the paper. We are grateful to Scott Tremaine and Bob Nichol for insightful comments on the paper. Y.T.L. thanks Michael Strauss, Antonio Vale, and Jim Gunn for helpful discussions, and I.H. for constant encouragement. Y.T.L. acknowledges supports from the Princeton–Católica Fellowship, NSF PIRE grant OISE-0530095, FONDAP-Andes, and the World Premier International Research Center Initiative, MEXT, Japan.

Funding for the SDSS and SDSS-II has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the U.S. Department of Energy, the National Aeronautics and Space Administration, the Japanese Monbukagakusho, and the Max Planck Society, and the Higher Education Funding Council for England. The SDSS Web site is http://www.sdss.org/.

The SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions. The Participating Institutions are the American Museum of Natural History, Astrophysical Institute Potsdam, University of Basel, University of Cambridge, Case Western Reserve University, The University of Chicago, Drexel University, Fermilab, the Institute for Advanced Study, the Japan Participation Group, The Johns Hopkins University, the Joint Institute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysics and Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences, Los Alamos National Laboratory, the Max-Planck-Institute for Astronomy, the Max-Planck-Institute for Astrophysics, New Mexico State University, Ohio State University, University of Pittsburgh, University of Portsmouth, Princeton University, the United States Naval Observatory, and the University of Washington.

Footnotes

  • 10 

    This condition is solely to ensure the integrals behave well in the TR analysis. The "extreme bright end" can be at arbitrarily high luminosity (S. Tremaine 2009, private communication). Other than these two assumptions, we note that the form of the LD can be very general for the TR tests to be applicable—the LD does not even have to be continuous.

  • 11 

    In the process of creating the N mock clusters, a galaxy can be picked more than once, or not selected at all.

  • 12 

    We would like to point out that the cannibalism proposed by Ostriker and co-workers is not binary mergers per se. It is based on the computable tendency of the most massive stellar systems to spiral to the centers of clusters via dynamical friction and to merge with other systems already residing at the center.

Please wait… references are loading.
10.1088/0004-637X/715/2/1486