GalWeight: A New and Effective Weighting Technique for Determining Galaxy Cluster and Group Membership

, , and

Published 2018 June 27 © 2018. The American Astronomical Society. All rights reserved.
, , Citation Mohamed H. Abdullah et al 2018 ApJ 861 22 DOI 10.3847/1538-4357/aac5db

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/861/1/22

Abstract

We introduce GalWeight, a new technique for assigning galaxy cluster membership. This technique is specifically designed to simultaneously maximize the number of bona fide cluster members while minimizing the number of contaminating interlopers. The GalWeight technique can be applied to both massive galaxy clusters and poor galaxy groups. Moreover, it is effective in identifying members in both the virial and infall regions with high efficiency. We apply the GalWeight technique to MDPL2 and Bolshoi N-body simulations, and find that it is >98% accurate in correctly assigning cluster membership. We show that GalWeight compares very favorably against four well-known existing cluster membership techniques (shifting gapper, den Hartog, caustic, SIM). We also apply the GalWeight technique to a sample of 12 Abell clusters (including the Coma cluster) using observations from the Sloan Digital Sky Survey. We conclude by discussing GalWeight's potential for other astrophysical applications.

Export citation and abstract BibTeX RIS

1. Introduction

The problem of contamination of kinematic samples of galaxies in clusters by foreground and background galaxies is longstanding. It arises because of the fact that only the projected positions and velocities of galaxies are measured in redshift surveys. Due to the lack of knowledge about the motion perpendicular to the line of sight, it is difficult to judge a priori which of the galaxies found close to a cluster in projected space are actually bound to it and a good tracer of the underlying potential. Excluding fiducial members or including unbound galaxies, or interlopers, may lead to significantly incorrect estimates of the cluster mass.

Several methods have been suggested in the literature to address this problem. All these methods aim at cleaning the galaxy sample by removing nonmembers before attempting a dynamical analysis of the cluster. Some algorithms utilize only the redshift information, such as (i) the 3σ-clipping method (Yahil & Vidal 1977), which iteratively eliminates interlopers with velocities greater than 3σ; (ii) the fixed gapper technique (Beers et al. 1990; Zabludoff et al. 1990) by which any galaxy that is separated by more than a fixed value (e.g., 1σ of the sample or 500–1000 km s−1) from the central body of the velocity distribution is rejected as a nonmember; or (iii) the jackknife technique (Perea et al. 1990), which removes the galaxy whose elimination causes the largest change in the virial mass estimator. These methods are primarily based on statistical rules and some selection criteria. Other algorithms utilize both position and redshift information, such as (i) the shifting gapper technique (Fadda et al. 1996) which applies the fixed gapper technique to a bin shifting along the distance from the cluster center, or (ii) the den Hartog & Katgert (1996) technique that estimates the maximum (escape) velocity as a function of distance from the cluster center calculated either by the virial or projected mass estimator (e.g., Bahcall & Tremaine 1981; Heisler et al. 1985).

In addition to the techniques described above, the spherical infall models (hereafter referred to as SIMs, e.g., Gunn & Gott 1972; Yahil 1985; Regős & Geller 1989; Praton & Schneider 1994) can determine the infall velocity as a function of distance from the cluster center. The SIM in phase-space has the shape of two trumpet horns glued face to face (Kaiser 1987), which enclose the cluster members. However, studies show that clusters are not well fit by SIMs in the projected phase-space diagram, because of the random motion of galaxies in the cluster outer region caused by the presence of substructure or ongoing mergers (van Haarlem & van de Weygaert 1993; Diaferio 1999). A recent investigation (Abdullah et al. 2013) showed that SIMs can be applied to a sliced phase-space by taking into account the distortion of phase-space due to transverse motions of galaxies with respect to the observer and/or rotational motion of galaxies in the infall region in the cluster rest frame. However, that is out of the scope of the current paper.

Another sophisticated method is the caustic technique described by Diaferio (1999) that, based on numerical simulations (Serra & Diaferio 2013), is estimated to be able to identify cluster membership with ∼95% completeness within 3rv (rv is the virial radius defined below). The caustic technique depends on applying the two-dimensional adaptive kernel method (hereafter, 2DAKM, e.g., Pisani 1993, 1996) to galaxies in phase-space (Rp, vz), with the optimal smoothing length ${h}_{\mathrm{opt}}=(6.24/N)\sqrt{({\sigma }_{{R}_{p}}^{2}+{\sigma }_{{v}_{z}}^{2})/2}$, where ${\sigma }_{{R}_{p}}$ and ${\sigma }_{{v}_{z}}$ are the standard deviations of projected radius and line-of-sight velocity, respectively, and N is the number of galaxies. ${\sigma }_{{R}_{p}}$ and ${\sigma }_{{v}_{z}}$ should have the same units and therefore the coordinates (Rp, vz) should be rescaled such that $q={\sigma }_{v}/{\sigma }_{{R}_{p}}$, where q is a constant that is usually chosen to be 25 or 35 (additional details about the application of this technique may be found in Serra et al. 2011).

One more technique that should be mentioned here is the halo-based group finder (Yang et al. 2005, 2007). Yang et al. (2007) were able to recover true members with ∼95% completeness in the case of poor groups ($\sim {10}^{13}\,{M}_{\odot }$). However, they found that the completeness dropped to ∼65% for rich massive clusters ($\sim {10}^{14.5}\,{M}_{\odot }$). Also, theirs is an iterative method that needs to be repeated many times to obtain reliable members. Moreover, its application depends on some assumptions and empirical relations to identify the group members.

This paper introduces a simple and effective new technique to constrain cluster membership that avoids some issues facing other techniques, e.g., selection criteria, statistical methods, assumption of empirical relations, or need for multiple iterations. The paper is organized as follows. The simulations used in the paper are described in Section 2. In Section 3, the GalWeight technique is introduced and its efficiency at identifying bona fide members is tested on MultiDark N-body simulations. In Section 4, we compare GalWeight with four well-known existing cluster membership techniques (shifting gapper, den Hartog, caustic, SIM). We apply GalWeight to 12 Abell clusters (including the Coma cluster) in Section 5, and present our conclusions in Section 6. Throughout this paper, we adopt ΛCDM with ${{\rm{\Omega }}}_{m}=0.3$, ${{\rm{\Omega }}}_{{\rm{\Lambda }}}=0.7$, and ${H}_{0}\,=100$ h km s−1 Mpc−1, h = 1.

2. Simulations

In this section, we describe the simulated data that we use in this work in order to test the efficiency of the GalWeight technique to recover the true membership of a galaxy cluster.

1. MDPL2: The MDPL24 simulation is an N-body simulation of 38403 particles in a box of comoving length 1 h−1 Gpc, mass resolution of $1.51\times {10}^{9}$ h−1 M, and gravitational softening length of 5 h−1 kpc (physical) at low redshifts from the suite of MultiDark simulations (see Table 1 in Klypin et al. 2016). It was run using the L-GADGET-2 code, a version of the publicly available cosmological code GADGET-2 (Springel 2005). It assumes a flat ΛCDM cosmology, with cosmological parameters ${{\rm{\Omega }}}_{{\rm{\Lambda }}}=0.692$, ${{\rm{\Omega }}}_{m}=0.307$, Ωb = 0.048, n = 0.96, σ8 = 0.823, and h = 0.678 (Planck Collaboration et al. 2014). MDPL2 provides a good compromise between numerical resolution and volume (Favole et al. 2016). It also provides us with a large number of clusters of different masses extended from $0.7\times {10}^{14}$ to $37.4\times {10}^{14}$ h−1 M (the range used to test the efficiency of GalWeight).

2. Bolshoi: The Bolshoi simulation is an N-body simulation of 20483 particles in a box of comoving length 250 h−1 Mpc, mass resolution of $1.35\times {10}^{8}$ h−1 M, and gravitational softening length of 1 h−1 kpc (physical) at low redshifts. It was run using the Adaptive Refinement Tree code (Kravtsov et al. 1997). It assumes a flat ΛCDM cosmology, with cosmological parameters (${{\rm{\Omega }}}_{{\rm{\Lambda }}}=0.73$, Ωm = 0.27, Ωb = 0.047, n = 0.95, σ8 = 0.82, and h = 0.70). Bolshoi provides us with clusters of higher mass resolution than MDPL2. Thus, we use both simulations to test the efficiency of GalWeight to recover the true membership.

For both simulations, halos are identified using the bound density maximum (BDM) algorithm (Klypin & Holtzman 1997; Riebe et al. 2013) that was extensively tested (e.g., Knebe et al. 2011) and identifies local density maxima, determines a spherical cut off for the halo with overdensity equal to 200 times the critical density of the universe (ρ = 200ρc) for MDPL2 and 360 times the background matter density of the universe (ρ = 360ρbg), and removes unbound particles from the halo boundary. Among other parameters, BDM provides virial masses and radii. Virial mass is defined as ${M}_{v}=\tfrac{4}{3}\pi 200{\rho }_{c}{r}_{v}^{3}$ for MDPL2 and ${M}_{v}=\tfrac{4}{3}\pi 360{\rho }_{\mathrm{bg}}{r}_{v}^{3}$ for Bolshoi (see Bryan & Norman 1998; Klypin et al. 2016). The halo catalogs are complete for halos with circular velocity ${v}_{c}\geqslant 150$ km s−1 for MDPL2 (Klypin et al. 2016) and ${v}_{c}\geqslant 100$ km s−1 for Bolshoi (e.g., Busha et al. 2011; Klypin et al. 2011; Old et al. 2015).

For both MDPL2 and Bolshoi, the phase-space (line-of-sight velocity vz versus projected radius Rp) of a distinct halo (cluster) is constructed as follows. We assume the line of sight to be along the z-direction and the projection to be on the xy plane. We select a distinct halo of coordinates (${x}^{h},{y}^{h},{z}^{h}$) and velocity components (${v}_{x}^{h},{v}_{y}^{h},{v}_{z}^{h}$), and then we calculate the observed line-of-sight velocity of a subhalo, taking the Hubble expansion into account, as ${v}_{z}=({v}_{z}^{g}-{v}_{z}^{h})+{H}_{0}({z}^{g}-{z}^{h})$, where (${x}^{g},{y}^{g},{z}^{g}$) and (${v}_{x}^{g},{v}_{y}^{g},{v}_{z}^{g}$) are the coordinates and velocity components of the subhalo, respectively. Finally, we select all subhalos within a projected radius of5 ${R}_{p,\max }\,=10$ h−1 Mpc from the center of distinct halo and within a line-of-sight velocity interval of $| {v}_{z,\max }| =3500$ km ${{\rm{s}}}^{-1}$. These values are chosen to be sufficiently large to exceed both the turnaround radius and the length of the Fingers-of-God (hereafter, FOG) effect, which are typically $\sim 7\mbox{--}8\,{h}^{-1}$ Mpc and ∼6000 km s−1, respectively, for massive clusters. The turnaround radius rt is the radius at which a galaxy's peculiar velocity (vpec) is canceled out by the global Hubble expansion. In other words, it is the radius at which the infall velocity vanishes (${v}_{\inf }={v}_{\mathrm{pec}}-H\,r=0$).

3. The Galaxy Weighting Function Technique (GalWeight)

In this section, we describe the GalWeight technique in detail and demonstrate its use by applying it interactively to a simulated cluster of mass $9.37\times {10}^{14}$ h−1 M selected from the Bolshoi simulation. Figure 1 shows the phase-space distribution of subhalos (galaxies) near the center of the simulated cluster.

Figure 1.

Figure 1. Line-of-sight velocity vz as a function of projected radius Rp in the extended region around a simulated cluster of mass $9.37\times {10}^{14}$ h−1 M selected from the Bolshoi simulation. FOG is clearly seen in the main body of the cluster within ${R}_{p}\lesssim 1$ Mpc h−1. The effect of the mass concentration in and around the cluster is manifested as a concentration of galaxies around the vz = 0 line well outside the cluster itself. Interlopers are mostly galaxies at large projected distances and large peculiar velocities. In Section 3 and in Figures 24, we show in detail how GalWeight can be applied to this cluster to distinguish between interlopers and cluster members (Figure 5).

Standard image High-resolution image

The GalWeight technique works by assigning a weight to each galaxy i according to its position (${R}_{p,i}$, ${v}_{z,i}$) in the phase-space diagram. This weight is the product of two separate two-dimensional weights, which we refer to as the dynamical and phase-space weights. The dynamical weight (see Section 3.1 parts A.1 and A.2, and Figure 4(a), which is the product of Figures 2(b) and 3(b)) is calculated from the surface number density Σ(Rp), velocity dispersion ${\sigma }_{{v}_{z}}({R}_{p})$, and standard deviation ${\sigma }_{{R}_{p}}({v}_{z})$ profiles of the cluster. The phase-space weight (see Section 3.1 part B and Figure 4(b)) is calculated from the two-dimensional adaptive kernel method that estimates the probability density underlying the data and consequently identification of clumps and substructures in the phase space (Pisani 1996). The total weight is then calculated as the product of the dynamical and phase-space weights (see Section 3.1 part C and Figure 4(c)). The advantage of using the total weight rather than the dynamical weight or the phase-space weight alone is discussed in Section 3.3.

Figure 2.

Figure 2. Weighting function along projected radius Rp for the simulated cluster of mass $9.37\times {10}^{14}$ h−1 M selected from Bolshoi (see Section 3.1 A.1). The left panel (a) shows the function ${{ \mathcal D }}_{{R}_{p}}$ derived from the data (black points, Equation (1)), normalized by Equation (2), and fitted by ${{ \mathcal W }}_{{R}_{p}}$ (red curve, Equation (4)). The right panel (b) presents its corresponding probability density function in the phase-space diagram. As shown in (a) and (b), the weighting is greatest at Rp = 0 and decreases outwards.

Standard image High-resolution image
Figure 3.

Figure 3. Weighting function along line-of-sight velocity vz for the simulated cluster selected from Bolshoi. The left panel (a) shows the function ${{ \mathcal D }}_{{v}_{z}}$ calculated from the data (black points, Equation (5)), normalized by Equation (6), and fitted by ${{ \mathcal W }}_{{v}_{z}}$ (blue curve, Equation (7)). The right panel (b) presents its corresponding probability density function in phase space. As shown in (a) and (b), the applied weight is greatest at vz = 0 and decreases as the absolute line-of-sight velocity increases.

Standard image High-resolution image
Figure 4.

Figure 4. Weights to be applied as a function of position in line-of-sight velocity/projected radius phase space for the simulated cluster selected from the Bolshoi simulation. Panel (a) shows the dynamical weight ${{ \mathcal W }}_{\mathrm{dy}}.$ (The product of the weights shown in Figures 2(b) and 3(b).) Panel (b) presents the phase-space weight ${{ \mathcal W }}_{\mathrm{ph}}$ calculated from the 2DAKM. The total weight ${{ \mathcal W }}_{\mathrm{tot}}={{ \mathcal W }}_{\mathrm{dy}}\times {{ \mathcal W }}_{\mathrm{ph}}$ is shown in panel (c) with explicitly drawing three contour weights. The weight ${{ \mathcal W }}_{\mathrm{dy}}$ is maximum at the origin (0, 0) and decreases along both the Rp and vz axes and ${{ \mathcal W }}_{\mathrm{ph}}$ gives higher weight for galaxy clumping around the center and substructures as well. Note that the scaling for each panel is independent, with magenta representing maximum values.

Standard image High-resolution image

3.1. Galaxy Weighting Functions

A. Dynamical Weighting, ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z}):$ In calculating the dynamical weighting function, we assume that the weighting we apply should be larger at the cluster center, i.e., at the origin in phase space (Figure 1), and decay along both the Rp and vz phase space axes. This weighting function is, therefore, a product of two individual weighting functions; one that decays along the Rp-axis and the other along the vz-axis as described below.

A.1. Rp-axis Weighting Function, ${{ \mathcal W }}_{{R}_{p}}({R}_{p}):$ In order to calculate the projected radius weighting function, ${{ \mathcal W }}_{{R}_{p}}({R}_{p})$, we select two properties that are strongly correlated with projected radius and with the dynamical state of a cluster.

The first property is the surface number density profile Σ(Rp), defined as the number of galaxies per unit area as a function of distance from the cluster center. It has its maximum value at the cluster center and decreases with radial distance, and is also strongly correlated with the mass distribution of the cluster. The significance of introducing Σ(Rp) for calculating ${{ \mathcal W }}_{{R}_{p}}$ is that the velocities of member galaxies in the core of some clusters can be as high as $\approx 3000$ km s−1. It produces the Kaiser or FOG effect (see Kaiser 1987). This FOG distortion is the main reason that many membership techniques fail to correctly identify galaxies in the core with high line-of-sight velocities as members. Thus, Σ(Rp) is essential to recover the members in the cluster core. In other words, ignoring Σ(Rp) means missing some of the cluster members in the core.

The second property is the projected velocity dispersion profile, ${\sigma }_{{v}_{z}}({R}_{p})$. The significance of introducing ${\sigma }_{{v}_{z}}({R}_{p})$ for calculating ${{ \mathcal W }}_{{R}_{p}}$ is that it characterizes the dynamical state of a cluster from its core to its infall region. Specifically, there is random motion of member galaxies in the infall region due to the presence of substructures and recent mergers (e.g., van Haarlem 1992; van Haarlem & van de Weygaert 1993; Diaferio & Geller 1997). This effect of random motion can be taken into account implicitly in ${\sigma }_{{v}_{z}}({R}_{p})$. This is the main reason why the SIM technique fails in the cluster outskirts in the projected phase-space. Thus, ${\sigma }_{{v}_{z}}({R}_{p})$ is essential to recover the members in the cluster infall region. In other words, ignoring ${\sigma }_{{v}_{z}}({R}_{p})$ means missing some of the cluster members in the infall region.

Thus, the weighting ${{ \mathcal W }}_{{R}_{p}}({R}_{p})$ in the projected radius direction can be calculated by introducing the function ${{ \mathcal D }}_{{R}_{p}}({R}_{p})$ that is given by

Equation (1)

with the normalization

Equation (2)

where ${R}_{p,\max }$ is the maximum projected radius in phase space. The denominator ${R}_{p}^{\nu }$, where the slope of the power law ν is a free parameter in the range $-1\lesssim \nu \lesssim 1$. The factor ${R}_{p}^{\nu }$ is introduced in Equation (1) to provide flexibility and generalization for the technique. The free parameter ν is selected to adjust the effect of the distortion of FOG in the core and the distortion of the random motion in the outer region. It is defined as

Equation (3)

where σFOG is the velocity dispersion of the core galaxies and ${\sigma }_{\mathrm{rand}}$ is the velocity dispersion of the galaxies outside the core.

The function ${{ \mathcal D }}_{{R}_{p}}({R}_{p})$, calculated from the data, is contaminated by interlopers that cause scattering, especially at large projected distances (see black points in the left panel of Figure 2). Therefore, in order to apply a smooth weighting function, we fit ${{ \mathcal D }}_{{R}_{p}}({R}_{p})$ with an analytical function. Any analytical function that is a good fit to ${{ \mathcal D }}_{{R}_{p}}({R}_{p})$ could be utilized. In this paper, we choose to use the function

Equation (4)

which has four parameters: a is a scale radius ($0\lt a\lesssim 1$), γ is a slope of the power law ($-2\lesssim \gamma \lt 0$), and ${{ \mathcal A }}_{0}$ and ${{ \mathcal A }}_{\mathrm{bg}}$ are the central and background weights in the Rp-direction. These parameters are determined by applying the chi-squared algorithm using the Curve Fitting MatLab Toolbox. Note that the analytical function we selected here has the same form as the generalized King model (King 1972; Adami et al. 1998).

Thus, the weight ${{ \mathcal W }}_{{R}_{p}}({R}_{p,i})$ of each galaxy can be calculated according to its projected radius Rp from the cluster center. The weighting along Rp is shown in Figure 2(a), where the function ${{ \mathcal D }}_{{R}_{p}}({R}_{p})$ is normalized using Equation (2). The data are smoothed and approximated using Equation (4) (shown by the red line). The right panel (b) shows the projected radius weight function in phase space.

A.2. vz-axis Weighting Function, ${{ \mathcal W }}_{{v}_{z}}({v}_{z}):$ In phase-space, most members are concentrated near the line vz = 0 and the number of members decreases with increasing absolute line-of-sight velocity. The weighting function along the vz-axis can, therefore, be approximated by the histogram of the number of galaxies per bin, ${N}_{\mathrm{bin}}({v}_{z})$, or equivalently the standard deviation of projected radius, ${\sigma }_{{R}_{p}}({v}_{z})$, directed along the line-of-sight velocity axis, normalized by the total number of galaxies Ntot in the cluster field. In other words, the weighting in the line-of-sight velocity direction can be calculated by introducing the function ${{ \mathcal D }}_{{v}_{z}}({v}_{z})$ that is given by

Equation (5)

with the normalization

Equation (6)

where ${v}_{z,\max }$ is the maximum line-of-sight velocity of phase space. As above, to obtain a smooth weighting function in vz, the histogram or equivalently ${{ \mathcal D }}_{{v}_{z}}({v}_{z})$ can be fitted by an analytical function. In this paper, we select an exponential model that is given by

Equation (7)

where ${{ \mathcal B }}_{0}$ is the central weight, ${{ \mathcal B }}_{\mathrm{bg}}$ is the background weight in vz, and b is scale parameter ($-0.01\lesssim b\lt 0$). Again, these parameters are determined by applying the chi-squared algorithm using the Curve Fitting MatLab Toolbox. The weighting along vz is shown in Figure 3(a), where the function ${{ \mathcal D }}_{{v}_{z}}({v}_{z})$ (black points) is normalized using Equation (6). The data are smoothed and approximated by Equation (7) for an exponential model (blue curve). The right panel (b) shows the resulting exponential-model weight as a function of location in line-of-sight velocity/projected radius phase space. As shown in (a) and (b), the applied weight is greatest at vz = 0 and decreases as the absolute line-of-sight velocity increases.

We can now construct a two-dimensional dynamical weight ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ by multiplying ${{ \mathcal W }}_{{R}_{p}}({R}_{p})$ and ${{ \mathcal W }}_{{v}_{z}}({v}_{z})$ together:

Equation (8)

${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ is shown in the left panel of Figure 4, and is the product of the weights shown in Figures 2(b) and 3(b). The weight is maximum at the origin and decreases along both Rp and vz.

To sum up, the dynamical weight is calculated from three properties (surface number density Σ(Rp) and velocity dispersion ${\sigma }_{{v}_{z}}({R}_{p})$ along Rp, and standard deviation of projected radius ${\sigma }_{{R}_{p}}({v}_{z})$ along vz), which are correlated strongly with the dynamics of the cluster. This weight takes into account the effects of the FOG in the cluster core and the random motion of galaxies in the infall region.

B. Phase-Space Weighting, ${{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z}):$ This weighting is the coarse-grained phase-space density, which can be simply calculated by the two-dimensional adaptive kernel method (2DAKM, e.g., Silverman 1986; Pisani 1996). The kernel density estimator is the estimated probability density function of a random variable. For N galaxies with coordinates (x, y) = (Rp, vz), the density estimator is given by

Equation (9)

where the kernel K(t) is given by Gaussian distribution

Equation (10)

and ${h}_{j,i}={\lambda }_{i}{h}_{j}$ is the local bandwidth, ${h}_{j}={\sigma }_{j}{N}^{-1/6}$ is the fixed bandwidth for two-dimensional space and ${\sigma }_{j}$ is the standard deviation for j = {x, y}. The term ${\lambda }_{i}={[\gamma /{f}_{0}({x}_{i},{y}_{i})]}^{0.5}$ and $\mathrm{log}\gamma ={\sum }_{i}\mathrm{log}{f}_{0}({x}_{i},{y}_{i})/N$, where ${f}_{0}({x}_{i},{y}_{i})$ is given by Equation (9) for ${\lambda }_{i}=1$ (see also, Shimazaki & Shinomoto 2010).

Consequently, applying 2DAKM for the distribution of galaxies in the phase-space demonstrates high weights for positions of high-density distribution of galaxies. Therefore, the main purpose of introducing the phase-space weight is to take into account the effect of the presence of any clump or substructure in the field that cannot be counted by the dynamical weight. Also, the phase-space weight is introduced to reduce the excessive increase of dynamical weight near the center (see Section 3.3). The phase-space weight ${{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z})$ is shown in Figure 4(b) that gives more weights for galaxies in clumps and substructures, and from the distribution of galaxies in the cluster field this weighting function is maximum around the cluster center.

C. Total Weighting, ${{ \mathcal W }}_{\mathrm{tot}}({R}_{p},{v}_{z}):$ The total weighting function is calculated as

Equation (11)

and shown in Figure 4(c) for the simulated cluster. It shows the probability distribution function (PDF) of the total weight ${{ \mathcal W }}_{\mathrm{tot}}({R}_{p},{v}_{z})$. The weighting in Figure 4(c) is then applied to individual galaxies. Figure 5(a) shows Figure 1 once again, but now after applying the "total weighting." We still need to separate cluster members from interlopers. We explain how to do that in Section 3.2.

Figure 5.

Figure 5. Identification of the simulated cluster membership from weighted galaxies. Panel (a) shows the weight of each galaxy in line-of-sight velocity/projected radius phase space (magenta color indicates higher weight). Panel (b) shows a histogram or PDF of the weight applied to each galaxy, ${{ \mathcal W }}_{\mathrm{tot}}({R}_{p,i},{v}_{z,i})$. 1DAKM fitting returns a bimodal PDF in this example of the simulated cluster. We choose to use the number density method (NDM, Abdullah et al. 2013) to identify the contour weight value that separates cluster members from interlopers. This is shown by the solid red vertical line in panel (c) and solid red line in panel (a). 1σ confidence intervals are shown by the two red dashed lines. The two vertical dashed black lines represent the virial and turnaround radii, where the cluster members are those enclosed by the best contour line and within the turnaround radius. We impose one additional cut, shown by the black solid lines in panel (a), cutting the red contour line in the very inner radius by the maximum vz of the enclosed members.

Standard image High-resolution image

3.2. Membership Determination

Figure 5(a) shows the weight of each galaxy in the simulated cluster phase space. The question is now how to utilize the weighted galaxies in phase space to best identify cluster members. One would like to identify a single, optimal weight value that separates cluster members from field galaxies, i.e., identify the best contour weight to select in panel (a). One way is to consider the PDF, or histogram of the total weight for all galaxies, which is shown in Figure 5(b). Fitting the PDF using a 1DAKM reveals two obvious peaks (bimodal PDF). One might imagine simply drawing a vertical line to separate the members located on the right with higher weights from the interlopers located on the left. However, not all clusters show this bimodality in the PDF of ${{ \mathcal W }}_{\mathrm{tot}}$. Another way could be to exclude all galaxies that have weights less than, for example, 3σ from the average value of the main peak (i.e., ${{ \mathcal W }}_{\mathrm{cut}}\,={{ \mathcal W }}_{\mathrm{peak}}-3\sigma $). However, attempting to do the separation by either of these two ways is subjective.

Therefore, we prefer to select the optimal contour weight by utilizing the number density method (hereafter, NDM), a technique that was introduced in Abdullah et al. (2013). The goal in applying the method here is to find the optimal contour weight (or line) that returns the maximum number density of galaxies. In other words, we select a certain contour line (weight) and calculate its enclosed area and number of galaxies, Nin (which contribute positively), then account for the number of galaxies, Nout (which contribute negatively) located outside this contour line. Then, the number density of this contour line can be calculated by (${N}_{\mathrm{in}}-{N}_{\mathrm{out}})/\mathrm{Area}$ (see Figure 9 in Abdullah et al. 2013).

In Figure 5(c) the PDF of the number density of galaxies calculated by NDM is plotted for weights (contour lines) in the range $-12\leqslant \mathrm{log}{{ \mathcal W }}_{\mathrm{tot}}\leqslant -6$. The optimal contour line corresponds to the maximum number density of galaxies, the value of weight that should be utilized as the separator of cluster members from interlopers is shown by the red vertical solid line with 1σ confidence intervals shown by the two red vertical dashed lines. This optimal contour line with 1σ confidence are shown as solid and dashed red lines in panel (a), respectively.

As shown in Figure 5(a), the optimal contour line extends to large distances ($R\sim 10\,{h}^{-1}$ Mpc) and not all galaxies within this boundary are members. Therefore, the last step of GalWeight is to determine a cutoff radius within which the galaxies are assumed to be bounded. Thus, the cluster members are defined as the galaxies enclosed by the optimal contour line and within the cutoff radius. This cutoff radius can be adopted as the virial radius rv (which is the boundary of the virialized region) or the turnaround radius rt (which is the boundary of the cluster infall region). Note that the main goal of this paper is to introduce and test the efficiency of GalWeight to recover the true members in the virial and infall regions using simulations. Thus, knowing the virial radius of each simulated cluster, we test the efficiency of GalWeight at rv, 2rv, and 3rv projected on the phase-space diagram as described in Section 3.4 and Table 1 (see, e.g., Serra & Diaferio 2013). However, for our sample of the 12 Abell clusters (observations) rv and rt are determined from the mass profile estimated by the virial mass estimator and NFW mass profile (Navarro et al. 1996, 1997) as discussed in Section 5.

Table 1.  Efficiency of the GalWeight Technique Determined by Calculating fc and fi at rv, 2rv, and 3rv for a Sample of ∼3000 Clusters from the MDPL2 and Bolshoi Simulations

Mass Range Mean Number of   fc     fi  
(1014 h−1 M) Halos rv 2rv 3rv rv 2rv 3rv
(1) (2) (3) (4) (5) (6) (7) (8) (9)
MDPL2
0.73–37.39 4.28 1500 (All) 0.993 ± 0.014 0.986 ± 0.015 0.981 ± 0.013 0.112 ± 0.035 0.096 ± 0.048 0.113 ± 0.051
0.73–2.00 1.37 253 0.998 ± 0.050 0.992 ± 0.016 0.981 ± 0.018 0.096 ± 0.039 0.098 ± 0.050 0.118 ± 0.053
2.00–4.00 3.16 617 0.993 ± 0.015 0.983 ± 0.016 0.979 ± 0.012 0.113 ± 0.034 0.099 ± 0.050 0.118 ± 0.053
4.00–8.00 5.37 484 0.989 ± 0.016 0.984 ± 0.013 0.982 ± 0.011 0.118 ± 0.032 0.099 ± 0.045 0.117 ± 0.049
8.00–37.39 11.20 146 0.988 ± 0.013 0.988 ± 0.010 0.988 ± 0.013 0.121 ± 0.028 0.105 ± 0.043 0.122 ± 0.045
Bolshoi
0.70–10.92 1.53 500 (1500) (All) 0.995 ± 0.011 0.981 ± 0.021 0.971 ± 0.020 0.126 ± 0.045 0.217 ± 0.109 0.226 ± 0.102
0.70–2.00 1.31 415 (1194) 0.996 ± 0.011 0.983 ± 0.0208 0.972 ± 0.019 0.124 ± 0.047 0.218 ± 0.109 0.227 ± 0.102
2.00–4.00 2.70 72 (252) 0.992 ± 0.012 0.975 ± 0.023 0.967 ± 0.025 0.133 ± 0.040 0.128 ± 0.103 0.227 ± 0.105
4.00–8.00 4.43 11 (48) 0.990 ± 0.012 0.970 ± 0.022 0.961 ± 0.022 0.131 ± 0.039 0.207 ± 0.113 0.217 ± 0.103
8.00–10.92 9.68 2(6) 0.997 ± 0.004 0.982 ± 0.024 0.973 ± 0.025 0.130 ± 0.018 0.270 ± 0.116 0.250 ± 0.094

Note. fc is the completeness or the fraction of the number of fiducial members identified by GalWeight as members relative to the actual number of members. fi is the contamination or the fraction of interlopers incorrectly assigned to be members. Columns: (1) cluster mass range; (2) cluster mean mass per bin; (3) the actual number of clusters per bin in simulations and the number between brackets gives the number of clusters in different orientations for Bolshoi; (4)–(6) and (7)–(9) are fc and fi for each mass bin at rv, 2rv, and 3rv, respectively.

Download table as:  ASCIITypeset image

We impose one additional cut, shown by the solid black lines highlighted by black circles in panel (a), to cut the red contour line in the very inner radius by the maximum vz of the enclosed members. This is because, in some cases, the optimal contour line extends to very high velocities in the innermost region ($R\lesssim 0.25\,{h}^{-1}$ Mpc) without including any other members, so it is not necessarily to show this tail of the contour line.

The main steps in applying the GalWeight technique to determine cluster membership are summarized below:

  • 1.  
    Make an appropriate cut in Rp and vz, and plot galaxies in line-of-sight velocity/projected radius phase space. In this paper, we use ${R}_{p,\max }=10$ h−1 Mpc and $| {v}_{z,\max }| =3500$ km ${{\rm{s}}}^{-1}$.
  • 2.  
    Calculate the function $\tfrac{{\rm{\Sigma }}({R}_{p})\sigma ({R}_{p})}{{R}_{p}^{\nu }}$ and fit it with an analytical model (e.g., Equation (4)) to obtain ${{ \mathcal W }}_{{R}_{p}}({R}_{p})$.
  • 3.  
    Calculate the function ${\sigma }_{{v}_{z}}({v}_{z})$ and fit it with an analytical model (e.g., Equation (7)) to obtain ${{ \mathcal W }}_{v}(v)$.
  • 4.  
    Determine the dynamical weighting, ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})\,={{ \mathcal W }}_{{R}_{p}}({R}_{p})\times {{ \mathcal W }}_{{v}_{z}}({v}_{z})$.
  • 5.  
    Apply the 2DAKM in phase space to determine the phase-space weighting, ${{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z})$.
  • 6.  
    Calculate the total weight ${{ \mathcal W }}_{\mathrm{tot}}({R}_{p},{v}_{z})={{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})\,\times {{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z})$.
  • 7.  
    Plot the PDF for all galaxy weights and apply a cut, retaining all galaxies with weight larger than this cut as members (NDM is used here to determine the optimal value of the cut).
  • 8.  
    Determine the cutoff radius (rv or rt) using a dynamical mass estimator and identify cluster members as those enclosed by the optimal contour line and within the cutoff radius.

3.3. Why Do We Use Total Weight Rather Than Dynamical or Phase-space Weights?

One may ask why we depend on the total weight to assign cluster membership rather than using the dynamical weight or phase-space weight alone. We present Figure 6 to help answer this question. It shows the phase space of three Bolshoi simulated clusters (see Section 2). Using simulated clusters provides the advantage that true members are known definitively. Figure 6 shows the optimal contour lines determined by applying, separately, the dynamical weight (green line), the phase-space weight (blue line), and the total weight (black line). The red points show true members within 3rv.

Figure 6.

Figure 6. Application of dynamical, phase-space, and total weights (green, blue, and black lines, respectively) to three simulated clusters taken from the BolshioP simulation (Section 2). The red points show true members within 3rv. Applying the dynamical weight alone (green) results in the inclusion of many galaxies within R ∼ 1 Mpc h−1 with very high line-of-sight velocities. Applying the phase-space weight alone (blue), fails to recover some members in the core while simultaneously incorrectly including some interlopers at large distances due to the presence of nearby clusterings and clumps. The total weight (black), the product of the dynamical and phase-space weights, recovers true members effectively in both the core and infall regions (see Table 1).

Standard image High-resolution image

In Figure 6, the dynamical weight ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ (green; see also Figure 4(a)) is seen to be very smooth and idealized. In other words, ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ describes well an isolated galaxy cluster in phase-space. It does not take into account the effects of nearby clusters, clumps, and/or substructures. Also, it shows an excessive increase near the cluster center ($\sim 1\,{h}^{-1}$ Mpc) and incorrectly includes interlopers near the center that have very high velocities. This effect is due to introducing Σ(Rp) in ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$, where the surface number density is very high near the cluster center. However, ignoring ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ leads to missing some cluster members, especially those that close to the center in phase space. Thus, ${{ \mathcal W }}_{\mathrm{dy}}({R}_{p},{v}_{z})$ cannot be used on its own to assign cluster membership, but it is very important for correctly identifying members with high line-of-sight velocities.

Figure 6 demonstrates that, on its own, phase-space weighting ${{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z})$ also has some difficulty in recovering true cluster members (blue; see also Figure 4(b)). This is because it does not take into account the FOG effect in the cluster core, where those members that have high velocities do not have high concentration, so they are assigned low weights in phase space and not counted as members. Also, the presence of nearby clusterings and substructures have the effect of widening the "optimal" contour line. Consequently, it is very difficult to separate true members from galaxies (interlopers) located in nearby clumps. This results in the inclusion of some interlopers in the infall region. In summary, using ${{ \mathcal W }}_{\mathrm{ph}}({R}_{p},{v}_{z})$ alone, simultaneously excludes some true members near the cluster center and includes some interlopers in the infall region.

We have shown that both the dynamical weight and phase-space weight have issues in identifying true members when applied alone. However, as the black solid line in Figure 6 shows, the total weight (the product of the dynamical and phase-space weights), is very effective. It can simultaneously identify cluster members moving with high velocities in the core (${R}_{p}\lesssim 1$ Mpc h−1) as well as members moving with random motions in the infall regions (${R}_{p}\sim 3{r}_{v}$).

3.4. Testing the Efficiency of GalWeight on MDPL and Bolshoi Simulations

To further demonstrate and quantify the GalWeight technique when assigning membership, we again utilize the MDPL2 and Bolshoi6 simulations from the suite of MultiDark simulations. The efficiency of GalWeight can be quantified by calculating two fractions defined as follows. The first is the completeness fc, which is the fraction of the number of fiducial members identified by GalWeight as members in the projected phase-space relative to the actual number of 3D members projected in the phase-space. The second is the contamination fi, which is the fraction of interlopers incorrectly assigned to be members, projected in the phase space (see, e.g., Wojtak et al. 2007; Serra & Diaferio 2013). Ideally, of course, GalWeight would return fractions of fc = 1 and fi = 0.

MDPL2 provides us with 1500 simulated clusters with masses ranging from $0.73\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ to $37.4\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ to which we can apply GalWeight. We calculate the fractions fc and fi at three radii—${r}_{v},2{r}_{v}$, and 3rv. As shown in Table 1, the mean values of fc and fi within rv are 0.993 and 0.112, respectively, for the 1500 clusters overall. Also, the fraction fc decreases from 0.993 at rv to 0.981 at 3rv.

For Bolshoi, we have about 500 clusters with masses greater than $0.70\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$. In order to increase the cluster sample of Bolshoi to 1500 clusters, we randomly select different lines of sight or ordinations for each distinct halo in additional to the original line of sight along the z-direction (see column 3 in Table 1 for Bolshoi). Then, we apply GalWeight to each cluster. The mass range of the sample is $0.70\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ to $10.92\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ as shown in Table 1. The mean values of fc and fi within rv are 0.995 and 0.126, respectively, for the 1500 clusters overall. Also, the fraction fc decreases from 0.995 at rv to 0.971 at 3rv.

The main reason that some interlopers are assigned as members (fi = 0.113 for MDPL2 and fi = 0.226 for Bolshoi, as maximal value) is because of the triple-value problem (Tonry & Davis 1981). That is, there are some foreground and background interlopers that appear to be part of the cluster body because of the distortion of phase-space. The effect of the triple-value problem is apparent in Figure 7 (discussed below), where most of the interlopers assigned as members are embedded in the cluster body. We defer a discussion of how GalWeight may be adapted to overcome this problem to a future work.

Figure 7.

Figure 7. Application of the GalWeight technique (solid black lines) to 12 simulated clusters selected from the MDPL simulation (Section 2). Red points show fiducial members within 3rv. The virial mass ($\mathrm{log}{M}_{v}$ h−1 M) and number of members within rv is shown for each cluster. Clearly, GalWeight does well in effectively identifying members with high accuracy in both the virialized and infall regions for structures ranging in mass from rich clusters to poor groups.

Standard image High-resolution image

In order to demonstrate the ability of GalWeight to assign membership in the case of both poor and massive clusters, we divide the 1500 clusters (for each simulation) into four mass bins as shown in Table 1. The fraction fc varies from 0.998 (0.996) for the poor clusters of mean mass $1.44\,\times {10}^{14}\,(1.13\times {10}^{14})\,{h}^{-1}\,{M}_{\odot }$ to 0.988 (0.997) for the massive clusters of mean mass $11.34\times {10}^{14}\,(9.68\times {10}^{14})\,{h}^{-1}\,{M}_{\odot }$ at rv for MDPL2 (Bolshoi). We conclude that GalWeight can be applied effectively to a range of cluster masses with high efficiency.

Figure 7 shows examples of GalWeight being applied to 12 simulated Bolshoi clusters (solid black lines), where red and gray points show fiducial members and interlopers, respectively, within 3rv. The 12 clusters shown in Figure 7 are ranked by virial mass, with the most massive cluster ($10.92\times {10}^{14}$ h−1 M) shown in the upper left corner and the least massive one ($1.06\times {10}^{14}$ h−1 M) shown in the lower right corner. The figure demonstrates that GalWeight can effectively recover cluster membership for rich massive galaxy clusters as well as small or poor groups of galaxies with the same efficiency.

In summary, applying GalWeight to the suite of MDPL2 and Bolshoi simulations demonstrates that GalWeight can successfully recover cluster membership with high efficiency. It also further demonstrates that it can simultaneously identify members in both the virial and infall regions by taking into account the FOG effect and the random motion of galaxies in the infall region. Furthermore, it can be applied to both rich galaxy clusters and poor groups of galaxies with the same efficiency (see Table 1).

4. A Comparison of Membership Techniques

In this section, we perform a general comparison between GalWeight and four other well-known techniques (shifting gapper, caustic, den Hartog, and SIM) without doing any quantitative comparison. We defer testing the efficiency of different membership techniques to recover the 3D true members of clusters and the influence of the determining their dynamical masses to a future work (see, e.g., Wojtak et al. 2007).

We begin by showing how each technique fares when it is applied in turn to two simulated clusters with masses of $10.92\times {10}^{14}$ h−1 M and $4.24\times {10}^{14}$ h−1 M from the Bolshoi simulation, shown in Figure 8. Making the assumption that the cluster is spherical, fiducial members are assumed to lie within three virial radii, 3rv, and are shown as 2D members in the phase-space (red points) in each panel of Figure 8. We select this radius (3rv) in order to examine the ability of each technique to recover true members not only within the virial radius but also in the infall region, i.e., the region of a cluster that extends from the viral radius rv to the turnaround radius rt, where ${r}_{t}\sim \,2\mbox{--}4\,{r}_{v}$. Shown in each panel by the solid black line is the optimal choice of demarcation contour separating members and field galaxies identified by our GalWeight technique. For reasons of space, we do not describe each of the four techniques (shifting gapper, den Hartog, caustic, and SIM) in detail here. However, we summarize them below and refer the reader to the references for more information.

Figure 8.

Figure 8. Example of four well-known membership techniques applied to two simulated clusters with masses of $10.92\times {10}^{14}$ h−1 M (top panels) and $4.24\,\times {10}^{14}$ h−1 M (bottom) from the Bolshoi simulation (Section 2). In each panel, the red points represent fiducial cluster members within 3rv, and the solid black lines show the demarcation contour enclosing cluster members, identified by applying our new technique (GalWeight). The open blue circles in panels (a), (b), (e), and (f) show members identified by the shifting gapper technique using Nbin = 10 and Nbin = 15, respectively. Panels (c) and (g) show the caustic technique employing rescale parameters of q = 25 (cyan lines) and q = 35 (pink lines) and also the Den Hartog technique (dotted black lines). The Yahil SIM (dark green lines) and Regős SIM (light green lines) techniques are presented in panels (d) and (h). GalWeight recovers fiducial members with high accuracy, improving upon the shifting gapper and den Hartog techniques simultaneously at small and large projected radii, the caustic techniques at small projected radius and the SIM technique at large projected radius ($\sim 3{r}_{v}$).

Standard image High-resolution image

The shifting gapper technique (Fadda et al. 1996) works by first placing galaxies into bins according to their projected radial distance from the cluster center. The user has the freedom to choose the number of galaxies per bin that they believe is best-suited to each application of the technique. Commonly chosen values are Nbin = 10 or 15. For each bin, the galaxies are sorted according to their velocities, then any galaxy separated by more than a fixed value (e.g., 1σ of the sample or 500–1000 km s−1) from the previous one is considered an interloper and removed. Fadda et al. (1996) used a gap of 1000 km s−1 and a bin of 0.4 h−1 Mpc or larger in order to have at least 15 galaxies. The open blue circles in panels (a), (e) and (b), (f) of Figure 8 represent the members identified by this technique, where the number of galaxies utilized per bin was Nbin = 10 and Nbin = 15, respectively. The gray points symbolize interlopers. Clearly, membership identification depends heavily upon the choice of Nbin, as there are many differences between the galaxies identified as members in panels (a), (e) and (b), (f). Additionally, in both cases, some true members of the two clusters are missed, especially at small projected radius. Furthermore, the shifting gapper technique depends on the choice of the velocity gap used to remove interlopers in each bin. A choice of a high-velocity gap results in the identification of a large fraction of interlopers as cluster members, while the choice of a low-velocity gap results in missing true cluster members (Aguerri et al. 2007).

The application of the caustic technique (e.g., Alpaslan et al. 2012; Serra & Diaferio 2013) is shown in panels (c) and (g) of Figure 8 for two rescale parameters, q = 25 (cyan lines) and q = 35 (pink lines). Although this technique is quite successful when applied to the cluster outskirts, it misses some of the true members located within the core, which are the most important galaxies affecting the dynamics of the clusters. They are missed because the caustic technique does not take into account the effects of the FOG distortion. Also, the caustic technique cannot be applied to small galaxy groups. Furthermore, applying the caustic technique is rather subjective and relies upon how the caustics can be inferred from the data (Reisenegger et al. 2000; Pearson et al. 2014). Nonetheless, it is still a powerful technique for estimating cluster masses.

The application of the den Hartog technique (den Hartog & Katgert 1996) is also shown by the dotted black lines in Figure 8, panels (c) and (g). This technique estimates the escape velocity as a function of distance from the cluster center by calculating the virial mass profile (see Section 5), ${v}_{\mathrm{esc}}(R)=\sqrt{\tfrac{2{{GM}}_{\mathrm{vir}}(R)}{R}}$, where G is the gravitational constant. The figure demonstrates that this technique is very biased toward including many far interlopers. In addition, its application relies on assumptions of hydrostatic equilibrium and spherical symmetry.

Panels (d) and (h) in Figure 8 show the application of two spherical infall models (SIMs). The Yahil (1985) and Regős models (Regős & Geller 1989) are shown by dark green and light green lines respectively. Note that one needs to determine the mass density profile and the background mass density in order to apply the SIM technique and determine the infall velocity profile (e.g., van Haarlem & van de Weygaert 1993). We determine the mass density profile for the simulated cluster from the NFW model (Navarro et al. 1996, 1997, Equations (15) and (16)), knowing its concentration c, virial radius rv, and scale radius ${r}_{s}={r}_{v}/C$. Also, the background mass density is given by ${\rho }_{\mathrm{bg}}={{\rm{\Omega }}}_{m}\,{\rho }_{c}$.

As shown in Figures 8(d) and (h), SIMs have difficulty identifying true members in the infall region in projected phase space. This is due to the fact that the effect of random motion of galaxies in the infall regions (van Haarlem & van de Weygaert 1993; Diaferio 1999) causes some members in the cluster outskirts to be missed. A recent investigation by our own team (Abdullah et al. 2013) has shown that SIMs can successfully be applied to sliced phase space by taking into account some kinds of distortions such as the transverse motion of galaxies with respect to the observer and/or rotational motions of galaxies inside the cluster. However, this is out of the scope of the current paper.

5. Observations—Application to a Sample of 12 Abell Clusters

In this section, we apply GalWeight to a sample of 12 Abell galaxy clusters, with galaxy coordinates and redshifts taken from SDSS-DR127 (hereafter, SDSS-DR12 Alam et al. 2015). In order to demonstrate the technique for both massive and poor clusters, we selected clusters with Abell richness parameters ranging from 0 to 3 (Abell et al. 1989). We deliberately selected some clusters that were almost isolated and others that had clumps or groups of galaxies nearby in order to demonstrate how the technique performs under these different scenarios. We apply the GalWeight technique only to this pilot sample of 12 clusters in this paper, deferring application to the entire SDSS-DR13 sample of ∼800 clusters to a later paper.

The data sample is collected as follows. The NASA/IPAC Extragalactic Database (NED)8 provides us with a first approximation of the angular coordinates and redshift of the center of our cluster sample (${\alpha }_{c}$, ${\delta }_{c}$, zc). We then download the coordinates and redshifts (right ascension α, declination δ, and spectroscopic redshift z) for objects classified as galaxies near the center of each cluster from SDSS-DR12 (Alam et al. 2015). The next step is to apply the binary tree algorithm (e.g., Serra et al. 2011) to accurately determine the cluster center (${\alpha }_{c}$, ${\delta }_{c},{z}_{c}$) and create a line-of-sight velocity (vz) versus projected radius (Rp) phase-space diagram. Rp is the projected radius from the cluster center and vz is the line-of-sight velocity of a galaxy in the cluster frame, calculated as ${v}_{z}=c(z-{z}_{c})/(1+{z}_{c})$, where z is the observed spectroscopic redshift of the galaxy and zc is the cluster redshift. The term $(1+{z}_{c})$ is a correction due to the global Hubble expansion (Danese et al. 1980) and c is the speed of light.

We then apply GalWeight to the 12 Abell clusters as described in detail in Section 3 in order to get the optimal contour line. The final step is to determine the virial radius, rv, at which ρ = 200ρc and the turnaround radius, rt, at which ρ = 5.55ρc (e.g., Nagamine & Loeb 2003; Busha et al. 2005; Dünner et al. 2006) from all galaxies located inside the optimal contour line of a cluster

In order to calculate these two radii, we should first determine the cluster mass profile. The cluster mass can be estimated from the virial mass estimator and NFW mass profile (Navarro et al. 1996, 1997) as follows.

The viral mass estimator is given by

Equation (12)

where ${v}_{z,i}$ is the galaxy line-of-sight velocity and Rij is the projected distance between two galaxies (e.g., Limber & Mathews 1960; Binney & Tremaine 1987; Rines et al. 2003).

If a system extends beyond the virial radius, Equation (12) will overestimate the mass due to external pressure from matter outside the virialized region (The & White 1986; Carlberg et al. 1997; Girardi et al. 1998). The corrected virial mass can be determined using the following expression:

Equation (13)

where S(r) is a term introduced to correct for surface pressure. For an NFW density profile and for isotropic orbits (i.e., the projected, ${\sigma }_{v}$, and angular, ${\sigma }_{\theta }$, velocity dispersion components of a galaxy in the cluster frame are the same, or equivalently the anisotropy parameter $\beta =1-\tfrac{{\sigma }_{\theta }^{2}}{{\sigma }_{r}^{2}}=0$), S(r) can be calculated by

Equation (14)

where $x=r/{r}_{s}$, rs is the scale radius, $\sigma (\lt r)$ is the integrated three-dimensional velocity dispersion within r, and ${\sigma }_{v}(r)$ is a projected velocity dispersion (e.g., Koranyi & Geller 2000; Abdullah et al. 2011).

The mass density within a sphere of radius r introduced by NFW is given by

Equation (15)

and its corresponding mass is given by

Equation (16)

where ${M}_{s}=4\pi {\rho }_{s}{r}_{s}^{3}[\mathrm{ln}(2)-(1/2)]$ is the mass within rs, ρs = δsρc is the characteristic density within rs and ${\delta }_{s}=({{\rm{\Delta }}}_{v}/3){c}^{3}{\left[\mathrm{ln}(1+c)-\tfrac{c}{1+c}\right]}^{-1}$, and the concentration $c={r}_{v}/{r}_{s}$ (e.g., Navarro et al. 1997; Rines et al. 2003; Mamon et al. 2013).

The projected number of galaxies within a cylinder of radius R is given by integrating the NFW profile (Equation (15)) along the line of sight (e.g., Bartelmann 1996; Zenteno et al. 2016)

Equation (17)

where Ns is the number of galaxies within rs that has the same formula as Ms, and g(x) is given by (e.g., Golse & Kneib 2002; Mamon et al. 2010)

Equation (18)

Thus, we can fit rs for each cluster to get S(r) from Equation (14) and calculate the corrected mass profile Mv(r) from Equation (13). Also, the NFW mass profile is calculated from Equation (16). Then, rv, at which Δ = 200ρc, can be calculated from the viral or NFW mass profiles. While rt, at which Δ = 5.55ρc, can be determined from NFW mass profile only. We cannot determine rt from the virial mass profile because the assumption of hydrostatic equilibrium is invalid.

Finally, after we calculate rv and rt (from NFW mass profile) the cluster membership can be defined as all galaxies enclosed by the optimal contour line and within rt, as shown by the red points in Figure 9. It is worth noting once again that GalWeight is effective at taking into account the effects of the FOG distortion in the innermost regions and the random motion of galaxies in the cluster infall region. Moreover, GalWeight is not affected by the presence of substructures or nearby clusters or groups as demonstrated, for example, for A2063 and A2065. Furthermore, GalWeight can be applied both to rich clusters such as A2065 and A1656 and to poor galaxy groups such as A1459 and A2026.

Figure 9.

Figure 9. Application of the GalWeight technique to 12 Abell clusters from SDSS-DR12 (see also Table 2). The solid black lines shows the optimal contour line and the two dashed vertical lines show the virial and turnaround radii respectively. The red points show galaxies identified as clusters members—those enclosed by optimal contour line and rt. Also shown in each panel is the cluster virial mass ($\mathrm{log}{M}_{v}$ h−1 M) and number of galaxies within rv.

Standard image High-resolution image

In order to compare our results with the literature, we calculate the radii and their corresponding masses at three overdensities, Δ500 = 500ρc, Δ200 = 200ρc, and Δ100 = 100ρc as shown in Table 2. The sample is displayed in order of decreasing NFW M200 mass. A complete list of NFW parameters is also provided in Table 2.

Table 2.  Dynamical Parameters Derived for the Sample of 12 Abell Galaxy Clusters

Cluster zc Virial Mass Estimator NFW Mass Estimator NFW Parameters
    r500 M500 r200 M200 r100 M100 r500 M500 r200 M200 r100 M100 rs Ms c
(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) (17)
A2065 0.073 1.27 11.95 1.78 12.97 2.29 13.86 1.20 10.11 1.78 13.01 2.36 15.30 0.22 1.90 8.16
A1656 0.023 1.07 7.14 1.58 9.06 2.07 10.30 1.06 6.78 1.58 9.09 2.11 10.96 0.26 1.61 6.01
A2029 0.078 0.99 5.68 1.49 7.65 1.97 8.84 0.94 4.73 1.49 7.67 2.07 10.31 0.61 2.80 2.46
A2142 0.090 0.97 5.24 1.47 7.27 2.03 9.64 0.91 4.35 1.47 7.32 2.05 10.02 0.67 2.99 2.19
A2063 0.035 0.80 2.91 1.17 3.73 1.54 4.23 0.81 3.10 1.18 3.76 1.55 4.27 0.08 0.40 14.40
A1185 0.033 0.75 2.38 1.08 2.89 1.42 3.29 0.72 2.20 1.08 2.91 1.44 3.48 0.17 0.49 6.53
A0117 0.055 0.65 1.55 0.93 1.86 1.26 2.31 0.61 1.29 0.93 1.88 1.27 2.37 0.24 0.46 3.87
A2018 0.088 0.55 0.92 0.90 1.67 1.35 2.84 0.55 0.95 0.90 1.67 1.27 2.36 0.48 0.82 1.83
A1436 0.065 0.49 0.68 0.89 1.64 1.25 2.27 0.61 1.29 0.89 1.64 1.18 1.92 0.10 0.23 8.68
A1983 0.045 0.58 1.09 0.85 1.39 1.09 1.51 0.57 1.03 0.85 1.41 1.14 1.71 0.16 0.27 5.37
A1459 0.020 0.50 0.73 0.72 0.85 0.92 0.89 0.50 0.73 0.72 0.87 0.95 0.97 0.04 0.08 18.4
A2026 0.091 0.46 0.54 0.71 0.82 0.91 0.88 0.47 0.59 0.71 0.83 0.96 1.03 0.16 0.19 4.32

Note. Radii and their masses are calculated by virial and NFW mass estimators at overdensities of Δ = 500, 200, and 100 ρc. The radius and mass are in units of h−1 Mpc and 1014 h−1 M. Columns: (1) cluster name; (2) cluster redshift; (3)–(4), (5)–(6), and (7)–(8) are radii and their corresponding masses calculated by the virial mass estimator at overdensities of Δ = 500, 200, and 100, respectively; (9)–(10), (11)–(12), and (13)–(14) are radii and their corresponding masses calculated by an NFW model at overdensities of Δ = 500, 200, and 100, respectively; (15)–(17) are scale radius, its corresponding scale mass, and concentration of NFW parameters.

Download table as:  ASCIITypeset image

In Table 3, we list ratios of radii and masses for each of the 12 Abell clusters using our GalWeight-determined method (assuming an NFW profile) divided by previously published values, $({r}_{\mathrm{NFW}}/{r}_{\mathrm{ref}})$ and $({M}_{\mathrm{NFW}}/{M}_{\mathrm{ref}})$ respectively, at overdensities of Δ = 500, 200, and 100ρc. Column 8 of Table 3 also lists the ratio of GalWeight-determined masses relative to those estimated from the caustic technique (Rines et al. 2016), (${M}_{\mathrm{NFW}}/{M}_{\mathrm{caus}}{)}_{200}$, at Δ = 200ρc. Table 3 clearly shows that the radii and masses estimated for a cluster are strongly dependent on the technique used to assign membership and remove interlopers (see Wojtak et al. 2007). The ratio $({r}_{\mathrm{NFW}}/{r}_{\mathrm{ref}})$ ranges between 0.63 and 1.55, while the ratio $({M}_{\mathrm{NFW}}/{M}_{\mathrm{ref}})$ ranges between 0.58 and 2.18.

Table 3.  GalWeight-determined Ratios of Radii and Mass for each of the 12 Abell Clusters Compared to Previously Published Values

Cluster (rNFW/rref) (MNFW/Mref) (MNFW/Mcaus)200
(1) (2) (3) (4) (5) (6) (7) (8)
  Δ500 Δ200 Δ100 Δ500 Δ200 Δ100 Rines et al. (2016)
A2065 1.70 (5), 1.15 (8) 1.04 (8), 1.11 (11) 1.50 (8) 1.12 (8), 1.30 (11) 3.84
A1656 1.16 (5) 0.89 (4), 1.05 (9) 1.04 (6) 1.09 (5) 0.72 (4), 1.16+, (9) 1.12 (6) 2.02
A2029 1.05 (5), 0.93 (8) 0.93 (8), 0.89 (11) 0.92 (5), 0.80 (8) 0.82 (8), 0.88 (11) 1.42
A2142 0.91 (8), 1.30+, (10) 1.30+, (10), 0.94 (11) 0.96 (12) 0.66 (8) 1.80+, (10), 0.75 (11) 0.86 (12) 2.27
A2063 1.30 (8) 1.18 (8) 1.05 (12) 2.18 (8) 1.65 (8) 1.03 (12) 1.40
A1185 1.01, (3) 2.77, (3) 1.37
A0117 0.83, (1), 1.05 (2) 0.58, (1)
A2018 0.82 (2), 1.18 (13) 0.94
A1983 0.85 (7) 0.90, (3), 0.81 (7) 1.18 (7) 0.99, (3), 1.03 (7) 1.64
A1436 1.55+, (10) 0.64, (1), 1.24+, (10) 0.64, (1), 1.30+, (10)
A1459 0.94, (1) 0.84 (1)
A2026 0.63 (2)

Note. Columns: (1) cluster name; (2)–(4) ratio of GalWeight radii to those in the literature at overdensities of Δ = 500, 200, and 100 ρc respectively, assuming an NFW model; (5)–(7) ratio of GalWeight masses to those in the literature at overdensities of Δ = 500, 200, and 100 ρc respectively, assuming an NFW model; (8) ratio of GalWeight masses assuming an NFW model to those calculated from the caustic technique in Rines et al. (2016) at Δ = 200ρc. References: (1) Abdullah et al. (2011); (2) Aguerri et al. (2007); (3) Girardi et al. (2002); (4) Kubo et al. (2007); (5) Laganá et al. (2011); (6) Lokas & Mamon (2003); (7) Pointecouteau et al. (2005); (8) Reiprich & Böhringer (2002); (9) Rines et al. (2003); (10) Rines & Diaferio (2006); (11) Sifón et al. (2015); (12) Wojtak & Lokas (2010). Symbols: + caustic technique, shifting gapper, SIM.

Download table as:  ASCIITypeset image

The cluster masses from the literature tabulated in Table 3 have been calculated in various ways. Below, we explicitly compare our values to those obtained from applying the shifting gapper, SIM, and caustic methods.

First, comparing to the shifting gapper technique (see (${}^{\bullet }$) in Table 3, Girardi et al. 2002; Sifón et al. 2015), we find that the ratio $({M}_{\mathrm{NFW}}/{M}_{\mathrm{ref}})$ is larger than unity in some cases (A2065, A1185) and smaller than unity in others (A2029, A2142). This is because members assigned by this technique, and consequently the mass calculated, depend on the selection criteria of number of galaxies and velocity gap per bin. As discussed before, the choice of a high-velocity gap includes more members and consequently larger mass and vice versa.

Second, comparing to the SIM method (see () in Table 3, Abdullah et al. 2011), we note that the mass ratio $({M}_{\mathrm{NFW}}/{M}_{\mathrm{ref}})$ is less than unity for the three clusters A0117, A1436, and A1459. This is because SIM includes more galaxy members inside the virial region even though they are very far from the cluster body. This is due to the assumption of conservation of mass that influences the validity of SIM in the innermost region (see Figure 6 in Abdullah et al. 2011).

Third, comparing to the caustic technique (see (+) in Table 3, Rines et al. 2003; Rines & Diaferio 2006; Rines et al. 2016), we specifically calculate the ratio ${({M}_{\mathrm{NFW}}/{M}_{\mathrm{caus}})}_{200}$ as listed in Table 3, column 8. It demonstrates that this ratio is larger than unity for seven clusters, and the highest ratio is for A2065, for which the estimated mass from NFW is four times that expected from the caustic technique. As described above, the main reason for this discrepancy is that the caustic technique does not take into consideration the effect of FOG. Thus, it misses more members inside the virial region and consequently expects lower masses.

We compare again GalWeight with the four well-known techniques (shifting gapper, caustic, den Hartog, and SIM) for the Coma cluster as shown in Figure 10. The figure (see also Figure 8) demonstrates that the GalWeight performs very favorably against established methods, taking into account as it does the effects of the FOG distortion at small projected radius well as the random motion of galaxies in the infall region. In order to apply SIM to the Coma cluster, the spatial number density profile is calculated from the NFW model (Navarro et al. 1996, 1997). Also, we assume that the background number density ${\rho }_{\mathrm{bg}}=0.0106$ h3 ${{\rm{Mpc}}}^{-3}$, which is calculated using the parameters of the Schechter luminosity function (${\phi }^{* }=0.0149$ h3 ${{\rm{Mpc}}}^{-3}$, ${M}^{* }-5\mathrm{log}h=-20.44$, and α = −1.05 for r magnitude, Blanton et al. 2003).

Figure 10.

Figure 10. Example of four well-known membership techniques applied to the cluster. The blue open symbols and solid lines are the same as those in Figure 8. Clearly GalWeight (solid black lines) appears to identify cluster members well, both in the virialized and infall regions of phase space.

Standard image High-resolution image

Because of the presence of interlopers, estimates of cluster mass tend to be biased too high and estimates of cluster concentration tend to be biased too low. Our work suggests that applying GalWeight rather than another technique to determine cluster membership before applying a dynamical mass estimator (virial theorem, NFW model etc.) likely results in a more accurate estimate of the true cluster mass and concentration. In a future work, we will compare the efficiency of different membership techniques to assign membership and their influence on estimating cluster mass using different mass estimators.

6. Discussion and Conclusion

In this paper, we introduced the Galaxy Weighting Function Technique (GalWeight), a powerful new technique for identifying cluster members. specifically designed to simultaneously maximize the number of bona fide cluster members, while minimizing the number of contaminating interlopers.

GalWeight takes into account the causes of different distortions in the phase-space diagram and is independent of statistical or selection criteria. It can recover membership in both the virial and infall regions with high accuracy and is minimally affected by substructure and/or nearby clusters.

We first demonstrated GalWeight's use by applying it interactively to a simulated cluster of mass $9.37\times {10}^{14}$ h−1 M selected from Bolshoi simulation. Next, we tested the efficiency of the technique on ∼3000 clusters selected from the MDPL2 and Bolshoi simulations with masses ranging from $0.70\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$ to $37.4\times {10}^{14}\,{h}^{-1}\,{M}_{\odot }$. The completeness and interloper fractions for MDPL2 are ${f}_{c}=0.993,0.992$ and 0.981 and ${f}_{i}=0.096,0.098$ and 0.118, while for Bolshoi ${f}_{c}=0.995,0.981$ and 0.971 and ${f}_{i}=0.126,0.217$ and 0.226 within rv, 2rv, and 3rv, respectively. We then compared its performance to four well-known existing cluster membership techniques (shifting gapper, den Hartog, caustic, and SIM). Finally, we applied GalWeight to a sample of 12 Abell clusters of varying richnesses taken from SDSS-DR12. By assuming an NFW model and applying the virial mass estimator, we determined the radius and corresponding mass at overdensities of Δ500, Δ200, and Δ100. The virial mass (at Δ200) of the sample ranged from $0.82\times {10}^{14}$ h−1 M to $12.97\times {10}^{14}$ h−1 M, demonstrating that GalWeight is effective for poor and massive clusters. In the future, we plan to apply GalWeight to a larger SDSS sample of galaxy clusters at low and high redshifts.

We believe that GalWeight has the potential for astrophysical applications far beyond the identification of cluster members, e.g., identifying stellar members of nearby dwarf galaxies, or separating star-forming and quiescent galaxies. We also plan to investigate these applications in a future work.

We thank Brian Siana for useful discussions. We also thank Gary Mamon for his useful comments on the paper. Finally, we thank the reviewer for suggestions that improved this paper. G.W. acknowledges financial support for this work from NSF grant AST-1517863 and from NASA through programs GO-13306, GO-13677, GO-13747, and GO-13845/14327 from the Space Telescope Science Institute, which is operated by AURA, Inc., under NASA contract NAS 5-26555, and grant number 80NSSC17K0019 issued through the Astrophysics Data Analysis Program (ADAP).

Footnotes

Please wait… references are loading.
10.3847/1538-4357/aac5db