Are Individuals Perceived as More Attractive within a Group? A Confirmative Study of Group Attractiveness Effect and the Cheerleader Effect in China

The stereotype “what is beautiful is good” suggests that having an attractive physical appearance is very important to people’s lives. Physical attractiveness, as an important personal trait, plays vital roles for shaping one’s first impression, and for influencing one’s subsequent evaluation and selection, based on obvious visual features. However, the contextual difference of the physical attractiveness between a group and its group members has been given less attention. For this reason, Van Osch et al. proposed and tested that the perceived physical attractiveness of a group is greater than the average attractiveness of its members (GA-effect), while Walker and Vul found that an individual as a group member is perceived to be more attractive in group context than in isolation (cheerleader effect). Our present work seeks to replicate these two effects on a sample of 1005 Chinese participants to check whether such effects also exist in China, a collectivist culture. Participants were asked to rate the degree of attractiveness presented in each photo stimuli and in each rating condition, and the results show that: (a) the participants’ ratings of physical attractiveness of a group was significantly higher than the average attractiveness of its members (i.e., replicating the GA-effect); (b) the participants’ ratings of physical attractiveness of an individual was evaluated higher in a group than in isolation (i.e., replicating the cheerleader effect); and (c) the larger the group size, the larger the GA-effect. This validating study may aid in understanding human behavior, judgments, and perceptions according to the GA-effect and the cheerleader effect used in a different context in China.


Introduction
The "what is beautiful is good" stereotype is powerful and widespread [1]. People who are more attractive may receive greater attention and concern [2], receive better judgment [3]), have more friends [4], a more successful career [5], and a longer life expectancy [6]. An attractive individual is imbued with almost all sorts of positive qualities, especially in western European and northern American cultures [7]. The phenomenon of "what is beautiful is good" has been studied for decades [7][8][9][10][11][12][13]. Physical attractiveness refers to people's evaluations of individuals' physical appearances based on people tend to pay more attention to individuals that are most attractive. A common phenomenon in daily life is that attractive individuals may receive longer gaze durations and larger cones of gaze. Scientific evidence, by utilizing eye tracking technique, also finds that more attractive group members receive more visual attention than less attractive group members [10]. Thus, selective attention may lead to the most attractive members of a group exhibiting greater influence on the valuation of the group. For instance, researchers found that people rate faces that they were more familiar with as more attractive compared to those who are not so familiar [23]. Therefore, based on the filter model of selective attention, people's evaluations of the physical attractiveness of those whom they are most concerned about in a group may influence their valuation of the whole group. If people think highly of someone in the group, they may think highly of their group as a whole, though this effect may be moderated by several factors such as target sex, target baseline level of attractiveness, and the level of attractiveness of the group members [24]. On the contrary, if people are greatly influenced by the least attractive members in the group, they may weaken the evaluation of the whole group [25].
The group attractiveness effect (GA-effect) implies that people perceive groups of individuals as more attractive than the average attractiveness of their members [10]. To support this claim, Van Osch and colleagues conducted nine studies that examined what happens when people evaluate the physical attractiveness of a group compared with the same attractiveness of its group members to test whether the judgments of this attractiveness can be explained by the averaging rule for group impressions [26]. We would expect the evaluation of a group's physical attractiveness to be based on the average attractiveness of its members [27]. However, in 8 out of 9 studies, Van Osch et al. found that people judge the physical attractiveness of a group to be more attractive than the average of its members. Although the GA-effect was originally found in European culture, its generalization is worth testing-especially in an Eastern culture such as China-since follow-up studies disclosed that a group's facial attractiveness is indeed the average attractiveness of its members [13].

Cheerleader Effect
The cheerleader effect was popularly mentioned in an American TV show named How I Met Your Mother [28], and it refers to a phenomenon that a girl will look more attractive if she is in a group of girls rather than in an isolated situation. To test the existence of the cheerleader effect, Walker and Vul [10] presented the individual's facial image in a group and in isolation, and found that an individual's face was rated as more attractive in a group than in isolation, thus confirming the cheerleader effect in a laboratory setting [11]. Ojiro et al. replicated the design of the Experiment 4 conducted by Walker and Vul, and they found the cheerleader effect was not significant in a Japanese culture based on very small size effect [18]. Then, a group of psychologists [15,16] investigated whether the spatial arrangement of the faces in a group would influence the magnitude of the cheerleader effect. Their findings demonstrated that faces were again rated as significantly more attractive in each group configuration than in isolation, and the spatial location of the target face did not influence the size of the cheerleader effect. Thus, the results supported the cheerleader effect in Australia, another individualistic culture, and confirmed that cheerleader effect is a robust phenomenon. A very recent work offered limited evidence for the role of hierarchical encoding in the cheerleader effect and continued to support the cheerleader effect [16]. Ying and colleagues provided evidence that supports the ensemble averaging in facial attractiveness, since one's brain automatically averages each individual's facial attractiveness together to create the gist of the collective attractiveness. This confirmed the robustness and replicability of the friend effect in the cheerleader effect, whereby simply being in the company of friends makes an individual look more attractive in a group. Similarly, if the viewer has prior exposure to unattractive groups of faces, then a face will again seem more attractive when subsequently viewed by comparison [14].

Overview of the Present Work
Two studies were conducted to replicate GA-effect and cheerleader effect. Specifically, in Study I, three sub studies were conducted based on 11 sets of photo stimuli, in order to replicate the original studies by Van Osch and colleagues ("Study 1a", "Study 1b", and "Study 2" for female groups, "Study 3" for male groups, "Study 4" for mixed gender groups, and "Study 5" for within-subject design). We also tested the cheerleader effect using same sample in Study Ia, Ib, and Ic.
Study Ia was a between-subject design, in which participants enrolled in a university campus were randomly and evenly assigned to three different conditions: in the group condition, they were asked to rate a group as a whole regarding its attractiveness (GR condition); in the other two conditions, they were required to respond to the attractiveness of each single individual (IR condition), or to rate the attractiveness of an individual while they were seen within the group (GMR condition).
Study Ib was a within-subject design, in which university students were called to respond to attractiveness ratings based on GR, GMR, and IR conditions. Study Ic was again a within-subject design, however, the participants were organizational workers. Study II aimed to replicate only the GA-effect by a within-subject design based on six groups of photo stimuli with only faces, following the design of the original studies conducted by Van Osch and colleagues ("Study 4" for mixed gender groups, and "Study 5" for within-subject design), in order to eliminate other outlying factors (hip-waist ratio) that may affect the attractiveness rating. Participants were asked to rate a group as a whole in the GR condition, and subsequently rate the attractiveness of each individual while they were seen within the group (GMR condition). Furthermore, a group size effect on the different types of attractiveness ratings was examined.

Sample
A total number of 813 subjects were enrolled in Study I. Specifically, in Study Ia, 522 university students (236 males, 286 females) aged between 18 and 27 (M = 20.2, SD = 2.6) were randomly and evenly assigned to each of the following conditions: the group rating (GR) condition; the group member (GMR) rating condition, and the individual rating (IR) condition; thus, Study Ia was a between-subject design. Study Ib was a within-subject design, with 155 first-year university students (60 males, 95 females) aged between 18 and 20 (M = 18.6, SD = 0.8) enrolled to judge the attractiveness in sequence for GR, GMR, and IR conditions. Study Ic was also a within-subject design, however, instead of the university students' sample, 136 organizational workers (76 males, 60 females) were enrolled in the MPA program at Southwest Jiaotong University, China, so that the varying ages (M age = 36.6, SD = 9.8) with different professional backgrounds were considered to cover a broader sample of ages and professions. As Study I did not include any clinical data collection nor pose any psychological harm to participants, the ethics committee of the Southwest Jiaotong University did not demand an ethical application from our project conducted among an adult population. Still, both oral and written informed consent was obtained from the participants prior to our survey and they were assured that participation was anonymous and the data usage was for scientific purposes only.

Study Design and Procedure
The photo stimuli were provided by 75 university students who were active in the online public social network platform named Wechat. In order to ensure the diversity of appearance, features such as varied gender, with or without glasses, hair length, and different facial expressions were considered in the selection criteria, and they were chosen with relatively high, medium, and low attractiveness (pilot tests of one-way ANOVA with attractiveness factor at high, medium, and low levels) with significant difference (all ps < 0.05) among each comparison of the attractiveness. The photo stimuli were assigned into three groups which represented three different rating conditions. Specifically, in the GR condition, several individuals appeared together in one photo stimuli in order to be rated as a whole group; in the GMR condition, several individuals' appearances synthesized together in a single photo stimulus with their corresponding Arabic number for indication (see Figure 1); and in the IR condition, where each individual was presented in a single photo stimulus. platform named Qualtrics Survey Software, by creating survey links based on different rating conditions. First, all photo stimuli for each rating conditions were separately prepared and presented in each questionnaire, thus three online questionnaires were created. Secondly, three online survey links were prepared by putting these questionnaires on the web. The questionnaires were then administered by sending the links to the potential participants in exchange for researchers' course credits at Southwest Jiaotong University. It was worth noting that the questionnaire was comprised of two parts: the first part contained questions about the participants' social demographic background, and the second part was a set of questions asking the attractiveness of each presented photo stimuli, based on a 10-point Likert-type scale (i.e., how attractive do you find this person? From 1 = not at all attractive, to 10 = very attractive). Specifically, the example photo stimuli (for illustration purpose) regarding each condition is indicated in Figure 1.

Measures
In the GR condition, the participants were asked to rate a total of 11 group photo stimuli with a varied number of displayed persons, from 3 to 11 (See Table 1). The participants were asked to respond the attractiveness of the group, considering the following criteria or indicators such as: (1) the physical attractiveness (how pretty the people in the photograph were), (2) friendliness (how kind the people in the photograph seemed), and (3) social attractiveness (to what extent would you want to belong with these people). Then they were required to indicate the degrees of attractiveness of the group (1 = not at all attractive, 10 = very attractive). All the selected photo stimuli were obtained with the owner's informed consent, and they were used in the following three experimental conditions. In the IR condition, the participants were asked to rate each individual photo based on the subject's attractiveness. In the GMR condition, the participants were asked to locate a specifically targeted person according to the Arabic number among a group of individuals within one photo stimuli (see Figure 1). In the GR condition, the participants were asked to rate the overall attractiveness of a photograph within which several individual members were synthesized together. All photo stimuli were presented in the online platform named Qualtrics Survey Software, by creating survey links based on different rating conditions. First, all photo stimuli for each rating conditions were separately prepared and presented in each questionnaire, thus three online questionnaires were created. Secondly, three online survey links were prepared by putting these questionnaires on the web. The questionnaires were then administered by sending the links to the potential participants in exchange for researchers' course credits at Southwest Jiaotong University. It was worth noting that the questionnaire was comprised of two parts: the first part contained questions about the participants' social demographic background, and the second part was a set of questions asking the attractiveness of each presented photo stimuli, based on a 10-point Likert-type scale (i.e., how attractive do you find this person? From 1 = not at all attractive, to 10 = very attractive). Specifically, the example photo stimuli (for illustration purpose) regarding each condition is indicated in Figure 1.

Measures
In the GR condition, the participants were asked to rate a total of 11 group photo stimuli with a varied number of displayed persons, from 3 to 11 (See Table 1). The participants were asked to respond the attractiveness of the group, considering the following criteria or indicators such as: (1) the physical attractiveness (how pretty the people in the photograph were), (2) friendliness (how kind the people in the photograph seemed), and (3) social attractiveness (to what extent would you want to belong with these people). Then they were required to indicate the degrees of attractiveness of the group (1 = not at all attractive, 10 = very attractive). In the GMR condition, the participants were required to evaluate the physical attractiveness of each person shown simultaneously in the single photo set, and each member in the group photo had an Arabic number at the top of their head in order to facilitate distinction. A total of 11 group photo sets that were displayed in GR condition were shown in the experiment, among which 5 were female group photo sets, 4 were male group photo sets, and 2 were mixed-group photo sets. For each photo, participants were asked to rate the following question: how attractive do you find group member X? (1 = not at all attractive, 10 = very attractive).
In the IR condition, the participants were asked to rate each individual photo that contained only a single person, and a total of 21 personal profile photos were shown. These photos were taken from 3 groups of photo sets among the photo stimuli in two previous conditions. For each photo sets, the participants were asked to answer the questionnaire items based on a 10-point Likert-type scale (i.e., how attractive do you find this person? From 1 = not at all attractive to 10 = very attractive).

Data Analysis
The questionnaire was distributed and returned through the researchers' network, because having a convenient sampling improved the flexibility and ability of respondents to accept the survey, and facilitated the diffusion of the questionnaire. Data gathering lasted for 4 months, and data analysis was conducted in SPSS (25.0, IBM, Armonk, NY, USA) for analysis of variance (ANOVA).

Was the Group Attractiveness Equal to the Average Attractiveness of Its Members (GR vs. GMR)? Validation of the GA-Effect
For Study Ia, a between-subject design, we conducted the univariate analysis of variance (ANOVA) with rating condition as independent variable (2 levels), while rating scores as dependent variable. Prior to conducting a series of follow-up ANOVA analyses, the homogeneity of variance assumption was tested for all 11 photo stimuli. Based on a series of Levene test, the homogeneity of variance assumption was considered satisfied (with p > 0.05), as reported in Table 1. The group was perceived as more attractive than the average attractiveness of each group member, across each photo stimuli, and with no exception of any single significance level (with one p < 0.001, four ps < 0.005, six ps < 0.05).
The partial eta squared (η p 2 ) for measuring the size effect based on the cut-off points for small (0.02-0.12), medium (0.13-0.25), and large effects (>0.26) according to Cohen [29], indicated an overall small effect, with six cases around η p 2 = 0.02, which generally confirmed the group attractiveness effect.
Instead of testing the between-subject effect on GA-effect in Study Ia, assigning different university students to rate different rating conditions, we conducted Study Ib adopting a within-subject design by asking another group of university student to rate each of the three conditions randomly. One-way analysis of variance (ANOVA) was adopted to compare the difference between the group ratings and group member ratings. The homogeneity of variance assumption for all 11 photo stimuli was considered satisfactory (with p > 0.05) before a series of follow-up comparisons. As indicated in Table 2, the group as a whole was rated to be more attractive than the average of the individual group members, and as such, this difference was significant except in one single case p > 0.05 (with five ps < 0.001, three ps < 0.005, two ps < 0.05). The partial eta squared (η p 2 ) for measuring the size effect indicated an overall small effect (with one case of medium effect at 0.127), based on the criteria, the results generally confirmed the GA-effect. Both Study Ia and Study Ib focused on the university sample, thus in Study Ic, we enrolled a group of 136 organizational workers from different professional backgrounds. We conducted one-way ANOVA with rating condition as a two-level factor (1-GMR condition, 2-IR condition), and rating score as dependent variable, on the premise of the satisfied homogeneity of variance assumption (p > 0.05). As indicated in' Table 3, the individuals were perceived more attractive when they were in the group context than in the group member context, and this difference was significant with the majority of ps < 0.005 (only one case p > 0.05). The partial eta squared (η p 2 ) for measuring the size effect based on the criteria of 0.02-0.12, 0.13-0.25, and above 0.26 which are cut-off points for small, medium, and large effects according to Cohen [29], showing an overall small effect, generally supporting the GA-effect. To test whether there is any difference on the attractiveness rating between the individual and that individual within a specific group, a series of one-way analysis of variance were carried out to compare the difference of IR and GMR ratings. The homogeneity of variance assumption was satisfied (p > 0.05) prior to conducting a series of follow-up ANOVA analyses. The results revealed in Table 4 shows that individuals were rated as more attractive when they were placed in a group, and most differences were observed to be significant (the majority of ps < 0.05, with three exceptions ps > 0.05). The partial eta squared (η p 2 ) for measuring the size effect indicated a small effect (with fourteen cases) medium effect (with five cases) to large effect (with two cases), based on the cut-off criteria, thus confirming the cheerleader effect. A series of one-way analyses of variance, with rating conditions as the independent variable and rating scores as the dependent variable, were conducted to compare the difference between group member rating and individual rating, under the satisfactory compliance of the homogeneity of variance assumption (p > 0.05). As indicated in Table 5, the group member ratings of attractiveness were greater than the individual ratings, and this difference was significant with the majority of ps < 0.001 (with fifteen ps < 0.001, two ps < 0.005, two ps < 0.05, when with the other two cases ps > 0.05). The partial eta squared (η p 2 ) for measuring the size effect indicated a small to medium effect (except one case 0.000) based on the cut-off value. Therefore, the individuals were perceived as more attractive than in the group member context, which again confirmed the cheerleader effect. The previous two pools of university samples supported the cheerleader effect, with the inclusion of another 136 workers which were recruited for rating the individual's attractiveness within a group or alone. A series of one-way ANOVA were conducted based on the satisfying homogeneity of variance assumption (p > 0.05). The results in Table 6 indicate that individuals were perceived as more attractive when they were rated in the group member context rather than alone; and this difference was significant with the majority of ps < 0.001 (with one case close to the borderline p = 0.08). The partial eta squared (η p 2 ) for measuring the size effect indicated a small to medium effect based on the criteria, and thus mainly supported cheerleader effect. The results for testing size effect as indicated in Table 7 revealed that there was a significant size effect in the group effect (p < 0.001), when the number of individuals increased from three in group photo set P5 to eight in group photo set P6, both GR rating and GMR rating increased and both significantly differed (p < 0.005). Specifically, overall, the same individual stimulus was rated higher in a larger size group photo set in comparison to a smaller size group photo set (though with one case such difference level p > 0.05). For instance, when the first individual in group photo set P7 was put in a larger size group in P8, she was perceived as more attractive than in a smaller group.

Sample
A total of 500 Chinese participants were contacted for Study II, of which 399 responded, excluding those who were inattentive or for whom there were missing data on crucial study variables. In total, 192 participants were retained with ages ranging from 20 to 48; of these 37.5% were male, and 80% held a university degree. Such a convenient sample improved the flexibility and capability of respondents to accept the survey, due to the distribution of the questionnaire among researchers' networks. The participants' oral and written informed consent was obtained before the experiment, and their participation was anonymous. In addition, the ethics committee of the Southwest Jiaotong University does not demand ethical application from projects conducted among an adult population, which does not include any clinical data collection nor pose any psychological or physical harm to the individuals.

Materials, Study Design and Procedure
Since using the whole body (compared to just the face) in the photo stimuli could cause confounding effects for attractiveness rating, following Study 6 by Van Osch et al. [10], we restricted attractiveness within the faces in Study II to replicate the GA-effect. In order to ensure diversity in appearance, facial features such as glasses, hair length, and skin color were considered in the selection criteria, and the 54 portrait facial photos were all arranged in groups of nine with a combination 3 × 3 matrix. For illustration purposes, example photo stimuli as study materials are depicted in Figure 2. Each photo comprises a mixed gender of both Asian and European adult faces to ensure the integration of various facial features. appearance, facial features such as glasses, hair length, and skin color were considered in the selection criteria, and the 54 portrait facial photos were all arranged in groups of nine with a combination 3 × 3 matrix. For illustration purposes, example photo stimuli as study materials are depicted in Figure  2. Each photo comprises a mixed gender of both Asian and European adult faces to ensure the integration of various facial features. We combined each individual face and synthesized 9 of them together in a single group photo. In other words, photo stimuli were presented to the participants in groups of nine which were distributed as a 3 × 3 matrix by a combination of English letter (A-C) and Arabic number (1-3). The participants were asked to respond to questions which were then divided into two parts. Participants were first asked to rate the group's overall facial attractiveness of the photo stimuli in GR condition (see Figure 2: GR condition), considering the following criteria or indicators such as: (1) physical attractiveness (how pretty the people in the photograph were), (2) friendliness (how kind the people in the photograph seemed), and (3) social attractiveness (to what extent would you want to belong with these people). Then, they were required to indicate the degrees of attractiveness of the group (1 = not at all attractive, 10 = very attractive): How attractive do you find this group (1 = not at all attractive, 10 = very attractive)? After this, they were asked to rate the attractiveness of each individual face in GMR condition (see Figure 2: GMR condition): How attractive do you find group member A1 (1 = not at all attractive, 7 = very attractive)? Thus, Study II was a within-subject design with rating conditions as a factor (two levels with respect to two conditions), where rating scores were the dependent variables.

Data Analysis
Data collected from the online Qualtrics software platform were loaded into the SPSS (25.0) for further analysis. Analysis of variance with rating conditions as a factor (2 levels: GR vs. GMR), and rating scores as dependent variables were applied for comparing the mean differences between the groups. We combined each individual face and synthesized 9 of them together in a single group photo. In other words, photo stimuli were presented to the participants in groups of nine which were distributed as a 3 × 3 matrix by a combination of English letter (A-C) and Arabic number (1-3). The participants were asked to respond to questions which were then divided into two parts. Participants were first asked to rate the group's overall facial attractiveness of the photo stimuli in GR condition (see Figure 2: GR condition), considering the following criteria or indicators such as: (1) physical attractiveness (how pretty the people in the photograph were), (2) friendliness (how kind the people in the photograph seemed), and (3) social attractiveness (to what extent would you want to belong with these people). Then, they were required to indicate the degrees of attractiveness of the group (1 = not at all attractive, 10 = very attractive): How attractive do you find this group (1 = not at all attractive, 10 = very attractive)? After this, they were asked to rate the attractiveness of each individual face in GMR condition (see Figure 2: GMR condition): How attractive do you find group member A1 (1 = not at all attractive, 7 = very attractive)? Thus, Study II was a within-subject design with rating conditions as a factor (two levels with respect to two conditions), where rating scores were the dependent variables.

Data Analysis
Data collected from the online Qualtrics software platform were loaded into the SPSS (25.0) for further analysis. Analysis of variance with rating conditions as a factor (2 levels: GR vs. GMR), and rating scores as dependent variables were applied for comparing the mean differences between the groups.

Results
Was group attractiveness greater than the average attractiveness of its members (GR vs. GMR)? One-way ANOVA was adopted for the difference of group rating and group member rating, based on having homogeneity of variance assumption satisfied (p > 0.05). As indicated in Table 8, overall, the group was perceived as more attractive than that attributed by the constitutive group members, and this difference was significant for 5 photo sets with all ps < 0.05, with one photo set at the borderline (p < 0.069). The partial eta squared (η p 2 ) for measuring the size effect indicated a small to medium effect based on the criteria, thus confirming that the GA-effect was consistent with previous findings by Van Osch et al. [10].

Discussion
By investigating Chinese participants' evaluations on physical attractiveness, StudyI expanded the research on GA-effect to another cultural context, and confirmed previous findings on the higher perceived attractiveness of a group compared to the attractiveness of its individual members [10], as well as the higher attractiveness of an individual within a group context rather than in isolation [11,14]. Study II again confirmed the existence of GA-effect by investigating Chinese participants' evaluations on both Chinese, Korean, and European facial photo stimuli, which was distinct with prior findings obtained in Korea in that the group attractiveness are an average [13]. In general, our main findings are: (a) the group's physical attractiveness is significantly greater than the average attractiveness of its members, which confirmed the GA-effect; (b) an individual is more attractive in a group setting compared to when he or she is seen alone, which conformed the cheerleader effect; and (c) the larger the group size, the greater the GA-effect.
To illustrate our findings on the GA-effect, the filter model of selective attention [30,31] may help explain why people are motivated to selectively be attracted to the most charming members: as people were faced with a large amount of information, they would selectively receive parts of the information that pertained to their interest. Particularly, in keeping with the "what is beautiful is good" stereotype, people tended to focus on the most attractive members of a given group. Therefore, when people judged the physical attractiveness of a group, they were more likely to be influenced by the most appealing members of that group, making the most attractive individuals account more for overall group attractiveness, thus increasing the group attractiveness [32]. This seemed to confirm that the obsession with beauty or attractiveness is not unique to modern Western culture, but can be also found in Eastern cultures, or even around the world in almost all societies that have been studied [33]. Moreover, when rating attractiveness, we found that in the experimental procedure, no matter which order the participants rate on attractiveness-, GMR first as in Study Ib and Study Ic, or GR first as in Study II-, the GA-effect was consistently found. This reflected on the fact that ratings of the group attractiveness were always higher than the attractiveness of its individual members, thus fully supporting the existence of the GA-effect.
By replicating Walker and Vul's work, the results generated from Study Ia, Ib, and Ic demonstrated that individuals give higher attractiveness ratings when targets are in group member condition (GMR) in comparison to the individual rating condition (IR). Such a result confirmed the cheerleader effect [11] and was consistent with very recent studies [14][15][16][17]24], therefore extending this finding to a collectivistic culture with a stronger effect size compared to previous studies [18]. According to the explanations proposed by Walker and Vul [11], when rating attractiveness: (1) people's visual system would automatically compute ensemble representations of faces presented in a group; (2) making the individual members of the group biased toward this ensemble average; and (3) average faces were more attractive. Thus, individuals are rated as more attractive in group based on the interplay of these three cognitive phenomena. Findings that individuals were perceived as more attractive in group member context than in isolation from present dataset supported the cheerleader effect [11], irrespective of an individual that is presented via body image in Study I or via facial image in Study II. As people's evaluations are influenced by the environmental context and/or other group members, such as ecological conditions may influence one's judgment of physical attractiveness [27]. When referring to a person's attractiveness, participants preferentially pay more attention towards the interested target's attractiveness, automatically integrating the attractive characteristics or stimuli in the group into one average, and this average would affect people's judgment of group members' attractiveness. Average characteristics or stimuli were more attractive [23,34], which may have increased people's rating on group members' attractiveness leading to individual physical attractiveness garnering higher ratings when in a group, rather than in isolation.
Regarding the effect of the group size on attractiveness, Van Osch et al. [10] found that group size was a significant moderator of the GA-effect, that is, the GA-effect was more likely to be found in larger groups, such as those with six or more members [35]. Our studies, controlling the size of the group, also supported this finding. One possible explanation might be that this is the result of selective attention: as the number of a group increases, people notice more of the objects of interest in that group, which makes more attractive individuals exact much greater influence on the group's overall judgment, thereby leading to greater deviations from the average attractiveness within larger groups [36]. Another reason might be the result of the priming effect; as Price and colleagues mentioned in previous work, when the number increases, it activates an internal representation of relative quantity that directly affects the internal representations of magnitude, thus in turn affecting judgements of the average [37]. Moreover, consistent with previous studies [35][36][37], our results which were revealed from present datasets could not allow us to draw further conclusions on numbers that could be enough for generating such a size effect.

Implications
Theoretically speaking, the generalization of two previously tested attractiveness effects where either GA-effect or cheerleader effect, are confirmed in a collectivist culture as represented in China. Specifically, both Study I and Study II confirmed previous work on GA-effect within European culture [35] which was different from previous findings obtained in Korea [13]. Study I also confirmed the cheerleader effect that was found in the U.S. [11], consisted with previous work validated in Japan [18], as well as in Australia [15,16]. In this sense, the present work may provide additional evidence for understanding how and why people's perceptions, judgement, and behaviors can be influenced by attractiveness, irrespective of individual or collectivist culture forms [38]. In a deeper sense, our work added evidence for understanding the selective attention theory, as well as the mechanism of hierarchical encoding in the cheerleader effect.
Practically speaking, the findings yielded from the present datasets suggest that people are more attractive when they are evaluated in the group context, and this result illustrates the importance of the environmental or contextual factors in evaluating physical attractiveness. On one hand, it suggests that choosing a good group may help facilitate self-image and increase attractiveness [39]; on the other hand, it warns us that individual physical attractiveness evaluated in group may be influenced by some other group members within that group, so we should be more careful when making judgments about people's physical attractiveness, whether alone or in groups. Secondly, the overall group physical attractiveness is greater than the average attractiveness of the group members, suggesting that people may selectively pay attention to someone in the group that they find more attractive, and such attention and preference to attractive individuals will affect people's evaluations of the whole group. Evidence found that when rating the attractiveness of romantic partners, being in a group context increased the attractiveness of an individual when they met the minimum level of attractiveness of the group, which comprised of some highly attractive individuals [24]. For example, in commercial advertisement, the adoption of a group of individuals as advertising spokespersons may be more likely to be noticed and welcomed by consumers; in leisure activities such as singing shows or talent shows, the inclusion of celebrities may be more likely be appreciated and viewed by the audience. Men across cultures place greater importance than women do on physical attractiveness of a prospective partner, choosing a group in which one's attractiveness can be increased seems to be a good mating strategy for women [14,40]. As the size of the group increase, the larger the GA-effect, suggesting that the number of team members may affect the physical attractiveness of the overall team, as well as each member in this team. For example, in the business and management negotiation, the selection and combination of a team with more attractive members may increase the partner's evaluation of the whole team of that company and help achieve the negotiation agreement. In general, the attractiveness effect, where either GA-effect or cheerleader effect, seems widespread in our daily lives, not only affecting our judgment of people's physical attractiveness, but more importantly, perhaps being an important basis for our judgment and decision-making in many walks of life. Previous works speculated that the GA-effect and the cheerleader effect might influence the above-mentioned situations, but there is no evidence yet. Therefore, this confirmative study could help us understand human behavior, judgments, and perceptions over the attractiveness effects used in different contexts and across cultures [41].

Limitations and Future Research
This paper replicated some of the studies by Van Osch and colleagues in order to investigate whether the group attractiveness effect is detectable in a non-European country such China. Our results confirmed the existence of such a GA-effect in Chinese context through two sub-studies. The present work also validated the cheerleader effect in Chinese context. However, a few limitations warrant notice. First, following the design of the original study, we neglected the fact that some of the ratings of physical attractiveness of an individual may not only be based on facial attractiveness, but also on body height or body shape or clothing, or even age or ethnicity which are known to influence participants' perceptions, and in this sense we applied only faces in Study II to rule out above mentioned influential factors. Second, although we recruited both university students and real workers, this sample size may be not large enough to represent the whole Chinese population, thus it may affect the generalization of our findings. Also, regarding an eventual gender effect, the present sample failed to test the influence of the participant's gender difference in various ways of rating time length of participants' attention to the stimuli, their eye-tracking, or other factors on GA-effect as in study by Van Osch et al. Thus, future work may continue analyzing the gender effect to compare male and female attractiveness within larger sample sizes, as well as that of attractiveness being rated by the same or opposite genders. Future work may also continue to study different sample groups to determine the attractiveness ratings based on different ethnicities, Chinese stimuli versus foreign (i.e., European) stimuli. Finally, in terms of tools and instruments, both qualitative and quantitative accounts including additional physiological data might be gathered to ensure understanding of these effects. Possible causes and effects of GA-effect intuitively and accurately facilitate application of the GA-effect in the fields of everyday life, such as advertising, entertainment, management negotiation, and so on, as possible research directions for future work.

Conclusions
Two studies replicated the GA-effect and the cheerleader effect. Overall, our findings confirm that the GA-effect is detectable in a collectivistic cultural context in China, that is to say, the perceived physical attractiveness of a group was greater than the average attractiveness of its group members [10], and the perceived physical attractiveness of an individual was greater in the group context than in isolation [11]. It seems that group context may help yield advantages for increase of people's attractiveness, however, we are not sure if it is the factor of collectivistic culture that can account for this, due to present datasets. As such, further research may benefit from designing experiments for exploring the underlying mechanism by investigating psychological and neural factors.