Laughter exaggerates happy and sad faces depending on visual context

Sherman, Aleksandra; Sweeny, Timothy D.; Grabowecky, Marcia; Suzuki, Satoru

doi:10.3758/s13423-011-0198-2

Laughter exaggerates happy and sad faces depending on visual context

Brief Report
Published: 04 January 2012

Volume 19, pages 163–169, (2012)
Cite this article

Download PDF

Psychonomic Bulletin & Review Aims and scope Submit manuscript

Laughter exaggerates happy and sad faces depending on visual context

Download PDF

Aleksandra Sherman¹,
Timothy D. Sweeny²,
Marcia Grabowecky¹ &
…
Satoru Suzuki¹

2591 Accesses
10 Citations
4 Altmetric
Explore all metrics

Abstract

Laughter is an auditory stimulus that powerfully conveys positive emotion. We investigated how laughter influenced the visual perception of facial expressions. We presented a sound clip of laughter simultaneously with a happy, a neutral, or a sad schematic face. The emotional face was briefly presented either alone or among a crowd of neutral faces. We used a matching method to determine how laughter influenced the perceived intensity of the happy, neutral, and sad expressions. For a single face, laughter increased the perceived intensity of a happy expression. Surprisingly, for a crowd of faces, laughter produced an opposite effect, increasing the perceived intensity of a sad expression in a crowd. A follow-up experiment revealed that this contrast effect may have occurred because laughter made the neutral distractor faces appear slightly happy, thereby making the deviant sad expression stand out in contrast. A control experiment ruled out semantic mediation of the laughter effects. Our demonstration of the strong context dependence of laughter effects on facial expression perception encourages a reexamination of the previously demonstrated effects of prosody, speech content, and mood on face perception, as they may be similarly context dependent.

Perceiving emotions in visual stimuli: social verbal context facilitates emotion detection of words but not of faces

Article Open access 18 November 2020

Additive effects of emotional expression and stimulus size on the perception of genuine and artificial facial expressions: an ERP study

Article Open access 06 March 2024

The angry versus happy recognition advantage: the role of emotional and physical properties

Article 03 February 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Although recognizing a facial expression seems effortless, it requires integration of multiple visual features (Smith, Cottrell, Gosselin, & Schyns, 2005). Because expression perception is vital for social interactions, the visual system likely incorporates expression-relevant information from other sensory modalities. The auditory modality may provide especially strong crossmodal cues to facial expressions, since vocalizations tend to accompany affective states.

Indeed, researchers have demonstrated that emotion-related speech sounds such as prosody (see below), as well as less explicit emotion-conveying sounds such as music (e.g., Spreckelmeyer, Kutas, Urbach, Altenmüller, & Münte, 2006), can influence the perception of facial expressions. For example, when a face and voice are concurrently presented, the visual and auditory forms of information both contribute to emotion perception proportionally to their expressive strength (Massaro & Egan, 1996). Additionally, emotion classification improves when emotionally congruent faces and voices are bimodally (vs. unimodally) presented (de Gelder & Vroomen, 2000; Kreifelts, Ethofer, Grodd, Erb, & Wildgruber, 2007). These results suggest that concurrent visual and auditory information facilitates emotion judgments, but the results do not necessarily provide evidence of crossmodal interactions in facial expression perception.

More relevant to the present study, visual classification of happy and fearful faces is speeded when emotionally congruent voices are presented concurrently (Dolan, Morris, & de Gelder, 2001). Similarly, visual classification across the continuum of happy and sad faces is biased by concurrently presented happy and sad voices (de Gelder & Vroomen, 2000). These findings suggest that emotion-conveying prosody crossmodally influences visual classification of facial expressions. Ethofer et al. (2006) showed that emotion-conveying voices also influence visual perception of facial expressions, by asking participants to rate expressions ranging from fearful to happy on a schematic scale of negative to positive expressions. They found that simultaneously presented fearful voices made the participants rate the neutral to fearful faces as more fearful, but there was no effect of happy voices.

Although previous research has provided clear evidence of auditory–visual interactions in facial expression perception, researchers have only investigated these auditory effects on single, isolated faces. However, people typically see emotional faces in social as well as dyadic contexts, and it is unclear whether crossmodal effects obtained with a single face would generalize to a crowd of faces. Although such a generalization may seem intuitive, neural responses are different when multiple faces simultaneously fall within the large receptive fields of face-selective visual neurons, as compared to when only a single face falls within those receptive fields (Desimone, 1991; Kastner et al., 2001; Sweeny, Grabowecky, & Suzuki, 2009; Zoccolan, Cox, & DiCarlo, 2005). It is thus possible that crossmodal effects might differ for a single face and a crowd of faces. Furthermore, instead of prosody, which can be interpreted differently, depending on cultural and linguistic contexts (e.g., Scherer, Banse, & Wallbott, 2001), we used laughter, which universally and strongly conveys positive emotion.

We present a surprising demonstration that laughter produces opposite effects on perceived facial expressions, depending on whether one views a single emotional face or an emotional face in a crowd of faces.

Experiment 1: Effects of laughter on perception of a single face versus multiple faces

Method

Participants

The participants in all experiments were Northwestern University undergraduate students, gave informed consent to participate for partial course credit, had normal or corrected-to-normal visual acuity and normal hearing, and were tested individually in a dimly lit room. A group of 18 (10 female, 8 male) students participated in this experiment.

Stimuli and procedure

The participants were shown brief displays of schematic faces (~100 ms, varied between 94 and 106 ms because of the 75-Hz monitor refresh rate). Each face (subtending 1.15° [horizontal] by 1.49° [vertical] of visual angle) assumed one of three expressions, denoted by the curvature of the mouth; an upward-curved mouth indicated a happy face, a flat mouth indicated a neutral face, and a downward-curved mouth indicated a sad face. The curved mouths (0.80° × 0.14°) and the flat mouth (0.86° × 0.06°) were of the same length.

On crowd trials, eight schematic faces were arranged in a circular array (5.50° radius) around a central fixation point. A third of the crowd trials contained a happy face (in one of eight randomly selected locations) presented among seven neutral faces; another third of the crowd trials contained a sad face presented among seven neutral faces; and the remaining crowd trials contained eight neutral faces. On single-face trials, a happy, sad, or neutral face was presented alone (in one of eight randomly selected locations) without the additional crowd of seven neutral faces.

Participants were told that every display contained either a happy or sad face and that their task was to report the perceived intensity of the happy or sad expression. Because of the brief stimulus duration, even neutral faces appeared happy or sad, presumably due to neural noise (Sweeny, Grabowecky, Kim, & Suzuki, 2011). Using visual stimuli identical to those used in the present study, we previously assessed response confidence and verified that participants indeed perceived negative and positive expressions, even when neutral faces were briefly presented (Sweeny et al., 2011).

On half of the trials, the visual display (100 ms) was presented simultaneously with a sound clip of a child laughing (1,000 ms, 67 dB SPL). The sound carried no information about the location or expression of the faces. A blank display (1,000 ms) followed, to allow the laughter clip to play for its full duration. No sound was presented on the remaining trials. All conditions were randomly intermixed across 96 trials, after 10 practice trials had been given prior to the experiment. The trial sequence and timing are shown in Fig. 1.

Rather than using a symbolic response scale (Ethofer et al., 2006), we more directly probed auditory effects on visual perception by using a curvature-matching task. The response screen consisted of 10 curved segments, arranged from left to right in the order of a strongly downward-curved (labeled “1”) to a strongly upward-curved (labeled “10”) segment. The numerical curvature labels were proportional to the vertical stretch of the curved segments, which linearly increased from 0.14° (for curves labeled “5” and “6”) to 0.22° (for curves labeled “1” and “10”). Participants pressed a key to indicate the curvature that most closely resembled the mouth of the perceived face. For example, when they perceived a slightly upward-curved mouth, they might press “6,” whereas when they perceived a strongly downward-curved mouth, they might press “3.” The actual mouth curvature was always “6” for a happy face or “5” for a sad face. We presented these as the minimum curvatures in the response scale because previous research (Sweeny et al., 2011), along with our pilot results, demonstrated that perceived mouth curvatures tend to be exaggerated in brief viewing. The response scale was thus appropriate for the range of perceived curvatures.

The visual stimuli were displayed on a color CRT monitor (1,024 × 768 pixels, 75 Hz) at a viewing distance of 100 cm, and were presented using MATLAB with the Psychophysics Toolbox (Brainard, 1997; Pelli, 1997). Sounds were presented through Sennheiser HD256 headphones (10-Hz to 20,000-Hz frequency response).

Results

To determine how laughter influenced the perceived intensity of each facial expression, we computed the differences in the average expression ratings between the laughter and no-sound trials for each participant and each expression (see Table 1 for all of the mean curvature ratings).^{Footnote 1} Positive values indicate that laughter made a face appear happier, whereas negative values indicate that laughter made a face appear sadder. The reported t statistics are Bonferroni corrected in all experiments.

Table 1 Average expression (mouth curvature) ratings from Experiment 1

Full size table

For single-face trials, laughter significantly increased the perceived intensity of the happy face, t(17) = 3.34, p < .05, d = 0.79, without significantly influencing the perceived intensity of the neutral or the sad face, t(17) < 1.15, n.s. (left panel in Fig. 2). This pattern of results was confirmed by a contrast analysis with 1, –.5, and –.5 as the weights assigned to the happy, neutral, and sad expressions, respectively, t(17) = 2.54, p < .02, d = 0.60.

For crowd trials, in contrast, laughter significantly increased the perceived intensity of the sad face, t(17) = –4.32, p < .01, d = –1.02, without significantly influencing the perceived intensity of the happy or the neutral face, t(17) < 2.23, n.s. (right panel in Fig. 2). This pattern of results was confirmed by a contrast analysis with .5, .5, and –1 as the weights assigned to the happy, neutral, and sad facial expressions, respectively, t(17) = 6.61, p < .0001, d = 1.56.

These differential effects of laughter on single-face and crowd trials were further confirmed by a significant ANOVA interaction between facial expression and visual context, F(2, 16) = 7.45, p < .003, η ² = .31.

Overall, simultaneously presented laughter exaggerated a happy expression for a single face and exaggerated a sad expression in a crowd of neutral faces. The fact that laughter produced expression-specific and opposite effects depending on the visual context makes explanations based on response bias and arousal unlikely.

Experiment 2: Why does laughter exaggerate a sad expression in a crowd?

We investigated why laughter increases the perceived intensity of a sad face in a crowd. We hypothesized that laughter might make the crowd of neutral faces appear slightly happy, thereby enhancing the negativity of the sad target face via increased emotion contrast. This possibility is hinted at by a trend in Experiment 1, in which laughter tended to make the crowd of neutral faces appear slightly happier, t(17) = 2.23, p < .05 (without Bonferroni correction), d = 0.52. If this hypothesis is true, making the mouths of the neutral distractor faces upward curved by an equivalent amount should exaggerate the perceived intensity of the deviant sad expression to the same extent as laughter did in Experiment 1.