Introduction

Continual monitoring of others’ opinions and behaviors in relation to our own is necessary for successful social interactions, because these monitoring processes lead to behavioral adaptations when required by the situation at hand. During everyday performance monitoring, we encounter a variety of social and nonsocial (feedback) signals that provide information as to whether our behavior is appropriate. Several recent studies have demonstrated that early brain responses reflecting the activity of our performance monitoring system are sensitive to physical stimulus properties of external feedback stimuli. While Liu, Nelson, Bernat, & Gehring (2014) and Pfabigan, Sailer, & Lamm (2015) showed that perceptual properties such as similarity of perceptual features and stimulus size differentially influence performance monitoring correlates, we demonstrated that social versus nonsocial stimulus characteristics were reflected in early brain responses to feedback stimuli (Pfabigan, Gittenberger, & Lamm, 2019). The authors reported that social (vs. nonsocial) and complex (vs. noncomplex) feedback stimuli enhanced processing of feedback information at several successive stages. Thus, the mere presentation of stimulus characteristics implying a social setting was enough to modulate neural correlates of feedback processing—most likely reflecting a bottom-up effect of the social stimuli. However, we demonstrated these results only in a western sample (Pfabigan, et al., 2019), which limits generalizability to this cultural group.

The influence of culture is omnipresent in our daily lives and an increasing number of findings have demonstrated that culture influences overt behavior and underlying cognitive processes and neural correlates (for an overview see Han, 2017; Han et al., 2013). The influence of social stimulus characteristics on feedback processing might be particularly strong in cultures emphasizing connectedness between the self and significant others. East Asian individuals often are characterized to view and define their self in relation to others—known as interdependent self-construal style. In contrast, western individuals often are characterized as being more autonomous defining their self rather independently from others—known as independent self-construal style (Markus & Kitayama, 1991). Indeed, several studies reported a mediating role of interdependent self-construal and culture during performance monitoring in a social context (Hitokoto, Glazer, & Kitayama, 2016; Kitayama & Park, 2014; Pfabigan et al., 2018).

The current study was designed to replicate systemically and potentially extend our previous findings (Pfabigan, et al., 2019) in a Chinese sample (as a subgroup of East Asian individuals) characterized by an emphasis on interdependent self-construals. Replication studies can be seldom found in published research, although reproducibility is a basic and indispensable requirement of empirical science that should be more strongly acknowledged also in psychological science (Giner-Sorolla, 2012; Schmidt, 2009) to foster recent replication efforts (Open Science, 2015; Pashler & Harris, 2012).

The second goal of the current study pertained to a methodological aspect of the question of whether social stimulus characteristics influence neural correlates of performance monitoring. Previous research mostly applied the event-related potential (ERP) technique in this regard, which, however, provides only phase-locked information of the ongoing EEG. In contrast, time-frequency decomposition allows the assessment of rhythmic activity, which is modulated by a mixture of phase-locked and nonphase-locked activity in response to the eliciting event (Pfurtscheller & Lopes da Silva, 1999). Time-frequency analysis might provide more comprehensive insights into underlying neural processes than classical ERP analysis alone (Cohen, 2011; Makeig, Debener, Onton, & Delorme, 2004). In particular in the domain of performance monitoring and cognitive control, considerable advances have been made in the past years (Cavanagh & Frank, 2014; Cavanagh & Shackman, 2015). Consequently, we tested whether neural oscillations corroborate or even extend previous ERP findings regarding the processing advantage of social versus nonsocial feedback stimuli.

To address our first question, we assessed three different ERP components. The first one was the Feedback-Related Negativity peaking within 200–300 ms after feedback onset (FRN; Miltner, Braun, & Coles, 1997) at fronto-central electrode sites. Its negative deflection is more pronounced for unfavorable compared with favorable, unexpected compared with expected, and salient compared with nonsignificant feedback outcomes (Alexander & Brown, 2011; Gehring & Willoughby, 2002; Miltner, et al., 1997; Pfabigan, Alexopoulos, Bauer, Lamm, & Sailer, 2011a; Pfabigan, Alexopoulos, Bauer, & Sailer, 2011b). Recent research describes FRN amplitude variation as an unsigned prediction error signal indicting the surprise of unexpected events (Hayden, Heilbronner, Pearson, & Platt, 2011; Talmi, Atkinson, & El-Deredy, 2013). As the second ERP, we assessed the positive-going P300 component, which usually occurs within 300–500 ms after feedback onset with a peak at parietal electrode sites (Duncan Johnson & Donchin, 1977; Polich, 2007), reflecting more elaborate stimulus processing of motivational saliency and context updating in working memory (Bellebaum & Daum, 2008; Nieuwenhuis, Aston-Jones, & Cohen, 2005). As the third ERP, the frontal P2 component peaking around 180 ms after feedback onset was assessed, which is assumed to reflect early stages of attention capture and affective significance of target stimuli (Cuthbert, Schupp, Bradley, Birbaumer, & Lang, 2000; Potts, 2004; Potts, Martin, Burton, & Montague, 2006).

To address the second research question, we captured frontal midline theta (FMΘ) oscillations (approximately 4-8 Hz) as nonphase-locked neural correlates of performance monitoring. FMΘ oscillations are usually captured over fronto-central electrodes and have been proposed to be sensitive to the same experimental manipulations as error-, feedback-, and conflict-related ERP components (Cavanagh, Zambrano-Vazquez, & Allen, 2012; Cohen, Elger, & Ranganath, 2007). The midcingulate cortex (MCC) has been proposed as their neuronal origin (Cavanagh & Frank, 2014). Recent research suggests that FMΘ might be a key neural mechanism of the medial prefrontal cortex (mPFC) to guide cognitive control processes (Cavanagh & Frank, 2014; Cavanagh, et al., 2012). FMΘ oscillations are assumed to indicate the requirement of top-down control in a given situation. They might reflect the underlying neural mechanism by which top-down control is computed and communicated across broader neural networks (Cavanagh & Frank, 2014; Cavanagh, et al., 2012). Supporting this notion, enhanced FMΘ oscillations have been associated with enhanced cognitive performance (Cohen & Donner, 2013) and more generally with executive functioning during error and feedback processing (Enriquez-Geppert, Huster, Figge, & Herrmann, 2014).

Participants performed the established time estimation task (Miltner, et al., 1997), which allowed approximately comparable presentation numbers of positive and negative feedback stimuli. Thumbs up/down served as social feedback stimuli, while patterned plus/minus symbols served as nonsocial ones—all representing highly overlearned stimuli conveying valence information. Importantly, these stimuli were approximately matched for visual complexity, which was identified as another factor influencing performance monitoring ERPs (Pfabigan, et al., 2019).

We hypothesized that social feedback stimuli also would demonstrate their processing advantage compared with nonsocial ones in the Chinese sample—reflected in enhanced FRN, P300, and P2 amplitudes for social compared with nonsocial feedback stimuli (Pfabigan, et al., 2019). Moreover, because previous research has shown individual variations of brain activity related to variation of interdependent self-construal (Ma et al., 2014), a cultural trait that characterizes East Asian samples, we explored whether interdependent self-construal was related to the hypothesized processing advantage of social versus nonsocial feedback stimuli (Hitokoto, et al., 2016; Kitayama & Park, 2014). Based on previous research suggesting comparable underlying mechanisms of the investigated ERPs and neural oscillations, we assumed that FMΘ oscillations would also be enhanced for social versus nonsocial feedback stimuli (Cavanagh & Frank, 2014).

Methods

Participants

Fifty-six volunteers participated in the electroencephalography (EEG) study. Three of them were excluded due to data acquisition problems or excessive alpha activity artefacts. The remaining 53 participants (29 women) were on average aged 21.5 years (standard deviation [SD] = 2.21; range 18-26). Sex/gender was equally distributed in the sample (χ2(1) = 0.47, p = 0.492); no age differences were observed between women and men (t(51) = 1.62, p = 0.111). All participants were right-handed as assessed with a questionnaire (Li, 1983), had normal or corrected-to-normal vision, and reported no prior or current psychiatric or neurological disorder. Written informed consent was obtained before the experiment. The study was conducted in accordance with the Declaration of Helsinki (7th revision, 2013) and was approved by the Ethics Board of the School of Psychological and Cognitive Sciences, Peking University. Sample size was based on a priori power consideration relying on our previous study (Pfabigan, et al., 2019). It suggested that a sample size of at least 46 participants has sufficient power (1-β > 0.80) to detect the smallest social content effect of the previous study p2 = 0.20; effect size specification after Cohen (1988) using GPower 3.1; Faul, Erdfelder, Buchner, & Lang, 2009). We recruited ten additional participants to be prepared for potential dropouts.

Before EEG data collection, participants filled in the self-construal scale (SCS; Singelis, 1994) to assess their dispositional independent and interdependent self-construal styles. The SCS comprises 30 items (15 per scale), which have to be rated on a 7-point Likert scale ranging from 1 (strongly disagree) to 7 (strongly agree). The average ratings for independent and interdependent self-construals were calculated per participant to acknowledge orthogonality of the two constructs. Larger values represent more pronounced independence and interdependence.

Time estimation task

In a modified version of the time estimation task (Miltner, et al., 1997; Pfabigan, Sailer, & Lamm, 2015; Pfabigan, Zeiler, Lamm, & Sailer, 2014), participants were required to estimate the passing of one second via button press. The trial structure was as follows: first, a central black fixation dot on a grey screen was shown for 1,000 ms. Afterwards, a black star replaced the dot for 250 ms, which started the time estimation of the respective trial. During the subsequent 1,750 ms (blank grey screen), participants could indicate the estimated elapse of 1 second via pressing button 1 on a standard keyboard with their right index finger; 2,000 ms after star onset, performance feedback was presented for 1,000 ms. The following intertrial-interval (black fixation dot) varied randomly between 1,400-1,600 ms. Feedback was always provided based on individual performance. However, task difficulty was adjusted to the individual performance level with a staircase procedure to guarantee approximately comparable numbers of correct and incorrect trials. At the beginning, positive feedback was given in cases a button press fell in the time window of 900 to 1,100 ms after star onset. Subsequently, the size of this time window was adjusted based on individual performance in the preceding trial (Miltner, et al., 1997). After a trial with positive feedback (i.e., a correct time estimation), the time window was narrowed down by 10 ms at both ends of the window. After a trial with negative feedback (i.e., an incorrect time estimation), the time window became wider again by adding 10 ms at both ends. Consequently, the overall probability of positive and negative feedback was approximately 50% for all participants.

Social feedback stimuli consisted of naturalistic hand gestures of “thumbs up” or “thumbs down” (indicating either positive or negative feedback). Nonsocial feedback stimuli consisted of black patterned “+” or “−” symbols (indicating either positive or negative feedback). Based on our previous results, we used only stimuli for the current study that were considered complex stimuli (Pfabigan, et al., 2019). Thumbs up/down stimuli were presented in an allocentric perspective to emphasize that feedback is given externally and serves an evaluative function. Stimuli were presented on a grey background and were comparable in size and luminance (please refer to Pfabigan, et al., 2019, for further details on objective and subjective stimulus characteristics, and to Fig. 1 for an illustration of the stimuli). Participants were explicitly instructed that they should consider social and nonsocial feedback stimuli as equivalent indicators of their time estimation accuracy.

Fig. 1
figure 1

Trial timing is depicted on the left side of the figure, while the four different feedback stimuli are illustrated on the right side

The experiment consisted of ten training trials and 200 experimental trials. The respective 100 trials depicting either social or nonsocial stimuli were presented block-wise as previously suggested (Pfabigan, et al., 2014). Block order was counterbalanced across participants. Overall EEG data collection lasted approximately 20 minutes. Afterwards, participants rated the presented feedback stimuli regarding motivational significance (not motivating vs. very motivating) and arousal (very calm vs. very exciting) on 7-point Likert scales. They performed another experimental task afterwards, which is not in the scope of the current report. At the end of the experiment, participants received a financial remuneration of RMB 100 for their participation.

EEG acquisition and analyses

Stimulus presentation was controlled by E-Prime 2.0 (Psychology Software Tools, Inc., Sharpsburg, PA). Participants were seated in a sound-attenuated, shielded room approximately 90 cm in front of a 19” TFT monitor (AOC G2460p LED). EEG was recorded via 64 electrodes embedded in a fabric electrode cap (EASYCAP GmbH, Herrsching, Germany; 10-20 system, 64Ch Standard Brain cap, Asian fit). One electrode was placed 1 cm below the left eye to assess vertical eye-movements. EEG signals were collected with a time constant of 10 seconds up until 1,000 Hz, referenced online against FCz, whereas AFz served as ground electrode (BrainAmp DC; Brain Products GmbH, Gilching, Germany).

Behavioral data analysis

Differences in response times were calculated per participant and per condition between each trial and its preceding trial to describe changes in response times evoked by directly preceding feedback. These mean trial-to-trial changes were further separated for trials in which reaction time changes were classified as correct adjustments (i.e., the difference in estimation time between the current trial (n) and the previous trial (n-1) was larger than the difference between estimation time in trial (n) and the 1,000-ms goal) and those classified as incorrect adjustments (i.e., the difference in estimation time between the current trial (n) and the previous trial (n-1) was smaller than the difference between estimation time in trial (n) and the 1,000-ms goal; Pfabigan, et al., 2015). These changes in response time were subjected to a 2x2x2 repeated-measures ANOVA with the within-subject factors social content (social, nonsocial), feedback valence (negative, positive), and estimation adjustment (incorrect, correct) to investigate time estimation changes as a function of feedback conditions. Please refer to Supplementary Materials, chapter 1.7, for an alternative assessment of changes in time estimation behavior. In the current study, we introduced an additional measure of time estimation performance. We extracted the average time interval length during which an estimation was considered as correct to assess estimation quality. The interval length was subjected to a paired-samples t-test comparing social versus nonsocial feedback trials. Post-experimental ratings concerning motivational significance and arousal were analysed separately using 2x2 repeated measures ANOVAs with the within-subject factors social content and feedback valence.

EEG preprocessing

EEG data were analysed using EEGLAB 13_1_1b (Delorme & Makeig, 2004) implemented in Matlab R2014a. Data analyses were performed as parallel to our previous study as possible (Pfabigan, et al., 2019). A high pass (0.1 Hz) and a low-pass filter (cut-off frequency 30 Hz, roll-off 6 dB/octave) were applied offline to the EEG data. Channel FCz was reinstated, subsequently data were re-referenced to linked mastoids and extended infomax independent component analysis (ICA; Bell & Sejnowski, 1995) was applied to detect eye movement-related artefacts. After discarding these artefacts (on average 2.5 ICs per participant, SD = 0.93, range: 1-5), data segments of the four possible conditions were extracted starting 2,200 ms before feedback onset and lasting for 3,300 ms (social positive [Spos], social negative [Sneg], nonsocial positive [NSpos], and nonsocial negative [NSneg]). Subsequently, semiautomatic artefact correction was performed in EEGLAB. At first, trials with voltage values exceeding ±75 μV (function pop_eegthres) or voltage drifts >50 μV (function pop_eegrejtrend) were automatically marked by the algorithms. These trials were rejected in case visual inspection also indicated the presence of artefacts. Artefact-free segments were averaged participant- and condition-wise; on average 35.85 (SD = 8.52) trials per condition.

ERP analysis

The P2 component was scored as the most positive peak in the time window 130-230 ms after feedback onset. The FRN component was scored as peak-to-peak difference measure (P2 minus FRN component; Holroyd, Nieuwenhuis, Yeung, & Cohen, 2003; Pfabigan, Alexopoulos, Bauer, & Sailer, 2011b) to account for component overlap with the preceding P2 component. FRN peak amplitudes (interval 200–300 ms after feedback onset) and the amplitude values of the preceding positive P2 peak were extracted for all conditions at a fronto-central electrode cluster centred around FCz (the cluster comprised of electrodes F1, Fz, F2, FC1, FCz, FC2, C1, Cz, C2) to increase signal-to-noise ratio and to keep the statistical models as simple as possible (as suggested by Luck and Gaspelin, 2017). The P300 component also was assessed as peak-to-peak measure to account for component overlap with the preceding negative ERP (N2 minus P300 component; Pfabigan, Alexopoulos, Bauer, & Sailer, 2011b). P300 peaks (interval 250-500 ms after feedback onset) were extracted for all conditions at a parietal electrode cluster (comprising of electrodes CP1, CPz, CP2, P1, Pz, P2). Furthermore, assessing peak values for the three ERP components allowed investigating their latency variation, which provided valuable information regarding stimulus processing speed. For example, P300 latencies are assumed to reflect stimulus classification demands (Kutas, McCarthy, & Donchin, 1977). FRN, P300, and P2 peak and latency measures were analysed separately with two-way repeated-measures ANOVAs with the within-subject factors social content (social, nonsocial), and feedback valence (positive, negative). FRN and P2 peak latencies were measured from feedback onset to the corresponding FRN/P2 peak at the FCz electrode cluster; P300 peak latency from feedback onset to the corresponding P300 peak at the Pz electrode cluster.

Time-frequency analysis

We calculated trial- and electrode-wise event-related spectral perturbations (ERSPs) per participant with artefact-free trials using a complex Morlet wavelet decomposition implemented in the EEGLAB newtimef() function. Power was calculated from 4 to 20 Hz in 25 bins (zero-padded 3-cycle Morlet wavelets were used for the lowest frequency, increasing linearly by 0.5 cycles until the highest frequency). We used the whole time window of −2,200 to 1,100 ms around feedback onset for calculation to prevent edge effects influencing the FMΘ period. Furthermore, mean power with the time interval of −400 to −200 ms before feedback onset served as baseline interval. This interval was chosen to avoid interference of windowing post-feedback activity and padding values together, which could yield biased ERSP estimates. Subsequently, power values (in μV2) were averaged over all conditions and participants. In line with previous studies, pronounced FMΘ activity was observed at fronto-central electrode sites (Cavanagh & Frank, 2014; Cavanagh, et al., 2012; van der Veen, van der Molen, van der Molen, & Franken, 2016) (Figure 3A). Keeping our analysis strategies as comparable as possible, we used the same electrode cluster as for FRN/P2 analysis. FMΘ responses (4-7.2 Hz) were extracted from baseline-corrected ERSPs in the interval 200-400 ms after feedback onset at the frontal electrode cluster subject- and condition-wise (Osinsky, Seeger, Mussel, & Hewig, 2016). These values were analysed with a two-way repeated-measures ANOVA with the within-subject factors social content and feedback valence. To further extend the current replication, we computed an equivalent time frequency analysis with the data of the previous study. Please refer to Supplementary Material (chapter 1.6) for details and results. To explore whether dispositional self-construal was associated with neural correlates of performance monitoring and rating data, we calculated social content difference measures (Sneg minus NSneg; Spos minus NSpos) and tested their association with self-construal scores with Spearman correlations (Bonferroni-adjusted p < 0.0125; Rousselet & Pernet, 2012). We further tested whether social content difference measures of FRN amplitudes and FMΘ oscillations were related to each other using Spearmen correlations (Bonferroni-adjusted p < 0.025).

Statistical analyses were performed using SPSS 22 (SPSS Inc., IBM Corporation, NY) and Statistica 6.0 (StatSoft Inc., Tulsa, OK). Significant interaction effects were explored with planned contrasts (negative minus positive) when appropriate or Tukey post-hoc tests. The significance level was set at p < 0.05. Partial eta-squared (ηp2) is reported to indicate effect sizes for significant ANOVA results (Kirk, 1996), while Cohen’s d is reported for t-tests (Cohen, 1988). A winsorisation procedure (Wilcox, 2012) was applied per condition for ERP, FMΘ, and behavioral data to account for outliers before statistical analyses (see Kogler, Sailer, Derntl, & Pfabigan, 2017; Rauchbauer, Majdandžić, Hummer, Windischberger, & Lamm, 2015, for a detailed description). As noted by Keselman, Wilcox, & Lix (2003) and Wilcox (2010), including outlier values in analyses violates assumptions of general linear model estimations and should thus be avoided.

To demonstrate generalizability of the current results, we assessed whether ERP and FMΘ variation and behavioral correlates were influenced by participants’ sex/gender (via conducting additional analyses adding sex/gender as between-subject effect to the respective ANOVA models).

Results

Behavioral results

Table 1 summarizes the means and standard deviations of the behavioral data. The overall frequency of positive feedback during the experiment was 48.95% (SD = 2.78), which reflects the successful implementation of the staircase procedure. Positive feedback occurred equally often in social and nonsocial feedback conditions (t(52) = 0.40, p = 0.690).

Table 1 Behavioral and rating data

Trial-to-trial changes in response times were influenced by feedback valence (F(1,52) = 245.25, p < 0.001, ηp2 = 0.83), estimation adjustment (F(1,52) = 287.75, p < 0.001, ηp2 = 0.85), and by trend by social content (F(1,52) = 3.00, p = 0.089, ηp2 = 0.06; larger changes following nonsocial than social feedback). There was a significant interaction between feedback valence and estimation adjustment (F(1,52) = 251.14, p < 0.001, ηp2 = 0.83), the remaining interactions were not significant (all p values > 0.150). Post-hoc analyses revealed that trial-to-trial changes in reaction times were largest after negative feedback, yielding a correct estimation adjustment (all p values < 0.001). No differences in trial-to-trial changes in response times were observed between negative and positive feedback, leading to an incorrect estimation adjustment (p = 0.152). Correct estimation adjustments after positive feedback yielded larger trial-to-trial changes compared to both incorrect adjustments (both p values < 0.001), but smaller changes compared to correct adjustments after negative feedback (p < 0.001). The average interval length to receive correct feedback was shorter for social compared with nonsocial feedback trials (t(52) = 2.03, p = 0.048, d = 0.19) indicating that participants showed higher estimation quality for social compared with nonsocial feedback. Please refer to Supplementary Materials for an alternative quantification approach for trial-to-trial changes in estimation behavior (chapter 1.7).

Post-experimental ratings of motivational significance showed significant main effects of social content (F(1,52) = 10.11, p = 0.002, ηp2 = 0.16) and feedback valence (F(1,52) = 43.06, p < 0.001, ηp2 = 0.45). Their interaction was not significant (p = 0.658). Social and positive feedback stimuli were rated as more motivating than nonsocial and negative ones. Post-experimental arousal ratings showed significant main effects of social content (F(1,52) = 7.89, p = 0.007, ηp2 = 0.13) and feedback valence (F(1,52) = 29.87, p < 0.001, ηp2 = 0.37). Their interaction was not significant (p = 0.562). Social and positive feedback stimuli were rated as more arousing than nonsocial and negative ones.

Ratings of self-construals were as follows: SCS Independence (M = 4.62, SD = 0.59), SCS Interdependence (M = 4.86, SD = 0.64). Interdependent self-construal was rated significantly higher than independent self-construal (t(52) = −2.66, p = 0.010, d = 0.40). Trial-to-trial changes in response times, time estimation interval length, post-experimental ratings, and SCS scores were not influenced by participants’ sex/gender (all p values > 0.075). Only the frequency of positive feedback showed a social content x sex/gender interaction (F(1,52) = 5.09, p = 0.028, ηp2 = 0.09). Tukey post-hoc tests showed that men received more positive feedback during nonsocial trials than women during social trials (p = 0.031).

EEG results

To keep the results section concise, Table 2 shows the exact statistical values and the information as to whether the results of our previous study (Pfabigan, et al., 2019) were replicated. Please refer to Supplementary Material for corroborating results of alternative ERP quantification methods (chapters 1.1–1.5). Table 3 shows means and SD of the EEG results.

Table 2 Results of the ERP ANOVA models

The FRN ANOVA model yielded two significant main effects and a significant interaction on FRN peak-to-peak amplitudes. The interaction was driven by a larger difference between negative and positive feedback (ΔFRN) in social (M = 4.37 μV, SD = 3.55) than nonsocial stimuli (M = 0.58 μV, SD = 4.18; t(52) = −5.10, p < 0.001, d = 0.86). The ANOVA of FRN latencies yielded two significant main effects, i.e., the FRN component peaked earlier for social than nonsocial, and for positive than negative feedback.

The P300 ANOVA model yielded a significant main effect of social content. P300 amplitudes were more positive following social compared with nonsocial feedback. The P300 latency ANOVA model yielded two significant main effects. P300 peaks were observed earlier for social than nonsocial and for positive than negative feedback.

The P2 ANOVA model yielded two significant main effects and a significant interaction. Tukey post-hoc tests showed that P2 amplitudes were smallest after nonsocial negative feedback (both p values < 0.001); the other conditions did not differ from each other (all p values > 0.998). The P2 latency ANOVA model yielded only a significant main effect of social content. P2 amplitudes peaked earlier after social than nonsocial feedback.

Neither P2, FRN, nor P300 amplitudes were influenced by participants’ sex/gender (all p values > 0.110). See Fig. 2 for EEG results.

Fig. 2
figure 2

(a) Amplitude courses of P2 and FRN components at the frontal cluster. (b) Amplitude course of the P300 component at the parietal cluster. (c) Bar graphs of the valence difference (negative minus positive) of FRN amplitudes for nonsocial and social feedback. (d) Topographical maps (in μV) of these valence difference waves for the FRN component (FRN mean latency topography minus P2 mean latency topography per condition). Negative is drawn upward per convention. Feedback onset is at time point zero

The FMΘ oscillations ANOVA model (mean FMΘ power 200-400 ms after feedback onset) yielded significant main effects for social content (F(1,52) = 4.54, p = 0.038, ηp2 = 0.08) and feedback valence (F(1,52) = 36.88, p < 0.001, ηp2 = 0.42) and their interaction (F(1,52) = 13.02, p = 0.001, ηp2 = 0.20). Tukey post-hoc tests showed that social negative feedback elicited enhanced FMΘ than nonsocial negative feedback (p < 0.001), whereas social and nonsocial positive feedback did not differ regarding FMΘ (p = 0.995). Moreover, the difference between negative and positive feedback stimuli (ΔFMΘ) was larger for social (M = 1.13 μV2, SD = 1.10) than for nonsocial feedback (M = 0.46 μV2, SD = 1.22; t(52) = −3.61, p < 0.001, d = 0.58). See Fig. 3 for FMΘ results.

Fig. 3
figure 3

(a) FMΘ scalp topography 200-400 ms after feedback onset averaged over all participants and conditions. The black rectangle indicates the electrode cluster around FCz used for further analysis. (b) Time-frequency (TF) decomposition from 4 to 16 Hz in a time window starting 400 ms before feedback onset and lasting for 1,100 ms, averaged over all participants and conditions. The black rectangle denotes frequency and time window for FMΘ analysis. (c) FMΘ time courses in the same time window separated for the four feedback conditions. (d) Bar graphs of the valence difference (negative minus positive) FMΘ for nonsocial and social feedback. (e) Time-frequency (TF) decomposition of the valence difference for nonsocial and social feedback. The black rectangles denote frequency and time windows for FMΘ

Adding sex/gender to the ANOVA model, a social content x sex/gender interaction was observed (F(1,52) = 5.76, p = 0.020, ηp2 = 0.10) in addition to the social content x valence interaction. Tukey post-hoc tests showed that FMΘ power was higher for social compared with nonsocial stimuli in women (p = 0.010), whereas men showed no differences (p = 0.992). The three-way interaction was not significant (p = 0.424).

Correlation analyses

We observed no significant correlations between independent or interdependent self-construal scores and difference scores of ERP data (all p values > 0.110), FMΘ power (all p values > 0.104), or rating data (all p values > 0.173). FRN amplitudes and FMΘ power were not significantly correlated with each other for negative (rs = 0.229, p = 0.099) and positive (rs = 0.177, p = 0.204) social content difference measures, suggesting a differentiation between phase-locked and nonphase-locked stimulus processing during feedback presentation.

Table 3 Means and SD of ERPs and neural oscillations

Discussion

The current study was designed to replicate and extend previous findings by testing a processing advantage of social compared with nonsocial feedback in a Chinese sample. Our ERP results first replicated our previous findings almost entirely (Pfabigan, et al., 2019). We observed more pronounced amplitudes for social compared with nonsocial feedback stimuli in three successive feedback processing stages—indexed by P2, FRN, and P300 components (refer to Supplementary Materials, chapter 1.5, for a discussion on P300 amplitude variation in response to feedback valence). Moreover, the current behavioral results were in line with the previous findings. Correct adjustments of time estimation performance were preceded by negative feedback and resulted in largest changes in time estimation response times. By trend, adjustments were smaller following social stimuli. Adding a measure of estimation quality, we observed better estimations for social than for nonsocial stimuli—suggesting that the social content manipulation was indeed behaviorally relevant. As the second main goal, the current work conducted time-frequency analyses to corroborate the influence of social stimulus characteristics on neural correlates of performance monitoring. Similar to the FRN results, the difference of negative minus positive frontal midline theta oscillations (FMΘ) was larger for social compared with nonsocial feedback stimuli. This result also was observed when conducting a similar time-frequency analysis in the previous dataset. No significant associations of dispositional self-construal were found with neural or behavioural correlates.

Overall, the current ERP and FMΘ results corroborate the suggestion that social stimuli were processed preferentially and more elaborately compared with nonsocial ones. Time estimation quality even benefitted from using social feedback stimuli. This emphasizes again the intrinsic relevance of social stimuli during social interaction (Dunbar, 1998) and their link to guiding behavior (Hariri, Tessitore, Mattay, Fera, & Weinberger, 2002). This pattern was similarly observed in individuals with Chinese and western cultural backgrounds for such universal social communicative gestures, such as thumbs up/down. Such culturally universal brain activation further underlines the importance of social signals. Our results corroborate the notion of previous imaging studies suggesting that social versus nonsocial information content could be seen as a distinct stimulus category (Britton et al., 2006; Norris, Chen, Zhu, Small, & Cacioppo, 2004; Scharpf, Wendt, Lotze, & Hamm, 2010; Vrtička, Sander, & Vuilleumier, 2011; Wagner, Kelley, Haxby, & Heatherton, 2016). These studies reported that social versus nonsocial information led to preferential processing in cortical and subcortical areas (e.g., occipital, frontal areas, amygdala, and insula). Thereby, it has been suggested that social versus nonsocial information is processed in an additive or even interactive manner with emotional content dimensions, for which processing advantages are already established (Britton, et al., 2006; Hariri, et al., 2002; Norris, et al., 2004; Scharpf, et al., 2010; Vrtička, et al., 2011).

Furthermore, apart from our own study, recent electrophysiological evidence supports the claim of a processing advantage of social versus nonsocial information. Okruszek et al. (2016) reported a clear differentiation between social and nonsocial stimulus content for negative versus neutral complex visual scenes within the first 300 ms of stimulus processing. Using written stimulus material, Bayer, Ruthmann, & Schacht (2017) also demonstrated processing differences between social and nonsocial stimulus content for negatively valenced versus neutral sentences within the first 300 ms. Because both studies demonstrate that social content is processed at rather early stages in the processing stream, one could consider it as an important stimulus aspect. This is not surprising, because social stimuli convey powerful positive and negative social motives, such as affiliation and protection, but also disapproval and social exclusion (Baumeister, Vohs, DeWall, & Zhang, 2007; Norris, et al., 2004; Porges, 2003). A recent study (Schacht & Vrticka; 2018) reporting a social stimulus processing advantage also for positive stimuli suggests the component process model of emotion (Scherer, 2009) as theoretical model to explain these early social stimuli processing advantages. This theory proposes a series of relevance checks, such as novelty, intrinsic pleasantness, and goal relevance (Scherer, 2009) during stimulus evaluation. Based on their findings, Schacht & Vrtička (2018) suggest that social stimulus content is appraised during one of the first relevance checks, possibly as aspect of intrinsic pleasantness. However, in contrast to their findings, the current study observed processing advantages of social stimuli specifically for negative stimuli (FRN and FMΘ). This might be caused by the inherent nature of feedback stimuli, because negative outcomes are behavirally more relevant than positive ones.

As of yet, only a few studies investigated whether neural oscillations are sensitive to performance monitoring in a social context. Measuring intracranial EEG, Cristofori et al. (2013) tested whether FMΘ oscillations were responsive to a standard social exclusion situation. Indeed, the authors observed enhanced FMΘ activity in exclusion compared with inclusion trials at electrodes implanted in cortical and subcortical areas of the so-called pain matrix. This led them to conclude that FMΘ oscillations also might represent an indicator of social pain. However, exclusion trials might be more conflicting with one's own expectations than inclusion trials and therefore require enhanced cognitive control demands, which is another explanation for enhanced FMΘ band activity in these trials (Cavanagh & Frank, 2014). Controlling for the cognitive conflict confound, van Noordt, White, Wu, Mayes, & Crowley (2015) again observed enhanced FMΘ activity for rejection events compared with visually similar “not my turn” events in a child sample. Together, both results suggest that social exclusion trials were the most salient ones. Investigating neural oscillations with a social evaluation paradigm, van der Veen et al. (2016) and van der Molen, Dekkers, Westenberg, van der Veen, & van der Molen (2016) observed enhanced FMΘ for unexpected social feedback. While these studies collectively demonstrate that FMΘ oscillations can serve as reliable markers of performance monitoring in a social context, no study directly investigated the influence of social versus nonsocial context or confounding variables so far. Thus, the current results extend the previous studies by demonstrating that variation in FMΘ oscillations—as a measure of phase-locked and nonphase-locked neural activation—also reflected the distinction in social and nonsocial feedback stimuli (in the absence of expectancy or cognitive conflict manipulations). This further suggests that cognitive control processes implemented by the mPFC might become more strongly engaged when social versus nonsocial stimuli are involved (Cavanagh & Frank, 2014; Cavanagh, et al., 2012).

No significant association between FRN amplitudes and FMΘ oscillations were observed in the current study. This is not unexpected, because the current time-frequency decomposition comprises a mixture of phase-locked and nonphase-locked activity while FRN amplitude variation comprises only phase-locked activity (Pfurtscheller & Lopes da Silva, 1999).

Furthermore, we observed no reliable association of EEG measures and individual self-construal scores in the current study. This is in contrast to previous studies reporting an association between error- and feedback-related amplitude variation and interdependent self-construal scores (Hitokoto, et al., 2016; Kitayama & Park, 2014). Interpreting this null result cautiously, it might be possible that the current bottom-up approach of manipulating physical stimulus aspects is less susceptible to influences of dispositional personality traits than previous experimental set-ups. To investigate this suggestion, future studies should assess the interplay of bottom-up and top-down influences of social stimulus content and social context manipulations in within-subject designs. The processing advantage of social versus nonsocial feedback stimuli might be a rather universal effect that is not sensitive to self-construals, which have been shown to influence multiple cognitive and affective domains (Jiang, Varnum, Hou, & Han, 2013; Lin, Lin, & Han, 2008; Sui & Han, 2007; Wang, Ma, & Han, 2014).

Previous studies usually focused on top-down influence of social feedback by constructing ingenious experiments with cover stories in which participants were led to believe to directly receive feedback from others (Kujawa, Arfer, Klein, & Proudfit, 2014; Somerville, Heatherton, & Kelley, 2006). In contrast, the current study defines social feedback solely by its physical appearance, i.e., by the depiction of symbolic hand gestures. These gestures are used in face-to-face interactions on a regular basis to demonstrate favorable or unfavorable assessments by our conspecifics (Morris, 1994). Moreover, cross-cultural social media phenomena make use of exactly these gestures to convey social approval or disapproval. Both examples highlight that symbolic hand gestures as nonverbal signals are tightly linked to social interaction even in the absence of direct interaction partners. Thus, although the current study did not introduce an elaborate social setting, the sole use of pictures of social communication signals was strong enough to trigger rapid and spontaneous stimulus-driven saliency to take effect, thereby reflecting a processing advantage of social stimuli. Moreover, the stimuli used were matched for visual complexity, thereby minimizing the possibility that the observed effects were driven by other stimulus characteristics than the availability of social signals (Song et al., 2005).

Although the effects of sex/gender were not in the focus of the current study, we decided to address them in additional analyses and report the findings to provide a more exhaustive results overview—also including null results (see Eliot, 2011). In addition, van der Veen et al. (2016) suggested considering sex/gender as important individual difference when conducting research in the domain of social feedback. The authors observed differential sensitivity of their physiological markers in women and men when introducing a social evaluation situation. In contrast, the current study used a bottom-up approach to manipulate social feedback content, which was less susceptible to sex/gender differences. Only FMΘ oscillations differed in women and men when comparing overall effects of social versus nonsocial stimuli. Higher sensitivity to social versus nonsocial stimuli in women might reflect an evolutionary residue, socially acquired gender roles promoting social interaction, or a combination of both (Eagly, 2009). The interaction of biological sex and sociocultural gender roles during social cognition is far from understood and warrants future investigations (Pavlova, 2017). However, the observed FMΘ enhancement for social negative versus nonsocial negative feedback was not affected by sex/gender. This allows us to conclude that the current manipulation elicited rather comparable neural and behavioral results in all participants.

Our results imply that the use of hand gestures is an easy-to-use strategy to enhance the saliency of performance feedback stimuli. This could be relevant for clinical intervention and for neuropsychological rehabilitation programs. Middle-European and Chinese individuals showed comparable neural activation pattern when interpreting the hand gestures to adapt subsequent behavior. Future studies should assess large and culturally diverse samples to test whether the observed processing advantage of social stimuli is indeed independent of individuals’ cultural backgrounds.

Conclusions

Using a bottom-up approach as experimental manipulation, we replicated the processing advantage of social over nonsocial complex stimuli during feedback processing in a Chinese sample. Not only did ERP components show amplitude enhancement for social compared with nonsocial stimuli, also neural oscillations in the theta range mirrored the previously observed activation pattern—in particular the differentiation between negative and positive feedback. This early processing advantage of social over nonsocial stimuli seems to be independent of cultural background or dispositional personality traits associated with enhanced social interaction. This further corroborates the notion that the dimension of social versus nonsocial characteristics should be considered as separate stimulus category that is culturally universal.