Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect, Tiredness, and Cognitive Performance

Personality traits describe how people typically think, feel, and behave. Personality states describe how people think, feel, and behave in a given moment. In their daily lives, people often behave the way they typically do (they enact trait-congruent personality states), but occasionally behave differently from how they typically do (trait-incongruent personality states). Several theories propose that such incongruent personality states should be associated with undesirable outcomes such as less positive affect or more tiredness, but the current state of evidence is inconclusive and mostly based on one personality dimension: extraversion. In this study, we contribute to filling important gaps in the literature by examining congruence of personality dimensions other than extraversion, considering characteristics of the situation, and modeling congruence with state-of-the-art response surface analyses. We aimed to manipulate state honesty-humility and state agreeableness as well as perceived adversity and deception of the situation in a prisoner’s dilemma paradigm. The manipulations mostly had the intended effects but they also had additional unspecific effects on other personality states and situation characteristics. The study thus emphasized the difficulty of manipulating personality states, situation characteristics, and trait–state and state–situation congruence. In pre-registered analyses of variance, response surface analyses, and specification curve analyses, we then examined how trait–state congruence and state–situation congruence were associated with positive affect, tiredness, and performance in a numerical Stroop task. Neither trait–state congruence nor state–situation congruence were associated with positive affect, tiredness, or cognitive performance. However, in light of this study’s limitations, more studies that are carefully designed, carefully operationalized, and well-powered

Personality traits describe how people typically think, feel, and behave (Diener & Lucas, 2020). However, the way people think, feel, and behave in a given moment-their personality states-can change from moment to moment and from situation to situation. People most often enact personality states at or close to their trait level (Fleeson, 2001). These personality states are called trait-congruent personality states. However, because people enact personality states across the whole spectrum of a given trait on a regular basis, their personality states occasionally differ from their trait level (Fleeson, 2001). These personality states are called trait-incongruent personality states. For example, people who are characterized by high levels on the personality trait extraversion nevertheless occasionally spend time alone (introverted personality state), people who are characterized by high levels on the personality trait agreeableness occasionally argue with others (disagreeable personality state), and people who are characterized by low levels on the personality trait conscientiousness occasionally complete their to-do list (conscientious personality state).
But does it matter whether a personality state is congruent or incongruent with one's trait? Several theories (e.g., Gallagher et al., 2011;Moskowitz & Côté, 1995;Tett & Burnett, 2003) propose that people should experience more positive affect, less negative affect, and less tiredness when enacting trait-congruent personality states as opposed to trait-incongruent personality states. However, the current evidence regarding this question is rather inconclusive and previous research on the congruence of personality states has several gaps: First, almost all studies focused on extraversion (e.g., Fleeson et al., 2002;McNiel & Fleeson, 2006;Zelenski et al., 2012), whereas little is known about the other personality dimensions. Second, previous studies have examined a variety of outcomes of trait-incongruent personality states including positive affect, negative affect, effort, vitality, authenticity, performance in cognitive tasks, and handgrip strength, making it difficult to compare the findings (Fleeson & Wilt, 2010;Gallagher et al., 2011;Mc-Niel et al., 2010;Pickett, Hofmans, Feldt, et al., 2020;Zelenski et al., 2012). Third, most studies modeled interactions between personality traits and personality states to examine whether the association between extraverted personality states and positive affect, for example, depends on the trait extraversion level (e.g., Gallagher et al., 2011;Jacques-Hamilton et al., 2018;Zelenski et al., 2012). However, interactions may not be the most adequate operationalization of congruence (J. R. Edwards, 2002;Humberg, Nestler, et al., 2019). Finally, although there is initial evidence that the situational context is relevant to the association between congruence and relevant outcomes (Gallagher et al., 2011;Whelan, 2013), characteristics of the situation were usually not considered.
The present study contributes to filling these gaps by (1) examining congruence of trait and state agreeableness and trait and state honesty-humility, (2) including and comparing three different outcomes, (3) modeling congruence with state-of-the-art response surface analyses (Cronbach, 1958;J. R. Edwards, 2002J. R. Edwards, , 2007Humberg, Nestler, et al., 2019), and (4) considering characteristics of the situation.

Consequences of Trait-Incongruent Personality States
Different theories have argued that trait-congruent personality states should be associated with more desirable outcomes than trait-incongruent personality states. The behavioral concordance model (Moskowitz & Côté, 1995) and trait activation theory (Tett & Burnett, 2003;Tett & Guterman, 2000), for example, propose that people experience more pleasant affect when engaging in trait-congruent personality states than in trait-incongruent states. Moreover, the contra-trait effort hypothesis (Gallagher et al., 2011) argues that incongruent personality states require more effort than congruent personality states. Therefore, trait-incongruent personality states should be associated with tiredness, fatigue, and even with impaired performance on subsequent tasks requiring similar effort (Gallagher et al., 2011;Zelenski et al., 2012). Finally, the traitconsistency hypothesis (Fleeson & Wilt, 2010) proposes that people feel more authentic when engaging in trait-congruent personality states than in trait-incongruent states.
Despite this theoretical basis, empirical findings on the consequences of trait-congruent and trait-incongruent personality states are mixed. Most prominently, a series of lab experiments examined associations between trait extraversion, state extraversion, and affect (e.g., Fleeson et al., 2002): Participants were instructed to be either talkative and bold (i.e., extraverted) or reserved and quiet (i.e., introverted) in a 10-minute group discussion. Afterwards, they rated their behavior and affect during the discussion. These experiments modeled interactions between personality traits and personality states to examine whether the association between personality states and positive affect depends on the personality trait level. However, they did not find any evidence for such an interaction; instead, all participants, regardless of their trait extraversion levels, reported higher levels of positive affect and lower levels of negative affect when behaving extravertedly (Fleeson et al., 2002;McNiel et al., 2010;McNiel & Fleeson, 2006;Zelenski et al., 2012; for a similar finding in an experience sampling study, see Margolis & Lyubomirsky, 2020). Similar findings were also reported regarding authenticity. People generally felt more authentic when their behavior was more open, conscientious, extraverted, agreeable, and emotionally stable, regardless of whether this behavior was congruent or incongruent with their personality trait (Cooper et al., 2018;e.g., Fleeson & Wilt, 2010;Wilt et al., 2021).
Other studies-mostly using experience sampling designs-did find evidence for trait-congruence effects. In an experimental experience sampling study, people low on trait extraversion reported weaker increases in positive affect and stronger increases in negative affect, inauthenticity, and tiredness when behaving extravertedly than people high on trait extraversion (Jacques-Hamilton et al., 2018). Moreover, people who behaved less extravertedly than normal in a group discussion reported more effort (Gallagher et al., 2011) and reacted more slowly in a subsequent Stroop task (Zelenski et al., 2012). Additionally, two recent experience sampling studies modeled congruence with quadratic regressions to examine whether the association between personality states and affect takes the form of an inverted U with its peak at the trait level. These studies found that people who behaved less conscientiously or more extravertedly than normal experienced less positive affect and less vitality, respectively, than people behaving congruently (Pickett, Hofmans, Debusscher, et al., 2020;Pickett, Hofmans, Feldt, et al., 2020). However, most of these effects were asymmetric such that, for example, only incongruent personality states below the trait extraversion level, but not above the trait level, were reported to be more effortful (Gallagher et al., 2011;Pickett, Hofmans, Debusscher, et al., 2020;Pickett, Hofmans, Feldt, et al., 2020). The associations between trait-state congruence and relevant outcomes may thus depend on the trait level and the direction of the incongruence.
These examples demonstrate that congruence has been studied with a variety of study designs (from laboratory experiments to longitudinal experience sampling designs and experimental experience sampling) and statistical operationalizations (from trait-state interactions to difference scores to quadratic associations). Particularly in the case of statistical operationalizations, it is important to consider how these operationalizations differ and under which circumstances they support the congruence hypothesis. A method that is well suited for this purpose is response surface analysis (Cronbach, 1958;J. R. Edwards, 2002J. R. Edwards, , 2007Humberg, Nestler, et al., 2019). Response surface analysis models congruence with polynomial regressions that in-Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Collabra: Psychology
clude linear effects, quadratic effects, and interactions. A significant congruence effect (typically called fit pattern) is not represented by a significant interaction coefficient or a significant quadratic coefficient as used in previous studies, but by a more complex pattern of associations involving both interactions and curvilinear effects (J. R. Edwards, 2007;Humberg, Dufner, et al., 2019). This approach is currently considered best practice for examining congruence effects (J. R. Edwards, 2007;Humberg, Dufner, et al., 2019). Moreover, response surface analysis has two important advantages: First, it includes interactions and quadratic effects and therefore makes it possible to compare the results to those found in previous studies. Second, response surface analysis can also model and detect asymmetric effects as reported in some of the previous studies.
Overall, we take away three key points: First, most previous research indicated that there is no interaction between personality traits and personality states with regard to positive affect, though a few recent studies suggest otherwise. Second, it may be important to examine outcomes other than positive affect such as tiredness, authenticity, or cognitive measures. Third, more complex statistical models such as response surface analyses could improve the understanding of the associations between trait-incongruent personality states and relevant outcomes.

Trait-State Congruence in Context
Another important limitation of previous research on trait-state congruence is that most previous studies failed to acknowledge possible influences of the situation. However, personality and behaviors are always contextualized in situations, and situations may therefore affect associations between trait-state congruence and relevant outcomes. Situations contain, for example, information such as incentives or rules about which behaviors are appropriate and will be rewarded and which behaviors are inappropriate and will be punished (Funder, 2016). Behaving extravertedly at a funeral, for example, may have completely different consequences than behaving extravertedly at a party.
Many of the previous studies examining trait-state congruence used a group discussion paradigm (e.g., Fleeson et al., 2002;McNiel et al., 2010;McNiel & Fleeson, 2006;Zelenski et al., 2012). A group discussion represents a situation that probably calls for rather extraverted personality states. Extraverted personality states may be congruent with such a situation and therefore lead to more positive affect. In these studies, the situational context may therefore have confounded the results regarding trait-state congruence. Put differently: The reason why both people low and people high on trait extraversion experienced more positive affect when behaving extravertedly in the group discussions may be that extraverted personality states were appropriate to this situation. The associations between trait-state congruence and positive affect might be different when examined in a situation that does not reward extraverted personality states. Indeed, Whelan (2013) found that the association between extraverted personality states and positive affect weakened when the group discussion paradigm was changed to a group puzzling task. Moreover, Kritzler et al. (2020) found that the association between personality states and state affect was moderated by situation characteristics. For example, the association between state extraversion and state positive affect was stronger when the situation was perceived as more social.
Accordingly, theories such as person-environment fit (J. R. Edwards et al., 1998Edwards et al., , 2006Kristof, 1996) and trait-activation theory (Tett & Burnett, 2003;Tett & Guterman, 2000) propose that state-situation congruence, congruence between personality states and characteristics of the situation, is also associated with outcomes such as positive affect. That is, situation-congruent personality states should have positive consequences, whereas situation-incongruent personality states should have negative consequences. In the context of congruence with personality states, situations may best be conceptualized as situation characteristics from taxonomies such as the Situational Eight DIAMONDS (Rauthmann et al., 2014). Situation characteristics represent psychologically relevant and meaningful interpretations of situations such as their sociality, positivity, or adversity. Because these situation characteristics "describe situations similar to the manner in which people can be described with trait" (Rauthmann et al., 2014, p. 679) and have already been shown to be theoretically and empirically linked to personality dimensions such as the Big Five (e.g., Rauthmann et al., 2014Rauthmann et al., , 2015Rauthmann et al., , 2016, they are well suited for examining state-situation congruence.
Overall, state-situation congruence may have confounded the effects of trait-state congruence. Therefore, it is important to include state-situation congruence when examining trait-state congruence.

An Experimental Situation for Manipulating Personality States and Situation Characteristics
Like many previous studies, we aimed to examine trait-state congruence and state-situation congruence in an experimental study. An experimental manipulation of situation and behavior was necessary to ensure that sufficient congruent and incongruent personality states could be sampled. Moreover, experimental designs have the advantage that they allow causal inferences. However, a group discussion as used in most previous studies (e.g., Fleeson et al., 2002;McNiel & Fleeson, 2006) may not be well suited for our purposes because it is a situation that is difficult to control and manipulate.
We therefore sought a situation that is easier to control and manipulate and that is relevant to personality traits and personality states. We found that economic games such as the prisoner's dilemma (Flood, 1958;Rapoport, 1989;Tucker, 1980) are well-suited for this purpose because they have been associated with personality, and aspects of the situation have been previously manipulated (Kagel & McGee, 2014;Pothos et al., 2011;Zettler et al., 2013). In the prisoner's dilemma, two criminals are interrogated separately by the police for a crime. If both criminals remain silent (i.e., cooperate), they can only be convicted of a lesser crime and spend one year in prison. If one confesses and accuses the other (i.e., defects), the first one goes free while the other spends three years in prison. If both confess, they both go to prison for two years. The criminals have no way of knowing what the other will do. The dilemma in this sit-Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Collabra: Psychology
uation is that each criminal is always better off by defecting than by cooperating, but that mutual cooperation yields a better outcome than mutual defection. In research, the prisoner's dilemma is often reframed to have monetary rewards instead of prison sentences as payoffs (e.g., Kagel & McGee, 2014;Pothos et al., 2011).
Decisions in the prisoner's dilemma are associated with the personality dimensions agreeableness (being patient, gentle vs. being quarrelsome, stubborn) and honesty-humility (being faithful, loyal vs. being sly, hypocritical) (Kagel & McGee, 2014;Pothos et al., 2011;Zettler et al., 2013). Moreover, both the prisoner's dilemma and the personality dimensions agreeableness and honesty-humility are associated with the situation characteristics adversity (the extent to which the situation contains threats, problems, conflicts, competition, or criticism) and deception (the extent to which the situation contains mistrust, lying, or betrayal) (Rauthmann et al., 2014(Rauthmann et al., , 2016. In our study, we aimed to manipulate state agreeableness and state honesty-humility as well as perceived adversity and deception in the prisoner's dilemma. We manipulated state agreeableness and state honesty-humility together 1 by instructing participants to act either agreeably and honestly or disagreeably and dishonestly. We manipulated perceived adversity and deception together by manipulating the game's payoffs and the description and behavior of the other player. We then examined how trait-state congruence and state-situation congruence in the prisoner's dilemma were associated with relevant outcomes. Because trait-state congruence and state-situation congruence have been theoretically and empirically linked to a variety of outcomes (e.g., Gallagher et al., 2011;McNiel & Fleeson, 2006;Zelenski et al., 2012), we examined three different outcomes in our study: positive affect, tiredness, and cognitive performance.

Hypotheses
First, we expected the manipulation to be successful (H1). That is, we hypothesized that participants instructed to behave honestly and agreeably would be more honest, share more often, and report higher levels of state agreeableness and state honesty-humility than participants instructed to behave dishonestly and disagreeably. Moreover, we hypothesized that participants in the more adverse and deceptive condition would report higher levels of adversity and deception than participants in the less adverse and deceptive condition. In addition, we expected the experimental conditions to not be significantly associated with state emotionality, state extraversion, state conscientious, and state openness, nor with the situation characteristics duty, intellect, mating, positivity, negativity, and sociality. Sec-ond, because main effects of personality states and situation characteristics on state affect are well documented in the literature regardless of trait-state congruence and state-situation congruence (e.g., Ching et al., 2014;Kritzler et al., 2020;Sherman et al., 2015), we expected to replicate these main effects (H2). That is, we expected higher levels of state agreeableness and state honesty-humility to be associated with more positive affect, and higher levels of perceived adversity and deception to be associated with more negative affect. Finally, and most importantly, we expected trait-congruent personality states (H3) and situation-congruent personality states (H4) to be related to more positive affect, less tiredness, and better cognitive performance than trait-incongruent and situation-incongruent states.

Methods
Data collection was approved by the local ethics committee of the Ruhr University Bochum (Protocol 544: "Consequences of incongruent behaviors").
The preregistration of sample sizes, hypotheses, and primary analyses is available on the Open Science Framework (Link: https://osf.io/hnu4b/). A summary of deviations from this preregistration, as well as study materials, data, analysis scripts, and supplementary materials are also available on the Open Science Framework (Link: https://osf.io/ 8tbep/). We report how we determined our sample size, all data exclusions, all manipulations, and all measures in the study (Simmons et al., 2012).

Participants
We determined a required sample size of 152 participants based on an interaction effect of ηp2 = .05 reported by Zelenski et al. (2012). However, because we suspected that effects might be smaller in our study, we aimed for a sample size of 200 participants with complete and valid data.
Participants were recruited via CloudResearch's Prime Panels (Chandler et al., 2019). In total, 917 participants landed on the study entry page but 610 were excluded because they did not consent to participating in the study (N = 380), did not fulfill eligibility criteria (N = 35), did not pass the screening (N = 37), answered instructed response items incorrectly (N = 49), or dropped out early in the study. Figure 1 details the flow of participants through the study as well as exclusion criteria and final sample sizes. Because some participants terminated the study before completion (96 participants dropped out before the Stroop task 2 ) and we preregistered using pairwise deletion, the sample sizes for the analyses differed depending on the measures included. Roughly speaking, we had a final sample of about 200 participants for analyses with Stroop performance as We manipulated state agreeableness and state honesty-humility, as well as adversity and deception together because the two dimensions, respectively, are most likely highly correlated in the prisoner's dilemma and it would thus have been difficult and unrealistic to manipulate them separately (Ashton et al., 2014;Rauthmann & Sherman, 2016a).
Exploratory analyses of predictors of attrition revealed a significant correlation between progress in the study and trait extraversion, r = .22, p < .001, but not with any other personality variables, demographic variables, nor with the experimental conditions.

Experimental Paradigm
Our experimental paradigm was a modified version of the classical prisoner's dilemma (Flood, 1958;Tucker, 1980Tucker, , 1983. We reframed the game to be about a jackpot of coins instead of prison sentences. Participants played 15 rounds against a computer-simulated player. In every round, the players had the opportunity to distribute a jackpot between them. Each player independently decided to either split the jackpot or steal the jackpot. If both chose to split, the jackpot was split between them. If one chose to steal while the other chose to split, the former received to jackpot whereas the latter received nothing. If both chose to steal, the jackpot was lost and both received nothing. To allow for more variation in honest and agreeable behavior, the players communicated to each other what they were planning to do before making their final decision-but they could lie or change their mind. We measured the number of honest decisions (i.e., trials in which the participants' communicated intention and the final decision matched) and the number of share decisions (i.e., trials in which the participants' final decision was to share the jackpot) as indices of participants' behaviors during the game.
We aimed to manipulate behavior and situation independently in a 2 (behavior condition: high agreeableness and honesty vs. low agreeableness and honesty) × 2 (situation condition: high adversity and deception vs. low adversity and deception) between-person design. Behavior during the game was manipulated by directly instructing participants to act either honestly and agreeably or dishonestly and disagreeably. 3 The situation was manipulated by varying payoffs and information about the other player. In the low adversity and deception condition, payoffs signaled that stealing is not worthwhile (Zettler et al., 2013) and the other player was framed as and behaved like a trustworthy partner. In the high adversity and deception condition, pay-offs signaled that stealing pays off (Zettler et al., 2013) and the other player was framed as and behaved like an untrustworthy opponent. We tested an earlier version of the experimental paradigm in a preliminary study. Details on this pretest can be found in the Online Supplementary Materials (OSM).

Personality Traits
Personality traits were operationalized as self-reported HEXACO personality traits (Ashton et al., 2004;Ashton & Lee, 2001) measured with the 24-item Brief HEXACO Inventory (De Vries, 2013). Participants indicated on a scale from 1 (disagree strongly) to 5 (agree strongly) how much they agreed with descriptive statements such as "I think science is boring," an item for Openness. Scale scores were computed by reverse-coding items when applicable and then averaging over all items that belonged to a scale. The personality trait scales showed rather low internal consistencies (Honesty-Humility: = .61, Emotionality: = .28, eXtraversion: = .59; Agreeableness: = .22, Conscientiousness: = .52, Openness: = .54) which is to be expected for short scales that are designed to maximize coverage of the construct and convergent validity instead of internal consistency (De Vries, 2013;Ziegler et al., 2014). However, the internal consistencies for Agreeableness and Emotionality were even lower than expected. This limitation is further addressed in the discussion.

Personality States
Personality states were operationalized as self-reported HEXACO personality states measured with bipolar items adopted from Sherman et al. (2015) and Churchyard (2013). All personality states were measured with one bipolar item each adopted from Sherman et al. (2015). State honesty-humility and state agreeableness were additionally measured with three items from Churchyard (2013), and all four items were combined into one scale score for these states. Participants indicated on a 7-point bipolar scale how they behaved in the game situation (e.g., for one of the agreeableness items, the scale ranged from cold, quarrelsome to warm, agreeable). Scale scores (when applicable) were computed by reverse-coding items when applicable and then averaging over all items that belonged to a scale.

Situation Characteristics
Situation characteristics were operationalized as per- The exact wording of the instructions was: "Your task is to behave honestly and cooperatively with your [partner/opponent]. Please try to be true to your word most of the time. Of course, you do not always have to do as you proposed, but please try to be honest and cooperative as much as you can." in the high agreeableness and honesty-condition and "Your task is to behave dishonestly and uncooperatively with your [partner/opponent]. Please try to play dirty most of the time. Of course, you do not always have to deceive your [partner/opponent], but please try to be dishonest and uncooperative as much as you can." in the low agreeableness and honesty-condition.

Figure 1. Detailed flow of participants through the study including exclusion criteria, dropouts, and final sample sizes
ceived situation characteristics from the Situational Eight DIAMONDS taxonomy (Rauthmann et al., 2014). The situation characteristics adversity and deception were measured with three items each taken from the S8* questionnaire (Rauthmann & Sherman, 2016a). The other situation characteristics were measured with one item each taken from the S8-I questionnaire (Rauthmann & Sherman, 2016b). Participants indicated on a scale from 1 (not at all) to 7 (totally) how well certain statements described the game situation they had just encountered (e.g., for deception, one item was "It is possible to deceive someone."). Scale scores (when applicable) were computed by reverse-coding items when applicable and then averaging over all items that belonged to a scale.

Positive Affect and Tiredness
Positive affect was operationalized as the good-bad dimension and tiredness was operationalized as the active-tired dimension of the three-dimensional model of affect (e.g., Matthews et al., 1990;Steyer et al., 1994;Wilhelm & Schoebi, 2007). These dimensions assess the valence (from unpleasant to pleasant) and the energetic arousal (from tired/without energy to awake/full of energy) of the current mood. Positive affect and tiredness were measured with a short form of the multidimensional mood state questionnaire (Steyer, 2021;Steyer et al., 1994) comprising four items each. Participants indicated on a scale from 1 (not at all) to 7 (totally) how intensely they had experienced moods characterized by certain adjectives at the end of the game (e.g., "content" and "tired"). Scale scores were computed by reverse-coding items when applicable and then averaging over all items that belonged to a scale.

Stroop Task
Participants performed a numerical Stroop task (Henik & Tzelgov, 1982) as a measure of cognitive performance. In this task, participants were presented two numbers (ranging from 1 to 9) and indicated which number is numerically larger. The two numbers always had a numerical difference of one and were presented in different physical sizes: In neutral trials, both numbers had the same physical size and only differed in their numerical size. In congruent trials, Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Collabra: Psychology
the numerically larger number was also displayed physically larger. In incongruent trials, the numerically smaller number was displayed physically larger.
Participants first completed a trial round with immediate feedback. The main task then consisted of 75 trials with 15 neutral, 30 congruent, and 30 incongruent trials presented in a random order. Participants were instructed to answer as fast as possible but correctly. We sampled three types of information during the main Stroop task: reaction time, trial type (congruent, incongruent, or neutral), and correctness of the answer.

Comprehension, Attention, and Careless Responding
We included a short screening consisting of four synonym tasks (see Chandler et al., 2019) at the beginning of the study to assess language comprehension and attentiveness. Additionally, we included two instructed response items (e.g., "To secure data quality, please choose answer option 4"; Meade & Craig, 2012) to further identify careless responders. Only participants who passed the screening and provided correct responses on the instructed response items were included in the final sample.

Procedure
The study was implemented using the online survey platform Qualtrics (Qualtrics, 2020). To be eligible, participants had to be at least 18 years old, live in the United States, access the study with a laptop or desktop computer, and consent to participate. After providing informed consent, the participants completed the screening task. Participants who did not fulfill eligibility criteria or did not answer the screening questions correctly were screened out and did not complete the rest of the study.
Participants who passed the screening then answered questions regarding their personality traits and demographics. They then played the prisoner's dilemma game which took on average 14 minutes (Q 1 = 5.65, Q 2 = 7, Q 3 = 8.19). Next, participants answered questions regarding their personality states, perceived situation characteristics, and their positive affect and tiredness at the end of the game in randomized order. Finally, participants performed the numerical Stroop task (duration ~ 7 minutes). In the end, participants were given the chance to comment on the study, created their individual code for further reference, and were compensated for study participation. Importantly, participants' compensations did not depend on the results of the prisoner's dilemma game. Overall, participants took on average 33 minutes (median = 24, SD = 85) to complete the study.  Aust & Barth, 2020). Detailed information about the planned analysis strategy can be found in the preregistration. Here, we only report on the analyses that were ultimately realized. For the manipulation check (H1), we conducted analyses of variance with the different situation characteristics and personality states as dependent variables and effects-coded variables indicating the experimental situation and behavior conditions and their interaction as independent variables. The main effects of personality states and situation characteristics on the positive affect dimension (H2) were examined in a multiple regression analysis. The planned analyses for the congruence hypotheses (H3 and H4) depended on the results from the manipulation check. In short, we planned to perform analyses of variance with the experimental groups and median-split personality traits whenever a manipulation was successful. In addition, and regardless of the success of the manipulations, we conducted response surface analyses (based on polynomial regression) with self-reported situation perceptions, personality states, and personality traits. Moreover, we conducted specification curve analyses for all analyses with cognitive performance indicators as dependent variables. Details on response surfaces analysis and specification curve analysis can be found in the respective sections. Because participants were nested within experimental groups, we controlled for these groups in all analyses regarding congruence hypotheses. We additionally estimated models controlling for the results of the game (i.e., points won and difference between participants' points and the computer's points).

Collabra: Psychology
compare the results to previous studies and fit patterns to use the best practice response surface analyses) and four different outcomes (i.e., positive affect, tiredness, Stroop reaction times, and Stroop error rates). We thus examined a total of 6 (congruence) × 2 (congruence operationalization) × 4 (outcomes) = 48 associations between congruence and relevant outcomes. Because a maximum of eight of these were used to test the same hypothesis, we used a Bonferroni-corrected significance level of = .00625 for these analyses.

Response Surface Analysis
A congruence effect is typically defined as a pattern in which there are certain combinations of two predictors (e.g., equal levels of personality trait and personality state) that are associated with optimal values of an outcome whereas any deviations from these optimal combinations are associated with decreases in the outcome e.g., Schönbrodt, 2016). As discussed above, trait-state congruence has been examined with a variety of statistical operationalizations including interactions, difference scores, and quadratic associations. Importantly, each of these may be indicative of congruence effects, but only under certain circumstances. For example, a significant interaction does not necessarily indicate a congruence effect, but an interaction that causes the direction of the relationship to change (e.g., Jacques-Hamilton et al., 2018) could indicate a congruence effect (compare Figures  2A and 2B). Similarly, a significant negative quadratic association does not necessarily indicate a congruence effect (e.g., Cooper et al., 2018), but a significant negative quadratic association with a peak near the trait level does (compare Figures 2C and 2D). Thus, interactions and quadratic associations may be indicative of congruence effects but only under certain conditions. Response surface analysis-which includes interactions and quadratic associations-facilitates the testing of congruence effects because it precisely defines these conditions. In response surface analyses, a congruence effect is represented by a specific and more complex pattern of associations called a fit pattern (see Figure 2E).
Response surface analyses (Cronbach, 1958;J. R. Edwards, 2002J. R. Edwards, , 2007Humberg, Nestler, et al., 2019) are based on polynomial regressions that include main effects, squared effects, and interactions. Parameters of the model are then statistically and graphically interpreted, and the predicted values are plotted as a three-dimensional surface to facilitate their interpretation. A significant congruence effect is represented by a specific pattern of associations in a polynomial regression (J. R. Edwards, 2007;see Humberg, Dufner, et al., 2019). This fit pattern is defined by specific parameters of the response surface which test the shape of the surface above the line of congruence (i.e., where X = Y) and the line of incongruence (i.e., where X = -Y) and examine the position of the ridge of the surface (the specific criteria for a fit pattern in a broad sense are a 4 < 0, a 2 = 0, a 3 = 0, and a 5 = 0; see Humberg, Nestler, et al., 2019;Nestler et al., 2019). A significant fit pattern indicates that the highest values of a dependent variable (e.g., positive affect) are predicted when the values of both predictor variables are equal, and that the predicted values decrease the more the two predictor variables deviate from each other. As mentioned before, we used both interactions and fit patterns as operationalizations of congruence in this study. Figures 2E and 2F compare response surfaces for congruence operationalized as an interaction (2E) and as a fit pattern (2F): The two approaches are similar (in fact, interactions are a part of the fit pattern), but still differ meaningfully.
For the response surface analyses, personality traits were transformed from a 5-point scale to a 7-point scale. Personality traits, personality states, and situation characteristics were centered on the midpoints of their respective scales (J. R. Edwards & Parry, 1993;Humberg, Nestler, et al., 2019).

Specification Curve Analysis
We conducted specification-curve analyses (Simonsohn et al., 2020) to account for the large number of researcher degrees of freedom in identifying and handling outliers and calculating the summary statistic in the Stroop task (Simmons et al., 2011). The specification curve analyses involved three steps. First, we determined the set of reasonable specifications. We specified different choices for identifying outliers, handling these outliers, selecting the trials to be considered for the calculation of the summary statistic, calculating the summary statistic, and including covariates. In sum, we analyzed a total of 8896 reaction time specifications and 2224 error rate specifications. Details on the set of reasonable specifications can be found in the OSM.
In the next step, the analyses were conducted separately with each specification as the dependent variable and the results were graphically illustrated in descriptive specification curves. These curves displayed the effects of interest (i.e., interaction coefficients or presence of fit patterns) from all specifications ranked by size of the effect. Additionally, these plots indicate which analytic decisions lead to which effects. Descriptive specification curves were used to examine the range of the effects across all specifications, the proportion of significant effects, and how the analytic decisions impacted the estimated effects.
Finally, to come to an overall conclusion regarding the presence of the hypothesized effect, we applied a permutation technique to test how inconsistent the results were with the null hypothesis of no effect across the specification curve. We created 500 data sets by shuffling the values within the dependent variables (i.e., the different reaction time and error rate specifications) and repeated the response surface analyses with these shuffled data. Because of the random distribution of reaction times or error rates, the null hypothesis of no effect is by definition true in these shuffled datasets. The number of shuffled data with at least as many significant specifications as the original data was then used to calculate a p value of the permutation test. Additionally, we graphically compared the specification curve of the effects obtained in the original data to the range of the expected-under-the-null specification curves from the shuffled datasets.
Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Figure 2. Exemplary response surface plots of congruence operationalized as an interaction (A) and congruence operationalized as a fit pattern as defined in response surface analyses (B)
Note. Association between two predictor variables X and Y and an outcome variable Z are plotted as a three-dimensional surface. The color of the surface indicates the predicted levels of the outcome variable with greener colors indicating higher levels and orange to red colors indicating lower levels. The blue lines represent the line of congruence (LOC; i.e., X = Y) and the line of incongruence (LOIC; i.e., X = -Y). Table 1 displays means, standard deviations, and bivariate correlations of the relevant study variables. Because of the high number of different analyses, not all results can be shown and discussed in detail in the main manuscript. Here, we focus on the overarching pattern of results. The online supplementary materials (OSM) entail additional results, tables, and figures.

Manipulation Check
As expected, participants in the high agreeableness and honesty condition reported significantly higher levels of state agreeableness (M = 5.23, SD = 1.40) and state honestyhumility (M = 5.30, SD = 1.51) than participants in the low agreeableness and honesty condition (M agree = 4.32,  However, contrary to our hypotheses, participants in the high agreeableness and honesty condition also reported significantly higher levels of state openness and perceived positivity and significantly lower levels of state emotionality than participants in the low agreeableness and honesty condition (see Table 2). Moreover, we found that participants' concrete behaviors in the game (i.e., deciding to share and being honest when communicating the intended decision) were significantly associated with both the behavior conditions, the situation conditions, and with their interaction. Thus, the manipulations also had unspecific effects on top of the expected effects.
In sum, the manipulation had both specific effects and unspecific effects. Because the the manipulation had the expected specific effects (albeit with small effect sizes in some cases), we decided to perform the planned analyses for the case of a successful manipulation. However, we would also like to explicitly point out that these results should be interpreted with caution due to the unspecific effects.

Associations Between Congruence and Positive Affect
Analyses of variance predicting positive affect from interactions between behavior conditions and median-split personality traits and from interactions between behavior conditions and situation conditions did not yield any significant interaction effects (all Fs < 2.28, all ps > .132). Instead, only the behavior conditions were significantly associated with positive affect (all Fs > 12.89, all ps < .001) such that participants in the high agreeableness and honestycondition (M = 3.83, SD = 0.87) reported significantly more positive affect than participants in the low agreeableness and honesty-condition (M = 3.36, SD = 1.14, d = 0.47, 95% CI [0.21, 0.72]). Including results of the game as covariates did not change anything about this pattern of results. Detailed results from these analyses can be found in the OSM. Overall, the analyses of variance did not yield any evidence for trait-state or state-situation congruence effects with respect to positive affect.
Next, we conducted response surface analyses with selfreported personality traits, personality states, and situation perceptions. Table 3 summarizes the results from the polynomial regressions regarding associations between trait-state congruence and state-situation congruence and positive affect. The polynomial regressions showed that only higher levels of state agreeableness and state honestyhumility were consistently significantly associated with more positive affect across all models (bs ranging from 0.21 to 0.29). In contrast, neither personality traits nor situation characteristics nor any of the trait-state interactions, state-situation interactions, or curvilinear coefficients were significantly associated with positive affect in any of the models (Table 3). Figure 3 shows the response surface plots for the associations between personality traits, personality states, and positive affect (Panels A and B) and between personality states, situation characteristics, and positive affect (Panels C to F). The surfaces were mostly steep along the personality-state axis. This illustrates the results from polynomial regressions that personality states were the only significant predictors of positive affect. Additionally, none of the response surfaces fulfilled criteria for a fit pattern (i.e., a 2 = 0, a 3 = 0, a 4 < 0, and a 5 = 0). Thus, trait-state congruence and state-situation congruence were not significantly associated with positive affect.

Associations Between Congruence and Tiredness
Analyses of variance predicting tiredness from interactions between behavior conditions and median-split personality traits and from interactions between behavior conditions and situation conditions did not yield any significant interaction effects (all Fs < 2.20, all ps > .139). Instead, only the behavior conditions were consistently associated with reverse-coded tiredness (all Fs > 5.84, all ps = .016) such that participants in the high agreeableness and honesty condition (M = 3.54, SD = 0.78) reported significantly more active mood-that is, less tiredness-than participants in the low agreeableness and honesty condition (M = 3.25, SD = 1.02, d = 0.33, 95% CI [0.08, 0.58]). Including results of the game as covariates did not change anything about this pattern of results. Detailed results from these analyses can be found in the OSM. Overall, the analyses of variance did not yield any evidence for trait-state or state-situation congruence effects with respect to tiredness.
Next, we conducted response surface analyses with selfreported personality traits, personality states, and situation perceptions and found a very similar pattern of results (Table 4): Higher levels of state agreeableness and state honesty-humility were associated with more active mood in all models except one (bs ranging from 0.13 to 0.26). In contrast, neither personality traits nor situation characteristics nor any of the trait-state interactions, state-situation interactions, or curvilinear coefficients were significantly associated with tiredness in any of the models (Table 4). Figure 4 shows the response surface plots for the associations between personality traits, personality states, and tiredness (Panels A and B) and between personality states, situation characteristics, and tiredness (Panels C to F). Again, these surfaces were-if at all-mostly steep along the axis of personality states as these were the only significant predictors in the models. The surfaces also did not fulfill criteria for a congruence pattern. Thus, congruence Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,... between personality traits and personality states and congruence between personality states and situation characteristics was not associated with tiredness.

Associations Between Congruence and Stroop Performance
Effects in the Stroop task are often found either regarding reaction times or regarding error rates (Zelenski et al., 2012). We therefore examined both dependent variables separately using specification-curve analyses (Simonsohn et al., 2019).

Stroop Effect
As expected, we found significant Stroop effects in the numerical Stroop task. Participants had significantly longer reaction times in incongruent trials than in congruent trials (median difference across all specifications: 106.71ms) and made more errors in incongruent trials than in congruent trials (median difference across all specifications: 3%). We found these significant Stroop effects in all specifications.
Moreover, permutation tests demonstrated that the null hypothesis of no Stroop effect had to be rejected across the whole specification curve, p < .001 for reaction times and p = .014 for error rates. More details on the Stroop effect can be found in the OSM.

Associations Between Congruence and Reaction Times
We tested the congruence hypotheses both with analyses of variance including interactions between median-split personality traits, behavior conditions, and situation conditions and with response surface analyses examining interactions and fit patterns between self-reported personality traits, personality states, and situation characteristics. Here, we summarize the overall pattern of results. Detailed tables and figures concerning these analyses can be found in the OSM.
In a first step, we conducted separate analyses of variance and response surface analyses for all specifications of the dependent variables and examined interaction effects and fit patterns across the different specifications in de-Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Figure 3. Response surface plots for associations between trait-state congruence (Panels A and B) and state-situation congruence (Panels C to F) and predicted values of positive affect
Note. Situation characteristics were reverse-coded (indicated by (r)) such that high values represent more desirable values and are congruent with high values of the personality states. The response surface parameters are listed below the titles. The blue lines represent the line of congruence (LOC; i.e., X = Y) and the line of incongruence (LOIC; i.e., X = -Y). scriptive specification curves. Figure 5 shows two examples of such descriptive specification curves for the interactions between median-split trait honesty-humility and the behavior conditions (from an analysis of variance) and between self-reported trait honesty-humility and state honesty-humility (from a polynomial regression). Results of the remaining associations can be found in the OSM. Overall, the descriptive specification curves indicated that none or only very few interactions were significantly associated with reaction times in the Stroop task. The only exception was a significant interaction between median-split trait agreeableness and the behavior conditions, which, however, had the wrong sign and thus indicated slower reaction times in congruent conditions. Figures 5C and 5D indicate which analytic decisions corresponded to the effects depicted in Figures 5A and 5B. This view can be used to understand how the analytic decisions affect the size of the effect. Only the decision which conditions (incongruent trials only, congruent trials only, neutral trials only, or all trials) were used to calculate reaction times consistently affected the size of the effect: Interaction coefficients were typically lowest when only neutral trials were analyzed and highest when only incongruent trials were analyzed. In contrast, neither the inclusion of results of the game as covariates nor the summary statistics or choice of cutoffs had a systematic effect on the results.
In the second step, the descriptive specification curves were compared to under-the-null specification curves in permutations tests. Table 5 summarizes the results from these permutation tests for all analyses. Apart from the already discussed significant but wrong-directed interaction, none of the permutation tests for reaction times were significant. That is, the result supported the null hypothesis of no effect across the specification curve. Moreover, Fig-Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,... ures 5E and 5F graphically compare the effects obtained in the original data to the 2.5th and 97.5th percentiles of the ranked effects in the shuffled datasets. As the effects found in the original data were well within the range of effects obtained with the shuffled datasets (in which the null hypothesis of no effect is per definition true), the plots further support the conclusion that there was no interaction effect. Overall, the specification curve analyses clearly indicated that trait-state congruence and state-situation congruence were not significantly associated with reaction times in the Stroop task.

Associations Between Congruence and Error Rates
We repeated the specification curve analyses with error rates as the dependent variables. Figures 6A and 6B again show two examples of descriptive specification curves, the remaining figures can be found in the OSM. Overall, none or only very few interaction coefficients were significantly associated with error rates in the Stroop task. Moreover, none of the analytic decisions consistently affected the size or direction of the interaction effects ( Figures 6C and 6D and ad-ditional figures in the OSM).
Permutation tests confirmed that neither interactions nor fit patterns between personality traits and behavior conditions, between situation and behavior conditions, between personality traits and personality states, or between personality states and situation characteristics were associated with error rates in the Stroop task (Table 5). P values of the permutation test were all bigger than p = .030 and the effects in the real data were well within the 2.5th and 97.5th percentile of the effects found in the shuffled data ( Figures 6E and 6F). Thus, neither trait-state congruence nor state-situation congruence was significantly associated with error rates in the Stroop task.

Discussion
We examined whether two forms of behavioral congruence, trait-state congruence and state-situation congruence, were associated with positive affect, tiredness, and performance in a Stroop task. Overall, the results led to three main findings: First, the manipulation check demonstrated that although the manipulations had the expected Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Figure 4. Response surface plots for associations between trait-state congruence (Panels A and B) and state-situation congruence (Panels C to F) and predicted values of reverse-coded tiredness
Note. Situation characteristics were reverse-coded (indicated by (r)) such that high values represent more desirable values and are congruent with high values of the personality states. The response surface parameters are listed below the titles. The blue lines represent the line of congruence (LOC; i.e., X = Y) and the line of incongruence (LOIC; i.e., X = -Y). effects, they also had unintended unspecific effects. Second, both analyses of variance with experimental conditions and response surface analyses with self-reported situation characteristics and personality states did not yield any trait-state congruence or state-situation congruence effects. Third, positive affect and tiredness were only consistently associated with behavior (i.e., experimental behavior conditions or personality states), whereas Stroop performance was not associated with any of the predictors.

Experimental Manipulations
This study was intended to experimentally manipulate trait-state congruence and state-situation congruence to examine their effects on mood and cognitive performance. We aimed to manipulate the behavior and personality states in a prisoner's dilemma game by directly instructing partic-ipants to act either honestly and agreeably or dishonestly and disagreeably. Direct instructions as manipulations of personality states have already been used in a few studies (e.g., McNiel et al., 2010;Zelenski et al., 2012) and we therefore expected them to be effective. Interestingly, the instructions did have the intended effects on state agreeableness and state honesty-humility, but they also had unintended effects on state openness, state conscientiousness, and state emotionality. The manipulation had therefore rather unspecific effects, affecting not only the targeted personality states but also most of the other personality states. As far as we are aware, previous studies using direct instructions to manipulate personality states did not report whether their manipulations also affected other personality states and it is therefore unclear whether this is a general problem with this type of manipulation or specific to our study. If such manipulations indeed affected not Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Figure 5. Results from specification curve analyses of the associations between the interaction of trait honesty-humility and state honesty-humility (A, C, E) and of state honesty-humility and reverse-coded adversity (B, D, F) and reaction times in the Stroop task
Note. A and B represent descriptive specification curves with specifications ranked by size and significant specifications marked in blue. C and D present the analytics choices that led to the effects displayed in A and B. E and F compare the effect observed in the real data (blue and grey dots) to the 2.5th and 97.5th percentiles (dashed lines) of the effects observed in the shuffled data.
only the desired variables but also undesired variables, this would be a serious threat to the internal validity of these designs. We therefore strongly recommend that future studies using similar manipulations examine their effects on all personality states to test the specificity of the manipulation.
Additionally, we aimed to manipulate perceived adversity and deception of the game by manipulating the game's payoff structure and the description and behavior of the computer-simulated player. To our best knowledge, no one has manipulated such economic games with the intent to manipulate perceived characteristics of the game before. Our manipulation was therefore novel and exploratory. We observed two main problems with the situation manipulation: First, although the manipulation was significantly associated with the perception of adversity (but not of deception), the effect was rather small. The effect of the manipulation therefore might have been too small to show the hypothesized effects even if the other problems had not existed. Second, as situation perceptions and personality states were interrelated, the effects of the manipulations often showed trends towards interactions between the behavior and situation conditions. Thus, it may be particularly difficult to manipulate situation perception and personality states independently from each other.
Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...  Moreover, the ulterior motive of these manipulations was to manipulate congruence between personality states and personality trait and between personality states and situation characteristics. Descriptive analyses (see OSM) showed that difference scores between the variables (used as an approximation of congruence) differed less between the experimental conditions than the manipulated variables themselves. For example, although state agreeableness was significantly lower in the low agreeableness and honesty condition, participants on average behaved more agreeably than their trait level in all experimental conditions. However, these difference scores should be interpreted with caution because they assume strong equivalence between the trait and state scales (e.g., Schönbrodt, 2016) which may not be given in this study. Overall, experimentally manipulating congruence may thus be even more challenging as it may require more than just manipulating the involved variables (J. R. Edwards et al., 2006) and may be more difficult to prove.

Collabra: Psychology
Despite the many difficulties in manipulating personality states, situation characteristics, and trait-state and state-situation congruence we have already discussed, we are convinced that such manipulations are important. Experimental studies with valid and reliable manipulations can further the understanding of personality in many ways. In the context of this study, it is particularly important to note that such experimental studies could deepen our understanding of personality processes and the causal relationships between personality states and other constructs such as well-being, motivation, or authenticity (e.g., Fleeson & Law, 2015;McCabe & Fleeson, 2016). Furthermore, such research could also help to inform the validity of manipulations in the existing literature (e.g., Fleeson & Wilt, 2010;Jacques-Hamilton et al., 2018;Margolis & Lyubomirsky, 2020;McCabe & Fleeson, 2016). Finally, experimental studies that manipulate personality states and situation characteristics independently from each other may provide important insight into their associations and Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Associations Between Congruence and Positive Affect and Tiredness
We examined how interactions and fit patterns as operationalizations of congruence were associated with positive affect and tiredness. Theories such as trait-activation theory (Tett & Burnett, 2003;Tett & Guterman, 2000) and the contra-trait effort hypothesis (Gallagher et al., 2011) predicted that behaving congruently should be associated with more positive affect and less tiredness, whereas behaving incongruently should be associated with less positive affect and more tiredness. However, we found that neither interactions nor fit patterns were associated with either positive affect or tiredness. Across all models, only state agreeableness and state honesty-humility were consistently significantly associated with positive affect and tiredness.
These findings have both theoretical and practical implications: First, our findings are consistent with those from previous studies that also did not find any interactions or congruence effects but only main effects of personality states (e.g., Fleeson et al., 2002;Fleeson & Wilt, 2010;Mc-Niel et al., 2010;Zelenski et al., 2012). This supports the concept of trait-state isomorphism (Fleeson et al., 2002): Personality states are functionally equivalent to personality traits, that is, they have the same correlates and consequences. Similarly, research on congruence and authenticity has also pointed towards a state-content significance hypothesis (Fleeson & Wilt, 2010) such that only the absolute levels of personality states were associated with authenticity, but not their congruence with personality traits. Future research should thus further examine these ideas of trait-state isomorphism (e.g., van Allen & Zelenski, 2018) and state-content significance. However, it is also possible that our sample might have been too small to detect congruence effects. Within-situation effects are typically rather small and unfold their power by accumulating over many situations (Funder & Ozer, 2019;Rauthmann et al., 2015). Particularly interactions and fit patterns are statistically more complex and thus require a bigger sample size than main effects to be reliably detected. We used a previously reported effect size for power considerations for this study (Zelenski et al., 2012), but this effect was based on an interaction effect between behavioral conditions and extreme groups of trait extraversion in an analysis of variance. It may therefore not have been the best benchmark for the analyses we conducted, particularly for the response surface analyses.
Notably, significant congruence effects were most often reported in experience sampling studies (e.g., Jacques-Hamilton et al., 2018;Pickett, Hofmans, Debusscher, et al., 2020;Pickett, Hofmans, Feldt, et al., 2020; but see also Margolis & Lyubomirsky, 2020), whereas null effects were most often reported in experimental lab studies (e.g., Fleeson et al., 2002;McNiel et al., 2010;McNiel & Fleeson, 2006). One of the systematic differences between these types of studies were their sample sizes with experimental studies typically examining at most 200 participants and experience sampling studies typically examining more than 1,000 different observations. Thus, one might suspect that these differ-ences in sample sizes (and accordingly in statistical power) might at least partially explain the differences in findings.

Associations Between Congruence and Stroop Performance
We also examined how interactions and fit patterns as operationalizations of congruence were associated with reaction times and error rates in a Stroop task. Theories such as the contra-trait effort hypothesis (Gallagher et al., 2011) suggested that behaving incongruently depletes self-control and that this self-control is then unavailable for subsequent tasks requiring mental effort. However, we found that neither interactions nor fit patterns were associated with reaction times or error rates in the Stroop task.
These results can be interpreted in different ways: First, the hypothesis may simply be false and subsequent performance is not impaired by incongruent personality states. As this hypothesis is based on the ego depletion literature (Baumeister et al., 1998(Baumeister et al., , 2007 which has recently been heavily criticized (Carter et al., 2015;Friese et al., 2019;Hagger et al., 2016;Lurquin et al., 2016), there may not be a sound theoretical basis for hypothesizing that congruence is associated with cognitive performance. Second, self-control includes different responses such as controlling thoughts, regulating emotions, or guiding behavior (Baumeister et al., 2007) and the task may not have measured the aspects of self-control that are impacted by incongruent personality states (Gallagher et al., 2011). Third, it could be argued that a certain amount or duration of incongruent personality states is needed to exert enough self-control to have an effect (Gallagher et al., 2011). Our situation may have been too short or the incongruence too small to see the hypothesized effect.
Overall, we did not find any evidence that trait-state congruence or state-situation congruence were significantly associated with cognitive performance in a Stroop task. There are several possible explanations, and based on our study, we unfortunately cannot conclude which of these explanations is correct. However, since the theoretical basis for these hypotheses is currently under heavy criticism, future research seems best advised to wait and see what the ego depletion debate yields before further applying these theories to incongruent personality states.

Importance of Statistical Operationalizations
Our study emphasizes the importance of operationalizations and analytic decisions. Congruence is a concept that has been examined using a multitude of operationalizations from difference scores to interactions to polynomial regressions and response surface analyses (e.g., J. R. Edwards, 1993;Humberg, Dufner, et al., 2019). In our study, we found that although the grand conclusions were the same regardless of the operationalizations, operationalizing congruence as interaction effects and fit patterns did not always yield the exact same results. As discussed before, a significant interaction coefficient, for example, only indicates that the relationship between two variables depends on the level of a third variable. This alone does not necessarily indicate a congruence effect. Therefore, it is important to carefully Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Collabra: Psychology
decide on the operationalization to use and to be aware of the implications of these decisions. Additionally, the theoretical concept of congruence needs to be further refined so that it becomes clearer which operationalizations are appropriate under which circumstances and which empirical findings are informative for these theories, and which are not. As response surface analysis includes and differentiates between the different operationalizations (e.g., Schönbrodt, 2016), it could be a valuable instrument in this endeavor.
Moreover, our study also emphasized the importance of analytic decisions and being aware of researcher degrees of freedom (Simmons et al., 2011). We conducted specification curve analyses (Simonsohn et al., 2020) to account for the multitude of possible analytic decisions regarding the calculation of reaction time and error rate indices in the Stroop task. Specification curve analyses can reveal whether an effect (a) is present across the majority of specifications and thus quite robust, (b) is only present in a subset of specifications and thus depends on known or unknown boundary conditions, or (c) is not present across the majority of specifications and thus most likely does not exist. In our specification curve analyses, the great majority of specifications did not yield any significant congruence or interaction effects and we thus concluded that there is most likely no association between behavioral congruence and performance in the Stroop task. However, we still found that the size of the effects depended on the analytic decisions. Particularly the decision over which type of trials the reaction times were computed (i.e., all trials vs. only congruent/incongruent/neutral trials) was associated with the size of the interaction effects. Additionally, there were some specifications, albeit very few, that resulted in significant effects. These were not enough to lead to a different conclusion, but if we had arbitrarily selected just one specification for the analyses and incidentally selected one of these specifications with a significant effect, our conclusions would have been different. Therefore, it is important to test the complete specification curve to be able to conclude that these are most likely just false positives. Overall, then, this study emphasizes that researchers should be aware of the importance and implications of their decisions and actively incorporate the implications of these decisions into their research more often-for example by using specification curve analyses (Simonsohn et al., 2020).

Limitations
Beyond the well-known and often discussed limitations of self-reports and WEIRD samples (Henrich et al., 2010), this study had a few rather specific limitations. First, the study was conducted online, which allowed us to collect a bigger and more diverse sample than possible in the lab, but also led to complications. We had, for example, less control over the participants and their environments and possible distractions in such an online study. This might be particularly important when cognitive tasks such as the Stroop task are included. Moreover, situation research in an online study may also bring about some challenges because it is unknown how comparable online situations are to "real" situations in the lab or even in everyday life. It is therefore possible that participants would have perceived and reacted to a comparable prisoner's dilemma game in the lab differently than to the online game. Researching situations online could be a great opportunity for situation research as it makes larger and more diverse samples accessible, however, future research should aim to learn more about differences and similarities of situations and situation perception online and offline.
Second, we did not assess whether participants believed to be playing the prisoner's dilemma game against a real person and therefore could not control whether this belief (or differences in the beliefs among the participants) affected their behavior during the game. Whereas there is evidence that economic games work similar on the internet as in the lab (Amir et al., 2012;Baumert et al., 2014) and even when playing against virtual humans or real humans (Nouri & Traum, 2013), there is also some evidence that beliefs about the other players affects behavior in economic games (Eckel & Wilson, 2006;Johnson & Mislin, 2011). Third, as discussed above, we determined the sample size with a power analysis based on a previously reported effect size (Zelenski et al., 2012) but this effect size may not have been the best benchmark for our analyses. Particularly the response surface analyses may have suffered from a loss of power to detect higher order effects such as interactions or quadratic effects. Finally, although internal consistencies are expected to be lower in short scales that aim to increase coverage of the construct, the alpha coefficients for the personality traits Emotionality and Agreeableness were unusually low.

Conclusion
The present study aimed to examine effects of trait-state and state-situation congruence on mood and cognitive performance. The resulting conclusions and future directions are twofold: First, we found that neither trait-state congruence nor state-situation congruence were associated with positive affect, tiredness, or performance in a Stroop task. Instead, positive affect and tiredness were only associated with personality states. Stroop performance was not consistently associated with any of the predictors at all. This study thus seems to support hypotheses like trait-state isomorphism or state content significance rather than congruence hypotheses. However, in light of the limitations of this study, more studies that are carefully designed, carefully operationalized, and well-powered are needed to examine effects of congruence on relevant outcomes such as mood and tiredness-particularly in personality domains other than extraversion. Second, as the manipulations had rather unspecific effects on personality states and perceived situation characteristics, our study emphasized the difficulty of manipulating personality states, situation characteristics, and trait-state and state-situation congruence in general. We need experimental research on personality states and situation characteristics to advance the understanding of personality processes, situations, and their interplay, and to do that we need good manipulations. Thus, although it will likely be a challenging endeavor, future research should aim to establish valid and reliable manipulations of personality states, perceived situation characteris-Be Yourself and Behave Appropriately: Exploring Associations Between Incongruent Personality States and Positive Affect,...

Collabra: Psychology
tics, and trait-state and state-situation congruence.

Contributions
Contributed to conception and design: SK, ML Contributed to acquisition of data: SK, ML Contributed to analysis and interpretation of data: SK Drafted and/or revised the article: SK, ML Approved the submitted version for publication: SK, ML