Processing of Emotions in Speech in Forensic Patients With Schizophrenia: Impairments in Identification, Selective Attention, and Integration of Speech Channels

Leshem, Rotem; Icht, Michal; Bentzur, Roni; Ben-David, Boaz M.

doi:10.3389/fpsyt.2020.601763

ORIGINAL RESEARCH article

Front. Psychiatry, 13 November 2020

Sec. Forensic Psychiatry

Volume 11 - 2020 | https://doi.org/10.3389/fpsyt.2020.601763

Processing of Emotions in Speech in Forensic Patients With Schizophrenia: Impairments in Identification, Selective Attention, and Integration of Speech Channels

$\nRotem Leshem$ Rotem Leshem¹

Michal Icht²

Roni Bentzur³

Boaz M. Ben-David^4,5,6^*

¹Department of Criminology, Bar-Ilan University, Ramat Gan, Israel
²Department of Communication Disorders, Ariel University, Ariel, Israel
³Psychiatric Division, Sheba Medical Center, Tel Hashomer, Israel
⁴Baruch Ivcher School of Psychology, Interdisciplinary Center (IDC), Herzliya, Israel
⁵Department of Speech-Language Pathology, University of Toronto, Toronto, ON, Canada
⁶Toronto Rehabilitation Institute, University Health Networks (UHN), Toronto, ON, Canada

Individuals with schizophrenia show deficits in recognition of emotions which may increase the risk of violence. This study explored how forensic patients with schizophrenia process spoken emotion by: (a) identifying emotions expressed in prosodic and semantic content separately, (b) selectively attending to one speech channel while ignoring the other, and (c) integrating the prosodic and the semantic channels, compared to non-clinical controls. Twenty-one forensic patients with schizophrenia and 21 matched controls listened to sentences conveying four emotions (anger, happiness, sadness, and neutrality) presented in semantic or prosodic channels, in different combinations. They were asked to rate how much they agreed that the sentences conveyed a predefined emotion, focusing on one channel or on the sentence as a whole. Forensic patients with schizophrenia performed with intact identification and integration of spoken emotions, but their ratings indicated reduced discrimination, larger failures of selective attention, and under-ratings of negative emotions, compared to controls. This finding doesn't support previous reports of an inclination to interpret social situations in a negative way among individuals with schizophrenia. Finally, current results may guide rehabilitation approaches matched to the pattern of auditory emotional processing presented by forensic patients with schizophrenia, improving social interactions and quality of life.

Introduction

Schizophrenia is a severe mental disorder that involves a wide range of deficits in cognitive, perceptual, and emotional processes (1–5). Individuals with schizophrenia show deficiencies in different dimensions of social cognition, characterized by an impaired ability to decode (perceive) verbal and non-verbal emotional expressions. In many studies, they have been reported to misattribute negative valence to ambiguous or neutral stimuli (6–12). These tendencies could heighten the risk of violence in schizophrenia (13). Indeed, individuals with schizophrenia are four to six times more likely to commit a violent crime than individuals without schizophrenia (14, 15). This group of violent offenders who have been diagnosed with schizophrenia (hereafter referred to as “forensic patients”) are the focus of interest in both research and prevention efforts in recent years (16, 17).

The current study explores whether forensic patients with schizophrenia process spoken emotion in a similar fashion as their non-clinical peers. Specifically, we target the ability to identify and integrate the emotional content of semantics (literal content) and prosody (tone of speech) of spoken sentences. There is previous evidence in the literature to suggest reduced emotional processing of semantics and prosody both in individuals with schizophrenia and in violent offenders. To the best of our knowledge, no study to date has specifically tested processing emotional content and prosody in spoken sentences among the intersecting population of violent offenders with schizophrenia. Furthermore, the majority of research tools used thus far with this clinical population did not directly assess the integration of information in both auditory channels, a routine task in daily social interactions. The current study attempts to address that gap in the existing research.

Perception of Social Cues in Schizophrenia and Violent Behavior

The relationship between psychotic disorders and violent behaviors is complex and inconclusive (18, 19). Psychotic disorders (including schizophrenia) form the most notable group of disorders in forensic psychiatry services (20), with over 70% of men in high-security hospitals falling within this diagnostic group (21). Of psychotic disorders, schizophrenia is notable, with a high estimated prevalence rate of violent behaviors, ranging from 15.3 to 19.1% in this population [13; (22, 23)].

Research has identified multiple risk factors for aggressive and violent behavior related to schizophrenia (23–25). Deficits in affective processing are suggested as one of the main precursors to violent behavior (26). This type of difficulty is also one of the key features of schizophrenia as defined by the DSM [DSM-5, (27)] and has been identified in various studies [e.g., (6, 28, 29)]. Specifically, individuals with schizophrenia demonstrate problems in the perception of emotional material, verbal as well as non-verbal (6, 7), and they tend to misidentify neutral cues as negatively-valenced (30). For example, patients with schizophrenia have been found to be poorer than controls at recognizing emotions in facial expressions, and have misattributed emotions to neutral expressions (10). As suggested by Weiss et al. (31), misinterpretation of social emotional cues (e.g., angry or fearful facial expressions) along with a negative bias (the tendency to negatively interpret social situations) impairs adaptive behavior in daily life situations. This, in turn, may increase the risk of violent and criminal behavior in schizophrenia (13).

Perception of Emotions in Speech

Spoken communication, and specifically the processing of emotions in spoken language, have an important role in daily social interactions (32, 33). Spoken emotion processing is crucial for the apprehension of other's feelings and development of empathy, which in turn can dampen violence toward another person (8, 12, 34–36). Indeed, when the listener does not fully apprehend the emotion conveyed by the speaker, miscommunication can ensue, with possible negative implications for the quality of social interactions (37) and aggressive and violent behavior.

The perception of spoken emotions involves the integration of several modalities, including visual and auditory channels. In the absence of visual cues (e.g., when talking over the phone) or when visual information is degraded [e.g., due to visual sensory degradation: (38, 39); or due to visual processing impairments that are well-established in schizophrenia: (40)], the ability to derive emotional meaning in spoken language relies on how it is conveyed in two auditory speech channels—the semantic channel (the meaning of the words) and the prosodic channel (the tone of speech, intonation of voice, and indexical cues).

The literature has identified three main components of processing emotional speech in healthy young adults (32, 41–43): (a) Identification of emotions. Listeners successfully identify the emotions expressed in the semantic and prosodic content when presented separately; (b) Selective attention. Listeners fail to selectively attend to one auditory channel while actively ignoring the other, when the task calls for it; (c) Channel integration. Listeners process the emotional content as a whole, affected by the emotions conveyed in both the prosodic and semantic channels. Most notably, the prosody of speech appears to have a much larger impact on emotional judgment than semantics [see, (44, 45)]. Let us now briefly describe what is currently known of these components among forensic patients with schizophrenia.

Identification of Emotions in Forensic Patients With Schizophrenia

Restricted identification of emotions has been well-documented in schizophrenia (46). These deficits were documented not only in the visual modality [facial emotion recognition; e.g., (8)], but also in the auditory modality. Specifically, there is evidence to suggest deficits in identification of emotional prosodies (9) in both pre-attentive and attentive processes [for a review, see (47)]. These were more predominately reported among male patients (48), with specific difficulties in processing negative emotions [sadness, fear, anger; (49)]. Deficits in prosodic processing for patients with schizophrenia were attributed by some researchers to early auditory dysfunction, such as deficits in basic pitch perception and auditory sensory memory (50–52).

Only a limited number of studies investigated the identification of emotional semantic content in schizophrenia (53), generally reporting impairments (1, 54). For example, when asked to identify the semantic emotions of spoken sentences pronounced with neutral prosody, patients with schizophrenia made more errors than controls [(55); averaging across study conditions].

Deficits in identification of spoken emotions (semantics and prosody) for people with schizophrenia were associated with impaired social functioning (56, 57). However, to the best of our knowledge, the literature is silent regarding identification of spoken emotions by forensic patients with schizophrenia. Most studies that tested this population focused on recognition of emotional facial expressions, indicating consistent impairments (13, 18, 31, 58–60). The current study aims to test the identification of prosodic and semantic emotions separately in this population.

Selective Attention in Forensic Patients With Schizophrenia

Attentional deficits, specifically in selective attention, are typical of schizophrenia. These have been identified mainly via research utilizing the visual color-word Stroop test [e.g., (61–63)]. Inflated Stroop effects in schizophrenia reflect a failure to inhibit the salient, yet irrelevant, channel (word semantics) while focusing on the less salient, yet relevant, channel [word font color; for a discussion on the nature of Stroop effects in clinical populations, see (39, 64, 65)]. Another line of research tested selective attention using cross-modality visual-auditory stimuli. Larger failures of selective attention were documented for individuals with schizophrenia, with a complex effect of emotional voice on facial expression processing [(66, 67); for a review, see (47)]. These failures appear to occur already at the perceptual level, with information leakage from one channel to the other [see (68)]. There are only a few studies that tested inhibition deficits in the auditory domain alone (unimodality) for individuals with schizophrenia. Presented with spoken emotion sentences (1, 55), individuals with schizophrenia showed larger failures than controls to selectively attend to one channel (semantics or prosody) while ignoring the other.

Deficits in selective attention and inhibition of irrelevant information may (at least partly) explain violent behaviors in individuals with schizophrenia. Within the context of criminal behavior, selective attention has been associated with behavioral regulation (58, 69, 70). Accordingly, responding to social situations in a flexible and adaptive manner involves efficient inhibition of irrelevant information. Failing to ignore an irrelevant emotional cue, specifically in social situations, may lead to an inappropriate or extreme reaction, including aggressive behavior [(71); with incarcerated offenders, (58)]. For example, recidivism of aggressive behavior was found to be related to reduced selective attention among forensic patients with schizophrenia (72). The current study tests whether this subgroup performs differently in selective attention and inhibition of emotional speech channels (prosodic and semantic emotions) than controls.

Integration of Channels in Forensic Patients With Schizophrenia

Many daily situations involve the integration of information conveyed concurrently by multiple sensory channels, e.g., visual and auditory. For example, processing emotional face-voice information involves the integration of affective cues conveyed by the two sensory modalities into a unified, multisensory percept (73). Impairment of multisensory integration is a well-known characteristic of schizophrenia (74–76). However, to the best of our knowledge, integration across auditory channels, in general, or of prosodic and semantic content, specifically, has not yet been examined in forensic patients with schizophrenia.

The Current Study

The current study aimed to test, for the first time, the perception of emotions in spoken language in people diagnosed with schizophrenia who committed severe violent offenses. To this end, the Test of Rating of Emotions in Speech [T-RES, (41)] was used to separately gauge the apprehension of semantics and prosody, and their relative roles in processing of spoken emotions, as depicted in Figures 1, 2.

FIGURE 1

Figure 1. General design of T-RES stimuli. All combinations of prosody and semantics (16) are presented in each emotional rating block (note: neutral semantics spoken with neutral prosody was deemed uninformative and confusing and was not presented). A, example of congruent stimulus (happy semantics and happy prosody); B, example of incongruent stimulus (happy semantics and angry prosody); C, example of baseline semantics (happy semantics and neutral prosody); D, example of baseline prosody (neutral semantics and happy prosody).

FIGURE 2

Figure 2. General design of T-RES: Rating tasks and rating blocks.

In this test, participants listen to sentences that present emotional semantic and prosodic content in different combinations, both congruent and incongruent. In three separate tasks, listeners are asked to rate the extent to which they agree that a sentence conveys a predefined emotion, while focusing on either the semantic or the prosodic channel, or on both. The performance on each of these tasks directly tests three distinct components of processing of emotional speech: (a) Identification of emotions in the tone of speech (prosody) and semantics, (b) Selective attention by focusing on one channel while ignoring the other, and (c) Integration of the prosody and semantic content, thereby processing the spoken emotion sentence as a whole. The literature reviewed thus far led to the following predictions:

Impairment in Identification of Semantics and Prosodic Emotional Cues

Based on the literature, we hypothesized that forensic patients with schizophrenia would show impairments in identification of emotions presented in the semantic and prosodic channels—that is, assigning lower emotional ratings (i.e., less intense emotions) than their peers. To test this, performance on the baseline condition (in which one channel conveys neutrality) was gauged. For example, we tested whether forensic patients with schizophrenia would correctly identify the happy emotional semantic content of the sentence “I won the lottery today” spoken with neutral prosody. Similarly, we tested whether they would correctly identify the happy emotional prosody of the neutral semantic sentence “Red pipes are metallic” spoken with happy prosody (see white cells C and D in Figure 1). A group difference in these measures, if found, would suggest that forensic patients with schizophrenia process emotions in the prosodic or semantic channels differently than controls.

Failure in Selective Attention

We hypothesized that forensic patients with schizophrenia would fail to selectively attend to a specific channel (prosody or semantics) while actively ignoring the other, to a larger extent than controls. To test this, listeners were asked to rate the emotions presented in one channel (e.g., semantics) while ignoring the other channel (prosody) that conveys a different emotion (for semantic and prosodic rating of incongruent spoken sentences, see black cells in Figure 1).

Integration of Channels

In light of missing evidence in the literature, a hypothesis was not made as to whether forensic patients with schizophrenia would be less biased to the prosodic channel than controls when asked to integrate both prosodic and semantic channels. This was tested directly by the prosodic dominance measure, in which ratings of sentences that convey a designated emotion only in prosody are compared with those that convey this emotion only in semantics (for incongruent sentences, see black cells in Figure 1).

Materials and Methods

The study received ethics approval from the medical center and two academic institutes affiliated with the authors. The study was carried out in accordance with the Declaration of Helsinki, and informed consent was obtained from all individual participants.

Participants

The clinical group consisted of 21 male participants diagnosed with schizophrenia with a violent criminal record, who volunteered to participate with no monetary compensation (two additional participants had been excluded: one due to his age, 64 years, which exceeded the inclusion criteria; another failed to follow task instructions). They were recruited from the Maximum Secure Unit (MSU), a unique setting in a national mental health center in central Israel. All were under court-ordered compulsory hospitalization due to severe violent behaviors (including murder and rape). Based on the MSU's medical records (obtained by the MSU department heads), all had been diagnosed with ICD-10 schizophrenia (mean of duration from initial diagnosis = 8.6 years, SD = 6.6 years, range = 1–21 years), and nine of the 21 participants had reported a history of substance addiction prior to incarceration. All participants were stable, had no change to their treatment regimen during the last 4 months, and possessed the capacity to provide informed consent.

The control group consisted of 21 male volunteers from the general population that matched the clinical participants in socio-demographic characteristics (see Table 1). They were recruited by advertisements in and around the campus (including a local mall) and received the equivalent of $25 to compensate for their participation time.

TABLE 1

Table 1. Participants' background data.

Inclusion Criteria

Participants in both groups reported normal hearing (with no reported pathologies or history of hearing disorders), normal or corrected-to-normal vision, and no history of head trauma, neurological illness, or current substance use. To evaluate their basic cognitive auditory span, which may affect spoken language processing (77), the auditory forward digit span was administered to all participants, with the expected reduced performance for the clinical group (see Table 1).

Measures and Tools: Test of Rating of Emotions in Speech (T-RES)

The Hebrew version of the T-RES (78) was used, with the following emotions: anger, happiness, sadness, and neutrality. The T-RES consists of three tasks. Two of the tasks relate to selective attention: (a) prosodic rating, in which listeners are requested to rate the emotion based only on prosodic information; and (b) semantic rating, in which listeners are requested to rate the emotion based only on semantic information. The third task was a general rating, an integration task in which listeners are requested to rate the emotion of the sentence as a whole. All spoken sentence stimuli had been pre-recorded by a professional female actress.

Stimuli

Figure 1 presents the makeup of the T-RES stimuli: the 15 spoken sentences in each semantic category are represented once in each of the tested prosodies, generating a 4 (semantic) × 4 (prosody) matrix. The cell marked “A” represents a congruent stimulus; e.g., a semantically happy sentence spoken with happy (congruent) prosody. Incongruent stimuli are represented by the cell marked “B”; e.g., a semantically happy sentence spoken with angry (incongruent) prosody. Baseline sentences present neutral content in one channel and emotional content in the other. In semantic baseline sentences, cell “C,” semantically emotional sentences (e.g., happy) are spoken with neutral prosody. In prosodic baseline sentences, cell “D,” semantically neutral sentences are spoken with emotional prosody (e.g., happy). For a full description of the characteristics of the spoken sentences and how they were constructed, see the research of Ben-David et al. (32, 42, 79)

Apparatus

The spoken sentences were presented on a 2.20 GHz Intel personal computer, using a 15.4-in. LCD monitor, via professional AKG K240 headphones, at a comfortable listening level (as confirmed by each participant). A research assistant was present throughout the experimental session, which lasted about 30 min.

Procedure

Upon arrival, all participants received an explanation of the experimental tasks and those wishing to participate signed an informed consent form. The T-RES session was conducted only after participants were found to meet the inclusion/exclusion criteria. Subsequently, all participants were tested individually in a quiet room: the participants with forensic schizophrenia were tested in the MCU and the control participants were tested at the academic institute.

In the T-RES, each sentence is rated on three separate rating blocks, as depicted in Figure 2. For each trial, using a 6-point Likert scale, listeners are asked to rate “How much do you agree that the speaker conveys______ (anger, sadness, or happiness)? From 1—strongly disagree to 6—strongly agree.”

The experimental session began with the general rating task for all participants. For a randomly chosen half of the participants in each group, this was followed by the semantic rating task and then the prosodic rating task. This order was reversed for the other half of the participants. The order of the three emotion-rating blocks was counterbalanced by using the Latin square design, and the order of the trials in each block was fully randomized. In sum, each sentence was presented three times in each task, once in each of three rating blocks (anger, sadness, and happiness), with a total of 135 trials per session (conducted in under 25 min). The full description of the T-RES stimuli, design, and task is specified in previous works [e.g., (42)]. Reliability and validity of the tool are fully detailed in (32).

Statistical Analyses

All of the following analyses used mixed-model repeated-measures ANOVAs (GLM) with average ratings as the dependent variable, Group (x2: forensic patients with schizophrenia vs. control) and Native Language (x2: native Hebrew speaker or not) as between-participants variables, and Target Emotion (x3: anger, sadness, or happiness) as a within-participants variable. Each test included one other within-participants variable. In prosodic- and semantic-rating tasks, Target Channel (x2: prosodic vs. semantic) was also used as a between-participants variable. Partial eta squared (η_p²) was used as the measure for power in all statistically significant tests. As separate analyses did not find that criminally-related background characteristics (e.g., murder conviction and incarceration in a secure ward) impacted performance in the T-RES among the forensic patients with schizophrenia, they will not be further discussed.

Results

Identifications of Emotions Presented in the Prosodic and Semantic Channels

The first analysis tested whether both groups could correctly identify emotions in the prosody and semantic channels, respectively (prosodic- and semantic-rating tasks). This was tested in baseline sentences, when the to-be-ignored channel was neutral (represented by white cells in Figure 1). The tested variable was Emotion Identification, which was the difference between ratings of target-emotion-present trials (in which the target emotion was present in the attended channel) and target-emotion-absent trials (in which the target emotion was absent from the attended channel). The data is presented in the upper section of Table 2, and graphically displayed in Figure 3A.

TABLE 2

Table 2. Summary of ratings (Means and SDs), averaged across target emotions, for the forensic patients with schizophrenia and the control group, with F values of the comparison.

FIGURE 3

Figure 3. A graphic description of ratings in the T-RES tasks, separately for forensic patients with schizophrenia and controls. The error bars are standard errors of their respective means. (A) Identification, comparing target emotion-present and target-emotion-absent trials in the prosodic and semantic rating tasks; (B) Selective Attention, comparing congruent and incongruent trails, in the prosodic and semantic rating tasks; (C) Integration, presenting three types of target-emotion-present trials in the general rating task; (D) Integration, comparing an average of target-emotion-present trials with target-emotion-absent trials in the general rating task.

A main effect for Emotion Identification was found, F_(1,38) = 379.7, p < 0.001, η_p² = 0.91, that significantly interacted with Group, F_(1,19) = 12.5, p = 0.001, η_p² = 0.25, indicating a larger effect for the control group than the clinical group (clinical group: F_(1,19) = 76.9, p < 0.001, η_p² = 0.80; control group: F_(1,19) = 794.7, p < 0.001, η_p² = 0.98). Target Channel (Prosody or Semantics), Target Emotion (Anger, Happiness, or Sadness), and Native Language (Native Hebrew speaker or Non-Native Hebrew Speaker) were each not found to generate a significant interaction with Group membership (clinical or control) and Emotion Identification (F < 1.3, p > 0.25).

In sum, the analyses indicated that both groups clearly identified the presented emotions in both prosody and semantics. However, participants in the control group were better able than the clinical group to distinguish between target-emotion-present (sentences that present the rated emotion in the target channel) and target-emotion-absent trials (sentences that do not present the rated emotion).

Selective Attention to the Prosodic or the Semantic Channel

Selective attention was gauged by comparing average ratings of congruent sentences (presenting the rated-emotion in both channels) with incongruent sentences (presenting the rated-emotion only in the target channel), denoting the Selective Attention variable. The data is presented in midsection of Table 2 and graphically displayed in Figure 3B.

A significant main effect for Selective Attention, denoting failures of selective attention, was indicated, F_(1,38) = 29.3, p < 0.001, $η_{p}^{2}$ = 0.44, with larger failures found in the clinical group than in the control group (a significant interaction of Selective Attention and Group variables), F_(1,38) = 14.5, p = 0.001, $η_{p}^{2}$ = 0.28. A main effect for Group, F_(1,38) = 22.7, p < 0.001, $η_{p}^{2}$ = 0.37, indicated that the clinical group generally provided lower ratings (regardless of the stimulus type) than the control group. That is, averaged across congruent and incongruent sentences, forensic patients with schizophrenia gauged the rated emotion as less intense than controls.

Failures of selective attention were significantly higher when listeners were asked to ignore the prosody and focus on the semantics than vice versa (an interaction of Selective Attention and Target Channel) across both groups, F_(1,38) = 15.9, p < 0.001, $η_{p}^{2}$ = 0.30), and separately for the clinical group, F_(1,19) = 13.4, p = 0.002, η_p² = 0.41, but not for the control group, F_(1,19) = 3.0, p = 0.10 (see also a marginally significant triple interaction for Selective Attention, Group, and Target Channel, F_(1,38) = 3.84, p = 0.057, $η_{p}^{2}$ = 0.09). Target Emotion (Anger, Happiness, or Sadness) and Native Language (Native Hebrew speaker or Non-Native Hebrew Speaker) were each not found to generate a significant interaction with Group membership (clinical or control) and Emotion Identification (F < 0.35, p > 0.7).

To conclude, it appears that failures of selective attention were substantially more prominent for the clinical group than for the control group, with larger failures in inhibiting the prosodic than the semantic information.

Integration of Channels and Channel Dominance

Figure 3C presents a graphic description of ratings of Trial Types in the general rating task, averaged across the three emotion rating blocks, separately for forensic patients with schizophrenia and control groups. From left-to-right, Figure 3C presents average ratings for congruent trials (the rated emotion appears in both channels), prosody trials (the rated emotion appears only in the prosody) and semantic trials (the rated emotion appears only in the semantics). There are two highly notable features of Figure 3C: (a) the similarity in the trend congruent > prosody > semantic trials in both groups; (b) higher ratings indicated by the control group, in all target-emotion-present trials (indicating more intense emotional ratings).

The statistical analyses supported these trends, with a significant linear trend (congruent > prosody > semantic) across groups, F_(1,38) = 164.8, p < 0.001, $η_{p}^{2}$ = 0.81, that did not interact significantly with Group membership, F_(1,38) = 1.0, p = 0.32. Across Trial Types and Target Emotions, the clinical group provided lower ratings than the control group, F_(1,38) = 10.7, p = 0.002, $η_{p}^{2}$ = 0.22. Notably, this effect of Group interacted significantly with the Target Emotion (Anger, Happiness, or Sadness), F_(2,76) = 11.1, p < 0.001, $η_{p}^{2}$ = 0.23. In other words, the clinical group provided lower ratings than the control group, indicating less intense emotional ratings, but the extent of this effect was dependent on the specific target emotion. In separate analyses conducted for each target emotion, the group difference in ratings was significant for the two negative emotions [Anger: F_(1,38) = 20.9, p < 0.001, $η_{p}^{2}$ = 0.36; Sadness: F_(1,38) = 7.7, p = 0.009, $η_{p}^{2}$ = 0.17), but not for the positive one (Happiness: F_(1,38) = 0.26, p = 0.61]. Additionally, Native Language did not interact with the linear trend or Group, nor did we find a significant interaction of the three (F < 1, p > 0.33 for all).

Finally, Figure 3D presents ratings for target-emotion-absent trials (the target emotion is absent from the semantics and the prosody) alongside target-emotion-present trials (average of target-emotion-congruent, prosody, and semantic trials). Analysis showed that discrimination, the difference between target-emotion-present and -absent trials, was reduced for the clinical group relative to the control group, F_(1,38) = 13.5, p = 0.001, $η_{p}^{2}$ = 0.26 (a significant interaction of Group and Trial Type).

In sum, the group of forensic patients with schizophrenia rated the negative emotions tested (Anger and Sadness) as less intense (lower ratings for target-emotion-present trials) than the control group. However, the positive emotion tested (Happiness) was rated as similarly intense (similar ratings for target-emotion-present trials) in both groups. In other words, the clinical group integrated the prosodic and semantic channels similarly to the control group, but under-rated the negative emotional information. Their ratings also indicated lower discrimination between target-emotion-present and target-emotion-absent trials—i.e., confusion in emotional ratings.

Discussion

The present study aimed to examine the processing of emotions in spoken language (conveyed by the semantic and prosodic channels) in violent offenders diagnosed with schizophrenia. Three distinct components of auditory emotional processing were assessed: identification, selective attention, and integration. To this end, we used the T-RES, a tool dedicated to examining the processing of spoken emotions. The results indicated that forensic patients with schizophrenia successfully identified spoken emotions, but discriminated less effectively between emotions than controls. They also demonstrated larger failures to inhibit prosodic information while focusing on the semantics. Although they integrated the prosodic and semantic channels similarly to the controls, the forensic patients with schizophrenia under-rated negative emotional information (anger and sadness).

Intact Identification of Emotions, but Reduced Discrimination

The findings of the current study indicate that forensic patients with schizophrenia were able to identify the presented emotions in both prosody and semantics. That is, ratings related to the degree of agreement that the target emotion was present were significantly higher (4.5–5.5) when indeed the target emotion was present (in either channel), than when it was absent (2.2–2.5). These results provide strong evidence to the preserved emotion identification abilities of forensic patients with schizophrenia, as the great majority of T-RES sentences (20 of 24) convey the target semantic emotions in an implicit manner (e.g., “You've won first place”), rather than explicitly, as tested in previous studies with this population [e.g., “I am happy to come dining with you” in (1)]. The current results were somewhat surprising, as deficits in identification of emotional prosodies [e.g., (9, 48)] and semantics (1, 53, 55) are considered well-known characteristics of schizophrenia.

Although identification of spoken emotions in the current study was intact for forensic patients with schizophrenia, they showed reduced ability to discriminate between emotions, relative to controls. Namely, their ratings indicated smaller differences between sentences that presented the rated emotions and sentences that did not (target-emotion-present vs. -absent). This pattern echoes previous findings (80) in which forensic patients with schizophrenia were better than non-forensic patients with schizophrenia at identification of facial emotional expressions, but less accurate at assessing their emotional intensity [for a similar effect with reduced feature discriminability in the presence of emotional words, see (81)].

Larger Failures of Selective Attention and Prosodic Dominance

In the current study, forensic patients with schizophrenia were found to perform with substantially larger failures of selective attention than matched controls. As aforementioned, such failures have been previously documented in the auditory domain for patients with schizophrenia [e.g., (1, 55)]. The current study expands this evidence, for the first time, to the unique group of forensic patients with schizophrenia. Failure to inhibit irrelevant auditory information (e.g., an emotional cue available in a social interaction) may lead to deficits in behavioral regulation, impulse control, and aggressive behaviors (58, 72). This, in turn, may lead to the criminal behavior that has been documented in forensic patients with schizophrenia.

Methodologically, it is also noteworthy that the majority of previous studies that found selective attention deficits in forensic patients with schizophrenia used neuropsychological tasks (e.g., Stroop, Go-no-go). In contrast, the current study showed similar evidence using an ecological task that mimics daily social behavior—the processing of emotions in spoken sentences. Therefore, increased failures of selective attention, as documented in the current study, can be more easily generalized to daily life situations for forensic patients with schizophrenia.

Failures of selective attention found in the current study were more prominent when the clinical participants were asked to inhibit the prosodic than the semantic information. This may hint that the prosodic channel is more dominant than the semantic one, when the task calls for selective attention. A prosodic bias may indeed be related to violent behaviors. Consider, e.g., the semantically neutral everyday sentence “Hi neighbor, could you please place the garbage in the container?” spoken with a stern, serious prosody. As violent offenders may display a “hostile attribution bias,” a tendency to view neutral expressions and behaviors as hostile [(82); for a review, see (26)], failing to inhibit the (negative) prosodic cues may lead to inappropriate social reactions for forensic patients with schizophrenia [see also (83)]. Indeed, poor executive functioning (e.g., inhibition) has also been associated with the risk of aggressive-behavior recidivism in schizophrenic patients (72).

Preserved Integration of Prosodic and Semantic Information, but Under-Rating of Negative Emotions

The current study is the first to demonstrate a preserved ability of forensic patients with schizophrenia to integrate emotional information presented in two separate auditory channels: prosody and semantics. As deficits in multisensory integration are common in schizophrenia (74–76), the current data may suggest that performance is preserved when uni-sensory (auditory) integration is called for. This preserved ability has clinical importance, given that the stimuli used by the T-RES are spoken sentences rather than single words [e.g., (53)]. This may be especially challenging, considering the attentional and verbal working memory deficits often reported in this population (84) and documented in the current study (see digit span data in Table 1). The presence of underlying challenges in these executive functions amplifies the strength of the finding of preserved (uni-sensory) channel integration.

One of the indicators of preserved integration is congruency supremacy (41). Indeed, in the current study, congruent sentences (which present the same emotion in both channels) received higher emotional ratings (indicating the most intense emotion) than all other rated-emotion-present trials (prosodic and semantic trials) among both groups, replicating previous findings with the T-RES paradigm. This effect somewhat echoes previous findings on schizophrenia by Brazo et al. (1). In their study, although individuals with schizophrenia were less accurate than their matched controls at categorizing spoken sentences conveying emotion, they benefitted from the redundancy of information in sentences with congruent prosody-semantics more than controls [for a discussion of redundancy gains in congruent presentation, see (85, 86)].

Interestingly, in the present study, forensic patients with schizophrenia under-rated negative (spoken) emotional information, somewhat in contrast with evidence in the literature on a negative bias in recognition of facial expressions [visual information; see (31)]. In our data, when asked to rate anger or sadness, the clinical group provided lower ratings than their peers, but no such differences were found for the positive emotion. For example, when asked to rate a spoken sentence that conveys happiness and anger in different channels, the clinical group provided lower anger ratings than the control group, while no significant group differences were documented for happiness ratings. This may suggest that forensic patients with schizophrenia have specific difficulties in processing spoken negative affect, unlike spoken positive affect. A study by Klumpp et al. (87) similarly found that, among patients with schizophrenia, negative semantics elicited a unique evoked response potential (N400) that did not occur with positive semantics.

Alternatively, one may relate the reduced ratings in emotional discrimination and integration that was documented for the clinical group as reflecting a flat effect—an experience of reduced emotional intensity, a well-known schizophrenia symptom [for a discussion, see (88)]. However, if forensic patients with schizophrenia were to show a flat affect, then lower ratings should have been reflected on all emotional rating scales. As the current data indicated lower ratings only on the negative emotions (see Figure 3), our findings do not appear to support the notion of a flat affect effect among the clinical group.

Caveats and Future Directions

A possible limitation of the current study concerns the clinical sample that included only male offenders. However, males represent the majority of offenders in secure mental wards (89). There is also evidence in the literature to suggest that males may be especially susceptible to dysfunction in emotional processing, whereas recognition of affective prosody and emotional semantics may be preserved in females [e.g., (55)]. Future studies may wish to include female offenders as well, evaluating possible gender differences. In addition, as the subgroup of forensic patients with schizophrenia differs from non-violent patients with schizophrenia, further studies should compare performance between the two groups.

A few limitations also relate to the T-RES instrument itself. First, the sentences were recorded by one professional female actress, rather than different speakers. Although this may potentially decrease the generalizability of the data, we argue that this also minimizes confounding factors. Second, the current study tested only Hebrew speakers. Since the perception of emotions in speech may be affected by cultural variables (90, 91), future studies may wish to examine the validity of the results when testing individuals from various cultures (or languages) with appropriate stimuli (92). Third, the T-RES evaluates the processing of basic and concrete emotions. Possibly, group differences may be more pronounced if more abstract and complex emotions (e.g., boredom, envy) would be tested. Future studies may wish to examine the processing of such emotions as well.

Clinical Implications

The current study's results may be useful to guide new rehabilitation approaches matched to the pattern of auditory emotional processing presented by forensic patients with schizophrenia. Forensic patients with schizophrenia may respond poorly to verbally-mediated treatment programs, as they processes spoken emotions differently than intended by the speaker. This should be acknowledged by the therapist. Moreover, targeted programs could focus on remediation of difficulties in discrimination between emotions, failures in inhibiting prosodic information, and the tendency to under-rate negative emotional information. These programs could use explicit or implicit methods to train participants to pay attention to emotional features they may have missed; relying on the preserved abilities of forensic patients with schizophrenia to identify spoken emotions and to integrate the semantic and prosodic speech channels. For example, we suggest tailoring Social Cognition Training Programs, which have been found to show promise in improving prosodic-affect recognition in schizophrenia [for reviews, see (93, 94)].

The results also support the use of the T-RES as a sensitive tool in identifying the nuances of components underlying the processing of spoken emotions in various clinical populations. Recently, in response to COVID-19 social restrictions, a remote adaptation (an online version) of the T-RES has been validated, iT-RES (95), increasing the feasibility of the test. We suggest incorporating the iT-RES to the arsenal of assessment tools for forensic patients with schizophrenia, to better portray idiosyncratic emotion processing performance, even in tele-health. As suggested by Leshem et al. (26), identifying difficulties in spoken emotion processing might also assist in prevention of recidivism in forensic populations.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by IRB, Shaar Menashe Mental Health Center IRB, Psychology, Bar-Ilan University IRB, Psychology, Interdisciplinary Center (IDC) Herzliya. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

The manuscript was written by RL, MI, and BB-D. Research design was conducted by RL and BB-D. Data was collected under the supervision of RB, RL, and BB-D. Data analysis was conducted by BB-D. BB-D was the corresponding author for the paper. All authors contributed to the article and approved the submitted version.

Funding

The corresponding author's lab was partially supported by a grant from the Israeli Science Foundation (ISF; 861/18).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank Ms. Maya Mentzel and Mr. Wil Shapiro for their invaluable contribution to data gathering.

References

1. Brazo P, Beaucousin V, Lecardeur L, Razafimandimby A, Dollfus S. Social cognition in schizophrenic patients: the effect of semantic content and emotional prosody in the comprehension of emotional discourse. Front Psychiatry. (2014) 5:120, 1–8. doi: 10.3389/fpsyt.2014.00120

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Green MF, Penn DL, Bentall R, Carpenter WT, Gaebel W, Gur RC, et al. Social cognition in schizophrenia: an NIMH workshop on definitions, assessment, research opportunities. Schizoph Bull. (2008) 34:1211–20. doi: 10.1093/schbul/sbm145

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ochsner KN. The social-emotional processing stream: five core constructs and their translational potential for schizophrenia and beyond. Biol Psychiatry. (2008) 64:48–61. doi: 10.1016/j.biopsych.2008.04.024

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Orellana G, Slachevsky A. Executive functioning in schizophrenia. Front Psych. (2013) 4:35. doi: 10.3389/fpsyt.2013.00035

PubMed Abstract | CrossRef Full Text | Google Scholar

5. van't Wout M, Aleman A, Kessels RP, Larøi F, Kahn RS. Emotional processing in a non-clinical psychosis-prone sample. Schizoph Res. (2004) 68:271–81. doi: 10.1016/j.schres.2003.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Edwards J, Jackson HJ, Pattison PE. Emotion recognition via facial expression and affective prosody in schizophrenia: a methodological review. Clin Psychol Rev. (2002) 22:789–832. doi: 10.1016/S0272-7358(02)00130-7

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Edwards J, Pattison PE, Jackson HJ, Wales RJ. Facial affect and affective prosody recognition in first-episode schizophrenia. Schizoph Res. (2001) 48:235–53. doi: 10.1016/S0920-9964(00)00099-2

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kohler CG, Walker JB, Martin EA, Healey KM, Moberg PJ. Facial emotion perception in schizophrenia: a meta-analytic review. Schizoph Bull. (2010) 36:1009–19. doi: 10.1093/schbul/sbn192

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Leitman DI, Foxe JJ, Butler PD, Saperstein A, Revheim N, Javitt DC. Sensory contributions to impaired prosodic processing in schizophrenia. Biol Psych. (2005) 58:56–61. doi: 10.1016/j.biopsych.2005.02.034

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Premkumar P, Cooke MA, Fannon D, Peters E, Michel TM, Aasen I, et al. Misattribution bias of threat-related facial expressions is related to a longer duration of illness and poor executive function in schizophrenia and schizoaffective disorder. Europ Psychiatry. (2008) 23:14–19. doi: 10.1016/j.eurpsy.2007.10.004

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Ross ED, Orbelo DM, Cartwright J, Hansel S, Burgard M, Testa JA, et al. Affective-prosodic deficits in schizophrenia: profiles of patients with brain damage and comparison with relation to schizophrenic symptoms. J Neurol Neurosurg Psychiatry. (2001) 70:597–604. doi: 10.1136/jnnp.70.5.597

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Trémeau F. A review of emotion deficits in schizophrenia. Dial Clin Neurosci. (2006) 8:59–70.

PubMed Abstract | Google Scholar

13. De Sanctis P, Foxe JJ, Czobor P, Wylie GR, Kamiel SM, Huening J, et al. Early sensory–perceptual processing deficits for affectively valenced inputs are more pronounced in schizophrenia patients with a history of violence than in their non-violent peers. Soc Cogn Affect Neurosci. (2013) 8:678–87. doi: 10.1093/scan/nss052

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Hodgins S. Violent behaviour among people with schizophrenia: a framework for investigations of causes, effective treatment prevention. Philos Trans R Society B. (2008) 363:2505–18. doi: 10.1098/rstb.2008.0034

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Mullen PE. Schizophrenia violence: from correlations to preventive strategies. Adv Psychiatric Treatment. (2006) 12:239–248. doi: 10.1192/apt.12.4.239

CrossRef Full Text | Google Scholar

16. de Tribolet-Hardy F, Habermeyer E. Schizophrenic patients between general and forensic psychiatry. Front Public Health. (2016) 4:135. doi: 10.3389/fpubh.2016.00135

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Ghoreishi A, Kabootvand S, Zangani E, Bazargan-Hejazi S, Ahmadi A, Khazaie H. Prevalence attributes of criminality in patients with schizophrenia. J Injury Violence Res. (2015) 7:7. doi: 10.5249/jivr.v7i1.635

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Demirbuga S, Sahin E, Ozver I, Aliustaoglu S, Kandemir E, Varkal MD, et al. Facial emotion recognition in patients with violent schizophrenia. Schizoph Res. (2013) 144:142–5. doi: 10.1016/j.schres.2012.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Sedgwick O, Young S, Baumeister D, Greer B, Das M, Kumari V. Neuropsychology and emotion processing in violent individuals with antisocial personality disorder or schizophrenia: the same or different? A systematic review and meta-analysis Australian and New Zealand. J Psychiatry. (2017) 51:1178–97. doi: 10.1177/0004867417731525

CrossRef Full Text | Google Scholar

20. Walsh E, Buchanan A, Fahy T. Violence and schizophrenia: examining the evidence. Br J Psychiatry. (2002) 180:490–5. doi: 10.1192/bjp.180.6.490

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Coid J, Kahtan N, Gault S, Jarman B. Ethnic differences in admissions to secure forensic psychiatry services. Br J Psychiatry. (2000) 177:241–7. doi: 10.1192/bjp.177.3.241

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Swanson JW, Swartz MS, Elbogen EB. Effectiveness of atypical antipsychotic medications in reducing violent behavior among persons with schizophrenia in community-based treatment. Schizophrenia Bull. (2004) 30:3–20. doi: 10.1093/oxfordjournals.schbul.a007065

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Swanson JW, Swartz MS, Van Dorn RA, Elbogen EB, Wagner HR, Rosenheck RA, et al. A national study of violent behavior in persons with schizophrenia. Arch General Psychiatry. (2006) 63:490–9. doi: 10.1001/archpsyc.63.5.490

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Coid JW, Ullrich S, Kallis C, Keers R, Barker D, Cowden F, et al. The relationship between delusions and violence: findings from the East London first episode psychosis study. JAMA Psychiatry. (2013) 70:465–71. doi: 10.1001/jamapsychiatry.2013.12

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Fazel S, Gulati G, Linsell L, Geddes JR, Grann M. Schizophrenia and violence: systematic review and meta-analysis. PLoS Med. (2009) 6:e1000120. doi: 10.1371/journal.pmed.1000120

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Leshem R, van Lieshout PH, Ben-David S, Ben-David BM. Does emotion matter? The role of alexithymia in violent recidivism: A systematic literature review. Crim Behav Mental Health. (2019) 29:94–110. doi: 10.1002/cbm.2110

PubMed Abstract | CrossRef Full Text | Google Scholar

27. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-5^®). American Psychiatric Pub (2013).

Google Scholar

28. Hooker C, Park S. Emotion processing and its relationship to social functioning in schizophrenia patients. Psychiatry Res. (2002) 112:41–50. doi: 10.1016/S0165-1781(02)00177-4

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Pinheiro AP, Del Re E, Mezin J, Nestor PG, Rauber A, McCarley RW, et al. Sensory-based and higher-order operations contribute to abnormal emotional prosody processing in schizophrenia: an electrophysiological investigation. Psychol Med. (2013) 43:603–18. doi: 10.1017/S003329171200133X

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Kohler CG, Turner TH, Bilker WB, Brensinger CM, Siegel SJ, Kanes SJ, et al. Facial emotion recognition in schizophrenia: intensity effects and error pattern. Am J Psychiatry. (2003) 160:1768–74. doi: 10.1176/appi.ajp.160.10.1768

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Weiss EM, Kohler CG, Nolan KA, Czobor P, Volavka J, Platt MM, et al. The relationship between history of violent and criminal behavior and recognition of facial expression of emotions in men with schizophrenia and schizoaffective disorder. Aggr Behav. (2006) 32:187–94. doi: 10.1002/ab.20120

CrossRef Full Text | Google Scholar

32. Ben-David BM, Ben-Itzchak E, Zukerman G, Yahav G, Icht M. The perception of emotions in spoken language in undergraduates with high functioning autism spectrum disorder: a preserved social skill. J Autism Dev Disorders. (2020) 50:741–56. doi: 10.1007/s10803-019-04297-2

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Ben-David BM, Thayapararajah A, van Lieshout PH. A resource of validated digital audio recordings to assess identification of emotion in spoken language after a brain injury. Brain Injury. (2013) 27:248–50. doi: 10.3109/02699052.2012.740648

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Kroner DG, Forth AE, Mills JF. Endorsement and processing of negative affect among violent psychopathic offenders. Personal Individ Differ. (2005) 38:413–23. doi: 10.1016/j.paid.2004.04.019

CrossRef Full Text | Google Scholar

35. Miller LA, Collins RL, Kent TA. Language and the modulation of impulsive aggression. J Neuropsych Clin Neurosci. (2008) 20:261–73. doi: 10.1176/jnp.2008.20.3.261

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Zucchelli MM, Ugazio Cognitive-emotional G, and inhibitory deficits as a window to moral decision-making difficulties related to exposure to violence. Front Psychol. (2019) 10:1427. doi: 10.3389/fpsyg.2019.01427

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Hudepohl MB, Robins DL, King TZ, Henrich CC. The role of emotion perception in adaptive functioning of people with autism spectrum disorders. Autism. (2015) 19:107–12. doi: 10.1177/1362361313512725

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Ben-David BM, Schneider BA. A sensory origin for aging effects in the color-word Stroop task: Simulating age-related changes in color-vision mimic age-related changes in Stroop. Aging Neuropsychol Cogn. (2010) 17:730–46. doi: 10.1080/13825585.2010.510553

PubMed Abstract | CrossRef Full Text

39. Ben-David BM, Schneider BA. A sensory origin for aging effects in the color-word Stroop task: an analysis of studies. Aging Neuropsychol Cogn. (2009) 16:505–34. doi: 10.1080/13825580902855862

CrossRef Full Text

40. Silverstein S, Keane BP, Blake R, Giersch A, Green M, Kéri S. Vision in schizophrenia: why it matters. Front Psychol. (2015) 6:41. doi: 10.3389/fpsyg.2015.00041

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Ben-David BM, Multani N, Shakuf V, Rudzicz F, van Lieshout PH. Prosody and semantics are separate but not separable channels in the perception of emotional speech: test for rating of emotions in speech. J Speech Lang Hear Res. (2016) 59:72–89. doi: 10.1044/2015_JSLHR-H-14-0323

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Ben-David BM, Gal-Rosenblum S, van Lieshout PHHM, Shakuf V. Age-related differences in the perception of emotion in spoken language: the relative roles of prosody and semantics. J Speech Lang Hear Res. (2019) 62:1188–202. doi: 10.1044/2018_JSLHR-H-ASCC7-18-0166

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Oron Y, Levy O, Avivi-Reich M, Shakuf V, Goldfarb A, Handzel O, et al. Tinnitus affects the relative roles of semantics and prosody in the perception of emotions in spoken language. Int J Audiol. (2020) 59:195–207. doi.org/10.1080/14992027.2019.1677952 doi: 10.1080/14992027.2019.1677952

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Jacob H, Brück C, Plewnia C, Wildgruber D. Cerebral processing of prosodic emotional signals: evaluation of a network model using rTMS. PLoS ONE. (2014) 9:e105509. doi: 10.1371/journal.pone.0105509

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Mehrabian A, Wiener M. Decoding of inconsistent communications. J Personal Soc Psychol. (1967) 6:109–14. doi: 10.1037/h0024532

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Aleman A, Kahn RS. Strange feelings: do amygdala abnormalities dysregulate the emotional brain in schizophrenia? Progress Neurobiol. (2005) 77:283–98. doi: 10.1016/j.pneurobio.2005.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Lin Y, Ding H, Zhang Y. Emotional prosody processing in schizophrenic patients: a selective review and meta-analysis. J Clin Med. (2018) 7:363. doi: 10.3390/jcm7100363

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Bozikas VP, Kosmidis MH, Anezoulaki D, Giannakou M, Andreou C, Karavatos A. Impaired perception of affective prosody in schizophrenia. J Neuropsych Clin Neurosci. (2006) 18:81–5. doi: 10.1176/jnp.18.1.81

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Lado-Codesido M, Perez CM, Mateos R, Olivares JM, Caballero AG. Improving emotion recognition in schizophrenia with “VOICES”: an on-line prosodic self-training. PLoS ONE. (2019) 14:e0210816. doi: 10.1371/journal.pone.0210816

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Javitt DC, Liederman E, Cienfuegos A, Shelley AM. Panmodal processing imprecision as a basis for dysfunction of transient memory storage systems in schizophrenia. Schizophr Bull. (1999) 25:763–75. doi: 10.1093/oxfordjournals.schbul.a033417

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Rabinowicz EF, Silipo G, Goldman R, Javitt DC. Auditory sensory dysfunction in schizophrenia: imprecision or distractibility? Arch Gen Psychiatry. (2000) 57:1149–55. doi: 10.1001/archpsyc.57.12.1149

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Strous RD, Cowan N, Ritter W, Javitt DC. Auditory sensory (“echoic”) memory dysfunction in schizophrenia. Am J Psychiatry. (1995) 152:1517–19. doi: 10.1176/ajp.152.10.1517

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Roux P, Christophe A, Passerieux C. The emotional paradox: dissociation between explicit and implicit processing of emotional prosody in schizophrenia. Neuropsychologia. (2010) 48:3642–9. doi: 10.1016/j.neuropsychologia.2010.08.021

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Condray R, Steinhauer SR, van Kammen DP, Kasparek A. The language system in schizophrenia: effects of capacity and linguistic structure. Schizophr Bull. (2002) 28:475–90. doi: 10.1093/oxfordjournals.schbul.a006955

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Scholten MRM, Aleman A, Kahn RS. The processing of emotional prosody and semantics in schizophrenia: relationship to gender and IQ. Psychol Med. (2008) 38:887–98. doi: 10.1017/S0033291707001742

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Frommann N, Stroth S, Brinkmeyer J, Wölwer W, Luckhaus C. Facial affect recognition performance and event-related potentials in violent and non-violent schizophrenia patients. Neuropsychobiology. (2013) 68:139–45. doi: 10.1159/000353252

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Poole JH, Tobias FC, Vinogradov S. The functional relevance of affect recognition errors in schizophrenia. J Int Neuropsychol Soc. (2000) 6:649–58. doi: 10.1017/S135561770066602X

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Hoaken PN, Allaby DB, Earle J. Executive cognitive functioning and the recognition of facial expressions of emotion in incarcerated violent offenders, non-violent offenders, and controls. Aggr Behav. (2007) 33:412–21. doi: 10.1002/ab.20194

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Tang DY, Liu AC, Lui SS, Lam BY, Siu BW, Lee TM, et al. Facial emotion perception impairments in schizophrenia patients with comorbid antisocial personality disorder. Psych Res. (2016) 236:22–7. doi: 10.1016/j.psychres.2016.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Wolfkühler W, Majorek K, Tas C, Küper C, Saimed N, Juckel G, et al. Emotion recognition in pictures of facial affect: Is there a difference between forensic and non-forensic patients with schizophrenia? Europ J Psychiatry. (2012) 26:73–85. doi: 10.4321/S0213-61632012000200001

CrossRef Full Text | Google Scholar

61. Barch DM, Carter CS, Hachten PC, Usher M, Cohen JD. The “benefits” of distractibility: mechanisms underlying increased Stroop effects in schizophrenia. Schizophr Bull. (1999) 25:749–62. doi: 10.1093/oxfordjournals.schbul.a033416

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Henik A, Salo R. Schizophrenia and the stroop effect. Behav Cogn Neurosci Rev. (2004) 3:42–59. doi: 10.1177/1534582304263252

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Perlstein WM, Carter CS, Barch DM, Baird JW. The Stroop task and attention deficits in schizophrenia: a critical evaluation of card and single-trial Stroop methodologies. Neuropsychology. (1998) 12:414–25. doi: 10.1037/0894-4105.12.3.414

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Ben-David BM, Nguyen LL, van Lieshout PH. Stroop effects in persons with traumatic brain injury: Selective attention, speed of processing, or color-naming? A meta-analysis. J Int Neuropsychol Soc. (2011) 17:354–63. doi: 10.1017/S135561771000175X

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Ben-David BM, Tewari A, Shakuf V, Van Lieshout PH. Stroop effects in Alzheimer's disease: selective attention speed of processing, or color-naming? A meta-analysis. J Alzheimer's Dis. (2014) 38:923–38. doi: 10.3233/JAD-131244

PubMed Abstract | CrossRef Full Text | Google Scholar

66. de Gelder B, Vroomen J, de Jong SJ, Masthoff ED, Trompenaars FJ, Hodiamont P. Multisensory integration of emotional faces and voices in schizophrenics. Schizophr Res. (2005) 72:195–203. doi: 10.1016/j.schres.2004.02.013

PubMed Abstract | CrossRef Full Text | Google Scholar

67. de Jong JJ, Hodiamont PP, Van den Stock J, de Gelder. B. Audiovisual emotion recognition in schizophrenia: reduced integration of facial and vocal affect. Schizophr Res. (2009) 107:286–93. doi: 10.1016/j.schres.2008.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Zvyagintsev M, Parisi C, Chechko N, Nikolaev AR, Mathiak Attention K, and multisensory integration of emotions in schizophrenia. Front Hum Neurosci. (2013) 7:674. doi: 10.3389/fnhum.2013.00674

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Leshem R. Using dual process models to examine impulsivity throughout neural maturation. Dev Neuropsychol. (2016) 41:125–43. doi: 10.1080/87565641.2016.1178266

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Riggs NR, Shin HS, Unger JB, Spruijt-Metz D, Pentz MA. Prospective associations between bilingualism and executive function in Latino children: Sustained effects while controlling for biculturalism. J Immigr Minority Health. (2014) 16:914–21. doi: 10.1007/s10903-013-9838-0

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Miyake A, Friedman NP, Emerson MJ, Witzki AH, Howerter A, Wager TD. The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: a latent variable analysis. Cogn Psychol. (2000) 41:49–100. doi: 10.1006/cogp.1999.0734

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Nazmie IF, Nebi MR, Zylfije H, Bekim H. Poor executive functioning associated with the risk of aggressive behavior recidivism in the forensic community in schizophrenic patients. Int J Biomed Sci. (2013) 3:94–99.

Google Scholar

73. Pourtois G, Dhar M. Integration of face and voice during emotion perception: is there anything gained for the perceptual system beyond stimulus modality redundancy? In: Integrating Face and Voice in Person Perception. New York, NY: Springer (2013). p. 181–206. doi: 10.1007/978-1-4614-3585-3_10

CrossRef Full Text | Google Scholar

74. Ross LA, Saint-Amour D, Leavitt VM, Molholm S, Javitt DC, Foxe JJ. Impaired multisensory processing in schizophrenia: deficits in the visual enhancement of speech comprehension under noisy environmental conditions. Schizophr Res. (2007) 97:173–83. doi: 10.1016/j.schres.2007.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Szycik GR, Münte TF, Dillo W, Mohammadi B, Samii A, Emrich HM, et al. Audiovisual integration of speech is disturbed in schizophrenia: an fMRI study. Schizophr Res. (2009) 110:111–8. doi: 10.1016/j.schres.2009.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Williams LE, Light GA, Braff DL, Ramachandran VS. Reduced multisensory integration in patients with schizophrenia on a target detection task. Neuropsychologia. (2010) 48:3128–36. doi: 10.1016/j.neuropsychologia.2010.06.028

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Nitsan G, Wingfield A, Lavie L, Ben-David BM. Differences in working memory capacity affect online spoken word recognition: evidence from eye-movements. Invited Paper Trends Hear. (2019) 23:1–12. doi: 10.1177/2331216519839624

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Shakuf V, Gal-Resenbaum S, Ben-David BM. The psychophysics of aging. In emotional speech, older adults attend to semantic, while younger adults to the prosody. In: Skotnikova I, Korolkova O, Blinnikova I, Doubrovski V, Shendyapin V, Volkova N, editor. Fechner Day 2016. Moscow: International Society for Psychophysics (2016). p. 89.

Google Scholar

79. Ben-David BM, van Lieshout PH, Leszcz T. A resource of validated affective and neutral sentences to assess identification of emotion in spoken language after a brain injury. Brain Injury. (2011) 25:206–20. doi: 10.3109/02699052.2010.536197

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Silver H, Goodman C, Knoll G, Isakov V, Modai I. Schizophrenia patients with a history of severe violence differ from nonviolent schizophrenia patients in perception of emotions but not cognitive function. J Clin Psychiatry. (2005) 66:300–8. doi: 10.4088/JCP.v66n0305

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Ben-David BM, Chajut E, Algom D. The pale shades of emotion: A signal detection theory analysis of the emotional Stroop task. Psychology. (2012) 3:537. doi: 10.4236/psych.2012.37079

CrossRef Full Text | Google Scholar

82. Nasby W, Hayden B, DePaulo BM. Attributional bias among aggressive boys to interpret unambiguous social stimuli as displays of hostility. J Abnormal Psychol. (1980) 89:459. doi: 10.1037/0021-843X.89.3.459

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Harris ST, Oakley C, Picchioni MM. A systematic review of the association between attributional bias/interpersonal style, and violence in schizophrenia/psychosis. Aggr Violent Behav. (2014) 19:235–41. doi: 10.1016/j.avb.2014.04.009

CrossRef Full Text | Google Scholar

84. Pinheiro AP, Rezaii N, Rauber A, Liu T, Nestor PG, McCarley RW, et al. Abnormalities in the processing of emotional prosody from single words in schizophrenia. Schizophr Res. (2014) 152:235–41. doi: 10.1016/j.schres.2013.10.042

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Ben-David BM, Algom D. Species of redundancy in visual target detection. J Exp Psychol. (2009) 35:958–76. doi: 10.1037/a0014511

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Ben-David BM, Eidels A, Donkin C. Effects of aging and distractors on detection of redundant visual targets and capacity: do older adults integrate visual targets differently than younger adults? PLoS ONE. (2014) 9:1–29. doi: 10.1371/journal.pone.0113551

PubMed Abstract | CrossRef Full Text | Google Scholar

87. Klumpp H, Keller J, Miller GA, Casas BR, Best JL, Deldin PJ. Semantic processing of emotional words in depression and schizophrenia. Int J Psychophysiol. (2010) 75:211–5. doi: 10.1016/j.ijpsycho.2009.12.004

PubMed Abstract | CrossRef Full Text | Google Scholar

88. Gur RE, Kohler CG, Ragland JD, Siegel SJ, Lesko K, Bilker WB, et al. Flat affect in schizophrenia: relation to emotion processing and neurocognitive measures. Schizophrenia Bull. (2006) 32:279–87. doi: 10.1093/schbul/sbj041

PubMed Abstract | CrossRef Full Text | Google Scholar

89. Kohen D. (Ed.). Oxford Textbook of Women and Mental Health. Oxford: Oxford University Press. (2010). doi: 10.1093/med/9780199214365.001.0001

CrossRef Full Text

90. Ishii K, Reyes JA, Kitayama S. Spontaneous attention to word content versus emotional tone: differences among three cultures. Psychol Sci. (2003) 14:39–46. doi: 10.1111/1467-9280.01416

PubMed Abstract | CrossRef Full Text | Google Scholar

91. Kitayama S, Ishii Word K, and voice: Spontaneous attention to emotional utterances in two languages. Cogn Emotion. (2002) 16:29–59. doi: 10.1080/0269993943000121

CrossRef Full Text | Google Scholar

92. Icht M, Ben-David BM. Oral-diadochokinesis rates across languages: English and Hebrew norms. J Commun Disorders. (2014) 48:27–37. doi: 10.1016/j.jcomdis.2014.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

93. Kurtz MM, Richardson CL. Social cognitive training for schizophrenia: a meta-analytic investigation of controlled research. Schizophr Bull. (2012) 38:1092–104. doi: 10.1093/schbul/sbr036

PubMed Abstract | CrossRef Full Text | Google Scholar

94. Tan BL, Lee SA, Lee J. Social cognitive interventions for people with schizophrenia: a systematic review. Asian J Psychiatry. (2018) 35:115–31. doi: 10.1016/j.ajp.2016.06.013

PubMed Abstract | CrossRef Full Text | Google Scholar

95. Ben-David BM, Mentzel M, Icht M, Gilad M, Dor YI, Ben-David S, et al. Challenges and opportunities for telehealth assessment during COVID-19: iT-RES, adapting a remote version of the Test for Rating Emotions in Speech. Int J Audiol. (2020). doi: 10.1080/14992027.2020.1833255

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: forensic patients with schizophrenia, emotions, speech processing, selective attention, prosody, cognition

Citation: Leshem R, Icht M, Bentzur R and Ben-David BM (2020) Processing of Emotions in Speech in Forensic Patients With Schizophrenia: Impairments in Identification, Selective Attention, and Integration of Speech Channels. Front. Psychiatry 11:601763. doi: 10.3389/fpsyt.2020.601763

Received: 01 September 2020; Accepted: 16 October 2020;
Published: 13 November 2020.

Edited by:

Athanassios Douzenis, National and Kapodistrian University of Athens, Greece

Reviewed by:

Sudarsana Reddy Kadiri, Aalto University, Finland
Marije E. Keulen-de Vos, Forensic Psychiatric Center (FPC), Netherlands

Copyright © 2020 Leshem, Icht, Bentzur and Ben-David. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Boaz M. Ben-David, boaz.ben.david@idc.ac.il

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.