Angry, old, male – and trustworthy? How expressive and person voice characteristics shape listener trust

This study examined how trustworthiness impressions depend on vocal expressive and person characteristics and how their dependence may be explained by acoustical profiles. Sentences spoken in a range of emotional and conversational expressions by 20 speakers differing in age and sex were presented to 80 age and sex matched listeners who rated speaker trustworthiness. Positive speaker valence but not arousal consistently predicted greater perceived trustworthiness. Additionally, voices from younger as compared with older and female as compared with male speakers were judged more trustworthy. Acoustic analysis highlighted several parameters as relevant for differentiating trustworthiness ratings and showed that effects largely overlapped with those for speaker valence and age, but not sex. Specifically, a fast speech rate, a low harmonic-to-noise ratio, and a low fundamental frequency mean and standard deviation differentiated trustworthy from untrustworthy, positive from negative, and younger from older voices. Male and female voices differed in other ways. Together, these results show that a speaker’s expressive as well as person characteristics shape trustworthiness impressions and that their effect likely results from a combination of low-level perceptual and higher-order conceptual processes.


Introduction
Trust is essential for human cooperation to emerge both at a small scale, such as among family members and friends, and at a larger scale, such as among groups within a society. Thus, recognizing trustworthy partners who serve not only themselves but are likely to engage in reciprocity is of individual and communal benefit. Although it is well known that humans aim to maximize this benefit by gauging the looks and behaviors of others [1,2], the underlying mechanisms are still poorly understood. Moreover, it remains unclear whether and how vocal signals are relevant. Here, we sought to better understand their role and to explore what aspects of a speaker's voice bias listener trust. Specifically, we investigated how expressed affect and speaker person characteristics (i.e., age, sex) shape trustworthiness perceptions and examined the acoustic patterns that explain their influence. a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 The vocal apparatus, that is the instrument from which the human voice emanates, is a complex system comprising a wind generator-the lungs, a string-like vibrator-the vocal folds, and a sequence of resonance chambers such as the pharynx, mouth and nasal cavities [3]. Relatively fixed structural aspects (e.g., vocal tract length, vocal fold size) as well as dynamic bodily activity (e.g., breathing, muscle tension) modulate the workings of the vocal apparatus and thus how the voice sounds. Resulting inter-and intra-individual differences can be quantified by a range of acoustic measures, also referred to as acoustic parameters, that describe a sound's frequency, intensity, and duration characteristics.
A substantial body of research has demonstrated statistical relationships between certain acoustic profiles and the affective state that listeners attribute to a speaker [4][5][6]. For example, fundamental frequency (F0), which is perceived as pitch or voice melody, is higher for elation as compared with other emotions such as anger and sadness [4]. Additionally, there is evidence that voice acoustics predict listener impressions of more stable person characteristics [7][8][9][10][11][12][13][14] including the extent to which a speaker appears trustworthy. Specifically, a lower F0 mean in speech has been repeatedly linked to increased perceived kindness or trustworthiness [8,[15][16][17][18]. Additionally, it has been shown that utterances rated as high and low on trustworthiness differ in F0 contour [19]. For example, in one study highly trustworthy pronunciations of the word "hello" were characterized by a rising F0 from the first to the second syllable. Moreover, when applied to other words, this rising F0 contour was found to influence listeners in choosing the more trustworthy of two voices [20].
However, not all research corroborates these observations and attests to a simple relationship between trustworthiness and F0. For example, in some hands F0 mean correlated positively with rated trustworthiness [20] and in other hands it was irrelevant [21]. Moreover, the direction of the F0 effect was shown to depend on context or the framing of the judgement task [22][23][24][25]. For example, in economic or mating-related contexts, voices with a higher F0 seemed more trustworthy than voices with a lower F0 irrespective of speaker sex. However, when probed more generally, impressions of trustworthiness appeared to differ by sex with low F0 female and high F0 male voices being preferred [23].
Although past investigations of voice acoustics offered insights into the mechanisms underpinning vocal trustworthiness, more research is needed on how acoustically derived information about the speaker is relevant. Specifically, it is of interest to determine the extent to which expressive and person characteristics, and thus potentially higher-order conceptual processes, shape trustworthiness impressions. Evidence that they might comes from the face processing literature. Research on dynamic face characteristics indicates that positive or happy expressions are rated as more trustworthy than neutral expressions, which are perceived as more trustworthy than negative expressions such as anger [26,27]. The role of stable face characteristics has been explored by manipulating person characteristics such as sex and age. Although women consistently seem more trustworthy than men [28,29], results for age are mixed. In the context of a trust game, participants expressed greater trust towards older as compared with younger individuals [30], yet their trustworthiness ratings did not map onto actual behavior in the game. Moreover, other work assessing trustworthiness impressions found them to be independent of age [31] or to favour younger over older adults [32].
Effects of both speaker expressive and person characteristics on perceived trustworthiness have been explained by the "emotion over-generalization" hypothesis, first conceived as metaphorical generalization by Secord [33,34]. According to this framework, emotional expressions are important for signalling probable behaviors and because positive affect is believed to increase the likelihood of a benevolent intent it purportedly increases trust. Additionally, the similarity between emotional expressions and person characteristics is thought to explain why the latter are co-opted when judging whom to trust [35][36][37]. For example, average female faces have thinner inner eyebrows and rounder jaws than average male faces and this, among other features, boosts perceptions of positive affect or surprise and therefore, so the hypothesis suggests, of trustworthiness. Support for this, albeit not entirely consistent [38], comes from a number of studies on faces [37,39].
To date, only a couple of studies considered expressive and person characteristics in the context of vocal first impressions [10,40,41] and none, that we could identify, examined trustworthiness. Hence, we sought to address this issue. We invited 20 speakers, differing in age and sex, to produce two sentences in accordance with specific expressive instructions (see below). Each expression was subjected to acoustic analysis and to a trustworthiness rating by a group of 80 listeners. Statistical analyses tested the following three predictions. (1) Expression valence was expected to correlate with perceived trustworthiness. As shown for faces, positive voices should be rated as more trustworthy than negative voices. We also pursued a possible, and not previously delineated, relation between the arousal and trustworthiness of vocal expressions. Besides valence, arousal, or a person's level of energy, forms a major dimension in a basic affective space [42,43] and serves as a relevant cue to behavioral intentions. It may, therefore, also play a role in biasing trust. (2) A second hypothesis was that basic person characteristics like sex and age would modulate perceived trustworthiness. Again, this prediction derived from previous work on faces showing that person characteristics bias trust preferences [28,30]. In line with extant findings, we expected female voices to receive higher trustworthiness ratings than male voices. However, due to inconsistent results no directional predictions could be made for age effects. (3) Last, and in line with the emotion over-generalization hypothesis, we expected that the voice acoustics predicting higher perceived trustworthiness should also be the voice acoustics characterizing the expressive and person characteristics that are most trusted. In other words, if a low F0 mean is associated with high perceived trustworthiness, then the affective expression, sex, and age receiving the highest trustworthiness ratings should have a lower F0 mean than the affective expression, sex, and age receiving the lowest trustworthiness ratings.

Materials and methods
This research was approved by the Institutional Review Board of the National University of Singapore and conducted following established guidelines. Participants were informed about the study procedure and signed a consent form before commencing their participation.

Participants
This study was conducted at the National University of Singapore and approved by its ethics review board. Singapore has a dominantly Chinese population and uses English as its official language. Participants of this study were Singaporean residents who spoke English as their native language. Younger participants comprised 20 women with a mean age of 21.1 years (SD 2.05, [19][20][21][22][23][24][25][26][27] and 20 men with a mean age of 23.7 years (SD 3.3, 20-32). Older participants comprised 45 individuals of which five were excluded from data analysis because they could not follow task instructions (N = 2), because they could not hear the materials even after adjusting the volume (N = 1), or because they failed to complete the full session (N = 2). This left 20 older women with a mean age of 68 years (SD 5, 60-77) and 20 older men with mean age of 67.9 years (SD 7.15, 60-91). An ANOVA with the participants' age as the dependent variable and Sex and Age Group as the independent variables revealed the expected Age Group main effect (F[1,75] = 1768.45, p < .0001) with the Sex effect (p = .27) and the interaction between Sex and Age Group (p = .209) being non-significant. Older participants completed the Mini Mental State Examination (MMSE) [44] and a hearing threshold test that measured age-related decline in cognitive functioning and hearing, respectively. Older women scored an average of 29.2 points (SD 1.58) on the MMSE (normal range [24][25][26][27][28][29][30] and had a mean hearing threshold of 32.8 dB (SD 7.47). Older men scored an average of 29.4 (SD 0.9) on the MMSE and had a mean hearing threshold of 38.5 dB (SD 10). Hearing thresholds were determined by calculating mean hearing thresholds across both ears, and across all frequencies measured, i.e. 1000 Hz, 2000 Hz, 4000 Hz, 750 Hz, 500 Hz, 250 Hz, and 125 Hz. According to WHO standards (http://goo.gl/buEL92), two elderly participants had normal hearing (� 25dB), 28 had a slight hearing impairment (26 to 40dB), 9 had a moderate impairment (41 to 60dB) and one had a severe impairment (61 to 80dB) that was corrected with a hearing aid. To compensate for possible hearing difficulties, we asked participants to adjust the volume of sound presentations to a comfortable level.

Stimuli
The stimuli used in this study were recorded from 20 Singaporean native English speakers with acting experience and are available on the Open Science Framework (https://osf.io/j3hfg/ ?view_only=77db09592fe44a79837f61ba8146fdb8). Speakers included five young women with a mean age of 22.2 years (range 21-24), five young men with a mean age of 23.8 years (range [23][24][25], five older women with a mean age of 69.2 years (range 62-85), and five older men with a mean age of 63 years (range 45-79). The older individuals were recruited from a local amateur-acting association, whereas the younger individuals were recruited through campus advertising. Acting experience ranged from 1 to 30 years (mean 5.75, SD 3.8) for the older speakers and from 1 to 6 years (mean 2.5, SD 1.65) for the younger speakers.
Speakers attended individual recording sessions. They were given two English sentences ("The politician faced his audience.", "The message reached its target.") that the authors of this study deemed to be affectively neutral but that, depending on the context, could be potentially relevant for a range of affective experiences. They were asked to produce these sentences with a range of expressive intentions. These expressive intentions were introduced to generate utterances with varying intonations and were deemed potentially relevant for modulating perceived trustworthiness by the authors of this manuscript. The specific voice conditions included six emotional expressions (i.e., content, happy, proud, afraid, angry, and sad) and four conversational expressions (i.e., confident, stating, doubtful, and questioning). Additionally, we asked speakers to attempt to sound trustworthy, untrustworthy, and neutral. For each expressive intention, speakers were given an expression label along with a short sentence describing a scenario for them to interpret vocally (e.g., trustworthy-You are talking to a stranger whom you wish to trust in you.). Speakers were asked to warm-up their voice by reading the sentences aloud neutrally. Subsequently, they could tackle the individual expressive intentions in their preferred order and repeat attempts until they were satisfied with their portrayal. Once they were satisfied with their portrayal, recordings were made with a few repetitions from which we later picked the clearest exemplar for each speaker, expressive intention, and sentence (e.g., no microphone artefacts, clear pronunciation, identifiable expressive intention). Recordings were conducted in a sound attenuated chamber and digitized at a 16 bit/44.1 kHz sampling rate. They were processed off-line using Adobe Audition 2.0. Specifically, individual sentences were cut into separate files and their intensity was normalized at the root-mean-square value.
In a preliminary rating study, 30 individuals (15 women, average age of 22 years (SD 3.77)) listened to each expression (2 sentences x 13 expressions x 20 speakers = 520) and rated its perceived valence and arousal on 7-point scales ranging from very negative to very positive and from very calm to very aroused. Valence and arousal ratings were subjected to separate ANO-VAs with Expression (13 levels) as a repeated measures factor. This produced a significant Using Praat [45], we analysed the acoustic stimulus properties. Specifically, we measured eight parameters including sound duration (i.e., duration of the entire utterance), which may be considered a measure of speech rate as the number of syllables was kept constant across speakers and expressions (10 and 7 for sentences 1 and 2, respectively). The other parameters comprised harmonic-to-noise ratio (HNR; the periodicity of the signal; Harmonicity settings = 0.01/75/0.1/4. and F0 standard deviation, intensity range (Intensity settings = 75/0/yes, Minimum Pitch = 75 Hz, Time Step = 0), jitter (i.e., short-term variability/perturbations in F0; Voice report settings = 0/0/75/600/1.3/1.6/0.03/0.45), and shimmer (i.e., short-term variability/perturbations in voice intensity; Voice report settings = 0/0/75/600/1.3/1.6/0.03/0. 45).
Measurement values of all parameters were normalized (mean 0, SD 1) and subjected to a series of cumulative link mixed models (i.e., ordered logit regression models) using the clmm function from the Ordinal package (Christensen, 2018) in R (R Core Team, 2015) with its default settings. In a first set of models, valence was the dependent variable, the individual parameters were modelled as fixed main effect and the intercepts of Rater and Speaker were modelled as random effects. For valence, we observed, in order of effect size, a negative relationship with Duration (β = -0.514, SE = 0.018, Z = -28. Last we explored whether and how the two sex and age groups differed on the six acoustic parameters explored here. To this end, we fitted six linear mixed effect models using the lme4 package [46] in R [47]. An acoustic parameter served as the dependent variable, Speaker Age and Speaker Sex served as fixed main effects (in consideration of group size, the interaction between Speaker Age and Speaker Sex was not modelled), and the intercepts of Speaker and Expression served as random effects.

Procedure
After giving informed consent, participants provided their demographic information and received instructions from the experimenter. Specifically, they were informed that they would hear a series of utterances (2 sentences x 13 expressions x 20 speakers = 520) over headphones and should judge how trustworthy or untrustworthy the speaker sounded using a 7-point scale ranging from 1 (very untrustworthy) to 7 (very trustworthy). This rating scale was shown on screen following each sentence until participants selected a number and submitted their selection by pressing the enter key on a standard keyboard. The next trial started after 0.5 seconds. Each participant heard all utterances in random order.

Listener agreement
The data are available on the Open Science Framework (https://osf.io/j3hfg/?view_only= 77db09592fe44a79837f61ba8146fdb8). We computed intraclass correlation coefficients (ICC) to determine listener agreement. To this end, we used the icc function from the irr package [48] in R [47] to examine two-way consistency across raters. This revealed that there was high agreement among participants in how they evaluated the voices (ICC(C,80) = .967).

Trustworthiness and speaker affective expression
First, we examined the relationship between perceived trustworthiness-our ordinal dependent variable-and the affective rating of each item averaged across all raters (see Methods). Specifically, we fitted a cumulative link mixed model (i.e., ordered logit regression model) using the clmm function in the Ordinal package [49] in R [47]. Our model included Trustworthiness as the dependent variable, either normalized Valence or Arousal (normed item values, mean 0, SD 1) as the fixed effect and the intercepts of Listener (40 levels) and Speaker (20 levels) as random effects. This analysis was conducted for young listeners only because only young listeners participated in the preliminary rating study from which we derived affective item norms. As

Trustworthiness and person characteristics
The relationship between perceived trustworthiness and the person characteristics of speakers and listeners is illustrated in Fig 2 (bottom row). Trustworthiness ratings were again analysed using a CLM model with Listener Age, Listener Sex, Speaker Age and Speaker Sex as fixed effects. We modelled all main effects as well as the interactions between Listener Age and Speaker Age and between Listener Sex and Speaker Sex. We refrained from modelling other interactions because they were not relevant for our hypotheses and because we wished to keep the speaker number contributing to each effect at an acceptable level (N�10). The intercepts of Listener (80 levels) and Speaker (20 levels) were random effects.
As expected, there were significant effects of Speaker Age (β = 0.596 SE = 0.141, Z = 4.21, p < .0001) and Speaker Sex (β = -0.349, SE = 0.141, Z = -2.47, p = .013) pointing to a role of these variables in modulating listener trust. Trustworthiness ratings were higher for younger than for older voices and for female than for male voices. Additionally, there was a marginal Listener Age effect indicating that older listeners tended to give higher trustworthiness ratings than younger listeners (β = 0.388, SE = 0.204, Z = 1.9, p = .057). The Listener Sex effect was non-significant (β = 0.147, SE = 0.72, Z = 0.72, p = .472). Of the interactions, the one involving Listener Sex and Speaker Sex was significant (β = 0.111, SE = 0.035, Z = 3.21, p = .001). Follow-up analyses, however, suggested that the trust advantage for female over male voices was

Acoustic features of trustworthiness
As for the stimulus valence/arousal analyses reported in the methods section, we entered the normalized acoustic parameters into separate CLM model as the fixed main effect. Trustworthiness ratings served as the dependent variable and the intercepts of Listener and Speaker

Discussion
This study examined whether and in what way a speaker's expressive and person characteristics shape listener trust.
Looking at expressive characteristics, we found a positive relationship between the valence of vocal expressions and rated trustworthiness. More positive voices were associated with increased perceived trustworthiness across our two stimulus sentences and across all speakers for the range of expressions that each speaker produced (Fig 2). Thus, we successfully replicate a phenomenon previously reported for faces [26,27]. Moreover, by providing an effect size estimate of 0.56 units on a 7-point rating scale, we highlight a potential relevance for this phenomenon in everyday life.
Notably, the arousal of vocal expressions predicted perceived trustworthiness inconsistently with a positive relation for one ("The politician faced his audience.") and a negative relation for the other ("The message reached its target.") sentence suggesting substantial dependence on verbal context and potential subtle differences in affective biases between the sentences. Moreover, a larger intensity range was, of all acoustic parameters, most strongly associated with arousal but had the smallest predictive value for the trustworthiness scores. Thus, it seems that the role of arousal in vocal trust may be negligible. Moreover, although valence and arousal conceptually overlap as indicated by a large correlation of rating results, they clearly dissociate in their significance for emerging first impressions.
The present study examined the role of person characteristics for trust from voices by looking at speaker sex and age effects. Extending evidence from faces [28,29], vocal recordings of women were rated as more trustworthy than those of men. Although this pattern showed consistently across the two sentences used here (betas = -.43, -.28), it was significant for only one of them, that is when speakers said "The politician faced his audience." but not when they said "The message reached its target.". Thus, although the female trustworthiness advantage may be a fairly robust phenomenon, its presence is moderated by other variables. For example, its strength may change as a function of whether trust concerns truthfulness or capability. In line with this, prevailing stereotypes characterize women as warmer but less competent than men [50].
Unlike our speaker sex effect, our speaker age effect showed consistently across the two sentences. Additionally, it was larger (β = 0.6) suggesting a greater relevance of age than sex in everyday trust. The age effect direction was both in conflict and in agreement with past research, which has provided a mixed pattern so far. Specifically, it was in conflict with work by Bailey and colleagues who found that older adult faces elicited higher trustworthiness ratings than younger adult faces [30]. However, those ratings were obtained after participants engaged in a trust-game with older and younger players and may, hence, have been context specific. Moreover, the age of the stimulus faces was left unspecified and may have been closer to middle age than to old age. This possibility agrees with research on political voting showing a preference for candidates in their 40s and 50s over younger and older candidates [15].
The present results corroborate evidence that younger faces appear more trustworthy than older faces [32] and that perceived vocal babyishness positively predicts ratings of warmth, honesty, and kindness [40]. Additionally, they converge with a more general age bias and a "youth ideal" that appeals on multiple levels and that readily engages processing resources. For example, research has shown that compared to expressions from older adults, expressions from younger adults more readily capture attention [51] and evoke better nonverbal perception [6]. Additionally, in surveys, younger individuals are seen as having better interpersonal skills as well as greater potential for professional development [52].
By examining both younger and older, female and male listeners, the present study also explored a potential role of peer status for perceived trustworthiness. Previous research identified own-age and own-sex preferences. Interpersonal attitudes have been characterized as more positive towards own as compared to other age individuals [52,53]. Additionally, children develop an own-sex bias that weakens after adolescence, but nevertheless persists into adulthood [54]. Together, this evidence raises the possibility that own-age and own-sex individuals are perceived as more trustworthy than those of another age and sex.
In line with prior work on facial age [31], however, the present study rejects this possibility in the context of vocal age. Younger and older listeners showed a similar age bias suggesting that youth is more relevant than peer status when it comes to trust. Yet, there was some indication for a role of peer effects in modulating sex-based preferences. Specifically, the preference for female over male voices tended to be larger in women than in men. Although awaiting replication, this tendency agrees with a recent study presenting untrustworthy, neutral, and trustworthy female and male faces [55]. Compared to men, women rated trustworthy faces as more trustworthy especially when they were female.
Both speaker expressive and person characteristics shaped voice acoustics and, through this, biased perceptions of trustworthiness. To see how, we measured voice acoustics and linked them to both the trustworthiness ratings and to the speaker variables (i.e., expression, age, sex). In doing so, we followed previous work on emotion expression [4] and examined a range of parameters describing frequency, intensity, and temporal sound properties.
Replicating earlier results [15][16][17][18], F0 mean negatively predicted perceived trustworthiness. Although this relationship has not been shown consistently [20,21], the present demonstration across a substantial speaker range gives additional credence. Apart from F0 mean, other parameters emerged as relevant. Specifically, ratings of trustworthiness were significantly axes). The individual points in the scatter plots represent the 13 expressions for the 2 sentences produced by each speaker. These points are summarized by a regression line for each speaker. The error bars in the bar-graphs represent the 95% confidence intervals. https://doi.org/10.1371/journal.pone.0210555.g003 Voice characteristics shape listener trust higher for vocalizations with a shorter duration, a lower HNR, a smaller F0 SD, more shimmer, and-but for one sentence only-more jitter. Interestingly, duration (β = -.61) and HNR (β = -.43) turned out to be the strongest predictors with an effect size larger than that of other parameters including F0 mean (β = -.4), which has been the focus of research so far.
Looking at the acoustics of speaker affective expression, we found both overlap and dissociation with those of trustworthiness. Like trustworthy vocalizations, more positive vocalizations were associated with a faster speech rate, a lower HNR, a smaller F0 SD, and a lower F0 mean. Unlike trustworthy vocalizations, however, they showed a larger intensity range, a very small and negative relationship with jitter, and a non-significant relationship with shimmer. As for speaker affective expression, person characteristics produced mixed results. Compared to older voices, younger voices had a faster speech rate, a lower HNR, a smaller F0 SD, and a lower F0 mean, but also-contrary to prediction-a larger intensity range and a marginally smaller F0 range. Compared to male voices, female voices had a higher F0 mean, less jitter, and less shimmer-all results in opposition with the acoustics of trustworthiness.
Overall, the present study offers some support for the emotion over-generalization hypothesis [39,37], but see [38]. Perceived speaker valence, the most relevant dimension of positive affect positively predicted perceived speaker trustworthiness. Moreover, speech rate, HNR, F0 mean, and F0 SD which most strongly predicted valence, also most strongly predicted trustworthiness and, importantly, their prediction was in the same direction. Additionally, speaker age effects largely aligned with the emotion overgeneralization hypothesis. Young voices were more trusted than old voices and speech rate, HNR, F0 mean and F0 SD differences overlapped with the acoustic differences between positive and negative voices. Yet, for other parameters the patterns diverged between positive affect on the one hand and trustworthiness and age on the other hand. Additionally, speaker sex effects offered no support for the emotion overgeneralization hypothesis suggesting that positive affect is limited in explaining first impressions of trustworthiness. Instead, these impressions seem to emerge multi-causally. Some of their mechanisms may be largely perceptual and 'hard-wired' and tap on the biological convergence of expressive and person characteristics. However, others may depend on the more flexible, higher-order conceptual knowledge that individuals acquire across their lifetime (e.g., stereotypes).
Although the present study provides insights into how vocal expressive and person characteristics shape trustworthiness impressions, it also raises questions for future research. One such question concerns the speaker sex and age effects observed here. The current speaker sample, albeit comparable to much published work [15][16][17][18][19][20][21][22][23][24][25], was small and one must worry that outliers biased overall effects and between speaker comparisons. We addressed this issue by controlling for individual speakers in our statistical analysis and by exploring whether effects emerge consistently for speakers across, as well as within, sex and age groups (Fig 2). As this was the case, we are confident in our results. Nevertheless, one may wish to replicate them with a larger and more representative sample.
As second question is how age affects both the vocal conveyance of trustworthiness and its perception. Looking at vocal production, it is still unclear how ageing changes the voice and the mechanisms of vocal expression. For example, although there is evidence that F0 mean increases in older adults [56] this evidence is inconsistent [57] and may interact with generational or cultural differences in life-style (e.g., smoking, singing) [57,58]. Additionally, it would be interesting to consider age differences in the motivation and ability to pose expressions and to examine spontaneous utterances as a function of age. Looking at vocal perception, the sensory, cognitive and emotional changes associated with ageing should be explored for their relevance for trustworthiness perception. For example future work may consider how age related hearing loss, decreasing executive functions [6], or a greater emphasis on positive experiences [59] may boost perceptions of trustworthiness in older adults.
Last, it would be important to pursue the role of voices for perceiving trustworthiness in everyday life. One issue is whether perceptions are misguiding or represent useful biases in situations with limited information. Clearly, we cannot be exceedingly accurate in judging trustworthiness from nonverbal cues only [1], but see [60]. Yet, there is empirical data suggesting that such judgements may not be entirely off the mark. Trustworthiness appears to have a stable personality basis [61] and observers appear to gauge some rudimentary aspect of this basis from faces more accurately than would be expected by chance [62][63][64]. Thus, one next step would be to explore the validity of trust impressions from the voice. Additionally, it would be relevant to examine how impressions inform actual behaviors.

Conclusions
To conclude, the present study pursued voice-based trustworthiness impressions from a fresh angle. In addition to exploring the sound properties of trustworthiness, it examined the role of vocal expressive and person characteristics and linked those back to relevant voice acoustics. In doing so, it produced original evidence that positive voices sound more trustworthy than negative voices. Additionally, it delineated speaker age and sex effects according to which trustworthiness is higher for younger as compared with older and female as compared with male individuals. Last, this study highlighted speech rate, HNR, F0 mean, and F0 SD as trust's most important acoustic dimensions and showed that they map onto a subset of expressive and person characteristics providing some, albeit imperfect, support for the emotion overgeneralization hypothesis. Notably, a single acoustic parameter-fast speech-was as if not more important than speaker affect, age, and sex in predicting listener trust thus underlining that trust likely arises from both simple perceptual as well as higher-order conceptual processes.