Abstract
Speech perception in difficult listening conditions depends highly on the temporal processing ability of the auditory pathway. In the present study, we investigate the inter-subject variability of young normal-hearing listeners in the identification of time-compressed speech and how the ability to identify time-compressed speech, as assessed by the speech reception threshold (SRTrate: the speech rate at which 50% of the speech is perceived correctly) is associated with the ability to identify speech in unmodulated (SRTunmod) and modulated noise (SRTmod). These tasks are highly dependent on the temporal processing abilities of the auditory pathway. We observed a large inter-subject variability in the SRTrate and found that it is significantly correlated with the SRT when listening to unmodulated and modulated noise. Furthermore, we found that listeners who are better at perceiving speech at high rates are better in listening to speech in modulated noise. This effect persisted even when controlling for their ability to perceive speech in unmodulated noise. In addition, we also found that an increase in speech rate from 2.7 to 6.6 syllables per second resulted in a reduction in glimpsing of 5.3 dB when listening to speech in a 4-Hz amplitude-modulated masker, even though speech in quiet was 100% intelligible at both rates. These results indicate that the ability of young normal-hearing individuals to efficiently process temporal features of speech is an imperative factor when listening to speech in difficult listening situations.
Similar content being viewed by others
Data Availability
Data is available from the authors upon reasonable request.
References
Ahissar E, Nagarajan S, Ahissar M et al (2001) Speech comprehension is correlated with temporal response patterns recorded from auditory cortex. Proc Natl Acad Sci 98:13367–13372
American National Standards Institude (1999) ANSI S3.1 1–1999: Maximum permissible ambient noise levels for audiometric test rooms. 1999
Baer T, Moore BCJ (1994) Effects of spectral smearing on the intelligibility of sentences in the presence of interfering speech. J Acoust Soc Am 95:2277–2280. https://doi.org/10.1121/1.408640
Boersma P, Weenink D (2020) Praat: doing phonetics by computer [Computer program]. In: http://www.praat.org
Bosker HR, Ghitza O (2018) Entrained theta oscillations guide perception of subsequent speech: behavioural evidence from rate normalisation. Lang Cogn Neurosci 33:955–967. https://doi.org/10.1080/23273798.2018.1439179
Carbonell KM (2017) Reliability of individual differences in degraded speech perception. J Acoust Soc Am 142:EL461–EL466. https://doi.org/10.1121/1.5010148
Cooke M (2006) A glimpsing model of speech perception in noise. J Acoust Soc Am 119:1562–1573. https://doi.org/10.1121/1.2166600
Dias JW, McClaskey C, m, Harris KC, (2019) Time-compressed speech identification is predicted by auditory neural processing, perceptuomotor speed, and executive functioning in younger and older listeners. J Assoc Res Otolaryngol 20:73–88. https://doi.org/10.1007/s10162-018-00703-1
Dietrich S, Hertrich I, Ackermann H (2013a) Ultra-fast speech comprehension in blind subjects engages primary visual cortex, fusiform gyrus, and pulvinar — a functional magnetic resonance imaging (fMRI) study. BMC Neurosci 14:74. https://doi.org/10.1186/1471-2202-14-74
Dietrich S, Hertrich I, Ackermann H (2013b) Training of ultra-fast speech comprehension induces functional reorganization of the central-visual system in late-blind humans. Front Hum Neurosci 7:1–15. https://doi.org/10.3389/fnhum.2013.00701
Ding N, Patel AD, Chen L et al (2017) Temporal modulations in speech and music. Neurosci Biobehav Rev 81:181–187. https://doi.org/10.1016/j.neubiorev.2017.02.011
Dubno JR, Horwitz AR, Ahlstrom JB (2003) Recovery from prior stimulation: masking of speech by interrupted noise for younger and older adults with normal hearing. J Acoust Soc Am 113:2084–2094. https://doi.org/10.1121/1.1555611
Dubno JR, Horwitz AR, Ahlstrom JB (2002) Benefit of modulated maskers for speech recognition by younger and older adults with normal hearing. J Acoust Soc Am 111:2897-2907. https://doi.org/10.1121/1.1480421
Edwards E, Chang EF (2013) Syllabic (~2 -5 Hz ) and fluctuation (~1-10 Hz ) ranges in speech and auditory processing. Hear Res 305:113–114. https://doi.org/10.1016/j.heares.2013.08.017
Festen JM, Plomp R (1990) Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. J Acoust Soc Am 88:1725–1736. https://doi.org/10.1121/1.400247
Fogerty D, Xu J, Gibbs BE (2016) Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: effect of the modulation spectrum. J Acoust Soc Am 140:1800–1816. https://doi.org/10.1121/1.4962494
Francart T, van Wieringen A, Wouters J (2008) APEX 3: a multi-purpose test platform for auditory psychophysical experiments. J Neurosci Methods 172:283–293. https://doi.org/10.1016/j.jneumeth.2008.04.020
Fu Q-J, Galvin JJ, Wang X (2001) Recognition of time-distorted sentences by normal-hearing and cochlear-implant listeners. J Acoust Soc Am 109:379–384. https://doi.org/10.1121/1.1327578
Füllgrabe C, Berthommier F, Lorenzi C (2006) Masking release for consonant features in temporally fluctuating background noise. Hear Res 211:74–84. https://doi.org/10.1016/j.heares.2005.09.001
Füllgrabe C, Moore BCJ, Stone MA (2015) Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognition. Front Aging Neurosci 6:347. https://doi.org/10.3389/fnagi.2014.00347
Ghitza O (2012) On the role of theta-driven syllabic parsing in decoding speech: intelligibility of speech with a manipulated modulation spectrum. Front Psychol 3:1–12. https://doi.org/10.3389/fpsyg.2012.00238
Ghitza O, Greenberg S (2009) On the possible role of brain rhythms in speech perception: intelligibility of time-compressed speech with periodic and aperiodic insertions of silence. Phonetica 66:113–126. https://doi.org/10.1159/000208934
Gnansia D, Pressnitzer D, Péan V et al (2010) Intelligibility of interrupted and interleaved speech for normal-hearing listeners and cochlear implantees. Hear Res 265:46–53. https://doi.org/10.1016/j.heares.2010.02.012
Goossens T, Vercammen C, Wouters J, Van WA (2018) Neural envelope encoding predicts speech perception performance for normal-hearing and hearing-impaired adults. Hear Res 370:189–200. https://doi.org/10.1016/j.heares.2018.07.012
Goossens T, Vercammen C, Wouters J, Van WA (2017) Masked speech perception across the adult lifespan: impact of age and hearing impairment. Hear Res 344:109–124. https://doi.org/10.1016/j.heares.2016.11.004
Gordon-Salant F, S; Friedman PJ, Sarah A (2007) Recognition of time-compressed and natural speech with selective temporal enhancements by young and elderly listeners. J Speech, Lang Hear Res 50:1181–1193
Gordon-Salant S, Friedman SA (2011) Recognition of rapid speech by blind and sighted older adults. J Speech, Lang Hear Res 54:622–631. https://doi.org/10.1044/1092-4388(2010/10-0052)
Gordon-Salant S, Fritzgibbons PJ (1993) Temporal factors and speech recognition performance in young and elderly listeners. J Speech Hear Res 36:1276–1285
Guest H, Munro KJ, Prendergast G et al (2018) Impaired speech perception in noise with a normal audiogram: no evidence for cochlear synaptopathy and no relation to lifetime noise exposure. Hear Res 364:142–151. https://doi.org/10.1016/j.heares.2018.03.008
Hertrich I, Dietrich S, Ackermann H (2018) Cortical phase locking to accelerated speech in blind and sighted listeners prior to and after training. Brain Lang 185:19–29. https://doi.org/10.1016/j.bandl.2018.07.002
Holmes E, Griffiths TD (2019) ‘Normal’ hearing thresholds and fundamental auditory grouping processes predict difficulties with speech-in-noise perception. Sci Rep 9:1–11. https://doi.org/10.1038/s41598-019-53353-5
Kidd GR, Watson CS, Gygi B (2007) Individual differences in auditory abilities. J Acoust Soc Am 122:418–435. https://doi.org/10.1121/1.2743154
Liu S, Del Rio E, Bradlow AR, Zeng F-G (2004) Clear speech perception in acoustic and electric hearing. J Acoust Soc Am 116:2374–2383. https://doi.org/10.1121/1.1787528
Lorenzi C, Gilbert G, Carn H et al (2006) Speech perception problems of the hearing impaired reflect inability to use temporal fine structure. Proc Natl Acad Sci 103:18866–18869. https://doi.org/10.1073/pnas.0607364103
Macpherson A, Akeroyd MA (2014) Variations in the slope of the psychometric functions for speech intelligibility: a systematic survey. Trends Hear 18:10–16. https://doi.org/10.1177/2331216514537722
Mattys SL, Davis MH, Bradlow AR, Scott SK (2012) Speech recognition in adverse conditions : A review Speech recognition in adverse conditions: a review. Lang Cogn Process 27:953–978. https://doi.org/10.1080/01690965.2012.705006
Meng Q, Wang X, Cai Y et al (2019) Time-compression thresholds for Mandarin sentences in normal-hearing and cochlear implant listeners. Hear Res 374:58–68. https://doi.org/10.1016/j.heares.2019.01.011
Middelweerd MJ, Festen JM, Plomp R (1990) Difficulties with speech intelligibility in noise in spite of a normal pure-tone audiogram: Original papers. Audiology 29:1–7. https://doi.org/10.3109/00206099009081640
Miller GA, Licklider JCR (1950) The intelligibility of interrupted speech. J Acoust Soc Am 22:167–173
Moore BCJ (2003) Temporal integration and context effects in hearing. J Phon 31:563–574. https://doi.org/10.1016/S0095-4470(03)00011-1
Moulines E, Charpentier F (1991) Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:453–467
Nelson PB, Jin S, Carney AE, Nelson DA (2003) Understanding speech in modulated interference: Cochlear implant users and normal- hearing listeners, J Acoust Soc Am 113:961–968. https://doi.org/10.1121/1.1531983
Niemeyer W, Starlinger I (1981) Do the blind hear better? Investigations on auditory processing in congenital or early acquired blindness II. Central functions. Audiology 20:510–515. https://doi.org/10.3109/00206098109072719
Oldfield RC (1971) The assessment and analysis of handedness: the Edinburg Inventory. Neuropsychologia 9:97–113
Penn LR, Ayasse ND, Wingfield A, Ghitza O (2018) The possible role of brain rhythms in perceiving fast speech: evidence from adult aging. J Acoust Soc Am 144:2088–2094. https://doi.org/10.1121/1.5054905
Phatak SA, Grant KW (2012) Phoneme recognition in modulated maskers by normal-hearing and aided hearing-impaired listeners. J Acoust Soc Am 132:1646–1654. https://doi.org/10.1121/1.4742718
Plomp R. (1983) The Role of Modulation in Hearing. In: Klinke R., Hartmann R. (eds) HEARING — Physiological Bases and Psychophysics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-69257-4_39
Poeppel D (2003) The analysis of speech in different temporal integration windows: cerebral lateralization as “asymmetric sampling in time.” Speech Commun 41:245–255. https://doi.org/10.1016/S0167-6393(02)00107-3
R Core Team (2019) R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. URL https://www.R-project.org/
Rhebergen KS, Versfeld NJ, Dreschler WA (2006) Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise. J Acoust Soc Am 120:3988–3997 https://doi.org/10.1121/1.2358008
Rosen S (1992) Temporal information in speech: acoustic, auditory and linguistic aspects. Phil Trans R Soc Lond B 336:367–373
Ruggles D, Bharadwaj H, Shinn-cunningham BG (2011) Normal hearing is not enough to guarantee robust encoding of suprathreshold features important in everyday communication. Proc Natl Acad Sci 108:15516–15521. https://doi.org/10.1073/pnas.1108912108
Saberi K, Perrott DR (1999) Cognitive restoration of reversed speech. Nature 398:760–761
Schlueter A, Lemke U, Kollmeier B, Holube I (2014) Intelligibility of time-compressed speech: The effect of uniform versus non-uniform time-compression algorithms. J Acoust Soc Am 135:1541–1555. https://doi.org/10.1121/1.4863654
Shannon R V, Zeng F, Kamath V et al (1995) Speech recognition with primarily temporal cues. Science 270:303–304
Shen Y, Pearson DV (2019) Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity. J Acoust Soc Am 145:2518–2529. https://doi.org/10.1121/1.5098949
Smith ZM, Delgutte B, Oxenham AJ (2002) Chimaeric sounds reveal dichotomies in auditory perception. Nature 416:87–90
Stone MA, Füllgrabe C, Moore BCJ (2012) Notionally steady background noise acts primarily as a modulation masker of speech. J Acoust Soc Am 132:317–326. https://doi.org/10.1121/1.4725766
Teng X, Tian X, Poeppel D (2016) Testing multi-scale processing in the auditory system. Sci Rep 6:1–13. https://doi.org/10.1038/srep34390
Van Hirtum T, Moncada-Torres A, Ghesquière P, Wouters J (2019) Speech envelope enhancement instantaneously effaces atypical speech perception in dyslexia. Ear Hear 40:1242–1252. https://doi.org/10.1097/AUD.0000000000000706
van Wieringen A, Wouters J (2008) LIST and LINT : Sentences and numbers for quantifying speech understanding in severely impaired listeners for Flanders and the Netherlands. Int J Audiol 47:348–355. https://doi.org/10.1080/14992020801895144
Vanthornhout J, Decruy L, Wouters J et al (2018) Speech intelligibility predicted from neural entrainment of the speech envelope. J Assoc Res Otolaryngol 19:181–191. https://doi.org/10.1007/s10162-018-0654-z
Varnet L, Ortiz-barajas MC, Erra RG et al (2017) A cross-linguistic study of speech modulation spectra. J Acoust Soc Am 142:1976–1989
Venables WN, Ripley BD (2002) Modern applied statistics with S, Fourth Edition. Springer New York
Versfeld NJ, Dreschler WA (2002) The relationship between the intelligibility of time-compressed speech and speech in noise in young and elderly listeners. J Acoust Soc Am 111:401–408. https://doi.org/10.1121/1.1426376
Viemeister NF, Wakefield H (1991) Temporal integration and multiple looks. J Acoust Soc Am 90:858–865
Wightman FL, Kistler DJ, O’Bryan A (2010) Individual differences and age effects in a dichotic informational masking paradigm. J Acoust Soc Am 128:270–279. https://doi.org/10.1121/1.3436536
Wingfield A, Peelle JE, Grossman M (2003) Speech rate and syntactic complexity as multiplicative factors in speech comprehension by young and older adults. Aging, Neuropsychol Cogn 10:310–322. https://doi.org/10.1076/anec.10.4.310.28974
Zeng F-G, Oba S, Garde S et al (1999) Temporal and speech processing deficits in auditory neuropathy. NeuroReport 10:3429–3435
Ziegler JC, Pech-georgel C, George F, Lorenzi C (2009) Speech-perception-in-noise deficits in dyslexia. Dev Sci 5:732–745. https://doi.org/10.1111/j.1467-7687.2009.00817.x
Acknowledgements
All subjects are thanked for taking the time to participate. Iris Van de Ryck and Jente Verbesselt are thanked for their help with the data collection. Students from the Master’s program Speech Language Pathology and Audiology Sciences at the KU Leuven are thanked for their help with the segmentation of the speech corpus. Associate editor Prof. Dr. Christian Lorenzi and two anonymous reviewers are thanked for their constructive feedback.
Funding
FWO (G.0662.13) and the HermesFonds (IWT-14124) funded this research.
Author information
Authors and Affiliations
Contributions
R. G. and J. W. elaborated the concept of the study; R. G., A. V. W., and J. W. designed the experiments R. G. and A. V. W. oversaw the segmentation of the speech corpus; R. G. was responsible for the data collection. R. G and J. W. analyzed the results; R. G drafted the paper with input from all other authors.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gransier, R., van Wieringen, A. & Wouters, J. The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise. JARO 23, 413–426 (2022). https://doi.org/10.1007/s10162-021-00832-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10162-021-00832-0