The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise

Gransier, Robin; van Wieringen, Astrid; Wouters, Jan

doi:10.1007/s10162-021-00832-0

The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise

Research Article
Published: 07 March 2022

Volume 23, pages 413–426, (2022)
Cite this article

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

447 Accesses
1 Citation
Explore all metrics

Abstract

Speech perception in difficult listening conditions depends highly on the temporal processing ability of the auditory pathway. In the present study, we investigate the inter-subject variability of young normal-hearing listeners in the identification of time-compressed speech and how the ability to identify time-compressed speech, as assessed by the speech reception threshold (SRT_rate: the speech rate at which 50% of the speech is perceived correctly) is associated with the ability to identify speech in unmodulated (SRT_unmod) and modulated noise (SRT_mod). These tasks are highly dependent on the temporal processing abilities of the auditory pathway. We observed a large inter-subject variability in the SRT_rate and found that it is significantly correlated with the SRT when listening to unmodulated and modulated noise. Furthermore, we found that listeners who are better at perceiving speech at high rates are better in listening to speech in modulated noise. This effect persisted even when controlling for their ability to perceive speech in unmodulated noise. In addition, we also found that an increase in speech rate from 2.7 to 6.6 syllables per second resulted in a reduction in glimpsing of 5.3 dB when listening to speech in a 4-Hz amplitude-modulated masker, even though speech in quiet was 100% intelligible at both rates. These results indicate that the ability of young normal-hearing individuals to efficiently process temporal features of speech is an imperative factor when listening to speech in difficult listening situations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Time-Compressed Speech Identification Is Predicted by Auditory Neural Processing, Perceptuomotor Speed, and Executive Functioning in Younger and Older Listeners

Article 19 November 2018

James W. Dias, Carolyn M. McClaskey & Kelly C. Harris

Frequency Following Responses to Tone Glides: Effects of Age and Hearing Loss

Article 12 July 2023

Michelle R. Molis, William J. Bologna, … Curtis J. Billings

Age-related changes in auditory processing and speech perception: cross-sectional and longitudinal analyses

Article 24 January 2017

Harvey Babkoff & Leah Fostick

Data Availability

Data is available from the authors upon reasonable request.

References

Ahissar E, Nagarajan S, Ahissar M et al (2001) Speech comprehension is correlated with temporal response patterns recorded from auditory cortex. Proc Natl Acad Sci 98:13367–13372
Article CAS Google Scholar
American National Standards Institude (1999) ANSI S3.1 1–1999: Maximum permissible ambient noise levels for audiometric test rooms. 1999
Baer T, Moore BCJ (1994) Effects of spectral smearing on the intelligibility of sentences in the presence of interfering speech. J Acoust Soc Am 95:2277–2280. https://doi.org/10.1121/1.408640
Article CAS PubMed Google Scholar
Boersma P, Weenink D (2020) Praat: doing phonetics by computer [Computer program]. In: http://www.praat.org
Bosker HR, Ghitza O (2018) Entrained theta oscillations guide perception of subsequent speech: behavioural evidence from rate normalisation. Lang Cogn Neurosci 33:955–967. https://doi.org/10.1080/23273798.2018.1439179
Article Google Scholar
Carbonell KM (2017) Reliability of individual differences in degraded speech perception. J Acoust Soc Am 142:EL461–EL466. https://doi.org/10.1121/1.5010148
Cooke M (2006) A glimpsing model of speech perception in noise. J Acoust Soc Am 119:1562–1573. https://doi.org/10.1121/1.2166600
Article PubMed Google Scholar
Dias JW, McClaskey C, m, Harris KC, (2019) Time-compressed speech identification is predicted by auditory neural processing, perceptuomotor speed, and executive functioning in younger and older listeners. J Assoc Res Otolaryngol 20:73–88. https://doi.org/10.1007/s10162-018-00703-1
Article PubMed Google Scholar
Dietrich S, Hertrich I, Ackermann H (2013a) Ultra-fast speech comprehension in blind subjects engages primary visual cortex, fusiform gyrus, and pulvinar — a functional magnetic resonance imaging (fMRI) study. BMC Neurosci 14:74. https://doi.org/10.1186/1471-2202-14-74
Article PubMed PubMed Central Google Scholar
Dietrich S, Hertrich I, Ackermann H (2013b) Training of ultra-fast speech comprehension induces functional reorganization of the central-visual system in late-blind humans. Front Hum Neurosci 7:1–15. https://doi.org/10.3389/fnhum.2013.00701
Article Google Scholar
Ding N, Patel AD, Chen L et al (2017) Temporal modulations in speech and music. Neurosci Biobehav Rev 81:181–187. https://doi.org/10.1016/j.neubiorev.2017.02.011
Article PubMed Google Scholar
Dubno JR, Horwitz AR, Ahlstrom JB (2003) Recovery from prior stimulation: masking of speech by interrupted noise for younger and older adults with normal hearing. J Acoust Soc Am 113:2084–2094. https://doi.org/10.1121/1.1555611
Article PubMed Google Scholar
Dubno JR, Horwitz AR, Ahlstrom JB (2002) Benefit of modulated maskers for speech recognition by younger and older adults with normal hearing. J Acoust Soc Am 111:2897-2907. https://doi.org/10.1121/1.1480421
Edwards E, Chang EF (2013) Syllabic (~2 -5 Hz ) and fluctuation (~1-10 Hz ) ranges in speech and auditory processing. Hear Res 305:113–114. https://doi.org/10.1016/j.heares.2013.08.017
Article PubMed Google Scholar
Festen JM, Plomp R (1990) Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. J Acoust Soc Am 88:1725–1736. https://doi.org/10.1121/1.400247
Article CAS PubMed Google Scholar
Fogerty D, Xu J, Gibbs BE (2016) Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: effect of the modulation spectrum. J Acoust Soc Am 140:1800–1816. https://doi.org/10.1121/1.4962494
Article PubMed PubMed Central Google Scholar
Francart T, van Wieringen A, Wouters J (2008) APEX 3: a multi-purpose test platform for auditory psychophysical experiments. J Neurosci Methods 172:283–293. https://doi.org/10.1016/j.jneumeth.2008.04.020
Article PubMed Google Scholar
Fu Q-J, Galvin JJ, Wang X (2001) Recognition of time-distorted sentences by normal-hearing and cochlear-implant listeners. J Acoust Soc Am 109:379–384. https://doi.org/10.1121/1.1327578
Article CAS PubMed Google Scholar
Füllgrabe C, Berthommier F, Lorenzi C (2006) Masking release for consonant features in temporally fluctuating background noise. Hear Res 211:74–84. https://doi.org/10.1016/j.heares.2005.09.001
Article PubMed Google Scholar
Füllgrabe C, Moore BCJ, Stone MA (2015) Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognition. Front Aging Neurosci 6:347. https://doi.org/10.3389/fnagi.2014.00347
Article PubMed PubMed Central Google Scholar
Ghitza O (2012) On the role of theta-driven syllabic parsing in decoding speech: intelligibility of speech with a manipulated modulation spectrum. Front Psychol 3:1–12. https://doi.org/10.3389/fpsyg.2012.00238
Article Google Scholar
Ghitza O, Greenberg S (2009) On the possible role of brain rhythms in speech perception: intelligibility of time-compressed speech with periodic and aperiodic insertions of silence. Phonetica 66:113–126. https://doi.org/10.1159/000208934
Article PubMed Google Scholar
Gnansia D, Pressnitzer D, Péan V et al (2010) Intelligibility of interrupted and interleaved speech for normal-hearing listeners and cochlear implantees. Hear Res 265:46–53. https://doi.org/10.1016/j.heares.2010.02.012
Article PubMed Google Scholar
Goossens T, Vercammen C, Wouters J, Van WA (2018) Neural envelope encoding predicts speech perception performance for normal-hearing and hearing-impaired adults. Hear Res 370:189–200. https://doi.org/10.1016/j.heares.2018.07.012
Article PubMed Google Scholar
Goossens T, Vercammen C, Wouters J, Van WA (2017) Masked speech perception across the adult lifespan: impact of age and hearing impairment. Hear Res 344:109–124. https://doi.org/10.1016/j.heares.2016.11.004
Article PubMed Google Scholar
Gordon-Salant F, S; Friedman PJ, Sarah A (2007) Recognition of time-compressed and natural speech with selective temporal enhancements by young and elderly listeners. J Speech, Lang Hear Res 50:1181–1193
Article Google Scholar
Gordon-Salant S, Friedman SA (2011) Recognition of rapid speech by blind and sighted older adults. J Speech, Lang Hear Res 54:622–631. https://doi.org/10.1044/1092-4388(2010/10-0052)
Article Google Scholar
Gordon-Salant S, Fritzgibbons PJ (1993) Temporal factors and speech recognition performance in young and elderly listeners. J Speech Hear Res 36:1276–1285
Article CAS Google Scholar
Guest H, Munro KJ, Prendergast G et al (2018) Impaired speech perception in noise with a normal audiogram: no evidence for cochlear synaptopathy and no relation to lifetime noise exposure. Hear Res 364:142–151. https://doi.org/10.1016/j.heares.2018.03.008
Article PubMed PubMed Central Google Scholar
Hertrich I, Dietrich S, Ackermann H (2018) Cortical phase locking to accelerated speech in blind and sighted listeners prior to and after training. Brain Lang 185:19–29. https://doi.org/10.1016/j.bandl.2018.07.002
Article PubMed Google Scholar
Holmes E, Griffiths TD (2019) ‘Normal’ hearing thresholds and fundamental auditory grouping processes predict difficulties with speech-in-noise perception. Sci Rep 9:1–11. https://doi.org/10.1038/s41598-019-53353-5
Article CAS Google Scholar
Kidd GR, Watson CS, Gygi B (2007) Individual differences in auditory abilities. J Acoust Soc Am 122:418–435. https://doi.org/10.1121/1.2743154
Article PubMed Google Scholar
Liu S, Del Rio E, Bradlow AR, Zeng F-G (2004) Clear speech perception in acoustic and electric hearing. J Acoust Soc Am 116:2374–2383. https://doi.org/10.1121/1.1787528
Article PubMed Google Scholar
Lorenzi C, Gilbert G, Carn H et al (2006) Speech perception problems of the hearing impaired reflect inability to use temporal fine structure. Proc Natl Acad Sci 103:18866–18869. https://doi.org/10.1073/pnas.0607364103
Article CAS PubMed PubMed Central Google Scholar
Macpherson A, Akeroyd MA (2014) Variations in the slope of the psychometric functions for speech intelligibility: a systematic survey. Trends Hear 18:10–16. https://doi.org/10.1177/2331216514537722
Article Google Scholar
Mattys SL, Davis MH, Bradlow AR, Scott SK (2012) Speech recognition in adverse conditions : A review Speech recognition in adverse conditions: a review. Lang Cogn Process 27:953–978. https://doi.org/10.1080/01690965.2012.705006
Article Google Scholar
Meng Q, Wang X, Cai Y et al (2019) Time-compression thresholds for Mandarin sentences in normal-hearing and cochlear implant listeners. Hear Res 374:58–68. https://doi.org/10.1016/j.heares.2019.01.011
Article PubMed Google Scholar
Middelweerd MJ, Festen JM, Plomp R (1990) Difficulties with speech intelligibility in noise in spite of a normal pure-tone audiogram: Original papers. Audiology 29:1–7. https://doi.org/10.3109/00206099009081640
Article CAS PubMed Google Scholar
Miller GA, Licklider JCR (1950) The intelligibility of interrupted speech. J Acoust Soc Am 22:167–173
Article Google Scholar
Moore BCJ (2003) Temporal integration and context effects in hearing. J Phon 31:563–574. https://doi.org/10.1016/S0095-4470(03)00011-1
Article Google Scholar
Moulines E, Charpentier F (1991) Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:453–467
Article Google Scholar
Nelson PB, Jin S, Carney AE, Nelson DA (2003) Understanding speech in modulated interference: Cochlear implant users and normal- hearing listeners, J Acoust Soc Am 113:961–968. https://doi.org/10.1121/1.1531983
Niemeyer W, Starlinger I (1981) Do the blind hear better? Investigations on auditory processing in congenital or early acquired blindness II. Central functions. Audiology 20:510–515. https://doi.org/10.3109/00206098109072719
Article CAS Google Scholar
Oldfield RC (1971) The assessment and analysis of handedness: the Edinburg Inventory. Neuropsychologia 9:97–113
Article CAS Google Scholar
Penn LR, Ayasse ND, Wingfield A, Ghitza O (2018) The possible role of brain rhythms in perceiving fast speech: evidence from adult aging. J Acoust Soc Am 144:2088–2094. https://doi.org/10.1121/1.5054905
Article PubMed PubMed Central Google Scholar
Phatak SA, Grant KW (2012) Phoneme recognition in modulated maskers by normal-hearing and aided hearing-impaired listeners. J Acoust Soc Am 132:1646–1654. https://doi.org/10.1121/1.4742718
Article PubMed Google Scholar
Plomp R. (1983) The Role of Modulation in Hearing. In: Klinke R., Hartmann R. (eds) HEARING — Physiological Bases and Psychophysics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-69257-4_39
Poeppel D (2003) The analysis of speech in different temporal integration windows: cerebral lateralization as “asymmetric sampling in time.” Speech Commun 41:245–255. https://doi.org/10.1016/S0167-6393(02)00107-3
Article Google Scholar
R Core Team (2019) R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. URL https://www.R-project.org/
Rhebergen KS, Versfeld NJ, Dreschler WA (2006) Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise. J Acoust Soc Am 120:3988–3997 https://doi.org/10.1121/1.2358008
Rosen S (1992) Temporal information in speech: acoustic, auditory and linguistic aspects. Phil Trans R Soc Lond B 336:367–373
Article CAS Google Scholar
Ruggles D, Bharadwaj H, Shinn-cunningham BG (2011) Normal hearing is not enough to guarantee robust encoding of suprathreshold features important in everyday communication. Proc Natl Acad Sci 108:15516–15521. https://doi.org/10.1073/pnas.1108912108
Article PubMed PubMed Central Google Scholar
Saberi K, Perrott DR (1999) Cognitive restoration of reversed speech. Nature 398:760–761
Article CAS Google Scholar
Schlueter A, Lemke U, Kollmeier B, Holube I (2014) Intelligibility of time-compressed speech: The effect of uniform versus non-uniform time-compression algorithms. J Acoust Soc Am 135:1541–1555. https://doi.org/10.1121/1.4863654
Article PubMed Google Scholar
Shannon R V, Zeng F, Kamath V et al (1995) Speech recognition with primarily temporal cues. Science 270:303–304
Shen Y, Pearson DV (2019) Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity. J Acoust Soc Am 145:2518–2529. https://doi.org/10.1121/1.5098949
Article PubMed PubMed Central Google Scholar
Smith ZM, Delgutte B, Oxenham AJ (2002) Chimaeric sounds reveal dichotomies in auditory perception. Nature 416:87–90
Article CAS Google Scholar
Stone MA, Füllgrabe C, Moore BCJ (2012) Notionally steady background noise acts primarily as a modulation masker of speech. J Acoust Soc Am 132:317–326. https://doi.org/10.1121/1.4725766
Article PubMed Google Scholar
Teng X, Tian X, Poeppel D (2016) Testing multi-scale processing in the auditory system. Sci Rep 6:1–13. https://doi.org/10.1038/srep34390
Article CAS Google Scholar
Van Hirtum T, Moncada-Torres A, Ghesquière P, Wouters J (2019) Speech envelope enhancement instantaneously effaces atypical speech perception in dyslexia. Ear Hear 40:1242–1252. https://doi.org/10.1097/AUD.0000000000000706
Article PubMed Google Scholar
van Wieringen A, Wouters J (2008) LIST and LINT : Sentences and numbers for quantifying speech understanding in severely impaired listeners for Flanders and the Netherlands. Int J Audiol 47:348–355. https://doi.org/10.1080/14992020801895144
Article PubMed Google Scholar
Vanthornhout J, Decruy L, Wouters J et al (2018) Speech intelligibility predicted from neural entrainment of the speech envelope. J Assoc Res Otolaryngol 19:181–191. https://doi.org/10.1007/s10162-018-0654-z
Article PubMed PubMed Central Google Scholar
Varnet L, Ortiz-barajas MC, Erra RG et al (2017) A cross-linguistic study of speech modulation spectra. J Acoust Soc Am 142:1976–1989
Article Google Scholar
Venables WN, Ripley BD (2002) Modern applied statistics with S, Fourth Edition. Springer New York
Versfeld NJ, Dreschler WA (2002) The relationship between the intelligibility of time-compressed speech and speech in noise in young and elderly listeners. J Acoust Soc Am 111:401–408. https://doi.org/10.1121/1.1426376
Article PubMed Google Scholar
Viemeister NF, Wakefield H (1991) Temporal integration and multiple looks. J Acoust Soc Am 90:858–865
Article CAS Google Scholar
Wightman FL, Kistler DJ, O’Bryan A (2010) Individual differences and age effects in a dichotic informational masking paradigm. J Acoust Soc Am 128:270–279. https://doi.org/10.1121/1.3436536
Article PubMed PubMed Central Google Scholar
Wingfield A, Peelle JE, Grossman M (2003) Speech rate and syntactic complexity as multiplicative factors in speech comprehension by young and older adults. Aging, Neuropsychol Cogn 10:310–322. https://doi.org/10.1076/anec.10.4.310.28974
Article Google Scholar
Zeng F-G, Oba S, Garde S et al (1999) Temporal and speech processing deficits in auditory neuropathy. NeuroReport 10:3429–3435
Article CAS Google Scholar
Ziegler JC, Pech-georgel C, George F, Lorenzi C (2009) Speech-perception-in-noise deficits in dyslexia. Dev Sci 5:732–745. https://doi.org/10.1111/j.1467-7687.2009.00817.x
Article Google Scholar

Download references

Acknowledgements

All subjects are thanked for taking the time to participate. Iris Van de Ryck and Jente Verbesselt are thanked for their help with the data collection. Students from the Master’s program Speech Language Pathology and Audiology Sciences at the KU Leuven are thanked for their help with the segmentation of the speech corpus. Associate editor Prof. Dr. Christian Lorenzi and two anonymous reviewers are thanked for their constructive feedback.

Funding

FWO (G.0662.13) and the HermesFonds (IWT-14124) funded this research.

Author information

Authors and Affiliations

ExpORL, Department of Neurosciences, KU Leuven, Box 721, Herestraat 49, Leuven, 3000, Belgium
Robin Gransier, Astrid van Wieringen & Jan Wouters

Authors

Robin Gransier
View author publications
You can also search for this author in PubMed Google Scholar
Astrid van Wieringen
View author publications
You can also search for this author in PubMed Google Scholar
Jan Wouters
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R. G. and J. W. elaborated the concept of the study; R. G., A. V. W., and J. W. designed the experiments R. G. and A. V. W. oversaw the segmentation of the speech corpus; R. G. was responsible for the data collection. R. G and J. W. analyzed the results; R. G drafted the paper with input from all other authors.

Corresponding author

Correspondence to Robin Gransier.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gransier, R., van Wieringen, A. & Wouters, J. The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise. JARO 23, 413–426 (2022). https://doi.org/10.1007/s10162-021-00832-0

Download citation

Received: 13 January 2021
Accepted: 15 December 2021
Published: 07 March 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s10162-021-00832-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise

Abstract

Access this article

Similar content being viewed by others

Time-Compressed Speech Identification Is Predicted by Auditory Neural Processing, Perceptuomotor Speed, and Executive Functioning in Younger and Older Listeners

Frequency Following Responses to Tone Glides: Effects of Age and Hearing Loss

Age-related changes in auditory processing and speech perception: cross-sectional and longitudinal analyses

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The Intelligibility of Time-Compressed Speech Is Correlated with the Ability to Listen in Modulated Noise

Abstract

Access this article

Similar content being viewed by others

Time-Compressed Speech Identification Is Predicted by Auditory Neural Processing, Perceptuomotor Speed, and Executive Functioning in Younger and Older Listeners

Frequency Following Responses to Tone Glides: Effects of Age and Hearing Loss

Age-related changes in auditory processing and speech perception: cross-sectional and longitudinal analyses

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation