Speech Confusion Index (Ø): A Recognition Rate Indicator for Dysarthric Speakers

Kayasith, Prakasith; Theeramunkong, Thanaruk; Thubthong, Nuttakorn

doi:10.1007/11816508_60

Prakasith Kayasith^21,22,
Thanaruk Theeramunkong²¹ &
Nuttakorn Thubthong²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4139))

Included in the following conference series:

International Conference on Natural Language Processing (in Finland)

1589 Accesses
2 Citations

Abstract

This paper presents an automated method to help us assess speech quality of a dysarthric speaker, instead of traditional manual methods that are laborious and subjective. The assessment result can also be a good indicator for predicting the accuracy of speech recognition that the speaker can benefit from the current speech technology. The so-called speech confusion index (Ø) is proposed to measure the severity of speech disorder. Based on the dynamic time wrapping (DTW) technique with adaptive slope constraint and accumulate mismatch score, Ø is developed as a measure of difference between two speech signals. Compared to the manual methods, i.e. articulatory and intelligibility tests, the proposed indicator was shown to be more predictive on recognition rate obtained from HMM and ANN. The evaluation was done in terms of three measures, root-mean-square difference, correlation coefficient and rank-order inconsistency. The experimental results on the control set showed that Ø achieved better prediction than both articulatory and intelligibility tests with the average improvement of 9.56% and 7.86%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Deller, J., Hsu, D., Ferrier, L.: On the use of hidden Markov Modeling for Recognition of Dysarthric Speech. Computer Methods and Programs in Biomedicine 35, 125–139 (1991)
Article Google Scholar
Kotler, A., Thomas-Stonel, N.: Effects of speech training on the accuracy of speech recognition for an individual with speech impairment. Journal of Augmentative and Alternative Communication 12, 71–80 (1997)
Article Google Scholar
Rosen, K., Yampolsky, S.: Automatic Speech Recognition and a Review of Its Functioning with Dysarthric Speech. Journal of Augmentative and Alternative Communication 16, 46–60 (2000)
Google Scholar
Thubthong, N., Kayasith, P.: Incorporated Tone model speech recognition for Thai dysarthria. In: Proc. of 11th International Society for Augmentive and Alternate Communication, Natal, Brazil (2004)
Google Scholar
Kent, R.D.: Hearing and believing: Some limits to auditory-perceptual assessment of speech and voice disorders. Journal of Speech and Hearing Disorders 7, 7–23 (1996)
Google Scholar
Corman, T.H., Leiserson, C.E., Rivet, R.L.: Introduction to Algorithms. MIT Press, Cambridge (1990)
Google Scholar
Itakura, F.: Minimum Prediction Residual Principle Applied to Speech Recognition. IEEE Transaction on acoustics, speech, and signal processing ASSP-23(1), 67–72 (1975)
Article Google Scholar
Kayasith, P., Thubthong, N., Theeramunkong, T.: Consistency Score: Dysarthric Speech Indicator for Modren Speech Technologies. In: Proc. of 12th International Society for Augmentive and Alternate Communication (ISAA 2006), Dusseldorf German (2006)
Google Scholar
Shriberg, L., Kwiatkowski, J.: Phonological disorders III: A procedure for assessing severity of involvement. Journal of Speech and Hearing Disorders 47(3), 256–270 (1982)
Google Scholar
Shriberg, L., Austin, D., Lewis, B.A., McSweeny, J.L., Wilson, D.L.: The percentage of consonants correct (PCC) metric. Extensions and reliability data. Journal of Speech, Language, and Hearing Research 40, 708–722 (1997)
Google Scholar
Kayasith, P., Thubthong, N.: Computerized Intelligibility Test for Thai Speech Disorder. In: US - Thailand Symposium on Biomedical Engineering, Bangkok Thailand (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Computer Technology, Sirindhorn International Institute of Technology (SIIT), Thammasat University, Klong Luang, Pathumthani, 12121, Thailand
Prakasith Kayasith & Thanaruk Theeramunkong
Assistive Technology Center, National Electronics and Computer Technology Center (NECTEC), Thailand Science Park, Klong Luang, Pathumthani, 12120, Thailand
Prakasith Kayasith
Acoustics and Speech Research Laboratory (ASRL), Department of Physics, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
Nuttakorn Thubthong

Authors

Prakasith Kayasith
View author publications
You can also search for this author in PubMed Google Scholar
Thanaruk Theeramunkong
View author publications
You can also search for this author in PubMed Google Scholar
Nuttakorn Thubthong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Turku Centre for Computer Science (TUCS), Department of Information Technology, University of Turku, Joukahaisenkatu 3-5 B, FIN-20520, Turku, Finland
Tapio Salakoski
Turku Centre for Computer Science (TUCS) and Department of IT, University of Turku, Lemminkäisenkatu 14 A, 20520, Turku, Finland
Filip Ginter & Sampo Pyysalo &
Department of Information Technology, University of Turku, Lemminkäisenkatu 14–18 A, FIN-20520, Turku, Finland
Tapio Pahikkala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kayasith, P., Theeramunkong, T., Thubthong, N. (2006). Speech Confusion Index (Ø): A Recognition Rate Indicator for Dysarthric Speakers. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds) Advances in Natural Language Processing. FinTAL 2006. Lecture Notes in Computer Science(), vol 4139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816508_60

Download citation

DOI: https://doi.org/10.1007/11816508_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37334-6
Online ISBN: 978-3-540-37336-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics