Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

Aggarwal, Gaurav; Singh, Latika

doi:10.1007/s13319-018-0207-6

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

3DR Express
Published: 08 November 2018

Volume 9, article number 55, (2018)
Cite this article

3D Research

Gaurav Aggarwal^1,2 &
Latika Singh¹

209 Accesses
9 Citations
Explore all metrics

Abstract

Due to age-bound onset of symptoms used for diagnosis of mild to moderate intellectual disability, early diagnosis of these problems has long been a difficult issue. The diagnosis includes tests pertaining to intellectual functioning and adaptive behaviours including communication skills etc. In this paper, it is proposed to use speech features as an early indicator of the disorder which can be used to train machine learning algorithms for differentiating between speech of normally developing children and children with intellectual disability. In this paper, speech abnormalities are quantified using acoustic parameters including Linear Predictive Cepstral Coefficients, Mel Frequency Cepstral Coefficients and spectral features in speech samples of 48 participants (24 with intellectual disability and 24 age-matched controls). A training dataset was created by extracting these features which was used for learning by various classifiers. The experiments show promising results where Support Vector Machine gives an accuracy of 98%. Consequently, a well-trained classification algorithm can be used as an aid in early detection of mild to moderate intellectual disability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Educational data mining: prediction of students' academic performance using machine learning algorithms

Article Open access 03 March 2022

Fundamentals of Artificial Neural Networks and Deep Learning

A comprehensive survey on automatic speech recognition using neural networks

Article 15 August 2023

References

Abbeduto, L., & Nuccio, J. B. (1991). Relation between receptive language and cognitive maturity in persons with mental retardation. American Journal of Mental Retardation, 96, 143–149.
Google Scholar
Abbeduto, L., Furman, L., & Davies, B. (1989). Relation between the receptive language and mental age of persons with mental retardation. American Journal of Mental Retardation, 93, 535–543.
Google Scholar
Aggarwal, G., & Singh, L. (2018). Classification of intellectual disability using LPC, LPCC, and WLPCC parameterization techniques. International Journal of Computers and Applications. https://doi.org/10.1080/1206212X.2018.1475330.
Article Google Scholar
Ai, O. C., Hariharan, M., Yaacob, S., & Chee, L. S. (2012). Classification of speech dysfluencies with MFCC and LPCC features. Expert Systems with Applications, 39(2), 2157–2165.
Article Google Scholar
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5 ^® ). Arlington: American Psychiatric Publishing.
Book Google Scholar
Antoniol, G., Rollo, V. F., & Venturi, G. (2005, May). Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In ACM SIGSOFT software engineering notes (Vol. 30, No. 4, pp. 1–5). ACM.
Batshaw, M. L. (2002). Children with disabilities (5th ed.). Baltimore: Brookes.
Google Scholar
Belva, B. C., Matson, J. L., Sipes, M., & Bamburg, J. W. (2012). An examination of specific communication deficits in adults with profound intellectual disabilities. Research in Developmental Disabilities, 33, 525–529.
Article Google Scholar
Berry, P. B. (1972). Comprehension of possessive and present continuous sentences by nonretarded, mildly retarded, and severely retarded children. American Journal of Mental Deficiency, 76, 540–544.
Google Scholar
Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., & Turchetti, C. (2015, January). Speaker identification with short sequences of speech frames. In Proceedings of the international conference on pattern recognition applications and methods (Vol. 2, pp. 178–185). SciTePress, Setúbal.
Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., & Turchetti, C. (2017). An investigation on the accuracy of truncated DKLT representation for speaker identification with short sequences of speech frames. IEEE transactions on cybernetics, 47(12), 4235–4249.
Article Google Scholar
Bourlard, H., & Wellekens, C. J. (1989). Speech pattern discrimination and multilayer perceptrons. Computer Speech & Language, 3(1), 1–19.
Article Google Scholar
Cabanas, R. (1954). Some findings in speech and voice therapy among mentally deficient children. Folia Phoniatrica et Logopaedica, 6(1), 34–37.
Article Google Scholar
Cheslock, M. A., Barton-Hulsey, A., Romski, M., & Sevcik, R. A. (2008). Using a speech-generating device to enhance communicative abilities for an adult with moderate intellectual disability. Journal of Intellectual and Developmental Disability, 46, 376–386.
Article Google Scholar
Clegg, J., Hollis, C., Mawhood, L., & Rutter, M. (2005). Developmental language disorders—A follow-up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child Psychology and Psychiatry, 46(2), 128–149.
Article Google Scholar
Davis, S. B., & Mermelstein, P. (1990). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in speech recognition (pp. 65–74).
Chapter Google Scholar
Griffiths, R. (1970). The ability of young children. A study in mental measurement. London: University of London Press.
Google Scholar
Harel, S., Greenstein, Y., Kramer, U., Yifat, R., Samuel, E., Nevo, Y., et al. (1996). Clinical characteristics of children referred to a child development center for evaluation of speech, language, and communication disorders. Pediatric Neurology, 15(4), 305–311.
Article Google Scholar
Hariharan, M., Chee, L. S., Ai, O. C., & Yaacob, S. (2012). Classification of speech dysfluencies using LPC based parameterization techniques. Journal of Medical Systems, 36(3), 1821–1830.
Article Google Scholar
Harrison, P., & Oakland, T. (2003). Adaptive behavior assessment system (ABAS-II). San Antonio, TX: The Psychological Corporation.
Google Scholar
Hau, C. C. (Ed.). (2015). Handbook of pattern recognition and computer vision. Singapore: World Scientific.
Google Scholar
Jothilakshmi, S., Ramalingam, V., & Palanivel, S. (2009). Unsupervised speaker segmentation with residual phase and MFCC features. Expert Systems with Applications, 36(6), 9799–9804.
Article Google Scholar
Kail, R. (1992). General slowing of information-processing by persons with mental retardation. American Journal of Mental Retardation, 97, 333–341.
Google Scholar
Koul, R., & Clapsaddle, K. C. (2006). Effects of repeated listening experiences on the perception of synthetic speech by individuals with mild-to-moderate intellectual disabilities. Augmentative & Alternative Communication, 22, 112–122.
Article Google Scholar
Lehiste, I., & Lass, N. J. (1976). Suprasegmental features of speech. Contemporary Issues in Experimental Phonetics, 225, 239.
Google Scholar
Lesser, R., & Hassip, S. (1986). Knowledge and opinions of speech therapy in teachers, doctors and nurses. Child: Care, Health and Development, 12(4), 235–249.
Google Scholar
Lynch, M. P., Oller, D. K., Eilers, R. E., & Basinger, D. (1990). Vocal development of infants with Down syndrome. In University of Wisconsin 11th annual symposium for research on child language disorders, Madison, WI.
Maber-Aleksandrowicz, S., Avent, C., & Hassiotis, A. (2016). A systematic review of animal-assisted therapy on psychosocial outcomes in people with intellectual disability. Research in Developmental Disabilities, 49, 322–338.
Article Google Scholar
McLean, L. K., Brady, N. C., McLean, J. E., & Behrens, G. A. (1999). Communication forms and functions of children and adults with severe mental retardation in community and institutional settings. Journal of Speech, Language, and Hearing Research, 42, 231–240.
Article Google Scholar
Memisevic, H., & Hadzic, S. (2013). Speech and language disorders in children with intellectual disability in Bosnia and Herzegovina. Disability, CBR & Inclusive Development, 24(2), 92–99.
Article Google Scholar
Merrill, E. C., & Jackson, T. S. (1992). Degree of associative relatedness and sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 173–185.
Google Scholar
Merrill, E. C., & Jackson, T. S. (1992). Sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 342–350.
Google Scholar
Merrill, E. C., & Mar, H. H. (1987). Differences between mentally retarded and nonretarded persons’ efficiency of auditory sentence processing. American Journal of Mental Deficiency, 91, 406–414.
Google Scholar
Mervis, C. B., & Becerra, A. M. (2007). Language and communicative development in Williams syndrome. Mental Retardation and Developmental Disabilities Research Reviews, 13(1), 3–15.
Article Google Scholar
Mervis, C. B., & Cicchetti, D. (1990). Early conceptual development of children with Down syndrome. In Children with Down syndrome: A developmental perspective (pp. 252–301).
Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition (Vol. 14). Englewood Cliffs: PTR Prentice Hall.
Google Scholar
Räsänen, O., & Pohjalainen, J. (2013). Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech. In INTERSPEECH (pp. 210–214).
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323(6088), 533.
Article MATH Google Scholar
Schalock, R. L., Borthwick-Duffy, S. A., Bradley, V. J., Buntinx, W. H., Coulter, D. L., Craig, E. M., et al. (2010). Intellectual disability: Definition, classification, and systems of supports. Washington, DC: American Association on Intellectual and Developmental Disabilities.
Google Scholar
Tager-Flusberg, H. E. L. E. N., & Sullivan, K. (1998). Children with mental retardation. In Handbook of mental retardation and development (p. 208–239).
Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.
Article Google Scholar
Wechsler, D. (2012). WPPSI-IV: Wechsler Preschool and Primary Scale of Intelligence. London: Pearson, Psychological Corporation.
Google Scholar
Xiao, X., Chng, E. S., & Li, H. (2008). Normalization of the speech modulation spectra for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 16(8), 1662–1674.
Article Google Scholar
Zhong, J., Hu, W., Soong, F., & Meng, H. (2017). DNN i-vector speaker verification with short, text-constrained test utterances. In Proceedings of Interspeech 2017 (pp. 1507–1511).

Download references

Author information

Authors and Affiliations

The NorthCap University, Sector-23A, Huda, Gurgaon, 122017, India
Gaurav Aggarwal & Latika Singh
Manipal University Jaipur, VPO Dehmikalan, Ajmer Highway, Jaipur, 303007, India
Gaurav Aggarwal

Authors

Gaurav Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar
Latika Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gaurav Aggarwal.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Aggarwal, G., Singh, L. Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability. 3D Res 9, 55 (2018). https://doi.org/10.1007/s13319-018-0207-6

Download citation

Received: 28 August 2018
Revised: 22 October 2018
Accepted: 29 October 2018
Published: 08 November 2018
DOI: https://doi.org/10.1007/s13319-018-0207-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

Abstract

Access this article

Similar content being viewed by others

Educational data mining: prediction of students' academic performance using machine learning algorithms

Fundamentals of Artificial Neural Networks and Deep Learning

A comprehensive survey on automatic speech recognition using neural networks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

Abstract

Access this article

Similar content being viewed by others

Educational data mining: prediction of students' academic performance using machine learning algorithms

Fundamentals of Artificial Neural Networks and Deep Learning

A comprehensive survey on automatic speech recognition using neural networks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation