Abstract
Due to age-bound onset of symptoms used for diagnosis of mild to moderate intellectual disability, early diagnosis of these problems has long been a difficult issue. The diagnosis includes tests pertaining to intellectual functioning and adaptive behaviours including communication skills etc. In this paper, it is proposed to use speech features as an early indicator of the disorder which can be used to train machine learning algorithms for differentiating between speech of normally developing children and children with intellectual disability. In this paper, speech abnormalities are quantified using acoustic parameters including Linear Predictive Cepstral Coefficients, Mel Frequency Cepstral Coefficients and spectral features in speech samples of 48 participants (24 with intellectual disability and 24 age-matched controls). A training dataset was created by extracting these features which was used for learning by various classifiers. The experiments show promising results where Support Vector Machine gives an accuracy of 98%. Consequently, a well-trained classification algorithm can be used as an aid in early detection of mild to moderate intellectual disability.
Similar content being viewed by others
References
Abbeduto, L., & Nuccio, J. B. (1991). Relation between receptive language and cognitive maturity in persons with mental retardation. American Journal of Mental Retardation, 96, 143–149.
Abbeduto, L., Furman, L., & Davies, B. (1989). Relation between the receptive language and mental age of persons with mental retardation. American Journal of Mental Retardation, 93, 535–543.
Aggarwal, G., & Singh, L. (2018). Classification of intellectual disability using LPC, LPCC, and WLPCC parameterization techniques. International Journal of Computers and Applications. https://doi.org/10.1080/1206212X.2018.1475330.
Ai, O. C., Hariharan, M., Yaacob, S., & Chee, L. S. (2012). Classification of speech dysfluencies with MFCC and LPCC features. Expert Systems with Applications, 39(2), 2157–2165.
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5 ® ). Arlington: American Psychiatric Publishing.
Antoniol, G., Rollo, V. F., & Venturi, G. (2005, May). Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In ACM SIGSOFT software engineering notes (Vol. 30, No. 4, pp. 1–5). ACM.
Batshaw, M. L. (2002). Children with disabilities (5th ed.). Baltimore: Brookes.
Belva, B. C., Matson, J. L., Sipes, M., & Bamburg, J. W. (2012). An examination of specific communication deficits in adults with profound intellectual disabilities. Research in Developmental Disabilities, 33, 525–529.
Berry, P. B. (1972). Comprehension of possessive and present continuous sentences by nonretarded, mildly retarded, and severely retarded children. American Journal of Mental Deficiency, 76, 540–544.
Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., & Turchetti, C. (2015, January). Speaker identification with short sequences of speech frames. In Proceedings of the international conference on pattern recognition applications and methods (Vol. 2, pp. 178–185). SciTePress, Setúbal.
Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., & Turchetti, C. (2017). An investigation on the accuracy of truncated DKLT representation for speaker identification with short sequences of speech frames. IEEE transactions on cybernetics, 47(12), 4235–4249.
Bourlard, H., & Wellekens, C. J. (1989). Speech pattern discrimination and multilayer perceptrons. Computer Speech & Language, 3(1), 1–19.
Cabanas, R. (1954). Some findings in speech and voice therapy among mentally deficient children. Folia Phoniatrica et Logopaedica, 6(1), 34–37.
Cheslock, M. A., Barton-Hulsey, A., Romski, M., & Sevcik, R. A. (2008). Using a speech-generating device to enhance communicative abilities for an adult with moderate intellectual disability. Journal of Intellectual and Developmental Disability, 46, 376–386.
Clegg, J., Hollis, C., Mawhood, L., & Rutter, M. (2005). Developmental language disorders—A follow-up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child Psychology and Psychiatry, 46(2), 128–149.
Davis, S. B., & Mermelstein, P. (1990). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in speech recognition (pp. 65–74).
Griffiths, R. (1970). The ability of young children. A study in mental measurement. London: University of London Press.
Harel, S., Greenstein, Y., Kramer, U., Yifat, R., Samuel, E., Nevo, Y., et al. (1996). Clinical characteristics of children referred to a child development center for evaluation of speech, language, and communication disorders. Pediatric Neurology, 15(4), 305–311.
Hariharan, M., Chee, L. S., Ai, O. C., & Yaacob, S. (2012). Classification of speech dysfluencies using LPC based parameterization techniques. Journal of Medical Systems, 36(3), 1821–1830.
Harrison, P., & Oakland, T. (2003). Adaptive behavior assessment system (ABAS-II). San Antonio, TX: The Psychological Corporation.
Hau, C. C. (Ed.). (2015). Handbook of pattern recognition and computer vision. Singapore: World Scientific.
Jothilakshmi, S., Ramalingam, V., & Palanivel, S. (2009). Unsupervised speaker segmentation with residual phase and MFCC features. Expert Systems with Applications, 36(6), 9799–9804.
Kail, R. (1992). General slowing of information-processing by persons with mental retardation. American Journal of Mental Retardation, 97, 333–341.
Koul, R., & Clapsaddle, K. C. (2006). Effects of repeated listening experiences on the perception of synthetic speech by individuals with mild-to-moderate intellectual disabilities. Augmentative & Alternative Communication, 22, 112–122.
Lehiste, I., & Lass, N. J. (1976). Suprasegmental features of speech. Contemporary Issues in Experimental Phonetics, 225, 239.
Lesser, R., & Hassip, S. (1986). Knowledge and opinions of speech therapy in teachers, doctors and nurses. Child: Care, Health and Development, 12(4), 235–249.
Lynch, M. P., Oller, D. K., Eilers, R. E., & Basinger, D. (1990). Vocal development of infants with Down syndrome. In University of Wisconsin 11th annual symposium for research on child language disorders, Madison, WI.
Maber-Aleksandrowicz, S., Avent, C., & Hassiotis, A. (2016). A systematic review of animal-assisted therapy on psychosocial outcomes in people with intellectual disability. Research in Developmental Disabilities, 49, 322–338.
McLean, L. K., Brady, N. C., McLean, J. E., & Behrens, G. A. (1999). Communication forms and functions of children and adults with severe mental retardation in community and institutional settings. Journal of Speech, Language, and Hearing Research, 42, 231–240.
Memisevic, H., & Hadzic, S. (2013). Speech and language disorders in children with intellectual disability in Bosnia and Herzegovina. Disability, CBR & Inclusive Development, 24(2), 92–99.
Merrill, E. C., & Jackson, T. S. (1992). Degree of associative relatedness and sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 173–185.
Merrill, E. C., & Jackson, T. S. (1992). Sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 342–350.
Merrill, E. C., & Mar, H. H. (1987). Differences between mentally retarded and nonretarded persons’ efficiency of auditory sentence processing. American Journal of Mental Deficiency, 91, 406–414.
Mervis, C. B., & Becerra, A. M. (2007). Language and communicative development in Williams syndrome. Mental Retardation and Developmental Disabilities Research Reviews, 13(1), 3–15.
Mervis, C. B., & Cicchetti, D. (1990). Early conceptual development of children with Down syndrome. In Children with Down syndrome: A developmental perspective (pp. 252–301).
Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition (Vol. 14). Englewood Cliffs: PTR Prentice Hall.
Räsänen, O., & Pohjalainen, J. (2013). Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech. In INTERSPEECH (pp. 210–214).
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323(6088), 533.
Schalock, R. L., Borthwick-Duffy, S. A., Bradley, V. J., Buntinx, W. H., Coulter, D. L., Craig, E. M., et al. (2010). Intellectual disability: Definition, classification, and systems of supports. Washington, DC: American Association on Intellectual and Developmental Disabilities.
Tager-Flusberg, H. E. L. E. N., & Sullivan, K. (1998). Children with mental retardation. In Handbook of mental retardation and development (p. 208–239).
Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.
Wechsler, D. (2012). WPPSI-IV: Wechsler Preschool and Primary Scale of Intelligence. London: Pearson, Psychological Corporation.
Xiao, X., Chng, E. S., & Li, H. (2008). Normalization of the speech modulation spectra for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 16(8), 1662–1674.
Zhong, J., Hu, W., Soong, F., & Meng, H. (2017). DNN i-vector speaker verification with short, text-constrained test utterances. In Proceedings of Interspeech 2017 (pp. 1507–1511).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Aggarwal, G., Singh, L. Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability. 3D Res 9, 55 (2018). https://doi.org/10.1007/s13319-018-0207-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13319-018-0207-6