Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Yeo, Eun Jung; Choi, Kwanghee; Kim, Sunhee; Chung, Minhwa

Computer Science > Computation and Language

arXiv:2210.15387 (cs)

[Submitted on 27 Oct 2022 (v1), last revised 28 Apr 2023 (this version, v3)]

Title:Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Authors:Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

View PDF

Abstract:Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly trained for two different tasks: severity classification and auxiliary automatic speech recognition (ASR). For the baseline experiments, we employ hand-crafted acoustic features and machine learning classifiers such as SVM, MLP, and XGBoost. Explored on the Korean dysarthric speech QoLT database, our model outperforms the traditional baseline methods, with a relative percentage increase of 1.25% for F1-score. In addition, the proposed model surpasses the model trained without ASR head, achieving 10.61% relative percentage improvements. Furthermore, we present how multi-task learning affects the severity classification performance by analyzing the latent representations and regularization effect.

Comments:	Accepted to ICASSP 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2210.15387 [cs.CL]
	(or arXiv:2210.15387v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.15387

Submission history

From: Eun Jung Yeo [view email]
[v1] Thu, 27 Oct 2022 12:48:10 UTC (186 KB)
[v2] Wed, 22 Mar 2023 19:38:02 UTC (187 KB)
[v3] Fri, 28 Apr 2023 16:41:16 UTC (187 KB)

Computer Science > Computation and Language

Title:Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators