ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Assessment of disordered voices using empirical mode decomposition in the log-spectral domain

Abdellah Kacha, Francis Grenez, Jean Schoentgen

Empirical mode decomposition (EMD) algorithm is proposed as an alternative to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio is used to summarize the degree of disturbance in the speech signal. The empirical mode decomposition algorithm is a tool for the analysis of multi-component signals. The analysis method does not require a priori fixed basis function like conventional analysis methods (e.g. Fourier transform and wavelet transform).The proposed method is tested on synthetic vowels and natural speech. The corpus of synthetic vowels comprises 48 stimuli of synthetic sounds [a] that combine three values of vocal frequency, four levels of jitter frequency and four levels of additive noise. The corpora of natural speech comprise a concatenation of the vowel [a] with two Dutch sentences produced by 28 normophonic and 223 speakers with different degrees of dysphonia.

Index Terms: Disordered voices, empirical mode decomposition, harmonic-to-noise ratio.


doi: 10.21437/Interspeech.2012-27

Cite as: Kacha, A., Grenez, F., Schoentgen, J. (2012) Assessment of disordered voices using empirical mode decomposition in the log-spectral domain. Proc. Interspeech 2012, 66-69, doi: 10.21437/Interspeech.2012-27

@inproceedings{kacha12_interspeech,
  author={Abdellah Kacha and Francis Grenez and Jean Schoentgen},
  title={{Assessment of disordered voices using empirical mode decomposition in the log-spectral domain}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={66--69},
  doi={10.21437/Interspeech.2012-27}
}