Skip to main content

Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients

  • Conference paper

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 484))

Abstract

A wavelet packet based adaptive filter-bank construction method is proposed for speech signal processing in this paper. On this basis, a set of acoustic features are proposed for speech emotion recognition, namely Coiflet Wavelet Packet Cepstral Coefficients (CWPCC). CWPCC extends the conventional Mel-Frequency Cepstral Coefficients (MFCC) by adapting the filter-bank structure according to the decision task; Speech emotion recognition system is constructed with the proposed feature set and Gaussian mixture model as classifier. Experimental results on Berlin emotional speech database show that the Coiflet Wavelet Packet is more suitable in speech emotion recognition than other Wavelet Packets and proposed features improve emotion recognition performance over the conventional features.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Morrison, D., Wang, R.L., De Silva, L.C.: Ensemble methods for spokenemotion recognition in call-centres. Speech Comm. 49(2), 98–112 (2007)

    Article  Google Scholar 

  2. France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, M.: Acoustical properties of speech as indicators of depression and suicidalrisk. IEEE Transactions on Biomedical Engineering 47(7), 829–837 (2000)

    Article  Google Scholar 

  3. Caponetti, L., Buscicchio, C.A., Castellano, G.: Biologically inspired emo-tion recognition from speech. Eurasip Journal on Advances in Signal Processing

    Google Scholar 

  4. Malta, L., Miyajima, C., Kitaoka, N., Takeda, K.: Multimodal estimationof a driver’s spontaneous irritation. In: Intelligent Vehicles Symposium, pp. 573–577. IEEE (2009)

    Google Scholar 

  5. Stephane, M.: A Wavelet Tour of Signal Processing, 3rd edn. Academic Press, Burlington (2009)

    MATH  Google Scholar 

  6. Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)

    Book  MATH  Google Scholar 

  7. Pavez, E., Silva, J.F.: Analysis and design of wavelet-packet cepstral co-efficients for automaticspeech recognition. Speech Comm. 54(6), 814–835 (2012)

    Article  Google Scholar 

  8. Saito, N., Coifman, R.R.: Local discriminant bases. In: SPIE 2303, Mathematical Imaging:Wavelet Applications in Signal and Image Processing, pp. 2–14 (1994)

    Google Scholar 

  9. Silva, J., Narayanan, S.S.: Discriminative wavelet packet filter bank selection for pattern recognition. IEEE Trans. Signal Process. 57(5), 1796–1810 (2009)

    Article  MathSciNet  Google Scholar 

  10. Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, New Jersey (1993)

    Google Scholar 

  11. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 1–27 (2011)

    Article  Google Scholar 

  12. Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of german emotional speech. In: Proceeding INTERSPEECH 2005, ISCA, pp. 1517–1520 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Huang, Y., Wu, A., Zhang, G., Li, Y. (2014). Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients. In: Li, S., Liu, C., Wang, Y. (eds) Pattern Recognition. CCPR 2014. Communications in Computer and Information Science, vol 484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45643-9_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45643-9_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45642-2

  • Online ISBN: 978-3-662-45643-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics