Skip to main content

Synthesis of Disordered Voices

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3817))

Abstract

The presentation concerns the simulation of disordered voices. The phonatory excitation model is based on shaping functions, which are nonlinear memoryless input-output characteristics that transform a trigonometric driving function into a synthetic phonatory excitation signal. The shaping function model enables controlling the instantaneous frequency and spectral brilliance of the phonatory excitation via two separate parameters. The presentation demonstrates the synthesis of different types of dysperiodicities via a modulation of the amplitude and instantaneous frequency of the harmonic driving function. The voice disorders that are simulated are short- and long-term perturbations of the vocal frequency and cycle amplitude, biphonation, diplophonia and raucity. Acoustic noise due to turbulent airflow is modeled by means of additive white noise.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fant, G.: Acoustic theory of Speech Production. The Hague, Mouton (1960)

    Google Scholar 

  2. Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. STL-QSPR 4, 1–13 (1985)

    Google Scholar 

  3. Schoentgen, J.: Shaping function models of the phonatory excitation signal. J. Acoust. Soc. Am. 114(5), 2906–2912 (2003)

    Article  Google Scholar 

  4. Schoentgen, J.: Nonlinear signal representation and its application to the modelling of the glottal waveform. Speech Comm. 9, 189–201 (1990)

    Article  Google Scholar 

  5. Schoentgen, J.: On the bandwidth of a shaping function model of the phonatory excitation signal. In: Proceedings NOLISP 2003 (2003)

    Google Scholar 

  6. Schoentgen, J.: Stochastic models of jitter. J. Acoust. Soc. Am. 109(4), 1631–1650 (2001)

    Article  Google Scholar 

  7. Schoentgen, J.: Spectral models of additive and modulation noise in speech and phonatory excitation signals. J Acoust Soc Am. 113(1), 553–562 (2003)

    Article  Google Scholar 

  8. Hanquinet, J., Grenez, F., Schoentgen, J.: Synthesis of disordered speech. In: Proceedings Interspeech 2005, Lisboa (accepted for presentation) (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hanquinet, J., Grenez, F., Schoentgen, J. (2006). Synthesis of Disordered Voices. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_20

Download citation

  • DOI: https://doi.org/10.1007/11613107_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31257-4

  • Online ISBN: 978-3-540-32586-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics