Automatic Speech Recognition Based on Ultrasonic Doppler Sensing for European Portuguese

Freitas, João; Teixeira, António; Vaz, Francisco; Dias, Miguel Sales

doi:10.1007/978-3-642-35292-8_24

João Freitas⁷,
António Teixeira⁸,
Francisco Vaz⁸ &
…
Miguel Sales Dias^7,9

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 328))

738 Accesses
6 Citations

Abstract

Conventional Automatic Speech Recognition systems solely rely on acoustic information, making them susceptible to problems like environmental noise, privacy, information disclosure and also excluding users with speech impairments. An Ultrasonic Doppler Sensing (UDS) based interface may be used to tackle these issues since it does not rely on audio signal information. This paper describes the first speech recognition experiments based on UDS for European Portuguese (EP). The work here presented analyzes the UDS signal and explores the recognition of EP digits and minimal pairs of words that only differ on nasality of one of the phones. The results of our experiments show a best word error rate of 27.8% using data collected with the device at different distances from the speaker in an isolated word recognition problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Srinivasan, S., Raj, B., Ezzat, T.: Ultrasonic sensing for robust speech recognition. In: Internat. Conf. on Acoustics, Speech, and Signal Processing (2010)
Google Scholar
Toth, A.R., Kalgaonkar, K., Raj, B., Ezzat, T.: Synthesizing speech from Doppler signals. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 4638–4641 (2010)
Google Scholar
Freitas, J., Teixeira, A., Dias, M.S., Bastos, C.: Towards a Multimodal Silent Speech Interface for European Portuguese. In: Ipsic, I. (ed.) Speech Technologies. InTech (2011) ISBN: 978-953-307-996-7
Google Scholar
Freitas, J., Teixeira, A., Dias, M.S.: Towards a Silent Speech Interface for Portuguese: Surface Electromyography and the nasality challenge. In: Int. Conf. on Bio-inspired Systems and Signal Processing, Vilamoura, Algarve, Portugal (2012)
Google Scholar
Kalgaonkar, K., Raj, B., Hu, R.: Ultrasonic doppler for voice activity detection. IEEE Signal Processing Letters 14(10), 754–757 (2007)
Article Google Scholar
Kalgaonkar, K., Raj, B.: Ultrasonic doppler sensor for speaker recognition. In: Internat. Conf. on Acoustics, Speech, and Signal Processing (2008)
Google Scholar
Jennings, D.L., Ruck, D.W.: Enhancing automatic speech recognition with an ultrasonic lipmotion detector. In: Internat. Conf. on Acoustics, Speech, and Signal Processing, Detroit (1995)
Google Scholar
Zhu, B.: Multimodal speech recognition with ultrasonic sensors. Master’s thesis. Massachusetts Institute of Technology, Cambridge, Massachusetts (2008)
Google Scholar
Hu, R., Raj, B.: A Robust Voice Activity Detector Using an Acoustic Doppler Radar. In: IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 171–176 (2005)
Google Scholar
Ellis, D.: Dynamic Time Warp (DTW) in Matlab. Web resource (2003), http://www.ee.columbia.edu/~dpwe/resources/matlab/dtw/

Download references

Author information

Authors and Affiliations

Microsoft Language Development Center, Lisboa, Portugal
João Freitas & Miguel Sales Dias
Dep. Electronics Telecommunications & Informatics/IEETA, University of Aveiro, Portugal
António Teixeira & Francisco Vaz
ISCTE-Lisbon University Institute/ADETTI-IUL, Lisboa, Portugal
Miguel Sales Dias

Authors

João Freitas
View author publications
You can also search for this author in PubMed Google Scholar
António Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Vaz
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Sales Dias
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escuela Politecnica Superior, Universidad Autonoma de Madrid. C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Doroteo Torre Toledano
Centro Politécnico Superior, Edificio Ada Byron, C/ María de Luna nº 1, 50018, Zaragoza, Spain
Alfonso Ortega Giménez
Universidade de Aveiro, Campus Universitário Aveiro, 3810-193, Aveiro, Portugal
António Teixeira
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Joaquín González Rodríguez
E.T.S.I.Telecomunicacion, Universidad Politécnica de Madrid, Ciudad Universitaria s/n, 28040, Madrid, Spain
Luis Hernández Gómez & Rubén San Segundo Hernández &
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Daniel Ramos Castro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Freitas, J., Teixeira, A., Vaz, F., Dias, M.S. (2012). Automatic Speech Recognition Based on Ultrasonic Doppler Sensing for European Portuguese. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-35292-8_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics