Abstract
Conventional Automatic Speech Recognition systems solely rely on acoustic information, making them susceptible to problems like environmental noise, privacy, information disclosure and also excluding users with speech impairments. An Ultrasonic Doppler Sensing (UDS) based interface may be used to tackle these issues since it does not rely on audio signal information. This paper describes the first speech recognition experiments based on UDS for European Portuguese (EP). The work here presented analyzes the UDS signal and explores the recognition of EP digits and minimal pairs of words that only differ on nasality of one of the phones. The results of our experiments show a best word error rate of 27.8% using data collected with the device at different distances from the speaker in an isolated word recognition problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Srinivasan, S., Raj, B., Ezzat, T.: Ultrasonic sensing for robust speech recognition. In: Internat. Conf. on Acoustics, Speech, and Signal Processing (2010)
Toth, A.R., Kalgaonkar, K., Raj, B., Ezzat, T.: Synthesizing speech from Doppler signals. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 4638–4641 (2010)
Freitas, J., Teixeira, A., Dias, M.S., Bastos, C.: Towards a Multimodal Silent Speech Interface for European Portuguese. In: Ipsic, I. (ed.) Speech Technologies. InTech (2011) ISBN: 978-953-307-996-7
Freitas, J., Teixeira, A., Dias, M.S.: Towards a Silent Speech Interface for Portuguese: Surface Electromyography and the nasality challenge. In: Int. Conf. on Bio-inspired Systems and Signal Processing, Vilamoura, Algarve, Portugal (2012)
Kalgaonkar, K., Raj, B., Hu, R.: Ultrasonic doppler for voice activity detection. IEEE Signal Processing Letters 14(10), 754–757 (2007)
Kalgaonkar, K., Raj, B.: Ultrasonic doppler sensor for speaker recognition. In: Internat. Conf. on Acoustics, Speech, and Signal Processing (2008)
Jennings, D.L., Ruck, D.W.: Enhancing automatic speech recognition with an ultrasonic lipmotion detector. In: Internat. Conf. on Acoustics, Speech, and Signal Processing, Detroit (1995)
Zhu, B.: Multimodal speech recognition with ultrasonic sensors. Master’s thesis. Massachusetts Institute of Technology, Cambridge, Massachusetts (2008)
Hu, R., Raj, B.: A Robust Voice Activity Detector Using an Acoustic Doppler Radar. In: IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 171–176 (2005)
Ellis, D.: Dynamic Time Warp (DTW) in Matlab. Web resource (2003), http://www.ee.columbia.edu/~dpwe/resources/matlab/dtw/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Freitas, J., Teixeira, A., Vaz, F., Dias, M.S. (2012). Automatic Speech Recognition Based on Ultrasonic Doppler Sensing for European Portuguese. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-35292-8_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)