Skip to main content

Phoneme Set and Pronouncing Dictionary Creation for Large Vocabulary Continuous Speech Recognition of Vietnamese

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8082))

Abstract

This paper describes our study on solving two basic problems of large vocabulary continuous speech recognition (LVCSR) of Vietnamese, which can be used as a standard reference for Vietnamese researchers and other researchers who are interested in Vietnamese language. First, a standard phoneme set is proposed with its corresponding grapheme-to-phoneme map. This phoneme set is the core to solve other problems related to LVCSR of Vietnamese. Then the creation of standard pronouncing dictionary based on the grapheme-to-phoneme map and the analysis of Vietnamese syllable is also described. Finally, we present the results on LVCSR using different types of pronouncing dictionary, which show some interesting aspects of Vietnamese language such as the structure of Vietnamese syllable and the effect of tone in the relationship with syllable.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chuong, N.T.: Selection of Sentence Set for Vietnamese Audio-Visual Corpus Design. In: IDAACS 2011, Praha, Czech Republic, pp. 492–495 (2011)

    Google Scholar 

  2. Vu, T.T., Nguyen, D.T., Luong, C.M., Hosom, J.P.: Vietnamese large vocabulary continuous speech recognition. In: INTERSPEECH 2005, pp. 1689–1692 (2005)

    Google Scholar 

  3. Vu, Q., Demuynck, K., Van Compernolle, D.: Vietnamese Automatic Speech Recognition: The FLaVoR Approach. In: Huo, Q., Ma, B., Chng, E.-S., Li, H. (eds.) ISCSLP 2006. LNCS (LNAI), vol. 4274, pp. 464–474. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Nguyen, H.Q., Nocera, P., Castelli, E., Trinh, V.L.: Using tone information for Vietnamese continuous speech recognition. In: RIVF 2008, pp. 103–106 (2008)

    Google Scholar 

  5. Nguyen, H.Q., Trinh, V.L., Le, T.D.: Automatic Speech Recognition for Vietnamese Using HTK System. In: RIVF 2010, pp. 1–4 (2010)

    Google Scholar 

  6. Vu, Q., et al.: A Robust Transcription System for Soccer Video Database. In: ICALIP, Shanghai (2010)

    Google Scholar 

  7. Nguyen, T., Vu, Q.: Advances in Acoustic Modeling for Vietnamese LVCSR. In: IALP 2009, pp. 280–284 (2009)

    Google Scholar 

  8. Vu, N.T., Schultz, T.: Vietnamese Large Vocabulary Continuous Speech Recognition. In: ASRU IEEE, Italy, pp. 333–338 (2009)

    Google Scholar 

  9. Vu, N.T., Schultz, T.: Optimization On Vietnamese Large Vocabulary Speech Recognition. In: Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2010, Penang, Malaysia (May 03, 2010)

    Google Scholar 

  10. Le, V.B., Besacier, L.: Comparison of Acoustic Modeling Techniques for Vietnamese and Khmer ASR. In: ICSLP 2006, Pittsburgh, PA (September 2006)

    Google Scholar 

  11. Nguyen, H.Q., Nocera, P., Castelli, E., Trinh, V.L.: Large vocabulary continuous speech recognition for Vietnamese, an under-resourced language. In: SLTU 2008, Ha Noi, Vietnam, May 5-7 (2008)

    Google Scholar 

  12. Le, V.B., Tran, D.D., Besacier, L., Castelli, E., Serignat, J.-F.: First steps in building a large vocabulary continuous speech recognition system for Vietnamese. In: RIVF 2005, Can Tho, Vietnam (February 2005)

    Google Scholar 

  13. Le, T., Nguyen, H., Vu, Q.: Progress in Transcription of Vietnamese Broadcast News. In: Proc. International Conference on Communications and Electronics, ICCE 2006 (October 2006)

    Google Scholar 

  14. Hoang, P.: Syllable Dictionary. Danang Publisher, Vietnam (1996)

    Google Scholar 

  15. Steve, Y., Odel, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book, version 3.2. Cambr. Univ., UK (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nguyen, T.C., Chaloupka, J. (2013). Phoneme Set and Pronouncing Dictionary Creation for Large Vocabulary Continuous Speech Recognition of Vietnamese. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40585-3_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40584-6

  • Online ISBN: 978-3-642-40585-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics