Skip to main content

Robust Rule-Based Method for Automatic Break Assignment in Russian Texts

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3658))

Abstract

In this paper a new rule-based approach to break assignment for the Russian language is discussed. It is a flexible and robust method of segmentation of texts in Russian in prosodic units. We implemented it in the recent “Orator” text-to-speech (TTS) system. The model was developed to use for the inflective languages as an alternative both for statistic and for strict rule-based algorithms. It is designed in such a way that all potentially tunable context dependencies are brought up to the interface grammar and can be easily modified by linguists. The algorithm we developed performs well on different kinds of texts due to this simple and intuitive grammar built upon an elaborate mechanism of morpho-grammatical analysis. Juncture correct rate varies between more than 98% for simple literary texts and 85% for raw transcripts of spontaneous speech.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Atterer, M.: Assigning Prosodic Structure for Speech Synthesis: A Rule-based Approach. In: Proceedings of Prosody 2002. Aix-en-Provence, pp. 147–150 (2002)

    Google Scholar 

  2. Bachenko, J., Fitzpatrick, E.: A Computational Grammar of Discourse-Neutral Prosodic Phrasing in English. Computational Linguistics 16, 157–167 (1993)

    Google Scholar 

  3. Bondarko, L.V., Volskaya, N.B., Tananaiko, S.O., Vasilieva, L.A.: Phonetic Properties of Russian Spontaneous Speech. In: Proceedings of the 15th ICPhS. Barcelona, Spain, pp. 2973–2976 (2003)

    Google Scholar 

  4. Black, A.W., Taylor, P.: Assigning phrase breaks from part-of-speech sequences. In: Proceedings of Eurospeech 1997. Rhodes, Greece, pp. 995–998 (1997)

    Google Scholar 

  5. Gee, J.P., Grosjean, F.: Performance Structures: A Psycholinguistic and Linguistic Appraisal. Cognitive Psychology 15, 411–458 (1998)

    Article  Google Scholar 

  6. Krivnova, O.F.: Perceptual and semantic meaning of prosodic boundaries in a coherent text. In: Problemy Fonetiki. Moscow, Russia, pp. 228–238 (1995) (in Russian)

    Google Scholar 

  7. Monaghan, A.I.C.: Rhythm and stress shift. Computer Speech and Language 4, 71–78 (1990)

    Article  Google Scholar 

  8. Oparin, I., Talanov, A.: Stem-Based Approach to Pronunciation Vocabulary Construction and Language Modeling of Russian. In: Eurospeech 2005. Lisbon, Portugal (2005) (submitted to)

    Google Scholar 

  9. Oparin, I.: Flexible Rule-Based Breaks Assignment for Russian. In: Eurospeech 2005. Lisbon, Portugal (2005) (submitted to)

    Google Scholar 

  10. Sanders, E.: Using Probabilistic Methods to Predict Phrase Boundaries for a Text-to-Speech System. Phd thesis, University of Nijmegen, the Netherlands (1995)

    Google Scholar 

  11. Traber, C.: Syntactic Processing and Prosody Control in the SVOX TTS System for German. In: Proceedings of Eurospeech 1993. Berlin, Germany, pp. 2099–2102 (1993)

    Google Scholar 

  12. Wang, M., Hirschberg, J.: Automatic Classification of Intonational Phrase Boundaries. Computer Speech and Language 6 (1992)

    Google Scholar 

  13. Zaliznyak, A.A.: Grammatical Dictionary of the Russian Language. Moscow, Russia (1977) (in Russian)

    Google Scholar 

  14. Zharkov, I.V., Slobodanuk, S.L., Svetozarova, N.D.: Automatic Accent-Intonational Transcriber of a Russian Text. Bochum-St.Petersburg (1994) (in Russian)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Oparin, I. (2005). Robust Rule-Based Method for Automatic Break Assignment in Russian Texts. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_46

Download citation

  • DOI: https://doi.org/10.1007/11551874_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28789-6

  • Online ISBN: 978-3-540-31817-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics