ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

First approach to the selection of lexical units for continuous speech recognition of Basque

Miren Karmele López de Ipiña, Inés Torres, Lourdes Oñederra, Amparo Varona, N. Ezeiza, M. Peñagarikano, M. Hernandez, Luis Javier Rodriguez

The selection of appropriated Lexical Units is an important issue in the Language Model (LM) generation. Word has been used classically as unit in most of the Continuous Speech Recognition systems. However, during the last years proposals of non-word units have begun to appear. Since Basque is an agglutinative language with a certain structure inside the word, the nonword units could be an adequate option. In this work, a statistical analysis of the morphological structure of Basque has been carried out. This analysis shows a slight increment of the rates of confusion in Continuous Speech Recognition Systems due to the great increment of acoustically similar and short units. Finally several proposals of Lexical Units are analyzed to deal with the problem.


doi: 10.21437/ICSLP.2000-324

Cite as: López de Ipiña, M.K., Torres, I., Oñederra, L., Varona, A., Ezeiza, N., Peñagarikano, M., Hernandez, M., Rodriguez, L.J. (2000) First approach to the selection of lexical units for continuous speech recognition of Basque. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 531-534, doi: 10.21437/ICSLP.2000-324

@inproceedings{lopezdeipina00b_icslp,
  author={Miren Karmele {López de Ipiña} and Inés Torres and Lourdes Oñederra and Amparo Varona and N. Ezeiza and M. Peñagarikano and M. Hernandez and Luis Javier Rodriguez},
  title={{First approach to the selection of lexical units for continuous speech recognition of Basque}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 531-534},
  doi={10.21437/ICSLP.2000-324}
}