ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Real-time telephone transmission simulation for speech recognizer and dialogue system evaluation and improvement

Sebastian Möller, Hervé Bourlard

Recognizer performance in telephone-based spoken dialogue systems may be strongly affected by the transmission channel. In order to investigate the impact of different parts of the transmission channel in more detail, a simulation model is presented. It implements all transmission characteristics of modern telephone networks, based on instrumentally measurable values as they are used by network planners. The simulation shows real-time capability and runs on a programmable DSP-based hardware. It can be used for a systematic investigation of recognizer performance as a function of transmission channel degradations, for producing training material with specified transmission characteristics, or for estimating the impact of transmission impairments on dialogue flow and system usability. The impact of transmission channel characteristics on the performance of a speech recognizer integrated in an interactive voice server is analyzed in more detail. It turns out that specific transmission characteristics may lead to a recognition degradation which otherwise would not have been expected from the standard training material. An outlook is given on future extensions of the simulation model, in order to better cover effects of mobile and IP-based telephone systems.


doi: 10.21437/ICSLP.2000-186

Cite as: Möller, S., Bourlard, H. (2000) Real-time telephone transmission simulation for speech recognizer and dialogue system evaluation and improvement. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 750-753, doi: 10.21437/ICSLP.2000-186

@inproceedings{moller00_icslp,
  author={Sebastian Möller and Hervé Bourlard},
  title={{Real-time telephone transmission simulation for speech recognizer and dialogue system evaluation and improvement}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 750-753},
  doi={10.21437/ICSLP.2000-186}
}