ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array

Panikos Heracleous, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano

The recognition of distant talking speech in a noisy and reverberant environments is key issue in any speech recognition system. A so-called hands-free speech recognition system plays an important role in the natural and friendly human-machine interface. Considering the practical use of a speech recognition system, we realize that such a system has to deal, also, with the case of the presence of multiple sound sources, including multiple talkers, as well as other noise sources. This paper proposes a novel method which recognizes multiple talkers simultaneously in real environments by extending the 3-D Viterbi search to a 3-D N-best search algorithm. While the 3-D Viterbi method finds the most likely path in the 3-D trellis space, the proposed method considers multiple hypotheses for each direction in every frame. Combinations of the direction sequence and the phoneme sequence of multiple sources are included in the N-best list. The paper investigates the performance of the proposed method through experiments using real utterances of multiple talkers.


doi: 10.21437/Eurospeech.1999-21

Cite as: Heracleous, P., Yamada, T., Nakamura, S., Shikano, K. (1999) Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 69-72, doi: 10.21437/Eurospeech.1999-21

@inproceedings{heracleous99_eurospeech,
  author={Panikos Heracleous and Takeshi Yamada and Satoshi Nakamura and Kiyohiro Shikano},
  title={{Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={69--72},
  doi={10.21437/Eurospeech.1999-21}
}