ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

Evolutive speaker segmentation using a repository system

Xavier Anguera Miro, Javier Hernando Pericas

When performing blind speaker segmentation one of the main problems is not knowing how many speakers appear in a conversation and wether they appear once or more than once. In this paper, an iterative method, which is based on the Evolutive-HMM is presented. Two main improvements to this system are introduced. On one hand, a repository generic speaker is used to model all utterances and all speaker models are derived from this iteratively. Different normalization of the scores are applied to the repository and the speakers to emphasize speaker changes. On the other hand, in all cases we use Gaussian Mixture Models (GMM) for their flexibility compared to an HMM structure. This method has been successfully tested using multi-speaker speech sequences generated by concatenation of speech segments from Speecon.


doi: 10.21437/Interspeech.2004-251

Cite as: Miro, X.A., Pericas, J.H. (2004) Evolutive speaker segmentation using a repository system. Proc. Interspeech 2004, 605-608, doi: 10.21437/Interspeech.2004-251

@inproceedings{miro04_interspeech,
  author={Xavier Anguera Miro and Javier Hernando Pericas},
  title={{Evolutive speaker segmentation using a repository system}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={605--608},
  doi={10.21437/Interspeech.2004-251}
}