In this paper we discuss the performance analysis of a speaker diarization system similar to the system that was submitted by ICSI at the NIST RT06s evaluation benchmark. The analysis that is based on a series of oracle experiments, provides a good understanding of the performance of each system component on a test set of twelve conference meetings used in previous NIST benchmarks. Our analysis shows that the speech activity detection component contributes most to the total diarization error rate (23%). The lack of ability to model overlapping speech is also a large source of errors (22%) followed by the component that creates the initial system models (15%).
Cite as: Huijbregts, M., Wooters, C. (2007) The blame game: performance analysis of speaker diarization system components. Proc. Interspeech 2007, 1857-1860, doi: 10.21437/Interspeech.2007-517
@inproceedings{huijbregts07_interspeech, author={Marijn Huijbregts and Chuck Wooters}, title={{The blame game: performance analysis of speaker diarization system components}}, year=2007, booktitle={Proc. Interspeech 2007}, pages={1857--1860}, doi={10.21437/Interspeech.2007-517} }