A comparative evaluation of the zeros of z transform representation for voice source estimation

Sturmel, Nicolas; D'Alessandro, Christophe; Doval, Boris

doi:10.21437/Interspeech.2007-249

A comparative evaluation of the zeros of z transform representation for voice source estimation

Nicolas Sturmel, Christophe D'Alessandro, Boris Doval

A new method for voice source estimation is evaluated and compared to Linear Prediction (LP) inverse filtering methods (autocorrelation LPC, covariance LPC and IAIF [1]). The method is based on a causal/anticausal model of the voice source and the ZZT (Zeros of Z-Transform) representation [2] for causal/anticausal signal separation. A database containing synthetic speech with various voice source settings and natural speech with acoustic and electro-glottographic signals was recorded. Formal evaluation of source estimation is based on spectral distances. The results show that the ZZT causal/anticausal decomposition method outperforms LP in voice source estimation both for synthetic and natural signals. However, its computational load is much heavier (despite a very simple principle) and the method seems sensitive to noise and computation precision errors.

doi: 10.21437/Interspeech.2007-249

Cite as: Sturmel, N., D'Alessandro, C., Doval, B. (2007) A comparative evaluation of the zeros of z transform representation for voice source estimation. Proc. Interspeech 2007, 558-561, doi: 10.21437/Interspeech.2007-249

@inproceedings{sturmel07_interspeech,
  author={Nicolas Sturmel and Christophe D'Alessandro and Boris Doval},
  title={{A comparative evaluation of the zeros of z transform representation for voice source estimation}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={558--561},
  doi={10.21437/Interspeech.2007-249}
}