ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Time-varying autoregressive tests for multiscale speech analysis

Daniel Rudoy, Thomas F. Quatieri, Patrick J. Wolfe

In this paper we develop hypothesis tests for speech waveform nonstationarity based on time-varying autoregressive models, and demonstrate their efficacy in speech analysis tasks at both segmental and sub-segmental scales. Key to the successful synthesis of these ideas is our employment of a generalized likelihood ratio testing framework tailored to autoregressive coefficient evolutions suitable for speech. After evaluating our framework on speech-like synthetic signals, we present preliminary results for two distinct analysis tasks using speech waveform data. At the segmental level, we develop an adaptive short-time segmentation scheme and evaluate it on whispered speech recordings, while at the sub-segmental level, we address the problem of detecting the glottal flow closed phase. Results show that our hypothesis testing framework can reliably detect changes in the vocal tract parameters across multiple scales, thereby underscoring its broad applicability to speech analysis.


doi: 10.21437/Interspeech.2009-725

Cite as: Rudoy, D., Quatieri, T.F., Wolfe, P.J. (2009) Time-varying autoregressive tests for multiscale speech analysis. Proc. Interspeech 2009, 2839-2842, doi: 10.21437/Interspeech.2009-725

@inproceedings{rudoy09_interspeech,
  author={Daniel Rudoy and Thomas F. Quatieri and Patrick J. Wolfe},
  title={{Time-varying autoregressive tests for multiscale speech analysis}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2839--2842},
  doi={10.21437/Interspeech.2009-725}
}