ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate

Geneviève Baudoin, Jan Cernocky, Gérard Chollet

This paper deals with the coding of spectral envelope parameters for very low bit rate speech coding (inferior to 500 bps). In order to obtain a sufficient intelligibility, segmental techniques are necessary. Variable dimension vector quantization is one of these. We propose a new interpretation of already published research from Chou- Lockabaugh [2] and Cernocky- Baudoin-Chollet [4,6] on the quantization of variable length sequences of spectral vectors, named respectively Variable to Variable length Vector Quantization (VVVQ) and Multigrams Quantization (MGQ). This interpretation gives a meaning to the Lagrange multiplier used in the optimization criterion of the VVVQ, and should allow new developments as, for example, new modelization of the probability density of the source. We have also studied the influence of the limitation of the delay introduced by the method. It was found that a maximal delay of 400 ms is generally sufficient. Finally, we propose the introduction of long sequences in the segmental codebook by linear interpolation of shorter ones.


doi: 10.21437/Eurospeech.1997-196

Cite as: Baudoin, G., Cernocky, J., Chollet, G. (1997) Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1295-1298, doi: 10.21437/Eurospeech.1997-196

@inproceedings{baudoin97_eurospeech,
  author={Geneviève Baudoin and Jan Cernocky and Gérard Chollet},
  title={{Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1295--1298},
  doi={10.21437/Eurospeech.1997-196}
}