Abstract
New quantization method for transform coding of speech and audio signals is proposed. The spectral coefficients obtained by the first transform are split into frequency bands, and those of each band are transformed again on a band basis, resulting in another set of coefficients for each band. Then, the efficiency of Huffman coding in two transform domains is analyzed on a band basis and a domain with better performance is selected for each band as the final quantization domain. In addition, a set of domain selection patterns with frequent occurrence is pre-defined in order to decrease the number of side-information bits for indicating the selected domains. The proposed quantization method based on the dual-domain approach is applied to ITU G.722.1 signal codec and the improvement of quantization performance for various speech and audio signals is verified.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Goyal, V.K.: Theoretical foundations of transform coding. IEEE Signal Processing Magazine (2001)
ISO/IEC 11172-3: Information technology-Coding of moving pictures and associated audio for digital storage media at up to about 1.5Mbit/s - Part 3 Audio (1993)
Sinha, D., Johnston, J.: Audio compression at low bit rates using a signal adaptive switched filterbank. IEEE ICASSP, 1053–1056 (1996)
ITU Recommendation G.722.1: Coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss (1999)
ITU Recommendation: Subjective qualification test plan for the ITU-T wideband(7kHz) speech coding algorithm around 16kbit/s (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hong, JS., Choi, JH., Ahn, CB., Sohn, CB., Oh, SJ., Park, H. (2005). Dual-Domain Quantization for Transform Coding of Speech and Audio Signals. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_64
Download citation
DOI: https://doi.org/10.1007/11581772_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)